Enable job alerts via email!

Sr. System Engineer

Support Revolution

San Jose (CA)

On-site

USD 140,000 - 158,000

Full time

10 days ago

Boost your interview chances

Create a job specific, tailored resume for higher success rate.

Job summary

A leading technology company is looking for a Senior System Engineer in San Jose, California. This role is critical in designing and deploying enterprise IT solutions, with responsibilities including system testing and infrastructure support. Candidates should have substantial experience with server configurations, cloud environments, and programming.

Qualifications

  • Experience in server/network/storage hardware configuration and troubleshooting (5-8 years).
  • Familiarity with AI/ML frameworks like PyTorch or TensorFlow.
  • Strong programming skills in Python and experience with DevOps.

Responsibilities

  • Deploy rack/cluster infrastructure and conduct system-level testing.
  • Write technical documents and deliver on-site deployment services.
  • Provide operational support for cluster and cloud infrastructure.

Skills

Python
Problem Solving
Communication
Teamwork

Education

BS/MS in Electrical Engineering or Computer Engineering

Tools

Docker
Kubernetes
Python
shell scripting
OpenStack
Azure
AWS

Job description

Select how often (in days) to receive an alert: Create Alert

Location: San Jose, California, United States

About Supermicro:

Supermicro is a Top Tier provider of advanced server, storage, and networking solutions for Data Center, Cloud Computing, Enterprise IT, Hadoop/ Big Data, Hyperscale, HPC and IoT/Embedded customers worldwide. We are the #5 fastest growing company among the Silicon Valley Top 50 technology firms. Our unprecedented global expansion has provided us with the opportunity to offer a large number of new positions to the technology community. We seek talented, passionate, and committed engineers, technologists, and business leaders to join us.

Job Summary:

As a global leader in server technologies, Supermicro has been growing extremely fast in many key markets such as Cloud Computing, Big Data, HPC, AI and Storage, etc. To meet the market demand, Supermicro is developing end to end enterprise IT solutions with compute, storage, networking all integrated into full rack or multi-rack level systems. Senior System Engineer plays an important role in designing, implementing, testing and deploying rack system solutions for data center and enterprise customers.

Essential Duties and Responsibilities:

Includes the following essential duties and responsibilities (other duties may also be assigned):
• Deploy Rack/Cluster infrastructure and execute comprehensive system level testing on the latest GPUs, CPU processors, Network and Storage, encompassing functionality, compatibility, performance, stress, and reliability testing, leveraging proprietary in-house tools
• Conduct proof of concept design and testing. Establish expertise in HPC/AI applications and benchmarks, providing optimized benchmarks for HPC/AI applications by fine-tuning system settings, optimizing OS/network configurations, and demonstrating strong problem-solving skills and building robust processes and procedures for HPC/AI solutions
• Provide operational support for Cluster, Storage, HPC and Cloud infrastructure. Identify and document hardware and software quality issues. Collaborate with product management and other Engineering teams to integrate enhancements into future products
• Write technical documents for test procedures, test reports and troubleshooting procedures related to servers/networks/clusters software and hardware to facilitate knowledge sharing
• Deliver on-site deployment services to ensure customer acceptance verification and satisfaction
• Write automation tools for cluster deployment and test environment

Qualifications:

• BS/MS in Electrical Engineering, Computer Engineering or a related field, MS preferred
• 5-8+ years of work-related experience in server/network/storage hardware configuration, testing, debugging and troubleshooting
• 5-8+ years of work-related experience in DevOps or in cloud environments, including but not limited to Docker/Containers and Kubernetes
• Experience with AI/ML frameworks such as PyTorch, TensorFlow, etc.
• Familiar with TCP/IP protocol stack, UDP, IPv4-IPv6, DNS, DHCP and other Application protocols
• Familiar with HPC, AI or Cloud benchmark tests, networking architecture
• Excellent Programming skills in Python and shell scripting
• Strong communication skills and strong sense of teamwork and good team player
• Familiar with MLPerf Training/Inference benchmark, LLM, HPL-AI or RCCL/NCCL is a plus
• CCNA, OpenStack, Openshit, Azure or AWS is a plus

Salary Range

$140,000 - $158,000

The salary offered will depend on several factors, including your location, level, education, training, specific skills, years of experience, and comparison to other employees already in this role. In addition to a comprehensive benefits package, candidates may be eligible for other forms of compensation, such as participation in bonus and equity award programs.

EEO Statement

Supermicro is an Equal Opportunity Employer and embraces diversity in our employee population. It is the policy of Supermicro to provide equal opportunity to all qualified applicants and employees without regard to race, color, religion, sex, sexual orientation, gender identity, national origin, age, disability, protected veteran status or special disabled veteran, marital status, pregnancy, genetic information, or any other legally protected status.

Get your free, confidential resume review.
or drag and drop a PDF, DOC, DOCX, ODT, or PAGES file up to 5MB.

Similar jobs

Sr. Systems Engineer

Motorola Solutions

Plantation

Remote

USD 98,000 - 197,000

13 days ago

Senior System Architect

Jabil Circuit

San Jose

Remote

USD 120,000 - 160,000

15 days ago

Sr Systems Engineer

Basis

Chicago

Remote

USD 111,000 - 190,000

10 days ago

Senior Systems Engineer

Cloudflare

Remote

USD 100,000 - 150,000

13 days ago

Sr. Business Systems Engineer

TalentBurst, an Inc 5000 company

San Francisco

Remote

USD 130,000 - 160,000

10 days ago

Sr. Business Systems Engineer

TalentBurst

San Francisco

Remote

USD 120,000 - 160,000

9 days ago

Sr. Business Systems Engineer

Jobs via Dice

San Francisco

Remote

USD 120,000 - 160,000

9 days ago

Sr. System Engineer

Support Revolution

San Jose

On-site

USD 140,000 - 158,000

9 days ago

Sr. System Engineer

Support Revolution

San Jose

On-site

USD 140,000 - 158,000

9 days ago