Enable job alerts via email!

Distinguished Engineer – Data Center System Software Architect

NVIDIA

Santa Clara (CA)

On-site

USD 308,000 - 472,000

Full time

30+ days ago

Boost your interview chances

Create a job specific, tailored resume for higher success rate.

Job summary

An established industry player is seeking a dynamic technical architect to lead the architecture of cutting-edge data center systems. This role involves collaborating with top cloud service providers and driving technological innovation. The ideal candidate will have extensive experience in system architecture, particularly in scalable server systems, and a deep understanding of system software for accelerators. Join a team at the forefront of AI and HPC advancements, where your contributions will shape the future of computing and technology. If you are passionate about creating impactful solutions and thrive in collaborative environments, this opportunity is for you.

Benefits

Equity
Comprehensive benefits package
Diversity and inclusion initiatives

Qualifications

  • 20+ years in system architecture and design with deep expertise in server systems.
  • Experience with complex system software for GPUs, FPGAs, and embedded systems.

Responsibilities

  • Lead technical innovation for next-gen data center products and engage with major customers.
  • Make critical technical decisions and mitigate risks through strategic collaborations.

Skills

System architecture
System software for accelerators
Linux kernel internals
Networking technologies
Device management protocols
Cross-functional project leadership

Education

BS in Computer Science
MS in Electrical Engineering

Tools

OpenBMC
CUDA
cuDNN
DOCA

Job description

NVIDIA data center systems, such as DGX and HGX, have become core to NVIDIA's rapidly growing enterprise and cloud provider businesses. These platforms bring together the full power of NVIDIA GPUs, NVIDIA NVLink, NVIDIA InfiniBand networking, NVIDIA Grace CPUs, and a fully optimized NVIDIA AI and HPC software stack. We’re looking for a strong technical architect to own the end-to-end architecture of these products, at the system software level.

Including firmware, kernel drivers, operating systems, and user mode drivers. You will work with component leads internally and engage with industry leading cloud service providers on taking these products to market.

What you’ll be doing:

  1. Serve as the primary technical point of contact for major customers, leading technological discussions, defining KPIs, gathering requirements, and addressing complex technical queries.
  2. As a system software architect, lead technical innovation and strategic collaborations with major hyperscalers to architect next-generation data center products.
  3. Align NVIDIA's roadmap with major customers' requirements through direct engagement.
  4. Develop and drive adoption of new technologies and protocols.
  5. Make critical technical decisions in ambiguous situations, mitigating risks through left-shift strategies.

What we need to see:

  1. Deep expertise in scalable and performant server system architecture, focusing on SW/HW interfaces.
  2. Extensive experience with complex system software for accelerators (GPUs, DPUs, FPGAs).
  3. Mastery of system firmware (SBIOS, OpenBMC), embedded systems, and Linux kernel internals.
  4. Proficiency in Out-of-Band and In-Band management architectures, device management protocols (e.g., MCTP, PLDM, SPDM, RDE) and system management protocols (Redfish, IPMI).
  5. Extensive knowledge of networking technologies and protocols, including TCP/IP, Ethernet, InfiniBand, as well as advanced switching and routing concepts.
  6. Experience collaborating with platform security experts to define tradeoffs between security and ease of use.
  7. Demonstrated success in leading complex, cross-functional projects to completion, showcasing the ability to influence and achieve results without direct authority in large-scale, collaborative environments. Demonstrable experience in implementing left shift strategy to de-risk program execution.
  8. BS or MS degree in Computer Science, Electrical Engineering or related field (or equivalent experience).
  9. 20+ years in the area of System architecture and design.

Ways to stand out from the crowd:

  1. Knowledge of cloud and cluster level deployment and management systems. Participation and contributions in standards bodies such as OCP and DMTF.
  2. Familiarity with NVIDIA HPC programming models and libraries (CUDA, cuDNN, DOCA).
  3. Knowledge of enterprise storage architectures and distributed parallel processing paradigms.

NVIDIA is leading the way in groundbreaking developments in Artificial Intelligence, High-Performance Computing and Visualization. The GPU, our invention, serves as the visual cortex of modern computers and is at the heart of our products and services. We have some of the most forward-thinking and hardworking people on the planet working for us. If you're creative, passionate and self-motivated, we want to hear from you!

NVIDIA’s invention of the GPU in 1999 fueled the growth of the PC gaming market, redefined modern computer graphics, and revolutionized parallel computing. More recently, GPU deep learning ignited modern deep learning — the next era of computing — with the GPU acting as the brain of computers, robots, and self-driving cars that can perceive and understand the world. Today, we are increasingly known as “the AI computing company.” We're looking to grow our company and establish teams with the most thoughtful people in the world. Are you ready to change the next generation of computing? Join us at the forefront of technological advancement.

The base salary range is 308,000 USD - 471,500 USD. Your base salary will be determined based on your location, experience, and the pay of employees in similar positions.

You will also be eligible for equity and benefits. NVIDIA accepts applications on an ongoing basis.

NVIDIA is committed to fostering a diverse work environment and proud to be an equal opportunity employer. As we highly value diversity in our current and future employees, we do not discriminate (including in our hiring and promotion practices) on the basis of race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status or any other characteristic protected by law.

Get your free, confidential resume review.
or drag and drop a PDF, DOC, DOCX, ODT, or PAGES file up to 5MB.

Similar jobs

Architect-Cloud-CA or WA preferred

Juniper Networks, Inc

Sunnyvale

Remote

USD 284,000 - 409,000

8 days ago

Distinguished Engineer – Data Center System Software Architect

NVIDIA

Remote

USD 308,000 - 472,000

26 days ago

Senior Software Architect - Deep Learning and HPC Communications

NVIDIA Corporation

Santa Clara

On-site

USD 184,000 - 357,000

2 days ago
Be an early applicant

Distinguished Technologist, PreSales Architect

Hewlett Packard Enterprise Development LP

Oregon

Remote

USD 203,000 - 493,000

Today
Be an early applicant

Presale Solution Architect

KMS Technology

Orlando

Remote

USD 300,000 - 350,000

Yesterday
Be an early applicant

Senior Operating System Architect

NVIDIA

Remote

USD 184,000 - 357,000

2 days ago
Be an early applicant

Enterprise Architect, Front Office

Kyndryl

Durham

Remote

USD 173,000 - 330,000

2 days ago
Be an early applicant

Regional Chief Architect – Cloud and Partner Ecosystem

Thoughtworks

Chicago

Remote

USD 202,000 - 324,000

2 days ago
Be an early applicant

Principal Enterprise Architect

Addepar

Remote

USD 203,000 - 318,000

6 days ago
Be an early applicant