Enable job alerts via email!

Senior Solutions Architect, HPC Systems Engineer

NVIDIA

United States

Remote

USD 184,000 - 288,000

Full time

Yesterday
Be an early applicant

Boost your interview chances

Create a job specific, tailored resume for higher success rate.

Job summary

NVIDIA is seeking a skilled Solutions Architect & Engineer to lead deployments of AI hardware and software in customer data centers. You will serve as a trusted advisor, guiding technical discussions and demonstrating expertise in advanced GPU and network systems. Join a team dedicated to innovation and technology in this dynamic role.

Benefits

Equity options
Comprehensive benefits
Diverse work environment

Qualifications

  • 8+ years of Systems/Solution Engineering experience.
  • System level expertise in CPU/GPU architectures and Linux.
  • Knowledge of networking switches and data center infrastructure.

Responsibilities

  • Work with customers on GPU server and networking system deployments.
  • Provide technical guidance and support for system configurations.
  • Conduct technical meetings with customers to discuss product roadmaps.

Skills

Communication
Time Management
Debugging
Networking Switches
System Architecture
DevOps/MLOps
C/C++

Education

BS/MS/PhD in Electrical/Computer Engineering, Computer Science, or similar

Job description

NVIDIA is looking for an experienced GPU and network systems Solutions Architect & Engineer. Do you want to be part of a team that brings new Artificial Intelligence (AI) hardware and software technologies to production in customer data centers? As part of the NVIDIA SA organization, you will be driving deployment of our end-to-end technology solutions integration at some of NVIDIA's most strategic technology customers, as well as offering recommendations to business and engineering teams on our product roadmap.

What you will be doing:

  • Working with NVIDIA AI Native, Consumer Internet and Enterprise customers on large data center GPU server and networking system deployments as Solution Architect Engineer. Guide customer discussions on network design, compute/storage and support bring up of server/network/cluster deployments. You will need to visit customer data center during bring up phase.

  • Demonstrate subject matter expertise in advanced GPU & network systems and be a trusted technical advisor to NVIDIA's strategic customers. Bring customer-specific requirements to product teams to guide product roadmap features.

  • Identify new project opportunities for NVIDIA products and technology solutions in data center and artificial intelligence applications. Work closely with the GPU/Network Systems Engineering, Product management and Sales teams

  • Work as customer trusted advisor conducting regular technical customer meetings for product roadmap, cluster issues debug, feature discussions and introduction to new technology solutions

  • Build custom product demonstrations and POCs for solutions that address critical business needs of our customers

  • Analyze and debug compute/network configuration, performance issues to deliver performant clusters

  • We make extensive use of conferencing tools, but occasional (20%) travel is required for on-site visit to customers and industry events. We are open to remote work location and look forward to have you join our team!

What we need to see:

  • BS/MS/PhD in Electrical/Computer Engineering, Computer Science, Physics, or other Engineering fields or equivalent experience.

  • This role is for an individual with the motivation and skills to drive the data center engineering process. Ideal candidate has 8+ years of Systems/Solution Engineering (or similar Engineering roles) experience

  • System level expertise of CPU/GPU server architecture, NICs, Linux, system software and kernel drivers

  • Experience with networking switches for Ethernet/Infiniband, and Data Center infrastructure (power/cooling)

  • Knowledge of DevOps/MLOps technologies such as Docker/containers, Kubernetes

  • Effective time management and capable of balancing multiple tasks

  • Strong verbal/written communication skills and share your ideas/code clearly through documents, presentation etc

Ways to stand out from the crowd:

  • External customer facing background

  • Experience with bringup and deployment of large clusters

  • Systems engineering, coding, and debugging skills including experience with C/C++, Linux kernel and drivers

  • Hands-on experience with NVIDIA GPU systems/SDKs (e.g. CUDA), NVIDIA Networking technologies (e.g. NICs, RoCE, InfiniBand), and/or ARM CPU solutions

  • Familiarity with virtualization technology concepts

NVIDIA is widely considered to be one of the technology world’s most desirable employers. We have some of the most forward-thinking and hardworking people in the world working for us. If you're creative and autonomous, we want to hear from you!

The base salary range is 184,000 USD - 287,500 USD. Your base salary will be determined based on your location, experience, and the pay of employees in similar positions.

You will also be eligible for equity and benefits. NVIDIA accepts applications on an ongoing basis.

NVIDIA is committed to fostering a diverse work environment and proud to be an equal opportunity employer. As we highly value diversity in our current and future employees, we do not discriminate (including in our hiring and promotion practices) on the basis of race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status or any other characteristic protected by law.
Get your free, confidential resume review.
or drag and drop a PDF, DOC, DOCX, ODT, or PAGES file up to 5MB.

Similar jobs

Senior Software Engineer - Platform & Resiliency New

Truffle Security Co.

Remote

USD 159,000 - 188,000

Today
Be an early applicant

Nuclear Remote Systems Engineer

Lensa

Remote

USD 93,000 - 230,000

10 days ago

REMOTE: Sr Power System Architect - Servers

Lensa

Austin

Remote

USD 190,000 - 220,000

10 days ago

Senior Data Architect

Paritii

Remote

USD 130,000 - 240,000

5 days ago
Be an early applicant

Software Engineer, Systems

AECOM

Cheyenne

Remote

USD 187,000 - 201,000

Today
Be an early applicant

Software Engineer, Systems

AECOM

Salem

Remote

USD 187,000 - 201,000

Today
Be an early applicant

Software Engineer, Systems

AECOM

Independence

Remote

USD 187,000 - 201,000

Today
Be an early applicant

Software Engineer, Systems

AECOM

City of Albany

Remote

USD 187,000 - 201,000

Today
Be an early applicant

Software Engineer, Systems

AECOM

Austin

Remote

USD 187,000 - 201,000

Today
Be an early applicant