Enable job alerts via email!

Senior Solutions Architect, HPC Systems Engineer

NVIDIA Corporation

Colorado

Remote

USD 184,000 - 288,000

Full time

5 days ago
Be an early applicant

Boost your interview chances

Create a job specific, tailored resume for higher success rate.

Job summary

NVIDIA is seeking an experienced Senior Solutions Architect, HPC Systems Engineer to support large data center GPU deployments. This role involves engagement with strategic customers, providing expert guidance on complex GPU and network systems, and requires over 8 years of related experience. You'll collaborate closely with various teams to improve product offerings and enhance customer experiences while participating in technical discussions and solution deployments.

Benefits

Equity opportunities
Comprehensive benefits

Qualifications

  • 8+ years of Systems/Solution Engineering experience.
  • Knowledge of networking switches and Data Center infrastructure.
  • Experience with DevOps/MLOps technologies.

Responsibilities

  • Guide customer discussions on network design and GPU system deployments.
  • Conduct regular technical customer meetings for product roadmaps.
  • Identify new project opportunities in AI applications.

Skills

System level expertise of CPU/GPU server architecture
Strong verbal/written communication skills
Effective time management

Education

BS/MS/PhD in Electrical/Computer Engineering, Computer Science, Physics

Tools

NVIDIA GPU systems/SDKs (e.g. CUDA)
Linux
Kubernetes

Job description

Senior Solutions Architect, HPC Systems Engineer page is loaded

Senior Solutions Architect, HPC Systems Engineer
Apply locations US, TX, Remote US, NC, Remote US, TN, Remote US, CO, Remote US, FL, Remote time type Full time posted on Posted 10 Days Ago job requisition id JR1998307

NVIDIA is looking for an experienced GPU and network systems Solutions Architect & Engineer. Do you want to be part of a team that brings new Artificial Intelligence (AI) hardware and software technologies to production in customer data centers? As part of the NVIDIA SA organization, you will be driving deployment of our end-to-end technology solutions integration at some of NVIDIA's most strategic technology customers, as well as offering recommendations to business and engineering teams on our product roadmap.

What you will be doing:

  • Working with NVIDIA AI Native, Consumer Internet and Enterprise customers on large data center GPU server and networking system deployments as Solution Architect Engineer. Guide customer discussions on network design, compute/storage and support bring up of server/network/cluster deployments. You will need to visit customer data center during bring up phase.

  • Demonstrate subject matter expertise in advanced GPU & network systems and be a trusted technical advisor to NVIDIA's strategic customers. Bring customer-specific requirements to product teams to guide product roadmap features.

  • Identify new project opportunities for NVIDIA products and technology solutions in data center and artificial intelligence applications. Work closely with the GPU/Network Systems Engineering, Product management and Sales teams

  • Work as customer trusted advisor conducting regular technical customer meetings for product roadmap, cluster issues debug, feature discussions and introduction to new technology solutions

  • Build custom product demonstrations and POCs for solutions that address critical business needs of our customers

  • Analyze and debug compute/network configuration, performance issues to deliver performant clusters

  • We make extensive use of conferencing tools, but occasional (20%) travel is required for on-site visit to customers and industry events. We are open to remote work location and look forward to have you join our team!

What we need to see:

  • BS/MS/PhD in Electrical/Computer Engineering, Computer Science, Physics, or other Engineering fields or equivalent experience.

  • This role is for an individual with the motivation and skills to drive the data center engineering process. Ideal candidate has 8+ years of Systems/Solution Engineering (or similar Engineering roles) experience

  • System level expertise of CPU/GPU server architecture, NICs, Linux, system software and kernel drivers

  • Experience with networking switches for Ethernet/Infiniband, and Data Center infrastructure (power/cooling)

  • Knowledge of DevOps/MLOps technologies such as Docker/containers, Kubernetes

  • Effective time management and capable of balancing multiple tasks

  • Strong verbal/written communication skills and share your ideas/code clearly through documents, presentation etc

Ways to stand out from the crowd:

  • External customer facing background

  • Experience with bringup and deployment of large clusters

  • Systems engineering, coding, and debugging skills including experience with C/C++, Linux kernel and drivers

  • Hands-on experience with NVIDIA GPU systems/SDKs (e.g. CUDA), NVIDIA Networking technologies (e.g. NICs, RoCE, InfiniBand), and/or ARM CPU solutions

  • Familiarity with virtualization technology concepts

NVIDIA is widely considered to be one of the technology world’s most desirable employers. We have some of the most forward-thinking and hardworking people in the world working for us. If you're creative and autonomous, we want to hear from you!

The base salary range is 184,000 USD - 287,500 USD. Your base salary will be determined based on your location, experience, and the pay of employees in similar positions.

You will also be eligible for equity and benefits . NVIDIA accepts applications on an ongoing basis.

NVIDIA is committed to fostering a diverse work environment and proud to be an equal opportunity employer. As we highly value diversity in our current and future employees, we do not discriminate (including in our hiring and promotion practices) on the basis of race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status or any other characteristic protected by law.
Similar Jobs (5)
Senior Solutions Architect, HPC Systems Engineer
locations 6 Locations time type Full time posted on Posted 10 Days Ago
Solutions Architect, HPC Systems Engineer
locations 6 Locations time type Full time posted on Posted 30+ Days Ago
Senior Solutions Architect, Networking
locations 2 Locations time type Full time posted on Posted 30+ Days Ago

Get your free, confidential resume review.
or drag and drop a PDF, DOC, DOCX, ODT, or PAGES file up to 5MB.

Similar jobs

Senior Solutions Architect, HPC Systems Engineer

NVIDIA

Remote

USD 184,000 - 288,000

10 days ago

Software Engineer, Systems (REMSWE19)

AECOM

Austin

Remote

USD 211,000 - 241,000

4 days ago
Be an early applicant

Principal Power System Engineer

Electric Power Engineers

Austin

Remote

USD 132,000 - 198,000

5 days ago
Be an early applicant

Software Engineer, Systems

AECOM

Austin

Remote

USD 187,000 - 201,000

10 days ago

Infrastructure Systems Architect – DCI Engineering

Jabil Malaysia

Austin

Remote

USD 157,000 - 283,000

4 days ago
Be an early applicant

IT Solution Architect, Stf - 1LMX Integration - Asset Management

Lockheed Martin

Grand Prairie

Remote

USD 134,000 - 268,000

6 days ago
Be an early applicant

Lead Product Platform Engineer

Cortex

Austin

Remote

USD 215,000 - 250,000

6 days ago
Be an early applicant

Senior Software Engineer SME

SAIC

Colorado

Remote

USD 160,000 - 200,000

7 days ago
Be an early applicant

Data Architect Asc Mgr - 1LMX / Common Data Lakehouse

Lockheed Martin

Stratford

Remote

USD 113,000 - 201,000

5 days ago
Be an early applicant