Enable job alerts via email!

Senior System Firmware Engineer, RAS - Platform Software

NVIDIA

Santa Clara (CA)

On-site

USD 184,000 - 357,000

Full time

2 days ago
Be an early applicant

Boost your interview chances

Create a job specific, tailored resume for higher success rate.

Job summary

An innovative tech company is seeking a Sr Software Engineer specializing in RAS firmware for their cutting-edge Arm Data Center products. This role involves designing and developing firmware, debugging complex issues, and collaborating with cross-functional teams to enhance product reliability. Join a team that is at the forefront of AI computing, where your expertise will directly influence the future of technology. With a commitment to diversity and a culture of creativity, this position offers a unique opportunity to thrive in an environment that values innovation and problem-solving. If you're ready to make a significant impact, we want to hear from you!

Benefits

Equity options
Comprehensive health benefits
Flexible work hours
Professional development opportunities

Qualifications

  • 8+ years of relevant experience in firmware development.
  • Proven experience as a post-silicon debug Engineer or similar role.
  • Strong programming skills in C, Python, and Perl.

Responsibilities

  • Design and develop RAS firmware for NVIDIA’s Arm Data Center products.
  • Triaging and debugging system and firmware related issues.
  • Collaborating with hardware and software teams to resolve platform issues.

Skills

C
Python
Perl
Problem-solving
Attention to detail

Education

BS in Electrical Engineering
MS in Computer Science
PhD in related field

Tools

Linux
Ubuntu
RTOS
ARM-based platforms

Job description

We are looking for a: Sr Software Engineer, RAS Firmware - Platform Software. NVIDIA’s invention of the GPU in 1999 sparked the growth of the PC gaming market, redefined modern computer graphics, and revolutionized parallel computing. More recently, GPU deep learning ignited modern deep learning — the next era of computing — with the GPU acting as the brain of computers, robots, and self-driving cars that can perceive and understand the world. Today, we are increasingly known as “the AI computing company.” We're looking to grow our company and establish teams with the most thoughtful people in the world. NVIDIA DGX systems deliver the world's leading solutions for enterprise AI infrastructure at scale.

We are seeking a talented and experienced Datacenter CPU RAS (Reliability, Availability, and Serviceability) firmware engineer. As a CPU RAS Firmware Engineer, you will be responsible for designing and implementing firmware-level changes. You will collaborate closely with hardware engineers, system architects, and software developers to create designs that meet stringent reliability requirements and ensure exceptional customer experiences. Are you ready to influence the next generation of computing? Join us at the forefront of technological advancement.

What you’ll be doing:

  1. Design and develop RAS firmware for NVIDIA’s Arm Data Center products.
  2. Triaging and debugging system, SoC, board, RAS firmware/UEFI related issues on customer, reference, and production platforms.
  3. Collaborating with hardware, firmware, and software teams to design features and debug issues.
  4. Engaging with customer partners to root cause & resolve platform issues.
  5. Supporting manufacturing/RMA failure issues in coordination with the Quality & Reliability team.
  6. Debugging and resolving hardware & firmware issues during SOC bring-up phases.
  7. Working with NVIDIA partners on RAS firmware issues to enhance their use of NVIDIA products.
  8. Contributing to all phases of product development, from definition and architecture to implementation, debugging, testing, and early customer support.

What we need to see:

  • BS, MS, or PhD in EE/CS or related field (or equivalent experience).
  • 8+ years of relevant experience.
  • Proven experience as a post-silicon debug Engineer, Hardware Test Engineer, or similar role.
  • Familiarity with Linux, Ubuntu, RTOS, and ARM-based platforms.
  • Understanding of datacenter server platforms and firmware.
  • Strong programming skills in C, Python, and Perl.
  • Excellent problem-solving skills and attention to detail.
  • Effective written and oral communication skills, a strong work ethic, teamwork orientation, dedication to quality, and commitment to completing tasks daily.
  • Self-motivated with a passion for creative solutions to complex problems.

Ways to stand out from the crowd:

  • Experience with ARM UEFI firmware development.
  • Background in Linux kernel development, especially writing device drivers.

NVIDIA is recognized as one of the most desirable employers in the tech industry, with innovative and dedicated professionals. If you're creative and autonomous, we want to hear from you!

The base salary range is $184,000 - $356,500, determined by location, experience, and current market rates. You will also be eligible for equity and benefits. NVIDIA accepts applications continuously.

We are committed to diversity and equal opportunity in employment. We do not discriminate based on race, religion, color, national origin, gender, gender identity, sexual orientation, age, marital status, veteran status, disability, or any other legally protected characteristic.

Get your free, confidential resume review.
or drag and drop a PDF, DOC, DOCX, ODT, or PAGES file up to 5MB.

Similar jobs

Senior System Firmware Engineer, RAS - Platform Software

NVIDIA Corporation

Santa Clara

Remote

USD 184,000 - 357,000

Yesterday
Be an early applicant

Senior Cloud Platform Software Engineer

NVIDIA Corporation

Santa Clara

Remote

USD 224,000 - 426,000

3 days ago
Be an early applicant

Senior System Firmware Engineer, RAS - Platform Software

Nvidia Corporation in

Santa Clara

On-site

USD 184,000 - 357,000

3 days ago
Be an early applicant

Senior Software Engineer - SRE, Backend (Reliability Engineering)

Affirm

Palo Alto

Remote

USD 190,000 - 240,000

8 days ago

Senior Cloud Platform Software Engineer

Nvidia Corporation in

Santa Clara

On-site

USD 224,000 - 426,000

Yesterday
Be an early applicant

Senior Software Engineer - GPU

NVIDIA

Remote

USD 184,000 - 357,000

3 days ago
Be an early applicant

Software Engineer II

Affirm

Cleveland

Remote

USD 142,000 - 192,000

5 days ago
Be an early applicant

Senior Cloud Platform Software Engineer

NVIDIA

Santa Clara

On-site

USD 224,000 - 426,000

3 days ago
Be an early applicant

Senior Software Engineer

NVIDIA Corporation

Santa Clara

Hybrid

USD 184,000 - 357,000

3 days ago
Be an early applicant