Enable job alerts via email!

Senior Software Engineer, DGX Cloud Orchestration

NVIDIA

United States

Remote

USD 136,000 - 265,000

Full time

30+ days ago

Boost your interview chances

Create a job specific, tailored resume for higher success rate.

Job summary

An established industry player is seeking a Senior Software Engineer to join their innovative DGX Cloud team. In this pivotal role, you will design and develop scalable automation solutions that enhance NVIDIA’s high-performance GPU infrastructure. Your expertise in building GraphQL and REST APIs, along with proficiency in programming languages like Go, Java, or Python, will be crucial in optimizing cloud operations. This role offers the opportunity to work on groundbreaking technology that drives the future of AI and cloud computing, making a significant impact in a collaborative and diverse environment. Join a team that values creativity and innovation, and help shape the next wave of technological advancements.

Benefits

Equity
Comprehensive benefits package
Flexible work hours
Diversity and inclusion initiatives

Qualifications

  • 5-9+ years of experience with a Bachelor's or Master's degree, or 2+ years with a PhD.
  • Expertise in building GraphQL and REST APIs, proficiency in Go, Java, or Python.

Responsibilities

  • Design and develop APIs to orchestrate and integrate operational workflows.
  • Build automation systems that streamline infrastructure lifecycle processes.

Skills

GraphQL
REST APIs
Go
Java
Python
JavaScript
Kubernetes
Docker
AWS
GCP
Azure

Education

Bachelor's degree
Master's degree
PhD

Tools

Prometheus
OpenTelemetry
Grafana

Job description

We are looking for a Senior Software Engineer to join our DGX Cloud team and build the foundational systems that drive NVIDIA’s high-performance GPU infrastructure. You will play a critical role in designing scalable automation solutions, integrating diverse systems, and enabling seamless workflows across global cloud operations. NVIDIA is widely recognized as one of the most desirable employers, with some of the most talented people in the world working for us. If you're passionate about building scalable, efficient systems to power cloud operations, we invite you to join our team.

What You'll Be Doing
  • Design and develop APIs (GraphQL/REST) to orchestrate and integrate operational workflows.

  • Build state management and workflow automation systems that streamline infrastructure lifecycle processes.

  • Collaborate across teams to codify business processes into scalable, self-measuring systems.

  • Develop extensible, schema-driven platforms for reducing manual toil and ensuring operational consistency.

  • Drive integrations with container orchestration tools like Kubernetes and observability systems such as Prometheus, OpenTelemetry, Grafana.

  • Optimize the reliability and efficiency of cloud operations through automated workflows and telemetry systems.

  • Lead and ship impactful technical projects, ensuring quality and scalability at every stage.

What we need to see:
  • 5-9+ years of industry experience with a Bachelor’s or Master’s degree (or equivalent experience), or 2+ years with a PhD.

  • Expertise in building GraphQL and REST APIs.

  • Proficiency in programming languages such as Go, Java, or Python.

  • Familiarity with modern JavaScript frameworks (e.g., React, Angular, Next.js).

  • Strong understanding of cloud infrastructure (AWS, GCP, Azure) and container technologies like Docker and Kubernetes.

  • Experience with high-scale distributed systems, including architectural patterns for APIs and data pipelines.

  • Outstanding communication and collaboration skills, with a focus on solving complex operational challenges.

  • A passion for automating manual processes and driving system efficiency.

Ways to Stand Out from the Crowd
  • A track record of designing workflow orchestration systems for large-scale infrastructure.

  • Proven experience in reducing operational inefficiencies through automation and integration.

  • Strong debugging and problem-solving skills in distributed environments.

NVIDIA is committed to creating an environment where diverse perspectives drive innovation. As part of the DGX Cloud team, you’ll work on groundbreaking technology that powers the future of AI and cloud computing. NVIDIA is leading the way in groundbreaking developments in Artificial Intelligence, High-Performance Computing, and Visualization. Our invention serves as the visual cortex of modern computers and is at the heart of our products and services. Our work opens up new universes to explore, enables amazing creativity and discovery, and powers what were once science fiction inventions from artificial intelligence to autonomous cars. NVIDIA is looking for great people like you to help us accelerate the next wave of artificial intelligence.

The base salary range is 136,000 USD - 264,500 USD. Your base salary will be determined based on your location, experience, and the pay of employees in similar positions.

You will also be eligible for equity and benefits. NVIDIA accepts applications on an ongoing basis.

NVIDIA is committed to fostering a diverse work environment and proud to be an equal opportunity employer. As we highly value diversity in our current and future employees, we do not discriminate (including in our hiring and promotion practices) on the basis of race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status or any other characteristic protected by law.

Get your free, confidential resume review.
or drag and drop a PDF, DOC, DOCX, ODT, or PAGES file up to 5MB.

Similar jobs

Staff Architect, AI Infrastructure

Support Revolution

San Jose

On-site

USD 168,000 - 184,000

Yesterday
Be an early applicant

Staff Architect, AI Infrastructure

Support Revolution

San Jose

On-site

USD 168,000 - 184,000

Yesterday
Be an early applicant

Managing Consulting Engineer - NVIDIA Solutions

CDW

On-site

USD 140,000 - 160,000

13 days ago