Enable job alerts via email!

Software Engineer, Infrastructure

OpenAI

Seattle (WA)

On-site

USD 210,000 - 405,000

Full time

9 days ago

Boost your interview chances

Create a job specific, tailored resume for higher success rate.

Job summary

Join a forward-thinking company as a Software Engineer in the Infrastructure organization, where you'll tackle complex engineering challenges. In this dynamic role, you'll design and maintain scalable, reliable systems that power cutting-edge AI technologies. Collaborate closely with cross-functional teams to enhance internal tooling and improve developer experience. If you're passionate about distributed systems and thrive in a collaborative environment, this opportunity offers the chance to make a significant impact in the AI field while working on innovative projects that benefit humanity.

Qualifications

  • 4+ years of experience in software engineering with a focus on distributed systems.
  • Strong communication skills with experience leading complex projects.

Responsibilities

  • Design, build, and maintain reliable systems across engineering.
  • Collaborate with teams to define technical strategy and architecture.

Skills

Python
Go
Rust
Linux
Kubernetes
Terraform
CI/CD

Tools

Kubernetes
Terraform
CI/CD pipelines

Job description

About the Team

We're hiring Software Engineers to join our broader Infrastructure organization, which supports multiple high-impact teams. Depending on your interests and experience, you could work on one of several focus areas-including Core Distributed Systems, Reliability Engineering, Observability, Developer Productivity or Cloud Infrastructure.

About the Role

All teams are deeply collaborative, work on mission-critical services, and are responsible for building distributed, scalable infrastructure to bring OpenAI's technology to the world through products like ChatGPT and the OpenAI API. You'll work closely with stakeholders to understand infrastructure, data and compute needs, setting the technical strategy that supports cutting-edge research and product development. This is a critical role for someone who is passionate about solving complex engineering problems at scale, ensuring their performance, scalability and reliability

Team Focus Areas

  • Distributed Systems: Owning and building important, highly scalable, available, performant, and reliable distributed systems (and their building blocks) to power the entire stack at OpenAI

  • Systems Engineering: Work across layers of the stack-debugging system bottlenecks, evolving core infrastructure, and solving novel problems in performance and scalability.

  • Reliability Engineering: Build scalable, fault-tolerant systems and lead efforts around service health, incident response, and resilience.

  • Observability: Design and maintain observability tooling (metrics, logs, tracing) to give teams visibility into production systems at scale.

  • Developer Productivity: Create tools, environments, and workflows that help engineers ship high-quality software faster and more safely.

  • Cloud Infrastructure: Own the cloud-native infrastructure (compute, networking, storage) that underpins all services and research workloads.

In this role you will:

  • Design, build, and maintain reliable and performant systems used across engineering.
    Work with your team to define technical strategy, architecture, and long-term goals.

  • Collaborate with other engineers, product managers, and researchers to build infrastructure that meets evolving needs.

  • Improve internal tooling, automation, and developer experience.

  • Contribute to incident response, postmortems, and the development of best practices around system reliability and scalability.

You might thrive in this role if you:

  • Strong software engineering skills with experience in Python, Go, Rust, or similar languages.

  • Experience designing, operating, or scaling distributed systems or developer infrastructure.

  • Comfort working in Linux environments, and with tools like Kubernetes, Terraform, CI/CD pipelines, and modern observability stacks.

  • Ability to navigate complex systems and a willingness to dig deep when debugging tricky issues.

  • Excellent communication and collaboration skills, especially in cross-functional settings.

Qualifications:

  • 4+ years of relevant industry experience, with 2+ years leading large scale, complex projects or teams as an engineer or tech lead

  • A passion for distributed systems at scale with a focus on reliability, scalability, security, and continuous improvement.

  • Excellent communication skills, with ability to build consensus among stakeholders both internally and externally.

About OpenAI

OpenAI is an AI research and deployment company dedicated to ensuring that general-purpose artificial intelligence benefits all of humanity. We push the boundaries of the capabilities of AI systems and seek to safely deploy them to the world through our products. AI is an extremely powerful tool that must be created with safety and human needs at its core, and to achieve our mission, we must encompass and value the many different perspectives, voices, and experiences that form the full spectrum of humanity.

We are an equal opportunity employer and do not discriminate on the basis of race, religion, national origin, gender, sexual orientation, age, veteran status, disability or any other legally protected status.

OpenAI Affirmative Action and Equal Employment Opportunity Policy Statement

For US Based Candidates: Pursuant to the San Francisco Fair Chance Ordinance, we will consider qualified applicants with arrest and conviction records.

We are committed to providing reasonable accommodations to applicants with disabilities, and requests can be made via this link.

OpenAI Global Applicant Privacy Policy

At OpenAI, we believe artificial intelligence has the potential to help people solve immense global challenges, and we want the upside of AI to be widely shared. Join us in shaping the future of technology.

Compensation Range: $210K - $405K

Get your free, confidential resume review.
or drag and drop a PDF, DOC, DOCX, ODT, or PAGES file up to 5MB.

Similar jobs

Staff Software Engineer, Infrastructure Team

Liftoff

Remote

USD 160,000 - 235,000

Yesterday
Be an early applicant

Senior Software Engineer, Infrastructure, Google Cloud Compute Infrastructure

Google

Seattle

On-site

USD 166,000 - 244,000

Today
Be an early applicant

Software Engineer - Infrastructure Observability Monitoring (Remote USA)

Cisco

California

Remote

USD 157,000 - 217,000

30+ days ago

Software Engineer, Infrastructure

Whatnot

Remote

USD 185,000 - 245,000

30+ days ago

Senior Infrastructure Engineer II

Boulevard

Remote

USD 165,000 - 236,000

Yesterday
Be an early applicant

Senior/Staff Engineer, Infrastructure (DevOps)

Pryon

Washington

Remote

USD 180,000 - 215,000

Yesterday
Be an early applicant

Cloud Infrastructure Engineer - GPU

Apple

Seattle

On-site

USD 166,000 - 297,000

3 days ago
Be an early applicant

Software Engineer, Infrastructure (2-8 YOE) San Francisco, CA; New York, NY

Agrisano Unternehmungen

Seattle

On-site

USD 148,000 - 222,000

30+ days ago

Senior Infrastructure Engineer II Remote - USA & Canada

Boulevard Labs, Inc.

Remote

USD 165,000 - 236,000

10 days ago