Enable job alerts via email!

Distributed Infrastructure Lead (Agent Networks)

ZipRecruiter

San Francisco (CA)

Remote

USD 120,000 - 180,000

Full time

5 days ago
Be an early applicant

Boost your interview chances

Create a job specific, tailored resume for higher success rate.

Job summary

An innovative company is seeking an Infrastructure Lead to architect the foundational systems for the next wave of AI agent networks. This role offers a unique opportunity to shape the future of AI infrastructure at a massive scale, working with industry veterans and technical leaders. You'll design scalable systems that enable billions of AI agents to interact efficiently across distributed environments. If you're passionate about solving complex infrastructure challenges and have deep expertise in distributed systems, this is the perfect opportunity for you.

Benefits

Full medical, dental, and vision coverage
Flexible PTO policy
Learning and development budget
Conference attendance support
Significant equity stake

Qualifications

  • Proven experience building distributed systems at scale.
  • Expertise in performance optimization and system reliability.

Responsibilities

  • Design and implement scalable infrastructure for massive agent networks.
  • Build robust, distributed systems for agent deployment and execution.

Skills

Distributed Systems
Scalable Architecture
High-Performance Computing
Systems Programming (Go, Rust)
Containerization
AI/ML Deployment

Tools

Kubernetes
Monitoring Systems

Job description

Job DescriptionJob DescriptionInfrastructure Lead (Agent Networks)

About this role

We are seeking an exceptional Infrastructure Lead to architect and build the foundational systems that will power the next of AI agent networks at Naptha AI. This is a rare opportunity to shape the future of AI agent infrastructure at a massively ambitious scale, backed by industry veterans and technical leaders through NVIDIA Inception, Google for Startups, and Microsoft for Startups.

We're building the foundational infrastructure for the next wave of AI companies, enabling frontier AI developers (many leaving labs like OpenAI, Anthropic, and DeepMind) to build products powered by enormous networks of highly capable next- AI agents. As our Infrastructure Lead, you'll design and implement the systems that will enable billions of AI agents to interact, coordinate, and scale efficiently across distributed environments.

Core Responsibilities

  • Design and implement scalable infrastructure for massive agent networks

  • Architect systems for efficient agent communication and coordination

  • Build robust, distributed systems for agent deployment and execution

  • Create monitoring, observability, and debugging systems for agent networks

  • Develop performance optimization strategies for large-scale agent operations

  • Design fault-tolerant systems for reliable agent interactions

  • Lead technical decisions around infrastructure architecture

Technical Challenges You'll Tackle

  • Designing distributed systems that can handle millions of concurrent agent interactions

  • Building efficient communication protocols for agent-to-agent interactions

  • Creating scalable orchestration systems for agent deployment

  • Implementing robust monitoring and debugging tools for complex agent networks

  • Optimizing resource utilization across distributed agent systems

  • Developing infrastructure that can adapt to emerging AI capabilities

You're a good fit if you have:

  • Deep expertise in distributed systems and scalable architecture

  • Strong experience with high-performance computing or large-scale systems

  • Track record of building reliable, production-grade infrastructure

  • Experience with modern cloud platforms and containerization

  • Strong coding abilities in systems programming

  • Understanding of AI/ML deployment challenges

  • Passion for solving complex infrastructure problems

Required Technical Experience:

  • Proven experience building distributed systems at scale

  • Expertise in performance optimization and system reliability

  • Strong programming skills (Go, Rust, or similar systems )

  • Experience with container orchestration (Kubernetes, etc.)

  • Understanding of network protocols and distributed computing

  • Experience with observability and monitoring systems

About the hiring process:

  • Technical architecture discussion

  • Systems design deep dive

  • Coding and problem-solving session

  • Team collaboration interview

  • Infrastructure vision presentation

Compensation & Benefits:

  • Highly competitive salary and significant equity stake

  • Remote-first work environment

  • Full medical, dental, and vision coverage

  • Flexible PTO policy

  • Learning and development budget

  • Conference attendance support

This is a unique opportunity to shape the infrastructure that will power the next of AI systems. You'll be working at the intersection of distributed systems, AI, and platform design, creating the foundation for how future AI agents will interact and scale.

Get your free, confidential resume review.
or drag and drop a PDF, DOC, DOCX, ODT, or PAGES file up to 5MB.

Similar jobs

Principal AI/ML Infra and Ops Engineering

UnitedHealth Group

San Francisco

Remote

USD 106,000 - 195,000

4 days ago
Be an early applicant

Principal AI/ML Infra and Ops Engineering

Optum

San Francisco

Remote

USD 106,000 - 195,000

6 days ago
Be an early applicant

Principal AI/ML Infra and Ops Engineering - 2287523

UnitedHealth Group

San Francisco

Remote

USD 106,000 - 195,000

7 days ago
Be an early applicant

Manager, AI, Infrastructure, & Tooling

Figma

San Francisco

Remote

USD 164,000 - 288,000

10 days ago

Manager, AI, Infrastructure, & Tooling

Figma

San Francisco

Remote

USD 164,000 - 288,000

10 days ago

Principal, Business Operations - Infrastructure and Operations (Remote)

CrowdStrike

Sunnyvale

Remote

USD 135,000 - 225,000

11 days ago

Infrastructure Lead (Remote Opportunity)

Veterans EZ Info Inc

Washington

Remote

USD 90,000 - 140,000

Today
Be an early applicant

Channel Partnership Lead - Infrastructure

Formula.Monks

Remote

USD 90,000 - 150,000

Today
Be an early applicant

Cloud Infrastructure Manager

Mukuru

Capetown

Remote

USD 90,000 - 150,000

Today
Be an early applicant