Enable job alerts via email!

Senior Platform engineer

ZipRecruiter

San Francisco (CA)

On-site

USD 150,000 - 210,000

Full time

2 days ago
Be an early applicant

Boost your interview chances

Create a job specific, tailored resume for higher success rate.

Job summary

Join Genmo, an innovative research lab focused on advancing AI through cutting-edge video technology. As an Architect, you'll lead the development of a robust multi-cluster infrastructure, ensuring efficient deployment and scaling of new models. Your expertise in distributed systems and automation will drive technical excellence and mentorship within the team.

Qualifications

  • 5+ years building production-grade distributed systems.
  • Fluency in Go or Rust plus Python.
  • Experience with real-time protocols is a plus.

Responsibilities

  • Architect a multi-cluster infrastructure layer.
  • Automate deployment and workflow processes.
  • Standardize infrastructure automation and CI/CD practices.

Skills

Distributed systems
Clear communication
Ownership mindset

Education

BS/MS/PhD in CS, EE, or related field

Tools

Kubernetes
Terraform
Helm
GitOps workflows
Service-mesh frameworks
CI/CD tooling
Observability stacks
GPU telemetry

Job description

Job DescriptionJob Description

We are Genmo, a research lab dedicated to building open, state-of-the-art models for video towards unlocking the right brain of AGI. Join us in shaping the future of AI and pushing the boundaries of what's possible in video .

What You’ll Do

  • Architect a multi‑cluster infrastructure layer that spans clouds and on‑prem GPU fleets.

  • Automate deployment, rollout, and autoscaling workflows so new models reach production with zero‑downtime.

  • Forecast & plan GPU capacity to meet latency SLOs while controlling cost.

  • Shape traffic policy for secure, low‑latency routing and global load balancing.

  • Instrument & observe—deliver end‑to‑end telemetry and debuggability for every model and cluster.

  • Standardize infrastructure automation, disaster‑recovery, and CI/CD practices across teams.

  • Drive reliability through post‑incident review and continuous improvement.

  • Mentor & lead—share distributed‑systems best practices and influence the long‑term roadmap.

You Have

  • BS/MS/PhD in CS, EE, or related field.

  • 5+ yrs building production‑grade distributed systems.

  • Fluency in a systems (Go or Rust) plus Python.

  • Clear, concise communication and an ownership mindset.

Nice to Have

  • Experience tuning real‑time protocols (WebRTC, gRPC, HTTP/2) for high‑throughput inference.

  • Multi‑cloud or edge deployments spanning AWS, GCP, Azure, or bare‑metal providers.

  • Security and compliance for high‑performance, distributed AI platforms.

  • Hands‑on expertise with:

    • Kubernetes internals and multi‑cluster operations

    • Infrastructure‑as‑code tools (Terraform, Helm) and GitOps workflows (Argo CD or Flux)

    • Service‑mesh frameworks (Linkerd, Istio, or Envoy Gateway)

    • Observability stacks (Prometheus, Grafana, OpenTelemetry) and GPU telemetry (NVIDIA DCGM)

    • CI/CD tooling (GitHub Actions, BuildKit)

Genmo is an Equal Opportunity Employer. Candidates are evaluated without regard to , , , , , , , , veteran status, or any other characteristic protected by federal or state law. Genmo, Inc. is an E-Verify company and you may review the Notice of E-Verify Participation and the Right to Work posters in English and Spanish.

Get your free, confidential resume review.
or drag and drop a PDF, DOC, DOCX, ODT, or PAGES file up to 5MB.

Similar jobs

Senior Platform Engineer

Productiv

Palo Alto

Remote

USD 182,000 - 202,000

9 days ago

Senior Platform Engineer

ZipRecruiter

Denver

Remote

USD 140,000 - 165,000

Yesterday
Be an early applicant

Sr Engineer - Platform

Davita Inc.

Dublin

Remote

USD 90,000 - 153,000

Yesterday
Be an early applicant

Sr. Platform Engineer

Early Warning

San Francisco

Hybrid

USD 130,000 - 160,000

Yesterday
Be an early applicant

SENIOR PLATFORM ENGINEER

Quality Control Specialist - Pest Control

San Francisco

On-site

USD 105,000 - 168,000

2 days ago
Be an early applicant

Senior Platform Engineer

Dtex Systems Pty Ltd

Mississippi

Remote

USD 170,000 - 220,000

4 days ago
Be an early applicant

Senior Back End Engineer, Platform

you.com

San Francisco

Remote

USD 150,000 - 270,000

11 days ago

Sr Platform Engineer - GenAI - AWS - Remote

Lensa

Indianapolis

Remote

USD 140,000 - 170,000

12 days ago

Sr Platform Engineer - GenAI - AWS - Remote

Lensa

Washington

Remote

USD 140,000 - 170,000

11 days ago