Enable job alerts via email!

Staff Cloud Availability Platform Engineer

ZipRecruiter

San Francisco (CA)

On-site

USD 180,000 - 210,000

Full time

27 days ago

Boost your interview chances

Create a job specific, tailored resume for higher success rate.

Job summary

A leading technology company is seeking a Platform Engineer to design and operate Kubernetes infrastructure. The role emphasizes performance, reliability, and automation, making a significant impact in the AI cloud infrastructure space. Candidates should have extensive experience in infrastructure engineering, particularly with Kubernetes and event-driven systems.

Benefits

Industry competitive pay
Restricted Stock Units
Health insurance options
Employer contributions to HSA
Paid Parental Leave
Paid life insurance
401(k) with 100% match up to 4%
Generous paid time off
Cell phone reimbursement
Tuition reimbursement
Subscription to Calm app
Company paid commuter benefit

Qualifications

  • 7+ years of experience in platform, backend, or infrastructure engineering roles.
  • Deep hands-on experience with Kubernetes internals and operational tooling.
  • Strong understanding of networking, DNS, load balancing in containerized environments.

Responsibilities

  • Designing, deploying, and operating Kubernetes infrastructure for multi-tenant applications.
  • Collaborating with SREs, security engineers, and backend developers.
  • Contributing to infrastructure-as-code and CI/CD pipelines for automated deployments.

Skills

Kubernetes internals
Networking in containerized environments
Event-driven systems
API-driven infrastructure
Observability tooling
Infrastructure-as-code
CI/CD pipelines

Tools

Terraform
Helm

Job description

Job DescriptionJob Description

Crusoe is building the World’s Favorite AI-first Cloud infrastructure company. We’re pioneering vertically integrated, purpose-built AI infrastructure solutions trusted by Fortune 500 companies to power their most advanced AI applications. Crusoe is redefining AI cloud infrastructure, with a mission to align the future of computing with the future of the climate. Our AI platform is recognized as the "gold standard" for reliability and performance. Our data centers are optimized for AI workloads and are powered by clean, renewable energy.

Be part of the AI revolution with sustainable technology at Crusoe. Here, you'll drive meaningful innovation, make a tangible impact, and join a team that’s setting the pace for responsible, transformative cloud infrastructure.

About This Role:
We’re looking for an experienced Platform Engineer to help design, deploy, and operate robust Kubernetes environments and scalable infrastructure systems. This role is ideal for engineers who thrive at the intersection of infrastructure and developer experience, and who are passionate about performance, reliability, and automation.

What You’ll Be Working On:

  • Designing, deploying, and operating Kubernetes infrastructure for multi-tenant, distributed applications.

  • Implementing and optimizing the container and host networking stack, including CNI plugins, network policies, and service mesh integrations.

  • Building and evolving event-driven platforms using Kafka, NATS, or cloud- pub/sub systems.

  • Developing and maintaining API interfaces (REST/gRPC) to power internal infrastructure services and developer tooling.

  • Driving improvements in system reliability, scalability, and observability through automation, instrumentation, and best practices.

  • Collaborating closely with SREs, security engineers, and backend developers to deliver infrastructure with strong operational maturity.

  • Contributing to and maintaining infrastructure-as-code and CI/CD pipelines for consistent, automated deployments.

What You’ll Bring to the Team:

  • 7+ years of experience in platform, backend, or infrastructure engineering roles.

  • Deep hands-on experience with Kubernetes internals, deployment patterns, and operational tooling.

  • Strong understanding of networking in containerized environments, including DNS, load balancing, and traffic routing.
    Practical experience implementing and supporting event-driven systems at scale.

  • A proven ability to build and evolve API-driven infrastructure used by developers and systems alike.

  • Familiarity with observability tooling like Prometheus, Grafana, OpenTelemetry, and structured logging practices.

  • Working knowledge of infrastructure-as-code tools (Terraform, Helm) and CI/CD pipelines.

Bonus Points:

  • Experience contributing to open-source infrastructure projects.

  • Familiarity with multi-cloud environments and hybrid cloud patterns.

  • Exposure to zero-downtime deployment strategies and advanced rollback mechanisms.

Benefits:

  • Industry competitive pay

  • Restricted Stock Units in a fast growing, well-funded technology company

  • Health insurance package options that include HDHP and PPO, vision, and dental for you and your dependents

  • Employer contributions to HSA accounts

  • Paid Parental Leave

  • Paid life insurance, short-term and long-term

  • Teladoc

  • 401(k) with a 100% match up to 4% of salary

  • Generous paid time off and holiday schedule

  • Cell phone reimbursement

  • Tuition reimbursement

  • Subscription to the Calm app

  • MetLife Legal

  • Company paid commuter benefit; $200 per month

Compensation:

Compensation will be paid in the range of $180,000 - $210,000. Restricted Stock Units are included in all offers. Compensation to be determined by the applicant’s education, experience, knowledge, skills, and abilities, as well as internal equity and alignment with market data.

Crusoe is an Equal Opportunity Employer. Employment decisions are made without regard to , , , , genetic information, , citizenship, marital status, /, preference/ , , , veteran status, , or any other status protected by law or regulation.

Get your free, confidential resume review.
or drag and drop a PDF, DOC, DOCX, ODT, or PAGES file up to 5MB.

Similar jobs

Staff Cloud Availability Platform Engineer

Energy Vault

San Francisco

On-site

USD 180,000 - 210,000

3 days ago
Be an early applicant

Staff Cloud Availability Platform Engineer

Crusoe Energy Systems LLC

San Francisco

On-site

USD 180,000 - 210,000

30+ days ago

Staff or Senior Software Engineer, Platform

Prepared

Remote

USD 180,000 - 220,000

2 days ago
Be an early applicant

Senior Staff Web Platform Engineer

SoFi

San Francisco

On-site

USD 172,000 - 297,000

Yesterday
Be an early applicant

Remote Staff Platform Engineer

NTT DATA, Inc.

Plano

Remote

USD 111,000 - 258,000

4 days ago
Be an early applicant

Remote Staff Platform Engineer

Applicable Limited

Plano

Remote

USD 111,000 - 258,000

5 days ago
Be an early applicant

Staff Platform Engineer

ZipRecruiter

Seattle

Remote

USD 188,000 - 216,000

6 days ago
Be an early applicant

Staff Data Platform Engineer

ClassDojo

San Francisco

Hybrid

USD 171,000 - 250,000

3 days ago
Be an early applicant

Senior Staff Web Platform Engineer

Social Finance, Inc. (SoFi)

San Francisco

On-site

USD 172,000 - 297,000

8 days ago