Enable job alerts via email!

Staff Cloud Availability Platform Engineer

Crusoe

San Francisco (CA)

On-site

USD 180,000 - 210,000

Full time

5 days ago
Be an early applicant

Boost your interview chances

Create a job specific, tailored resume for higher success rate.

Job summary

Join a leading AI-first cloud infrastructure company as a Platform Engineer. You will design and manage Kubernetes environments while collaborating with cross-functional teams to enhance system performance and reliability. This role offers competitive compensation and extensive benefits, including stock options and health insurance.

Benefits

Health insurance package options
401(k) with a 100% match
Paid Parental Leave
Generous paid time off
Tuition reimbursement

Qualifications

  • 7+ years of experience in platform, backend, or infrastructure engineering roles.
  • Deep hands-on experience with Kubernetes internals and operational tooling.

Responsibilities

  • Designing, deploying, and operating Kubernetes infrastructure.
  • Building and evolving event-driven platforms using Kafka or cloud-native systems.

Skills

Kubernetes
Networking
API Development
Automation

Tools

Terraform
Helm
Prometheus
Grafana

Job description

Crusoe is building the World’s Favorite AI-first Cloud infrastructure company. We’re pioneering vertically integrated, purpose-built AI infrastructure solutions trusted by Fortune 500 companies to power their most advanced AI applications. Crusoe is redefining AI cloud infrastructure, with a mission to align the future of computing with the future of the climate. Our AI platform is recognized as the "gold standard" for reliability and performance. Our data centers are optimized for AI workloads and are powered by clean, renewable energy.

Be part of the AI revolution with sustainable technology at Crusoe. Here, you'll drive meaningful innovation, make a tangible impact, and join a team that’s setting the pace for responsible, transformative cloud infrastructure.

About This Role:
We’re looking for an experienced Platform Engineer to help design, deploy, and operate robust Kubernetes environments and scalable infrastructure systems. This role is ideal for engineers who thrive at the intersection of infrastructure and developer experience, and who are passionate about performance, reliability, and automation.

What You’ll Be Working On:

  • Designing, deploying, and operating Kubernetes infrastructure for multi-tenant, distributed applications.

  • Implementing and optimizing the container and host networking stack, including CNI plugins, network policies, and service mesh integrations.

  • Building and evolving event-driven platforms using Kafka, NATS, or cloud-native pub/sub systems.

  • Developing and maintaining API interfaces (REST/gRPC) to power internal infrastructure services and developer tooling.

  • Driving improvements in system reliability, scalability, and observability through automation, instrumentation, and best practices.

  • Collaborating closely with SREs, security engineers, and backend developers to deliver infrastructure with strong operational maturity.

  • Contributing to and maintaining infrastructure-as-code and CI/CD pipelines for consistent, automated deployments.

What You’ll Bring to the Team:

  • 7+ years of experience in platform, backend, or infrastructure engineering roles.

  • Deep hands-on experience with Kubernetes internals, deployment patterns, and operational tooling.

  • Strong understanding of networking in containerized environments, including DNS, load balancing, and traffic routing.
    Practical experience implementing and supporting event-driven systems at scale.

  • A proven ability to build and evolve API-driven infrastructure used by developers and systems alike.

  • Familiarity with observability tooling like Prometheus, Grafana, OpenTelemetry, and structured logging practices.

  • Working knowledge of infrastructure-as-code tools (Terraform, Helm) and CI/CD pipelines.

Bonus Points:

  • Experience contributing to open-source infrastructure projects.

  • Familiarity with multi-cloud environments and hybrid cloud patterns.

  • Exposure to zero-downtime deployment strategies and advanced rollback mechanisms.

Benefits:

  • Industry competitive pay

  • Restricted Stock Units in a fast growing, well-funded technology company

  • Health insurance package options that include HDHP and PPO, vision, and dental for you and your dependents

  • Employer contributions to HSA accounts

  • Paid Parental Leave

  • Paid life insurance, short-term and long-term disability

  • Teladoc

  • 401(k) with a 100% match up to 4% of salary

  • Generous paid time off and holiday schedule

  • Cell phone reimbursement

  • Tuition reimbursement

  • Subscription to the Calm app

  • MetLife Legal

  • Company paid commuter benefit; $200 per month

Compensation:

Compensation will be paid in the range of $180,000 - $210,000. Restricted Stock Units are included in all offers. Compensation to be determined by the applicant’s education, experience, knowledge, skills, and abilities, as well as internal equity and alignment with market data.

Crusoe is an Equal Opportunity Employer. Employment decisions are made without regard to race, color, religion, disability, genetic information, pregnancy, citizenship, marital status, sex/gender, sexual preference/ orientation, gender identity, age, veteran status, national origin, or any other status protected by law or regulation.

Get your free, confidential resume review.
or drag and drop a PDF, DOC, DOCX, ODT, or PAGES file up to 5MB.

Similar jobs

Staff Cloud Availability Platform Engineer

Crusoe Energy Systems LLC

San Francisco

On-site

USD 180,000 - 210,000

6 days ago
Be an early applicant

Sr Staff BI and ML/Advanced Analytics Platform Architect

General Electric

San Ramon

Remote

USD 127,000 - 214,000

2 days ago
Be an early applicant

Staff Platform Engineer - USA

Inworld AI

Mountain View

On-site

USD 180,000 - 280,000

3 days ago
Be an early applicant

Senior/Staff Platform Engineer/SRE

慨正橡扯

Palo Alto

Hybrid

USD 180,000 - 275,000

3 days ago
Be an early applicant

Staff/Sr. Staff AI Engineer – AI Agent Platform - Remote

GEICO Tech

Remote

USD 115,000 - 260,000

2 days ago
Be an early applicant

Staff ML Platform Engineer – Large Scale Training (LLMOps/MLOps)

TrueFoundry

San Mateo

On-site

USD 167,000 - 251,000

7 days ago
Be an early applicant

Sr Staff BI and ML/Advanced Analytics Platform Architect

General Electric

Helena

Remote

USD 127,000 - 214,000

2 days ago
Be an early applicant

Sr Staff BI and ML/Advanced Analytics Platform Architect

General Electric

Phoenix

Remote

USD 127,000 - 214,000

2 days ago
Be an early applicant

Sr Staff BI and ML/Advanced Analytics Platform Architect

General Electric

Santa Fe

Remote

USD 127,000 - 214,000

2 days ago
Be an early applicant