Enable job alerts via email!

Senior Infrastructure Engineer - Ceph

JR United Kingdom

City Of London

On-site

GBP 60,000 - 90,000

Full time

4 days ago
Be an early applicant

Generate a tailored resume in minutes

Land an interview and earn more. Learn more

Start fresh or import an existing resume

Job summary

A leading technology company seeks a Senior Infrastructure Engineer with expertise in Ceph to enhance their cloud solutions. The role will focus on scaling and maintaining Ceph storage infrastructures, driving automation and innovation in an open-source environment. Ideal candidates will have proven experience in managing Kubernetes clusters and a strong background in systems programming.

Qualifications

  • 4+ years of software development focused on infrastructure.
  • 2+ years of system design experience, particularly in scaling and reliability.
  • 1+ year managing production-grade Ceph clusters.

Responsibilities

  • Design, deploy, and maintain Ceph storage solutions with a focus on high availability.
  • Create automation frameworks for large-scale Ceph deployments.
  • Engage with the Ceph community and contribute to open-source projects.

Skills

Ceph
Automation
Kubernetes
System Programming
Networking

Education

Bachelor’s degree in Computer Science

Tools

Terraform
Kubernetes Operators

Job description

Social network you want to login/join with:

Senior Infrastructure Engineer - Ceph, london (city of london)

col-narrow-left

Client:

Andiamo

Location:

london (city of london), United Kingdom

Job Category:

Other

-

EU work permit required:

Yes

col-narrow-right

Job Views:

2

Posted:

27.06.2025

Expiry Date:

11.08.2025

col-wide

Job Description:

A Mission-Oriented Opportunity

This role is with a team that builds software enabling data-driven decision-making and operational effectiveness at global scale. Their platforms help partners solve real-world problems—from forecasting supply chain disruptions to accelerating medical breakthroughs.

The Role

A team focused on mission-critical production infrastructure—spanning hundreds of Kubernetes clusters across on-premise environments, from large data centers to edge devices—is seeking a Senior Infrastructure Engineer with deep expertise in Ceph. This individual will enhance the scale, reliability, and performance of ruggedized Kubernetes offerings operating under complex and novel constraints.Kubernetes offerings operati

Ideal candidates are passionate about infrastructure at scale, adept in Ceph, and eager to contribute to the broader open-source ecosystem.

Key Responsibilities

  • Manage Ceph at Scale: Design, deploy, and maintain Ceph storage solutions across a variety of hardware environments with an emphasis on high availability and performance.
  • Automate Deployments: Create automation frameworks and tooling to manage large-scale Ceph deployments, minimizing manual effort and maximizing operational efficiency.
  • Innovate and Contribute: Drive the integration of emerging tools and features from the Ceph and CNCF ecosystems, and contribute upstream to relevant open-source projects.
  • Community Engagement: Actively participate in the Ceph developer and CNCF communities through collaboration, contribution, and knowledge sharing.
  • Infrastructure Evolution: Partner with peers to architect and build scalable, secure, and resilient infrastructure for next-generation deployments.

Preferred Qualifications

  • Ceph & Rook Mastery: Proven experience managing Ceph clusters in production environments, ideally via Rook.
  • Automation Skills: Proficiency with tools like Terraform, Kubernetes Operators, and programming in Go, Java, or equivalent.
  • Systems Programming Experience: Background in Go, Rust, or C/C++ for system-level development.
  • Hardware & OS Knowledge: Strong familiarity with system hardware, Linux-based OS internals, and diagnostic tools.
  • Networking Insight: Understanding of network architectures and experience with CNIs or cloud networking solutions.
  • Data Center Experience: Hands-on experience managing on-premise hardware or serving as a sysadmin/Site Reliability Engineer in production environments.
  • 4+ years of software development focused on infrastructure and operational excellence
  • 2+ years of system design experience, particularly in scaling and reliability
  • 1+ year managing production-grade Ceph clusters
  • Bachelor’s degree in Computer Science or equivalent experience
Get your free, confidential resume review.
or drag and drop a PDF, DOC, DOCX, ODT, or PAGES file up to 5MB.