Enable job alerts via email!

Senior Site Reliability Engineer

Queen Square Recruitment Ltd

City Of London

Hybrid

GBP 80,000 - 100,000

Part time

Today
Be an early applicant

Job summary

A leading recruitment agency is seeking a Senior Site Reliability Engineer for a 6-month contract role in London. The position involves developing cloud infrastructure, implementing CI/CD pipelines, and collaborating with engineering teams to enhance automation. Candidates should have over 4 years of experience in SRE or DevOps, with strong Kubernetes and Docker skills. This role offers a hybrid working environment and participation in enterprise-scale cloud transformation projects.

Qualifications

  • 4+ years of experience in Site Reliability or DevOps roles.
  • Advanced Kubernetes experience with Helm and Kubectl.
  • Strong experience with containerized Java microservices (Docker).

Responsibilities

  • Develop and maintain cloud infrastructure using IaC.
  • Define and implement CI/CD pipelines and tooling.
  • Monitor and optimize systems for performance.

Skills

Site Reliability or DevOps experience
Kubernetes (EKS, GKE, AKS, RKE)
Containerized Java microservices
CI/CD tools (Jenkins, ArgoCD, Azure DevOps)
Observability tools (Grafana, Prometheus)
GitLab/GitHub proficiency

Tools

Terraform
Azure Key Vault
Veracode
Job description
Senior Site Reliability Engineer

Role: Senior Site Reliability Engineer
Location: London (Hybrid - 3 days onsite per week)
Contract Length: 6 months
Day Rate: Open to Market Rates (Inside IR35)

About the Role

We are seeking a Senior Site Reliability Engineer (SRE) to join a high-performing engineering team within a leading professional services organization. The team's mission is to drive technology excellence through automation, reliability, and innovation across large-scale cloud environments.

Key Responsibilities
  • Develop and maintain resilient, highly available cloud infrastructure using Infrastructure as Code (IaC).
  • Define and implement CI/CD pipelines, DevOps and DevSecOps tooling standards.
  • Monitor and optimize systems, ensuring performance, reliability, and security.
  • Manage observability tools (Grafana, Datadog, Splunk, etc.) and integrate alerting systems.
  • Troubleshoot complex issues and participate in root cause analysis to drive continuous improvement.
  • Collaborate closely with development and operations teams to enhance automation and deployment practices.
Required Skills & Experience
  • 4+ years of experience in Site Reliability or DevOps roles.
  • Advanced Kubernetes experience (EKS, GKE, AKS, or RKE) with Helm and Kubectl.
  • Strong experience with containerized Java microservices (Docker).
  • Hands‑on experience with CI/CD tools such as Jenkins, ArgoCD, Azure DevOps, or GitHub Actions.
  • Experience in observability and monitoring tools (Grafana, Prometheus, Datadog, Splunk, OpsGenie, etc.).
  • Proficient in GitLab/GitHub and branching strategies (GitFlow).
  • Strong troubleshooting and documentation skills.
Desirable Skills
  • Terraform or Pulumi (module-level experience preferred).
  • Cloud security tools (Azure Key Vault, HashiCorp Vault, etc.).
  • AppSec tools (Veracode, Qualys, Aqua, Twistlock).
  • Experience with scripting languages (Python, PowerShell, Go, or Java).
  • Familiarity with event‑driven architecture (Kafka, EventHub, RabbitMQ).
Why Join?

This is an excellent opportunity to work on enterprise‑scale cloud transformation projects, collaborate with top engineers, and contribute to building world‑class reliability engineering practices.

Get your free, confidential resume review.
or drag and drop a PDF, DOC, DOCX, ODT, or PAGES file up to 5MB.