Enable job alerts via email!

Senior Site Reliability Engineer

ZipRecruiter

London

On-site

GBP 80,000 - 92,000

Full time

7 days ago
Be an early applicant

Boost your interview chances

Create a job specific, tailored resume for higher success rate.

Job summary

ZipRecruiter is looking for a motivated Senior Site Reliability Engineer to join their London team, focusing on service reliability and performance. The successful candidate will collaborate with development and operations teams, automating infrastructure and ensuring system resilience. Ideal candidates will possess extensive experience in cloud technologies, container orchestration, and scripting, with strong incident management skills.

Qualifications

  • 8+ years of relevant experience in SRE, DevOps or Infrastructure Engineering.
  • Strong practical experience with AWS and container technologies.
  • Bachelor's degree in a technical field or equivalent experience.

Responsibilities

  • Design and maintain resilient systems and services.
  • Automate tasks with infrastructure-as-code.
  • Drive incident management and continuous improvement.

Skills

Cloud Platforms
Container Orchestration
Scripting
Networking
Security Best Practices

Education

Bachelor’s degree in Computer Science

Tools

AWS
Terraform
Jenkins
PostgreSQL
Docker
Kubernetes

Job description

Job Description

Job Title: Senior Site Reliability Engineer (SRE)

Location: London, UK – Onsite (5 days/week)

Employment Type: Permanent

Salary: Up to £80,000 per annum (Gross)

About the Role:

We are seeking a highly skilled and motivated Site Reliability Engineer (SRE) to join our London-based team. This role is ideal for someone passionate about service reliability, scalability, and performance. As an SRE, you will collaborate with development and operations teams to automate infrastructure, enhance observability, and reduce manual processes (TOIL) to improve overall system health.

Key Responsibilities:

  • Design, build, and maintain scalable, resilient systems and services.
  • Automate routine tasks and eliminate manual effort using scripting and infrastructure-as-code.
  • Collaborate with development teams to ensure best practices for deployment, monitoring, and performance tuning.
  • Drive incident management processes, root cause analysis, and continuous improvement of system reliability.
  • Maintain and improve observability using monitoring and logging tools.
  • Optimize cloud infrastructure usage and costs.

Primary Skills & Experience:

  • Strong hands-on experience with cloud platforms, especially AWS (experience with GCP or Azure is a plus).
  • Deep understanding of Container Orchestration technologies such as Kubernetes and Docker.
  • Proficiency in monitoring and logging tools including: Datadog, Splunk, Dynatrace, AppDynamics, Prometheus, Grafana, ELK Stack, CloudWatch, Gremlin, ThousandEyes.
  • Experience with Terraform, Jenkins, GitLab CI, PostgreSQL, Redis, and Kong API Gateway.
  • Solid understanding of networking, security best practices, and infrastructure automation.
  • Exposure to AWS ECS, Atlas, and internal tooling integrations.
  • Diagramming and documentation skills using Lucidchart and PlantUML.

Secondary Skills:

  • Familiarity with ServiceNow (SNOW) and JIRA for incident and task tracking.
  • Competency in Shell scripting, Linux system administration, Bitbucket, and Akamai.
  • Experience working within DevOps pipelines and CI/CD frameworks.

Qualifications:

  • Bachelor’s degree in Computer Science, Engineering, or a related technical field (or equivalent practical experience).
  • 8+ years of relevant experience in SRE, DevOps, or Infrastructure Engineering roles.
Get your free, confidential resume review.
or drag and drop a PDF, DOC, DOCX, ODT, or PAGES file up to 5MB.

Similar jobs

Senior Site Reliability Engineer

JR United Kingdom

Hounslow

Remote

GBP 70,000 - 90,000

7 days ago
Be an early applicant

Senior Site Reliability Engineer

TieTalent

London

Remote

GBP 70,000 - 85,000

21 days ago

Senior Site Reliability Engineer

JR United Kingdom

Colchester

Remote

GBP 70,000 - 90,000

7 days ago
Be an early applicant

Senior Site Reliability Engineer

JR United Kingdom

Chelmsford

Remote

GBP 70,000 - 90,000

7 days ago
Be an early applicant

Senior Site Reliability Engineer

JR United Kingdom

Hemel Hempstead

Remote

GBP 90,000 - 90,000

7 days ago
Be an early applicant

Senior Site Reliability Engineer

JR United Kingdom

Woking

Remote

GBP 70,000 - 90,000

7 days ago
Be an early applicant

Senior Site Reliability Engineer

JR United Kingdom

Watford

Remote

GBP 76,000 - 90,000

7 days ago
Be an early applicant

Senior Site Reliability Engineer

JR United Kingdom

Bedford

Remote

GBP 76,000 - 90,000

7 days ago
Be an early applicant

Senior Site Reliability Engineer

JR United Kingdom

Luton

Remote

GBP 70,000 - 90,000

7 days ago
Be an early applicant