Enable job alerts via email!

Site Reliability Engineer Lead

JR United Kingdom

Slough

Hybrid

GBP 70,000 - 100,000

Full time

10 days ago

Boost your interview chances

Create a job specific, tailored resume for higher success rate.

Job summary

Join a leading transformation initiative as a Site Reliability Engineer (SRE) Lead, focusing on enhancing the SRE and Observability functions within a hybrid working environment. You will champion observability practices, collaborate with product teams, and leverage cutting-edge tools such as Datadog and AWS to drive impactful results.

Qualifications

  • Proven experience as a hands-on SRE Engineer.
  • Deep understanding of observability and monitoring practices.
  • Solid cloud engineering skills, especially with AWS.

Responsibilities

  • Lead the SRE function and promote observability-first thinking.
  • Define and implement the observability roadmap.
  • Partner with engineering squads for observability requirements.

Skills

SRE
Observability
DevOps
Cloud Engineering
Configuration Management

Education

AWS Certification

Tools

Datadog
GitHub
Terraform
Docker

Job description

Social network you want to login/join with:

Site Reliability Engineer (SRE) Lead – Observability

Location: London (Hybrid, 2 days on site per week)

Contract Role

Overview:

Join a high-impact team where you'll lead and shape the SRE and Observability function for a major transformation programme. This role goes beyond traditional SRE – you’ll champion best practices across product teams, drive observability strategy, and work hands-on with cutting-edge tools like Datadog and AWS.

Key Responsibilities:

  • Lead the SRE function and promote observability-first thinking across development and operations teams.
  • Define and implement the observability roadmap across product domains in collaboration with the client.
  • Be hands-on with Datadog for infrastructure and application-level monitoring.
  • Guide and review daily operations and improvements across observability platforms.
  • Partner with engineering squads to deliver on observability requirements in an agile, demand-led way.

Core Skills & Experience:

  • Proven experience as a hands-on SRE Engineer.
  • Deep understanding of observability and monitoring practices.
  • Practical experience with Datadog (or similar observability platforms).
  • Strong DevOps toolchain knowledge: GitHub, GitHub Actions, Jenkins, CodeQL, Nexus, CloudFormation, Terraform.
  • Solid cloud engineering skills, especially with AWS (EC2, ELB, ECS, S3, CloudTrail, Config, Lambda, VPC, EFS).
  • Exposure to container-based platforms (e.g., Docker).
  • Experience with configuration management tools like Chef.
  • AWS certification (or willingness to pursue one).
Get your free, confidential resume review.
or drag and drop a PDF, DOC, DOCX, ODT, or PAGES file up to 5MB.

Similar jobs

Senior Site Reliability Engineer

JR United Kingdom

Slough

Remote

GBP 90 000 - 90 000

13 days ago

Lead Platform Engineer for Defence-tech AI Startup

JR United Kingdom

Slough

Remote

GBP 60 000 - 85 000

13 days ago

Site Reliability Engineer (Equity only 0.5%)

JR United Kingdom

Reading

Remote

GBP 70 000 - 90 000

9 days ago

Site Reliability Engineer (Equity only 0.5%)

JR United Kingdom

London

Remote

GBP 70 000 - 110 000

9 days ago

Lead Platform Engineer for Defence-tech AI Startup

JR United Kingdom

Basildon

Remote

GBP 60 000 - 90 000

13 days ago

Lead Platform Engineer for Defence-tech AI Startup

JR United Kingdom

Stevenage

Remote

GBP 70 000 - 100 000

13 days ago

Lead Platform Engineer for Defence-tech AI Startup

JR United Kingdom

Crawley

Remote

GBP 70 000 - 100 000

13 days ago

Lead Platform Engineer for Defence-tech AI Startup

JR United Kingdom

Swindon

Remote

GBP 60 000 - 90 000

13 days ago

Lead Platform Engineer for Defence-tech AI Startup

JR United Kingdom

Bedford

Remote

GBP 70 000 - 90 000

13 days ago