Enable job alerts via email!

Senior Site Reliability Engineer

Yolk Recruitment Ltd

Cardiff

Remote

GBP 59,000 - 70,000

Full time

30+ days ago

Job summary

A leading recruitment firm is searching for a Senior Site Reliability Engineer to enhance the resilience and performance of their web systems. This remote position offers a competitive salary of up to £70,000 plus excellent benefits, including up to 38 days of holiday and flexible working hours. Candidates should have strong experience with web applications, proficiency in React and TypeScript, and knowledge of observability tools. Join a culture focused on continuous improvement and meaningful work.

Benefits

Up to 38 days of holiday
Flexible working
Annual share scheme

Qualifications

  • Strong experience supporting complex web applications and distributed systems.
  • Hands-on expertise in React and TypeScript development, focusing on performance.
  • Proven ability to implement observability practices.

Responsibilities

  • Drive operational excellence, ensuring resilience and performance of web systems.
  • Design scalable infrastructure and automate operations.
  • Guide engineering standards and support incident management.

Skills

Web application support
React
TypeScript
Observability tools (Prometheus, Grafana, Azure Monitor)
Containerization (Docker, Kubernetes)
CI/CD pipelines
Cloud infrastructure (Azure, GCP)
SRE frameworks (SLOs, SLIs, error budgets)
Testing tools (Playwright, Vitest, Jest)
Infrastructure-as-code (Terraform)
Job description

Senior Site Reliability Engineer
Remote
Up to £70,000 + annual share scheme + excellent benefits

What You'll Do:

You'll take a lead role in driving operational excellence, ensuring the resilience, observability, and performance of web-based systems across a growing digital platform. Working within a collaborative, cross-functional environment, you'll design scalable infrastructure, automate operations, and embed SRE principles to improve reliability and reduce toil.

This is a highly influential role where you'll guide engineering standards, support incident management, and mentor others in building robust, cloud-native systems using modern DevOps practices.

What You'll Bring:

  • Strong experience supporting complex web applications and distributed systems, including Micro Frontends and BFFs
  • Hands-on expertise in React and TypeScript development with an eye for performance and resilience
  • Proven ability to implement observability practices using tools like Prometheus, Grafana, or Azure Monitor
  • Proficiency in containerisation and orchestration (Docker, Kubernetes - ideally AKS or GKE)
  • Experience building and maintaining CI/CD pipelines for frontend applications (e.g. Azure DevOps, GitHub Actions)
  • Solid grasp of cloud infrastructure (Azure or GCP), networking, and security best practices for web platforms
  • Knowledge of SRE frameworks including SLOs, SLIs, error budgets, and incident response
  • Familiarity with testing tools such as Playwright, Vitest, and Jest
  • Understanding of infrastructure-as-code (Terraform) and DevSecOps is a plus

Why You Should Apply:

You'll join an organisation with a people-first culture and a passion for continuous improvement. With up to 38 days holiday, flexible working, and an annual share scheme, this is a role where you'll feel valued and empowered. From shaping platform reliability to mentoring future engineers, you'll be part of something meaningful, modern, and rewarding.

Ready to Apply?

Contact Lewis Allen to find out more.


Please apply with a CV and a cover letter outlining why you're perfect for the role.
Know someone great for the job? We offer a referral just get in touch!
Note: We do our best to respond to every application, but due to volume, we can't always guarantee it. If you haven't heard back within 7 days, unfortunately, you haven't been successful this time. Keep an eye on our site for new opportunities!

Get your free, confidential resume review.
or drag and drop a PDF, DOC, DOCX, ODT, or PAGES file up to 5MB.