Job Search and Career Advice Platform

Enable job alerts via email!

Senior Site Reliability Engineer

NatWest Group

Greater London

Hybrid

GBP 70,000 - 90,000

Full time

Yesterday
Be an early applicant

Generate a tailored resume in minutes

Land an interview and earn more. Learn more

Job summary

A leading financial services organization seeks a Senior Site Reliability Engineer to enhance the operational aspects of its products. The role includes working with cloud-native technologies and collaborating closely with engineers to improve system reliability and performance. Responsibilities encompass proactive incident management and technical guidance within a supportive team culture. Hybrid working arrangements are available, emphasizing innovation and continuing professional development. The position entails 35 hours a week, with a job posting closing date of January 21, 2026.

Qualifications

  • Proven experience in identifying performance bottlenecks in systems.
  • Hands-on experience with monitoring and observability tools.
  • Strong understanding of IT Service Management practices.

Responsibilities

  • Improve operational characteristics of products and services.
  • Respond to incidents and manage production environments.
  • Provide technical expertise to establish product risk tolerance.

Skills

Cloud-native microservices
Kubernetes management
Azure
Infrastructure as Code
DevOps principles
Communication skills

Tools

PowerShell
Grafana Stack
Azure DevOps
JSON
ServiceNow
Job description

Join us as a Senior Site Reliability Engineer

  • In this key role, you’ll improve, drive, and embed non-functional and operational characteristics such as availability, performance, efficiency, change management, monitoring, security, incident response, and capacity planning of our products and services
  • You’ll enjoy significant stakeholder interaction, working in collaboration with engineers to ensure a principled approach to deliver change in a safe and secure way
  • This is a chance to join an inclusive team with a collaborative ethos and a commitment to innovation and professional development
  • You’ll work from home some of the time, but you’ll also spend a significant amount of time working from an office or hub
What you’ll do

As a Senior Site Reliability Engineer, you’ll work closely with our feature team and other colleagues to meet defined service level objectives and continually improve systems and environments. You’ll define error budgets that support finding the right balance between risk and reliability.

You’ll also provide structure and help to our release process, suggesting and making improvements where possible. You’ll help scale systems sustainably through mechanisms like automation, evolving them by pushing for changes that improve reliability and velocity. We’ll also look to you to coach and provide guidance to colleagues and the wider team, leading where required.

In addition to this, you’ll:

  • Proactively contribute new ideas and innovations to meet short term and longer-term goals
  • Continually balance and manage any potential risks
  • Be accountable for the day‑to‑day health of both production and non‑production environments and respond to any incidents as required
  • Provide technical expertise and input to establish the risk tolerance of products and services
  • Communicate incident status updates clearly and frequently to other teams, customers and stakeholders
The skills you’ll need

We’re looking for an experienced Senior SRE with a proactive approach to spotting problems, areas for improvements and performance bottlenecks. You'll need experience working with cloud‑native microservices, including containerisation, management of Kubernetes workloads, and API management.

We’re also looking for:

  • Hands‑on experience of Azure, Infrastructure as Code, and technologies such as PowerShell, JSON, Azure Bicep, ARM and Azure DevOps
  • Experience with Full Stack Observability using tools such as Grafana Stack, Log Analytics, AppInsights
  • Excellent knowledge of DevOps processes and principles
  • Knowledge of IT Service Management and automation of IT fulfilment processes through Orchestration and ServiceNow
  • Strong communication skills with the ability to proactively engage with a wide range of stakeholders

Hours: 35

Job Posting Closing Date: 21/01/2026

Ways of Working: Remote First

Get your free, confidential resume review.
or drag and drop a PDF, DOC, DOCX, ODT, or PAGES file up to 5MB.