Job Search and Career Advice Platform

Enable job alerts via email!

Public Cloud & Google SRE Engineer

LLOYDS BANKING GROUP

Leeds

On-site

GBP 50,000 - 70,000

Full time

8 days ago

Generate a tailored resume in minutes

Land an interview and earn more. Learn more

Job summary

A leading UK bank is looking for a Site Reliability Engineer to help improve its next-generation cloud platform. You will collaborate with engineering leads to automate processes, ensure operational health, and enhance platform reliability. The ideal candidate should have experience in Google Cloud Platform and be proficient in scripting and DevOps practices. This role offers flexible benefits, a generous pension, and up to 30 days annual leave.

Benefits

Flexible benefits allowance
Generous pension contribution
Private health cover
Up to 30 days annual leave

Qualifications

  • Strong experience in Google Cloud Platform (GCP).
  • Ability to write, update and maintain scripts in various languages.
  • Strong understanding of cloud security principles.

Responsibilities

  • Resolve production incidents and ensure operational health.
  • Maintain Infrastructure-as-Code and CI/CD-based services.
  • Drive operational perfection across incident management and reliability.

Skills

DevOps background
Cloud engineering
Problem-solving
Scripting (Python, Groovy, Bash)
Observability tools

Education

Certifications in GCP or equivalent

Tools

Terraform
CI/CD (Harness, GitHub Actions)
Dynatrace
Job description
Overview

Our Cloud Platform team is a well-established, solution-focused engineering community, delivering one of the UK's largest technology transformations. We're modernising the bank's next-generation cloud platform and partnering closely with product teams to enable secure, scalable and compliant cloud solutions across Analytics, GenAI/ML, Databases, Storage, Serverless HPC and Application workloads. Our engineering work spans product curation, data-platform capability, data segregation, automation, quality assurance, and embedding AI into our workflows. Everything we build aims to empower engineering teams, improve the developer experience and raise delivery standards across the Group.

We're looking for a Site Reliability Engineer with strong experience in Google Cloud Platform (GCP). You'll collaborate with Engineering Leads and Product Owners to shape and deliver our platform roadmap. You'll help plan and prioritise work, automate processes using both traditional and GenAI tooling, remove impediments, and contribute to our continuous improvement culture. You'll also have the opportunity to participate in technical communities, work with internal customers across multiple domains, and support early-career engineers through role-modelling and mentoring.

Responsibilities
  • Tooling: Terraform, CI/CD (Harness or GitHub Actions), Python, Git workflows, Backstage
  • Security & Policy-as-Code: Open Policy Agent, Organisation Policy, Security Health Analytics, Wiz
  • Observability: Dynatrace
  • In this role, you'll spend around half your time resolving production incidents and ensuring operational health, and the other half improving our platform through engineering and automation. Participation in an out-of-hours support rota is required.
  • Apply hands-on engineering to maintain Infrastructure-as-Code and CI/CD-based services
  • Deliver enhancements that improve reliability, scalability and customer experience
  • Reduce toil and improve efficiency through automation and new tooling adoption
  • Drive operational perfection across monitoring, incident management, problem resolution, cost optimisation and reliability
  • Be responsible for the health of production and non-production environments and lead incident response activities
  • Investigate and fix service-related issues using code-first engineering approaches
  • Contribute to Agile ceremonies and support continuous team improvement
  • Provide clear and regular communication of incident status to stakeholders
  • Apply SRE practices and introduce chaos engineering where appropriate to strengthen resilience
  • Collaborate with teams and mentor others; communicate effectively and share knowledge
Qualifications
  • Strong DevOps and cloud-engineering background, including IaC (Terraform) and CI/CD pipelines (Jenkins, Harness, Azure DevOps or similar)
  • Experience working with a broad range of public-cloud technologies
  • Ability to write, update and maintain scripts (Python, Groovy, PowerShell, Bash)
  • Strong understanding of cloud security principles
  • Excellent problem-solving skills and structured logical thinking
  • Experience with observability and monitoring tools
  • Desirable: Experience using SDKs and APIs to deliver automation
  • Desirable: Certifications in GCP or another cloud provider (e.g., Azure)
  • Transferable experience from sysadmin, software engineering or other technical subject areas
  • Technology-agnostic approach and willingness to adopt the best tool for the job
  • Curiosity and aim to learn continuously, applying emerging cloud best practices
Benefits
  • A flexible benefits allowance
  • A generous pension contribution
  • Private health cover
  • Up to 30 days annual leave (plus ability to purchase more)
  • Access to a range of colleague share schemes

If this sounds like the kind of work you want to be part of - we'd love to hear from you.

Get your free, confidential resume review.
or drag and drop a PDF, DOC, DOCX, ODT, or PAGES file up to 5MB.