Job Search and Career Advice Platform

Enable job alerts via email!

Senior Site Reliability Engineer - FinTech / Global Payments - London HQ / Remote First

Future plc

Remote

GBP 55,000 - 75,000

Full time

Today
Be an early applicant

Generate a tailored resume in minutes

Land an interview and earn more. Learn more

Job summary

A leading technology company in the UK is seeking a skilled Site Reliability Engineer (SRE) to champion core SRE practices and enhance system reliability through automation and monitoring. Candidates should have strong programming skills in Python or Go, knowledge of observability tools such as Prometheus and Grafana, and a solid understanding of Linux and networking fundamentals. This role offers the opportunity to collaborate with development teams and promote a DevOps culture within the organization.

Qualifications

  • Strong grounding in SRE principles and operational best practices.
  • Proficient with observability tools and telemetry pipelines.
  • Solid programming skills in Python and/or Go; Java experience is a plus.

Responsibilities

  • Champion core SRE practices including defining SLIs, SLOs, and SLAs.
  • Refine KPIs for data-driven decisions on reliability and availability.
  • Monitor systems to ensure optimal performance and capacity planning.

Skills

SRE principles
Observability tools
Python programming
Go programming
Linux fundamentals
Networking fundamentals
Infrastructure as Code
Containerisation

Tools

Prometheus
Grafana
Terraform
Docker
Kubernetes
GitHub Actions
Jenkins
Kafka
Job description
Responsibilities
  • Champion core SRE practices : define SLIs / SLOs / SLAs, reduce toil through automation, and plan for Disaster Recovery.
  • Refine KPIs to support data-driven decisions around reliability and availability.
  • Monitor systems to ensure optimal performance, cost-efficiency, and capacity planning.
  • Collaborate with dev teams to build resilient, observable, and maintainable features.
  • Promote DevOps culture by leading knowledge-sharing sessions and supporting issue resolution.
Qualifications
  • Strong grounding in SRE principles and operational best practices.
  • Proficient with observability tools (Prometheus, Grafana, OTEL, Cloudwatch) and telemetry pipelines.
  • Solid programming skills in Python and / or Go; Java experience a plus.
  • Strong Linux and networking fundamentals (TCP, DNS, TLS, HTTP).
  • Familiarity with IaC (Terraform), CI / CD (GitHub Actions, Jenkins), and Agile workflows.
  • Experience with containerisation (Docker, Kubernetes) and stream processing (Kafka a plus).
Get your free, confidential resume review.
or drag and drop a PDF, DOC, DOCX, ODT, or PAGES file up to 5MB.