Enable job alerts via email!

Senior Site Reliability Engineer

Department for Business and Trade

Cardiff

On-site

GBP 50,000 - 70,000

Full time

Today
Be an early applicant

Job summary

A government department in the UK is seeking a Senior Site Reliability Engineer to help build a cutting-edge developer platform on AWS. The role involves ensuring reliable internet services, supporting development teams with monitoring and CI/CD tools, and participating in an on-call rotation. Ideal candidates will have extensive AWS experience and familiarity with containerization technologies.

Qualifications

  • Experience with building and scaling global platforms.
  • Capability to optimize observability tooling and incident responses.

Responsibilities

  • Ensure internet services are reliable and performant.
  • Support development teams with monitoring and CI/CD tools.
  • Participate in scaling the platform and on-call support.

Skills

Amazon Web Services
Azure
AWS CodePipelines
Terraform
Docker
Python
Elasticsearch

Tools

AWS CodeBuild
Elastic Container Service (ECS)
PostgreSQL
Job description
Overview

We are on a mission to build a new cutting-edge developer platform in AWS and support DBT services running on the platform. We are seeking Senior Site Reliability Engineers (SREs) to ensure our internet services work as users expect.

Responsibilities
  • Provide application performance monitoring, exception/log/metrics aggregation, dashboards, and declarative CI/CD pipelines to support development teams.
  • Evangelise service-level indicators, objectives, and error budgets with product teams and help negotiate them.
  • Build and scale the global product platform and participate in an on-call rota with additional allowance.
  • Roll out observability tooling to enhance system monitoring and incident response; streamline deployment processes to reduce downtime and speed up feature delivery.
Qualifications
  • Experience with Amazon Web Services and Azure.
  • AWS CodePipelines and AWS CodeBuild.
  • Terraform and AWS Copilot (CloudFormation).
  • Docker, Elastic Container Service (ECS) and Elastic Container Registry (ECR).
  • Elasticsearch/OpenSearch.
  • Python and Django framework.
  • PostgreSQL as a service (Amazon RDS).
  • Sentry and Redis/ElastiCache.
EEO / Disability

Proud member of the Disability Confident employer scheme. A Disability Confident employer will generally offer an interview to any applicant that declares they have a disability and meets the minimum criteria for the job as defined by the employer. For more details please refer to the relevant government guidance.

Get your free, confidential resume review.
or drag and drop a PDF, DOC, DOCX, ODT, or PAGES file up to 5MB.