Enable job alerts via email!

Senior Site Reliability Engineer

Government Recruitment Service

Tees Valley

Hybrid

GBP 65,000 - 85,000

Full time

7 days ago
Be an early applicant

Job summary

A government agency in the UK seeks a Senior Site Reliability Engineer to enhance service reliability on a cutting-edge AWS platform. Your role will involve designing monitoring tools, coaching teams, and improving CI/CD processes. Ideal candidates have strong AWS, Docker, and Python skills and experience in mentoring junior colleagues. The position offers the chance to contribute to essential government services.

Qualifications

  • Strong experience in AWS and Azure.
  • Proficiency in implementing CI/CD pipelines.
  • Experience mentoring junior engineers.

Responsibilities

  • Ensure internet services work reliably for users.
  • Implement observability tools for system monitoring.
  • Coach and mentor junior colleagues.

Skills

Amazon Web Services
CloudFormation
Python
ElasticSearch
Docker
Redis
Incident Response

Tools

Terraform
AWS CodePipelines
Sentry
PostgreSQL
Django
Job description
If you would like to find out more about the role, the Site Reliability Engineering team and what it’s like to work at DBT, we are holding a Hiring Manager Q&A session for this role where you can virtually 'meet the team' on Friday 17th October at 12:30pm. Please click here to book your spot.

About us

The Department for Business and Trade (DBT) has a clear mission - to grow the economy. Our role is to help businesses invest, grow and export to create jobs and opportunities right across the country. We do this in three ways.

Firstly, we help to build a strong, competitive business environment, where consumers are protected and companies rewarded for treating their employees properly.

Secondly, we open international markets and ensure resilient supply chains. This can be through Free Trade Agreements, trade facilitation and multilateral agreements.

Finally, we work in partnership with businesses every day, providing advance, finance and deal-making support to those looking to start up, invest, export and grow.

The Digital, Data and Technology (DDaT) directorate develops and operates tools and services to support us in this mission.

About the role

We are on a mission to build a new cutting-edge developer platform in AWS and support DBT services running on the platform.

Can we rely on you to make us more reliable? We need Site Reliability Engineers (SREs) to make sure our internet services work as users expect.


As a Senior Site Reliability Engineer you will work to give development teams the tools for their job, including application performance monitoring, exception, log and metrics aggregation, dashboards, and declarative CI/CD (continuous integration/continuous delivery) pipelines.

You’ll evangelise product teams about service-level indicators, objectives, and error budgets, and negotiate them. You’ll help build and scale our global product platform and participate in an on-call rota for which you will receive an additional allowance.

Specific projects the team are working on include rolling out an observability tool to enhance system monitoring and incident response and streamlining deployment processes to reduce downtime and speed up feature delivery.

Out of the 4 positions available, one of these posts will have line management responsibilities but we expect all of our Senior Site Reliability Engineers to coach and mentor junior colleagues across DDaT.

You will be using:

  • Amazon Web Services
  • Azure
  • AWS CodePipelines and AWS CodeBuild
  • Terraform & AWS Copilot (CloudFormation)
  • Docker, Elastic Container Service (ECS) and Elastic Container Registry (ECR)
  • ElasticSearch/OpenSearch
  • Pythonand Django framework
  • PostgreSQLas a service (Amazon RDS)
  • Sentry
  • Redis/Elasticache
Get your free, confidential resume review.
or drag and drop a PDF, DOC, DOCX, ODT, or PAGES file up to 5MB.