Enable job alerts via email!

Senior Site Reliability Engineer

BenevolentAI

London

On-site

GBP 70,000 - 100,000

Full time

14 days ago

Boost your interview chances

Create a job specific, tailored resume for higher success rate.

Job summary

A leading company in AI-driven technology is seeking a Senior Site Reliability Engineer to enhance their cloud infrastructure and software engineering practices. The role requires strong coding abilities, experience with Kubernetes, and expertise in cloud technologies. As part of a collaborative team, you will ensure reliability, automate processes, and support the infrastructure that powers critical operations.

Qualifications

  • Fluency in at least one programming language (Python/Java/Go/C++ preferred).
  • Hands-on experience with Kubernetes and cloud technologies.
  • Experience with monitoring and alerting solutions.

Responsibilities

  • Implementing software solutions for cloud infrastructure.
  • Monitoring and handling incident responses.
  • Automating infrastructure and software deployments.

Skills

Cloud operations
Kubernetes
Infrastructure-as-code
Automation
Unix systems

Tools

AWS
Ansible
Terraform
Helm
Grafana
Prometheus
InfluxDB

Job description

Social network you want to login/join with:

Senior Site Reliability Engineer, London

col-narrow-left

Client:
Location:

London, United Kingdom

Job Category:

Other

-

EU work permit required:

Yes

col-narrow-right

Job Reference:

5c1405419a00

Job Views:

7

Posted:

02.06.2025

Expiry Date:

17.07.2025

col-wide

Job Description:

As a Senior Site Reliability Engineer, you will be working alongside our autonomous cross-functional squads. You will advocate high-quality engineering and best-practice in production software as well as providing the infrastructure to both build rapid prototypes and launch production-quality services. You must be a strong communicator who can explain what is required to build and deliver top quality software products. You will be keen to work with the rest of the team and develop collaboratively.

You will promote test-driven-development and other Agile best-practices for ensuring the software is resilient enough for our scientists to rely upon. You will be a core team member building and maintaining the underlying infrastructure that supports our AI-driven technology. You will also be adding your input into diverse areas such as authentication, network topology, sharded databases, scalable web services and interfaces to external data sources and APIs.

Responsibilities:

  • Implementing software solutions for cloud infrastructure in accordance with specification and best engineering practices.
  • Working towards improving long-term infrastructure availability and reliability.
  • Monitoring and handling incident response of the infrastructure, platforms and core engineering services.
  • Constructing pipelines to automate infrastructure and software deployments.
  • Troubleshooting infrastructure, network and software issues.
  • Staying up to date with recent technology trends and tools.
  • Automating repetitive manual processes and procedures.
  • Participating in on-call rotation to support Benevolent employees in their day-to-day activities.

We are looking for:

  • Ability to code and fluency in at least one programming language (Python/Java/Go/C++ preferred).
  • Hands-on experience with Kubernetes.
  • Good understanding and experience in administering cloud technologies(we work with AWS, but experience with other cloud providers is also a benefit!).
  • Comfortable working with Unix-based operating systems.
  • Good understanding of infrastructure-as-code and tools such as Ansible, Terraform, Helm.
  • Experience with cloud networking, cloud operations, automation and workload orchestration.
  • Understanding of quality of service measurement tools (SLIs, SLOs, SLAs).
  • Experience with monitoring and alerting solutions (for example InfluxDB/Grafana/Prometheus).
  • High-level understanding of database technologies(for example, relational, NoSQL, Graph) and their basic use cases.
Get your free, confidential resume review.
or drag and drop a PDF, DOC, DOCX, ODT, or PAGES file up to 5MB.

Similar jobs

Senior Site Reliability Engineer

JR United Kingdom

Hounslow

Remote

GBP 70,000 - 90,000

10 days ago

Senior Site Reliability Engineer

TieTalent

London

Remote

GBP 70,000 - 85,000

23 days ago

Senior Site Reliability Engineer

JR United Kingdom

Colchester

Remote

GBP 70,000 - 90,000

10 days ago

Senior Site Reliability Engineer

JR United Kingdom

Chelmsford

Remote

GBP 70,000 - 90,000

10 days ago

Senior Site Reliability Engineer

JR United Kingdom

Hemel Hempstead

Remote

GBP 90,000 - 90,000

10 days ago

Senior Site Reliability Engineer

JR United Kingdom

Woking

Remote

GBP 70,000 - 90,000

10 days ago

Senior Site Reliability Engineer

JR United Kingdom

Watford

Remote

GBP 76,000 - 90,000

10 days ago

Senior Site Reliability Engineer

JR United Kingdom

Bedford

Remote

GBP 76,000 - 90,000

10 days ago

Senior Site Reliability Engineer

JR United Kingdom

Luton

Remote

GBP 70,000 - 90,000

10 days ago