Enable job alerts via email!

Site Reliability Engineer

NatWest Group

City of Edinburgh

On-site

GBP 50,000 - 70,000

Full time

30+ days ago

Generate a tailored resume in minutes

Land an interview and earn more. Learn more

Job summary

A major financial institution in Edinburgh is seeking a Site Reliability Engineer to enhance operational characteristics and support service delivery. The successful candidate will oversee day-to-day health of systems, engage with stakeholders, and require strong skills in Java Spring Boot and AWS. This role offers a balance of remote work and office presence, promoting collaboration and innovation.

Qualifications

Strong knowledge of reliability systems thinking.
Experience of using a data-driven and scientific approach.
Financial services knowledge is preferable.

Responsibilities

Support the improvement of operational characteristics.
Contribute new ideas to meet service level objectives.
Be accountable for the health of production environments.

Skills

Java Spring Boot

Dev Ops tools

AWS services

Microservices

APIs

Communication skills

Data-driven approach

Tools

Gitlab

AWS EKS

Join us as a Site Reliability Engineer

In this key role, you’ll support the improvement of non-functional and operational characteristics such as availability, performance, efficiency, change management, monitoring, security, incident response, and capacity planning of our products and services
You’ll enjoy significant stakeholder interaction, working in collaboration with engineers to ensure a principled approach to deliver change in a safe and secure way
This is a chance to join an inclusive team with a collaborative ethos and a commitment to innovation and professional development
You'll work from home some of the time, but you'll also spend a minimum of two days per week working from the office

What you'll do

As our Site Reliability Engineer, you’ll work alongside colleagues and feature team members to meet defined service level objectives and continually improve systems and environments. You’ll proactively contribute new ideas and innovations to meet short term and longer term goals whilst at the same time balancing and managing risk.

You’ll also be accountable for the day-to-day health of both production and non-production environments, responding to incidents as required.

A typical day will involve:

Providing structure and supporting release processes, suggesting and making improvements where possible
Supporting the clear communication and frequent update of incident status to other teams and customers
Providing technical expertise and input to establish the risk tolerance of products and services
Supporting the maintenance of services once they are live by measuring and monitoring availability, latency, and overall system health

The skills you'll need

We’re looking for someone with strong knowledge of reliability systems thinking and experience of software engineering. You’ll need experience of using a data driven and scientific approach to fact finding. We’ll also look for financial services knowledge, and the ability to identify wider business impact, risk and opportunity, and make connections across key outputs and processes.

You'll also need:

Good knowledge and experience of Java Spring Boot writing Java Micro services , APIs .
Strong knowledge of Dev Ops tools e.g. Gitlab , AWS services
Strong knowledge of deploy and release services, automation, and troubleshooting
Experience of developing and deploying on AWS EKS environments
Experience of utilising tools and technology across the software development lifecycle
Experience of using a data driven and scientific approach to fact finding
Strong communication skills with the ability to proactively engage with a wide range of stakeholders

Get your free, confidential resume review.

or drag and drop a PDF, DOC, DOCX, ODT, or PAGES file up to 5MB.