Enable job alerts via email!

Site Reliability Engineer

JR United Kingdom

Kingston upon Hull

On-site

GBP 60,000 - 90,000

Full time

14 days ago

Boost your interview chances

Create a job specific, tailored resume for higher success rate.

Job summary

A leading company in the PaaS sector is looking for a Site Reliability Engineer in Kingston upon Hull. This role entails building and maintaining high-availability systems, optimizing CI/CD pipelines, and collaborating across teams to ensure optimal performance and reliability of critical infrastructure. Candidates should possess a Bachelor's degree in Computer Science and have at least 7 years of relevant experience, especially in AWS environments.

Qualifications

7+ years of experience in Site Reliability Engineering or related fields.
Deep understanding of AWS services and Linux administration.
Strong experience in CI/CD implementation and management.

Responsibilities

Design and support EC2/ECS/EKS/Fargate environments for high availability.
Maintain and optimize existing CI/CD pipelines.
Implement and manage monitoring tools for system health.

Skills

AWS (Amazon Web Services)

Auto Scaling

Fargate

Route53

Observability tools (New Relic, DataDog, Splunk)

Scripting (Ansible, Bash, Python, Go)

CI/CD

Education

Bachelor's degree in Computer Science

Higher degree in Information Technology

Social network you want to login/join with:

Site Reliability Engineer, Kingston upon Hull, East Yorkshire

Client: Ranger Technical Resources

Location: Kingston upon Hull, East Yorkshire, United Kingdom

Job Category: Other

EU work permit required: Yes

Job Views: 3

Posted: 31.05.2025

Expiry Date: 15.07.2025

Job Description:

Site Reliability Engineer #2494

Position Summary:

Our partner, an innovative PaaS company specializing in remote monitoring and network management solutions, is looking for a Site Reliability Engineer to help ensure the reliability, scalability, and performance of critical infrastructure and applications. In this role, you will build and maintain highly available systems, support and optimize CI/CD pipelines, and determine optimal solutions for the company's products. You will collaborate closely with development, DevOps, and other teams to maintain high uptime, security, and user experience standards for millions of endpoints.

Experience and Education:

Bachelor's or higher degree in Computer Science, Information Systems, Information Technology, or a related technical field/experience.
7+ years of experience in Site Reliability Engineering, DevOps, Infrastructure, or related roles.
Deep understanding of AWS and its various modules and services.
Strong background in Linux administration and troubleshooting.
Proven experience in implementing and managing CI/CD pipelines and Infrastructure as Code (IaC) solutions.
Proven experience in monitoring and observability tools to proactively manage system health.

Skills and Strengths:

AWS (Amazon Web Services)
Auto Scaling
Fargate
Route53
Observability tools (New Relic, DataDog, Splunk)
Scripting (Ansible, Bash, Python, Go)
CI/CD

Primary Job Responsibilities:

Design and support EC2/ECS/EKS/Fargate environments for high availability and fault tolerance.
Implement advanced AWS features (Route53, ALB/NLB, multi-region setups) to ensure global reliability.
Maintain and optimize existing CI/CD pipelines and deployment processes to streamline software delivery, reduce risks, and facilitate seamless integration of new features.
Collaborate with Development, QA, and DevOps teams to incorporate best practices into build and release processes.
Implement, manage, and enhance monitoring tools to proactively detect and resolve system issues.
Administer and optimize Linux-based servers and applications, ensuring stability, performance, and security.
Implement and manage containerization solutions to improve scalability and efficiency.
Apply security best practices across AWS environments, ensuring compliance with industry standards and safeguarding cloud infrastructure.
Develop automated incident response mechanisms and self-healing solutions to minimize downtime and enhance fault tolerance.
Diagnose and resolve infrastructure, networking, and application performance issues to ensure operational efficiency.
Design and maintain robust backup, failover, and disaster recovery solutions to ensure business continuity.
Identify, diagnose, and resolve infrastructure or application performance bottlenecks.
Create real-time monitoring dashboards and alerting systems to track system health, capacity, and performance trends.
Work closely with development teams to optimize infrastructure for cost efficiency while maintaining high performance.

Get your free, confidential resume review.

or drag and drop a PDF, DOC, DOCX, ODT, or PAGES file up to 5MB.

Similar jobs

Senior Site Reliability Engineer

JR United Kingdom

Kingston upon Hull

Remote

GBP 76,000 - 90,000

10 days ago