Enable job alerts via email!

Site Reliability Engineer

JR United Kingdom

Slough

On-site

GBP 70,000 - 95,000

Full time

14 days ago

Boost your interview chances

Create a job specific, tailored resume for higher success rate.

Job summary

A leading technology company is seeking a Site Reliability Engineer to ensure the reliability and performance of critical infrastructure. The candidate will be responsible for maintaining systems, optimizing CI/CD pipelines, and implementing security best practices in an AWS environment, ensuring high availability and fault tolerance across millions of endpoints.

Qualifications

  • 7+ years of experience in Site Reliability Engineering or similar.
  • Deep understanding of AWS services and modules.
  • Experience with Infrastructure as Code solutions.

Responsibilities

  • Design and support EC2/ECS/EKS/Fargate environments for high availability.
  • Collaborate to integrate best practices into build and release processes.
  • Ensure business continuity through robust backup and disaster recovery solutions.

Skills

AWS
Auto Scaling
Observability tools
Scripting
CI/CD

Education

Bachelor's or higher degree in Computer Science or related field

Tools

New Relic
DataDog
Splunk

Job description

Our partner, an innovative PaaS company specializing in remote monitoring and network management solutions, is looking for a Site Reliability Engineer to help ensure the critical infrastructure and applications' reliability, scalability, and performance. In this role, you’ll build and maintain highly available systems, support and optimize CI/CD pipelines, and determine optimal solutions for the company’s products. You’ll collaborate closely with development, DevOps, and other teams to maintain high uptime, security, and user experience standards for millions of endpoints.

Experience and Education:

  • Bachelor's or higher degree in Computer Science, Information Systems, Information Technology, or a related technical field/experience.
  • 7+ years of experience in Site Reliability Engineering, DevOps, Infrastructure, or related roles.
  • Deep understanding of AWS and its various modules and services.
  • Strong background in Linux administration and troubleshooting.
  • Proven experience in implementing and managing CI/CD pipelines and Infrastructure as Code (IAC) solutions.
  • Proven experience in monitoring and observability tools to proactively manage system health.

Skills and Strengths:

  • AWS (Amazon Web Services)
  • Auto Scaling
  • Fargate
  • Route53
  • Observability tools (New Relic, DataDog, Splunk)
  • Scripting (Ansible, Bash, Python, GO)
  • CI/CD

Primary Job Responsibilities:

  • Design and support EC2/ECS/EKS/Fargate environments for high availability and fault tolerance.
  • Implement advanced AWS features (Route53, ALB/NLB, multi-region setups) to ensure global reliability.
  • Maintain and optimize the existing CI/CD pipelines and deployment processes to streamline software delivery, reduce risks, and ensure seamless integration of new features.
  • Collaborate with Development, QA, and DevOps teams to integrate best practices into build and release processes.
  • Implement, manage, and enhance monitoring tools to proactively detect and resolve system issues.
  • Administer and optimize Linux-based servers and applications, ensuring stability, performance, and security.
  • Implement and manage containerization solutions to improve scalability and efficiency.
  • Implement security best practices across AWS environments, ensuring compliance with industry standards and safeguarding cloud infrastructure.
  • Develop automated incident response mechanisms and self-healing solutions to minimize downtime and enhance fault tolerance.
  • Diagnose and resolve infrastructure, networking, and application-related performance issues to ensure operational efficiency.
  • Ensure business continuity by designing and maintaining robust backups, failover strategies, and disaster recovery solutions.
  • Identify, diagnose, and resolve infrastructure or application performance bottlenecks.
  • Create real-time monitoring dashboards and alerting systems to track system health, capacity, and performance trends.
  • Work closely with development teams to fine-tune infrastructure for cost efficiency while maintaining high performance.
  • Ensure business continuity by designing and maintaining robust backup, failover, and disaster recovery solutions.

Please note that if you are NOT a passport holder of the country for the vacancy you might need a work permit. Check our Blog for more information.

Bank or payment details should not be provided when applying for a job. Eurojobs.com is not responsible for any external website content. All applications should be made via the 'Apply now' button.

Created on 31/05/2025 by JR United Kingdom

Get your free, confidential resume review.
or drag and drop a PDF, DOC, DOCX, ODT, or PAGES file up to 5MB.

Similar jobs

Senior Site Reliability Engineer

JR United Kingdom

Slough

Remote

GBP 90,000 - 90,000

10 days ago

Site Reliability Engineer (Equity only 0.5%)

JR United Kingdom

Reading

Remote

GBP 70,000 - 90,000

6 days ago
Be an early applicant

Site Reliability Engineer (Equity only 0.5%)

JR United Kingdom

London

Remote

GBP 70,000 - 110,000

6 days ago
Be an early applicant

Senior Site Reliability Engineer

JR United Kingdom

Hemel Hempstead

Remote

GBP 90,000 - 90,000

10 days ago

Senior Site Reliability Engineer

JR United Kingdom

Woking

Remote

GBP 70,000 - 90,000

10 days ago

Senior Site Reliability Engineer

JR United Kingdom

Watford

Remote

GBP 76,000 - 90,000

10 days ago

Senior Site Reliability Engineer

JR United Kingdom

Bedford

Remote

GBP 76,000 - 90,000

10 days ago

Senior Site Reliability Engineer

JR United Kingdom

Luton

Remote

GBP 70,000 - 90,000

10 days ago

Senior Site Reliability Engineer

JR United Kingdom

Stevenage

Remote

GBP 70,000 - 90,000

10 days ago