Job Search and Career Advice Platform

Enable job alerts via email!

RedHat Linux Admin

Test Triangle

Sheffield

On-site

GBP 60,000 - 80,000

Full time

Yesterday
Be an early applicant

Generate a tailored resume in minutes

Land an interview and earn more. Learn more

Job summary

A technology consultancy is looking for an OpenShift Site Reliability Engineer (SRE) in Sheffield, UK. This position is crucial for ensuring the reliability and performance of OpenShift-based platforms while focusing on automation. The ideal candidate will work collaboratively across various teams and must have strong technical skills in cloud-native technologies, Kubernetes, scripting, and system monitoring. This role also involves establishing SRE best practices and participating in an on-call rotation.

Qualifications

  • Hands-on experience with OpenShift virtualization and Kubernetes administration.
  • Understanding of distributed systems and common failure domains.
  • Strong knowledge of Linux systems and networking.
  • Experience with monitoring, logging, alerting & Observability tools.
  • Proficiency in scripting languages like Python, Shell, Go, Terraform.
  • Familiarity with CI/CD tools like Jenkins and GitLab CI.
  • Understanding of containerization and microservices architecture.

Responsibilities

  • Develop secure service architecture using cloud-native technologies.
  • Develop systems for automatic scanning and remediation to prevent outages.
  • Establish and enforce SRE best practices through modeling.
  • Participate in an on-call rotation.

Skills

OpenShift virtualization
Kubernetes administration
Shell scripting
Python
Go Lang
Terraform
Monitoring tools (e.g., Prometheus)
CI/CD tools (e.g., Jenkins)
Docker
Ansible
Soft Skills
Job description
Overview

OpenShift Site Reliability Engineer (SRE) – Location: UK. This role ensures the reliability, availability, and performance of OpenShift-based virtual/container platforms with a focus on automation. Collaborates across Applications, Hardware, and Network teams.

Responsibilities
  • Develop secure service architecture using cloud-native technologies.
  • Develop systems, primarily in Shell scripting, YAML, Ruby, Python and Go, to prevent outages through automatic scanning and remediation.
  • Establish and enforce SRE best practices through platform constraints and high-fidelity system modeling.
  • Participate in an on-call rotation.
Qualifications
  • Hands-on experience with OpenShift virtualization and Kubernetes administration.
  • Understanding of distributed systems and common distributed system failure domains. Experience managing a production service with RedHat, Windows and ESXi.
  • Strong knowledge of Linux systems and networking.
  • Experience with monitoring, logging, alerting & Observability tools (e.g., Otel, Prometheus, Grafana, Slunk etc.).
  • Proficiency in scripting languages Python, Shell, Go Lang, Terraform etc.
  • Familiarity with CI/CD tools (e.g., Jenkins, GitLab CI).
  • Understanding of containerization (Docker) and microservices architecture.
  • Ansible – Configuration Management and Deployment.
  • Good problem-solving and communication skills.
  • Soft Skills:
  • Has experience and affinity to improve team performance.
  • Mindsets and Behaviors/Self-mastery.
  • Proven experience in Compute, OpenShift, Kubernetes, Hypervisors, Storage, Windows, Networks and Linux.
  • Work with industry groups and vendors outside of HSBC to establish and maintain HSBC's involvement and influence.
  • Accountability for the control and compliance of the engineering process.
  • Promote innovation and adoption of cutting-edge specialist technologies and practices with the domain.
  • Promote development of engineers through coaching, and mentoring.
  • Consult as required in other areas to assist and provide a different perspective to programmed or projects that require it.
Get your free, confidential resume review.
or drag and drop a PDF, DOC, DOCX, ODT, or PAGES file up to 5MB.