Enable job alerts via email!

Site Reliability Engineer

ZipRecruiter

Bristol

On-site

GBP 50,000 - 80,000

Full time

30+ days ago

Job summary

An established industry player is looking for an Operations Site Reliability Engineer to enhance the performance and availability of their customer-facing SaaS products. This role involves managing critical infrastructure, ensuring 24/7 operational support, and driving automation to streamline processes. Candidates should have a strong background in cloud operations, Linux systems, and scripting, along with experience in high-availability environments. Join a dynamic team where your expertise will play a crucial role in delivering exceptional service to global clients.

Qualifications

  • Degree in Systems Engineering or Computer Science required.
  • Experience in large cloud operations environments is essential.

Responsibilities

  • Monitor and maintain production services for optimal availability.
  • Drive automation to minimize manual tasks and failures.

Skills

Cloud Operations
Linux Administration
Scripting (Perl, Ruby, BASH, Python)
Automation Platforms
Deployment Tools (Ansible, Jenkins)

Education

Degree in Systems Engineering
Degree in Computer Science

Tools

Amazon Web Services
Google Cloud Platform
Ansible Tower
Jenkins

Job description

Job Description

Insight Global is seeking an Operations Site Reliability Engineer to provide global operational support for a leading infrastructure software company's customer-facing SaaS products. You will join a team of engineers demonstrating exceptional technical expertise, managing mission-critical infrastructure, and ensuring optimal availability (24x7x365), performance, and security.

This SRE role involves monitoring, maintaining, and enhancing the availability and performance of production services. Responsibilities include driving automation to minimize failures and manual tasks, supporting stakeholder requests within agreed SLAs, and managing maintenance activities, critical systems, and release planning for production applications.

Must haves:

  • A degree in Systems Engineering, Computer Science, or related fields
  • Professional experience in large cloud operations environments
  • Experience administering Linux systems and working with various Linux distributions
  • Operational experience with Amazon Web Services or Google Cloud Platform
  • Proficiency with automation platforms to streamline repetitive tasks
  • Strong scripting skills in Perl, shell, Ruby, BASH, or Python
  • Familiarity with deployment tools such as Ansible Tower and Jenkins
  • Experience executing large-scale deployments to global infrastructure
  • Experience in system/application administration in high-availability, customer-facing, large-scale environments
Get your free, confidential resume review.
or drag and drop a PDF, DOC, DOCX, ODT, or PAGES file up to 5MB.