Enable job alerts via email!

Site Reliability Engineer

JR United Kingdom

Bristol

On-site

GBP 45,000 - 75,000

Full time

30+ days ago

Generate a tailored resume in minutes

Land an interview and earn more. Learn more

Start fresh or import an existing resume

Job summary

Join a dynamic and expanding organization as an Operations Site Reliability Engineer, where you will play a crucial role in supporting global operational needs for cutting-edge SaaS products. This position offers the chance to work in a high-availability environment, ensuring the performance and security of mission-critical infrastructure. You will collaborate with a talented team to implement automation strategies, enhance application reliability, and respond to stakeholder requests effectively. With a competitive salary, bonus, and equity package, this role is perfect for those looking to make a significant impact in a thriving industry.

Qualifications

  • Degree in Systems Engineering or related fields required.
  • Strong hands-on experience with Linux and cloud operations.

Responsibilities

  • Monitor and ensure performance of production services 24x7.
  • Drive automation to reduce manual tasks and improve performance.

Skills

Cloud Operations
Linux Distros
Amazon Web Services
Google Cloud Platform
Scripting (Perl, Shell, Ruby, BASH, Python)
Automation Platforms
Deployment Tools (Ansible Tower, Jenkins)
System/Application Administration

Education

Degree in Systems Engineering
Degree in Computer Science

Tools

Ansible Tower
Jenkins

Job description

Insight Global is looking for an Operations Site Reliability Engineer to help with global operational support for a leading infrastructure software product company’s customer-facing SaaS products. You will be part of a team of engineers that demonstrates superb technical competency, operates mission-critical infrastructure and ensures the highest levels of availability (24x7x365), performance and security. This SRE would be part of the critical operations function that is responsible for the monitoring, availability and performance of production services. They would be driving automation to reduce failures, manual tasks and therefore improving overall application performance and availability. As well as responding to stakeholder requests within agreed timescales or SLO, they will also be supporting maintenance activities, critical systems, and the planning of releases related to production applications. This is an opportunity to join an organization expanding dramatically, whilst also offering a highly competitive salary, bonus and equity package.

Must haves:
  • A degree in Systems Engineering, Computer Science or related fields
  • Professional experience working in a large cloud operations setting
  • Strong hands-on experience of variants of Linux distros
  • Operational experience of working with Amazon Web Services or Google Cloud Platform
  • Experience of working with an automation platform to automate repetitive actions that reduce manual effort
  • Experienced and confident in at least one scripting language such as Perl, shell, Ruby, BASH or Python
  • Familiarity with deployment tools such as Ansible Tower and Jenkins
  • Experience in carrying out large deployments to global infrastructure
  • Experience of system/application administration in a distributed, customer-facing, high-availability and large-scale environments
Get your free, confidential resume review.
or drag and drop a PDF, DOC, DOCX, ODT, or PAGES file up to 5MB.