Job Search and Career Advice Platform

Enable job alerts via email!

Reliability Engineer

Danaher Corporation

Hartford

On-site

GBP 50,000 - 70,000

Full time

Yesterday
Be an early applicant

Generate a tailored resume in minutes

Land an interview and earn more. Learn more

Job summary

A leading global technology company is searching for a Reliability Engineer in Hartford, UK, to ensure the stability and performance of production systems. You will lead incident responses, monitor system health, and collaborate across teams to develop robust solutions. Candidates should possess strong scripting skills, a solid understanding of AWS cloud services, and familiarity with incident management tools. This role offers the opportunity to make a significant impact on the quality and reliability of systems, contributing to innovative technology.

Qualifications

  • Proficient in coding repeatable tasks using scripting languages like PowerShell or Python.
  • Strong knowledge of AWS services and cloud infrastructures.
  • High-level understanding of reliability solutions and troubleshooting with logs.

Responsibilities

  • Monitor system performance and proactively spot trends.
  • Lead incident responses during critical moments.
  • Collaborate with teams to implement sustainable solutions.
  • Build tools to streamline operations and reduce manual work.
  • Utilize data insights to improve system resilience.

Skills

Automation & Scripting
Cloud & Infrastructure
Reliability & Scalability
Monitoring & Incident Management
Database & Pipelines

Tools

AWS
PowerShell
Terraform
Grafana
ServiceNow
Job description

We are seeking a highly motivated Reliability Engineer to join our team. As a Reliability Engineer, you will play a crucial role in ensuring the stability, performance, and reliability of our production systems. Your responsibilities will include proactively identifying and resolving technical issues, leading major incident responses, and implementing best practices for system reliability. You will work closely with cross-functional teams to develop and maintain robust monitoring and automation solutions. This position reports directly to the Global Reliability Manager.

Responsibilities
  • Shape system reliability at scale by monitoring performance, spotting trends, and preventing issues before they impact users.
  • Take charge during critical moments, leading major incident responses and driving rapid service restoration.
  • Solve complex problems for the long term, collaborating across teams to implement robust, sustainable solutions.
  • Automate and innovate, building tools and processes that streamline operations and reduce manual work.
  • Drive continuous improvement, using data insights and post-incident learnings to make systems more resilient every day.
Qualifications
  • Automation & Scripting: Ability to code repeatable tasks using PowerShell, Bash, or Python, and familiarity with infrastructure-as-code tools such as Terraform and configuration management tools such as Puppet.
  • Cloud & Infrastructure: Strong knowledge of AWS Cloud services, networking, security, and storage solutions both on-premises and on the cloud.
  • Reliability & Scalability: High-level understanding of High Availability, Disaster Recovery, scalability solutions, and web infrastructure troubleshooting using logs.
  • Monitoring & Incident Management: Proficiency with monitoring dashboards (Grafana, Humio, CloudWatch) and incident management tools like ServiceNow and PagerDuty.
  • Database & Pipelines: Good understanding of SQL Server, Oracle, PostgreSQL (including DML), and familiarity with CI/CD pipelines such as GitLab CI.
  • It would be a plus if you also possess previous experience in:
    • EKS troubleshooting knowledge
    • Application support experience
    • Linux OS troubleshooting experience
    • Oracle Cloud Infrastructure knowledge
  • Participate in an on-call rotation to provide 24/7 support for critical systems and respond to incidents as needed.
About Abcam

For over 25 years, Abcam has been providing tools the scientific community needs to enable faster breakthroughs in critical areas like cancer, neurological disorders, infectious diseases, and metabolic disorders. We believe that to continue making progress, we need to work together, each bringing our own unique perspectives to make an impact on the world. This community needs people like you: dedicated, agile and above all audacious so we can truly drive science forward. Join our winning team today. Together, we'll accelerate the real-life impact of tomorrow's science and technology. We partner with customers across the globe to help them solve their most complex challenges, architecting solutions that bring the power of science to life.

Get your free, confidential resume review.
or drag and drop a PDF, DOC, DOCX, ODT, or PAGES file up to 5MB.