Enable job alerts via email!

Observability Engineer

Sophos Plc

Oxford

Hybrid

GBP 55,000 - 75,000

Full time

Today
Be an early applicant

Job summary

A global cybersecurity firm in Oxford is seeking a skilled Observability Engineer to manage and optimize observability platforms. This role involves configuring monitoring tools, integrating with incident management systems, and collaborating with cross-functional teams to enhance system reliability. Candidates should have 3-6 years of relevant experience and expertise in Zabbix and AWS CloudWatch. The position supports remote work with potential hybrid arrangements.

Benefits

Employee-led diversity and inclusion networks
Annual charity and fundraising initiatives
Global employee sustainability initiatives

Qualifications

  • 3-6 years in observability/monitoring engineering roles.
  • Strong expertise in Zabbix (architecture, deployment, customization).
  • Hands-on with Infrastructure as Code tools.

Responsibilities

  • Manage and configure Logic Monitor for monitoring.
  • Administer AWS CloudWatch for cloud resources.
  • Integrate monitoring tools with PagerDuty for event management.

Skills

Zabbix expertise
AWS CloudWatch
Automation/scripting (Python, Bash)
Monitoring configuration
Troubleshooting skills

Education

Bachelor's/Master's degree in computer science or technology

Tools

Logic Monitor
Terraform
Ansible
Job description

We are looking for a skilled Observability Engineer to join our IT Operations team. The ideal candidate will have hands-on experience with infrastructure and application monitoring tools, incident management platforms, and cloud monitoring. This role will focus on managing and optimizing our observability platforms, ensuring proactive monitoring, and enabling faster incident detection and resolution. Observability Engineer will be a part of Operation Center team, which is dedicated in maintaining the reliability, availability, and performance of mission-critical IT systems. As part of this dynamic group, observability engineers ensure end-to-end system visibility by implementing correct observability posture. The team collaborates cross-functionally with DevOps, SRE, Application, and Security units to proactively detect, diagnose, and resolve incidents leveraging advanced analytics and automation for proactive incident management and performance optimization.

What you will do
  • Manage, configure, and optimize Logic Monitor for infrastructure and application monitoring, with a focus on migrating to Zabbix.
  • Administer AWS CloudWatch for monitoring cloud resources.
  • Integrate monitoring tools with PagerDuty for effective event management.
  • Design, implement, and fine-tune monitoring dashboards, alert rules, and notification policies.
  • Develop best practices for alert tuning to reduce noise and improve proactive alert posture.
  • Collaborate with application, DevOps, and infrastructure teams to ensure monitoring coverage.
  • Automate repetitive monitoring tasks and alert configuration using scripts (Python, Bash, PowerShell, etc.).
  • Identify and implement opportunities to improve observability posture and reduce MTTA/MTTR/MTTD.
Qualifications
  • 3-6 years in observability/monitoring engineering roles.
  • Strong expertise in Zabbix (architecture, deployment, customization, API usage).
  • Applications, Synthetic and Rum monitoring configuration.
  • Hands-on with Infrastructure as Code tools (Terraform, Ansible, or similar).
  • Strong automation/scripting skills (Python, Bash, or PowerShell).
  • Experience with Logic Monitor is a plus.
  • Experience integrating monitoring tools with PagerDuty (or similar incident platforms).
  • Solid understanding of infrastructure monitoring for Windows, Linux, networking, Applications and cloud.
  • Good knowledge of AWS CloudWatch for cloud resources monitoring.
  • Develop and maintain dashboards, alerts, and automated responses to proactively detect and resolve issues before they impact users.
  • Strong troubleshooting and problem-solving skills.
  • Excellent communication and cross-team collaboration skills.
Desirable
  • Bachelor's/Master's degree in computer science/technology with excellent communications skills.
  • Experience with Zabbix API for automation and custom integrations.
  • Knowledge of database management for Zabbix backend.
  • Familiarity with containerized monitoring setups (Docker/Kubernetes).
  • Exposure to AIOps or event correlation platforms.
  • Understanding of ITIL framework.
  • Demonstrated success in deploying and managing monitoring tools and observability solutions at scale.

Sophos is a global leader and innovator of advanced security solutions for defeating cyberattacks. The company acquired Secureworks in February 2025, bringing together two pioneers that have redefined the cybersecurity industry with their innovative, native AI-optimized services, technologies and products. Sophos is now the largest pure-play Managed Detection and Response (MDR) provider, supporting more than 28,000 organizations. In addition to MDR and other services, Sophos' complete portfolio includes industry-leading endpoint, network, email, and cloud security that interoperate and adapt to defend through the Sophos Central platform. Secureworks provides the innovative, market-leading Taegis XDR/MDR, identity threat detection and response (ITDR), next-gen SIEM capabilities, managed risk, and a comprehensive set of advisory services. Sophos sells all these solutions through reseller partners, Managed Service Providers (MSPs) and Managed Security Service Providers (MSSPs) worldwide, defending more than 600,000 organizations worldwide from phishing, ransomware, data theft, other everyday and state-sponsored cybercrimes. The solutions are powered by historical and real-time threat intelligence from Sophos X-Ops and the newly added Counter Threat Unit (CTU). Sophos is headquartered in Oxford, U.K. More information is available at ., At Sophos, we believe in the power of diverse perspectives to fuel innovation. Research shows that candidates sometimes hesitate to apply if they don't check every box in a job description. We challenge that notion. Your unique experiences and skills might be exactly what we need to enhance our team. Don't let a checklist hold you back -we encourage you to apply. What's Great About Sophos? Sophos operates a remote-first working model, making remote work the primary option for most employees. However, some roles may necessitate a hybrid approach. While we are a remote first organization, applicants must have legal authorization to work in the jurisdiction where the position is posted, without requiring employer sponsorship. Our people - we innovate and create, all of which are accompanied by a great sense of fun and team spirit Employee-led diversity and inclusion networks that build community and provide education and advocacy Annual charity and fundraising initiatives and volunteer days for employees to support local communities Global employee sustainability initiatives to reduce our environmental footprint Global fitness and trivia competitions to keep our bodies and minds sharp Global wellbeing days for employees to relax and recharge Monthly wellbeing webinars and training to support employee health and wellbeing Our Commitment To You We're proud of the diverse and inclusive environment we have at Sophos, and we're committed to ensuring equality of opportunity. We believe that diversity, combined with excellence, builds a better Sophos, so we encourage applicants who can contribute to the diversity of our team. All applicants will be treated in a fair and equal manner and in accordance with the law regardless of gender, sex, gender reassignment, marital status, race, religion or belief, color, age, military veteran status, disability, pregnancy, maternity or sexual orientation.

Get your free, confidential resume review.
or drag and drop a PDF, DOC, DOCX, ODT, or PAGES file up to 5MB.