Enable job alerts via email!

Senior Site Reliability Engineering Manager, Production Engineering

ThousandEyes (part of Cisco)

London

Hybrid

GBP 60,000 - 100,000

Full time

Today
Be an early applicant

Boost your interview chances

Create a job specific, tailored resume for higher success rate.

Job summary

Join a forward-thinking company as a Senior Site Reliability Engineering Manager, where you will lead a talented team focused on enhancing the reliability and security of a cutting-edge digital experience platform. This role offers the opportunity to develop strategies that drive operational excellence while collaborating with cross-functional teams. If you are passionate about fostering a culture of continuous learning and innovation, and have a strong background in SRE principles and cloud technologies, this is the perfect opportunity for you to make a significant impact in a dynamic environment.

Qualifications

  • Proven track record of leading and scaling SRE teams in a fast-paced environment.
  • Deep knowledge of site reliability principles, including incident response and SLOs.
  • Expert-level knowledge of Kubernetes and its ecosystem.

Responsibilities

  • Lead a team of Site Reliability Engineers to enhance platform reliability and performance.
  • Develop strategies for improving security and compliance in cloud-native systems.
  • Collaborate with software development teams to optimize architecture for availability.

Skills

Site Reliability Engineering
Kubernetes
AWS
Microservices Architecture
Incident Response

Tools

Prometheus
OpenTelemetry
ArgoCD

Job description

Senior Site Reliability Engineering Manager, Production Engineering

Please note that we have a hybrid approach to work and would like to find someone who can come into the office in London at least one day a week

Who We Are

Cisco ThousandEyes is a Digital Experience Assurance platform that empowers organizations to deliver flawless digital experiences across every network – even the ones they don’t own. Powered by AI and an unmatched set of cloud, Internet and enterprise network telemetry data, ThousandEyes enables IT teams to proactively detect, diagnose, and remediate issues – before they impact end-user experiences.

ThousandEyes is deeply integrated across the entire Cisco technology portfolio and beyond, helping customers deploy at scale while also delivering AI-powered assurance insights within Cisco’s leading Networking, Security, Collaboration, and Observability portfolios.

About The Role

As the Senior Engineering Manager for our Production Engineering SRE team, you will lead a group of skilled engineers responsible for the design and management of large-scale, highly available distributed systems in the cloud, collaborating directly with application development teams to enhance the reliability, performance, and security of our platform.. You'll focus on enhancing the reliability, performance, and security of our platform while collaborating with cross-functional teams to drive operational excellence.

What You’ll Do

Team Leadership and Development:

  • Build and mentor a high-performing team of Site Reliability Engineers that embed with application development teams
  • Foster a culture of continuous learning, innovation, and best practices
  • Manage performance, set goals, and provide career development opportunities

Strategic Planning and Execution:

  • Develop and implement strategies to improve platform reliability, security, and performance
  • Collaborate with other engineering leaders to align SRE initiatives with overall business objectives
  • Establish and execute on a roadmap to build common platform solutions to reliability, security, and scale challenges engineering teams at ThousandEyes face.

Operational Excellence:

  • Oversee the design and implementation of scalable operations tooling for SREs and Developers
  • Ensure the effective management of our 24x7 incident response and on-call rotation
  • Lead efforts to automate production operations and adopt robust monitoring solutions

Security and Compliance:

  • Partner with application development teams and other platform engineering teams to enhance the security posture of our containerized and cloud-native systems
  • Ensure compliance with Cisco and industry standards for data protection, scanning, and system security

Cross-functional Collaboration:

  • Work closely with software development teams to optimize architecture and services for availability and performance
  • Collaborate with product management to align SRE initiatives with product roadmaps
  • Represent the Production Engineering SRE team in cross-functional meetings and initiatives
Minimum Qualifications
  • Proven track record of leading and scaling SRE teams in a fast-paces environment
  • Deep knowledge of site reliability principles, including incident response, change management, and SLOs
  • Expert-level knowledge of Kubernetes and its ecosystem
  • Strong understanding of cloud platforms, preferably AWS
  • Experience with microservices architecture and distributed systems
Preferred Qualifications
  • Strong communication and leadership skills, with the ability to influence cross-function stakeholders
  • Demonstrated ability in SRE, DevOps, or related fields, with at least 3 years in a management role
  • Background in security engineer, DevSecOps or a strong understanding of security best practices in cloud-native environments
  • Familiarity with CNCF tools such as Prometheus, OpenTelemetry, and ArgoCD

Cisco values the perspectives and skills that emerge from employees with diverse backgrounds. That's why Cisco is expanding the boundaries of discovering top talent by not only focusing on candidates with educational degrees and experience but also placing more emphasis on unlocking potential. We believe that everyone has something to offer and that diverse teams are better equipped to solve problems, innovate, and create a positive impact.

We encourage you to apply even if you do not believe you meet every single qualification. Not all strong candidates will meet every single qualification. Research shows that people from underrepresented groups are more prone to experiencing imposter syndrome and doubting the strength of their candidacy. We urge you not to prematurely exclude yourself and to apply if you're interested in this work.

Apply for this job

*

indicates a required field

First Name *

Last Name *

Email *

Phone *

Location (City) *

Resume/CV *

Enter manually

Accepted file types: pdf, doc, docx, txt, rtf

Enter manually

Accepted file types: pdf, doc, docx, txt, rtf

Education

School Select...

Degree Select...

Select...

LinkedIn Profile

Website

How did you hear about this job?

Do you now, or will you in the future, require sponsorship for employment visa status to work legally for our Company? * Select...

Are you happy with our hybrid approach to work, and able to come into the offices in London at least one day a week? * Select...

Get your free, confidential resume review.
or drag and drop a PDF, DOC, DOCX, ODT, or PAGES file up to 5MB.

Similar jobs

Senior Site Reliability Engineering Manager, Production Engineering New London, Greater London,[...]

ThousandEyes

London

Hybrid

GBP 80,000 - 120,000

Today
Be an early applicant

Regional Project Manager - Manufacturing Facility

NextWave Partners

Greater London

On-site

GBP 60,000 - 100,000

Yesterday
Be an early applicant

Manufacturing Manager, Sr Production Manager, Head of Production, Operations Production Manager[...]

TN United Kingdom

Stevenage

On-site

GBP 40,000 - 80,000

Yesterday
Be an early applicant

Digital Production Director

Futureheads

London

Hybrid

GBP 70,000 - 80,000

Today
Be an early applicant

Consultant/ Senior Consultant- Aerospace & Defence Manufacturing

TN United Kingdom

London

On-site

GBP 50,000 - 90,000

6 days ago
Be an early applicant

Production Manager

Broadwick

London

On-site

GBP 40,000 - 70,000

6 days ago
Be an early applicant

Plant / Production Manager

Rise Technical Recruitment Limited

Crawley

On-site

GBP 67,000 - 78,000

Today
Be an early applicant

Website Production Manager

Jas Gujral

London

On-site

GBP 40,000 - 80,000

7 days ago
Be an early applicant

Production Project Manager – New Fleet

Randstad Cpe London

London

On-site

GBP 75,000 - 90,000

5 days ago
Be an early applicant