Site Reliability Engineering Manager

Be among the first applicants.
TN United Kingdom
London
USD 60,000 - 100,000
Be among the first applicants.
2 days ago
Job description

Social network you want to login/join with:

Site Reliability Engineering Manager, London

col-narrow-left

Client:

Canonical

Location:

London, United Kingdom

Job Category:

-

EU work permit required:

Yes

col-narrow-right

Job Reference:

44bf86cf0092

Job Views:

15

Posted:

01.05.2025

Expiry Date:

15.06.2025

col-wide

Job Description:

This is a world-class devops and gitops engineering management challenge, bringing together operations management, software engineering and product development, and team leadership in a single high-value role.

Our mission is to pioneer and prove new and better approaches to large-scale IS. We support Canonical and Ubuntu operations, but we also help shape Canonical's managed application service offerings, raising the bar on devops and cloud-native operations. We take infra-as-code to the next level, blazing a trail to next-generation model-driven operations. We not only aim to automate every process that underpins our business, we also share that automation as open source packages which others use to drive their own operations.

We work across the full stack, from bare metal to Kubernetes, including cloud and virtualisation. We also work across the full range of infrastructure, from public cloud to private cloud and edge.

We have fully distributed, home-based teams in EMEA, APAC and the Americas. You will lead a team in your time zone, and report to a global director who may not be in your time zone.

Your role as SRE Manager

You will need to be a Linux and operations expert, as well as a great manager capable of leading a high-performance team, to excel in this role.

The IS team at Canonical runs the services used by over 60 million Ubuntu users. We automate all of Canonical’s production services, embracing model-driven operations. We are part of Canonical's effort to raise the bar on ops technology, encapsulating real-world operational knowledge into reusable and composable software operations packages. We use our real-life operational experiences to contribute to product improvements.

From Kubernetes to the kernel and everything in-between, you will be working with the latest technologies in a fast-paced engineering environment. As an SRE Manager you will be responsible for the operations engineers in your time zone. This includes customer service management, managed services operations and consistent product improvement engineering. Collaboration with internal customers, product engineering, and development groups is critical to success.

What your day will look like

  • Lead your team in daily agile devops practices
  • Represent the IS team to stakeholders, customers, and internal teams
  • Organize, coordinate and drive internal projects
  • Mentor engineers to improve their skills
  • Identify and measure team health indicators
  • Implement structured engineering and operations processes
  • Ensure proper team focus on priorities, milestones, and deliverables
  • Work to meet service level agreements with customer deployments around the globe
  • Deliver quality managed services in a consistent, timely manner

What we are looking for in you

  • Drive and a track record of going above-and-beyond expectations
  • Proven experience of software delivery using infrastructure as code
  • Proven experience managing devops teams for SAAS or similar offerings
  • Understanding of testing methodologies and maintainable code quality
  • Technical aptitude for understanding complex distributed systems
  • Experience with cloud topologies and technologies
  • Ability to travel twice a year, for company events up to two weeks long
  • An exceptional academic track record from both high school and university

Additional skills that you might also bring

  • Experience with Ubuntu system administration
  • Experience with agile software development methodologies
  • Experience working in and managing distributed teams

What we offer you

We consider geographical location, experience, and performance in shaping compensation worldwide. We revisit compensation annually (and more often for graduates and associates) to ensure we recognise outstanding performance. In addition to base pay, we offer a performance-driven annual bonus. We provide all team members with additional benefits, which reflect our values and ideals. We balance our programs to meet local needs and ensure fairness globally.

  • Distributed work environment with twice-yearly team sprints in person
  • Personal learning and development budget of USD 2, per year
  • Annual compensation review
  • Recognition rewards
  • Annual holiday leave
  • Maternity and paternity leave
  • Employee Assistance Programme
  • Opportunity to travel to new locations to meet colleagues
  • Priority Pass, and travel upgrades for long haul company events
Get a free, confidential resume review.
Select file or drag and drop it
Avatar
Free online coaching
Improve your chances of getting that interview invitation!
Be the first to explore new Site Reliability Engineering Manager jobs in London