Enable job alerts via email!

Principal Site Reliability Engineer

Orgvue

London

On-site

EUR 80,000 - 120,000

Full time

3 days ago
Be an early applicant

Boost your interview chances

Create a job specific, tailored resume for higher success rate.

Job summary

Orgvue, une entreprise de conception organisationnelle, recherche un Principal Site Reliability Engineer à Londres. Le candidat idéal sera un leader technique sénior, se concentrant sur la mise à l'échelle et la solidification d'infrastructures basées sur AWS et Kubernetes. Les responsabilités incluent la définition de SLOs, l'application des meilleures pratiques SRE et la collaboration avec diverses équipes pour bâtir une culture de fiabilité.

Benefits

Subsidised Gym Membership
Private Medical Insurance
25 days holiday
Summer Fridays
Employer pension contribution
Season ticket Loan
Cycle to Work Scheme
Annual Discretionary Bonus

Qualifications

  • Expérience démontrable en transformation SRE.
  • Expertise pratique approfondie avec Kubernetes en environnements de production.
  • Connaissance approfondie des services AWS.

Responsibilities

  • Définir et appliquer SLOs, SLIs, et budgets d'erreurs.
  • Mettre en œuvre une stratégie d'infrastructure cloud.
  • Guider l'équipe dans la construction de systèmes automatiques et autoréparables.

Skills

Kubernetes
AWS core services
Infrastructure as Code
Observability
Automation
Incident management

Tools

Terraform

Job description

Principal Site Reliability Engineer, London
Client:

Orgvue

Location:

London, United Kingdom

Job Category:

Other

-

EU work permit required:

Yes

Job Reference:

465704a68d8a

Job Views:

37

Posted:

22.06.2025

Expiry Date:

06.08.2025

Job Description:

Orgvue is an organisational design and planning platform that empowers your business to transform its workforce by understanding the work people do and the skills they have. Our platform connects strategy to structure, providing clarity of vision, so you can build a more adaptable, better performing organisation that thrives in a constantly changing world of work.

The world’s largest and best-known enterprises and consulting firms use Orgvue to visualise and model current and future states of the organisation and make faster, more informed decisions. The company is headquartered in London, with offices in Philadelphia, The Hague, Toronto, and Sydney.

As a Principal Site Reliability Engineer, you will be a senior technical leader focused on scaling and hardening our AWS- and Kubernetes-based infrastructure. You will work across product, platform, and operations teams to ensure our systems are reliable, observable, and resilient — even at scale.

This role combines hands-on technical capability with strategic vision, helping us build a world-class reliability culture and a robust engineering foundation for growth. We're looking for someone who has technical expertise, is a great communicator and enjoys collaborating across multiple teams.

Responsibilities

  • Define and enforce SLOs, SLIs, and error budgets across critical services
  • Crafting and implementing a cloud infrastructure and tooling strategy
  • Work across our Org to level up SRE practices
  • Help implement robust observability metrics, logs & traces using our observability tool
  • Guide the team in building automated, self-healing systems
  • Own and evolve our incident response processes, including on-call practices and post-mortem culture
  • Mentor engineers across the org on best practices in reliability, operational readiness, and scalable infrastructure
  • Drive Infrastructure as Code (IaC) using Terraform, Kubernetes, CloudFormation and GitOps practices
  • Collaborate closely with security, DevOps, and software teams to ensure compliance, scalability, and operational excellence
  • Evaluate and introduce tools, patterns, and practices that improve the performance and reliability of our SaaS platform

Requirements

Desired Skills & Experience:

  • Demonstrable experience leading SRE transformations
  • Deep hands-on expertise with Kubernetes (EKS preferred) in production environments
  • Strong experience with AWS core services (EC2, EKS, RDS, S3, ALB/NLB, IAM, CloudWatch, etc.)
  • Expert in Infrastructure as Code using tools such as Terraform, with knowledge of GitOps workflows
  • Strong background in observability: metrics, visualization, logging, and tracing
  • Understanding of automation, SDLC, CI/CD pipelines, deployment automation, and blue/green or canary releases
  • Proven experience with incident management, disaster recovery planning, root cause analysis, and post-incident reviews
  • Hybrid working - 1+ days a week in the London office
  • Subsidised Gym Membership
  • Private Medical Insurance (including Dental and Vision) and Life Assurance
  • 25 days holiday (increasing to 30 days at a rate of 1 extra day per year)
  • Summer Fridays (half-day Fridays for the months of July and August)
  • Employer pension contribution of 5% of your gross salary, if you contribute a minimum of 3%
  • Season ticket Loan
  • Cycle to Work Scheme
  • Annual Discretionary Bonus

'Here at Orgvue we promote individualism and a diverse workforce to build on our future success'

Please note that if you are NOT a passport holder of the country for the vacancy you might need a work permit. Check our Blog for more information.

Bank or payment details should not be provided when applying for a job. Eurojobs.com is not responsible for any external website content. All applications should be made via the 'Apply now' button.

Created on 22/06/2025 by TN United Kingdom

Get your free, confidential resume review.
or drag and drop a PDF, DOC, DOCX, ODT, or PAGES file up to 5MB.

Similar jobs

Principal Site Reliability Engineer

iwoca

Greater London null

Hybrid

Hybrid

GBP 100,000 - 140,000

Full time

14 days ago

Principal Site Reliability Engineer

iwoca

London null

Hybrid

Hybrid

GBP 100,000 - 140,000

Full time

27 days ago

Principal Site Reliability Engineer - iwoca

Jobs via eFinancialCareers

London null

Hybrid

Hybrid

GBP 100,000 - 140,000

Full time

22 days ago

Lead Site Reliability Engineer

Prism Digital

Milton Keynes null

On-site

On-site

GBP 80,000 - 110,000

Full time

8 days ago

Lead Safety Engineer

Technical Staffing Resources

Leatherhead null

Hybrid

Hybrid

GBP 80,000 - 100,000

Full time

5 days ago
Be an early applicant

Principal Site Reliability Engineer

ZipRecruiter

London null

On-site

On-site

GBP 70,000 - 120,000

Full time

26 days ago

Lead Site Reliability Engineer SRE Java - FinTech

ZipRecruiter

London null

Hybrid

Hybrid

GBP 110,000 - 130,000

Full time

28 days ago

Lead Site Reliability Engineer SRE Java - FinTech

Client Server

London null

Hybrid

Hybrid

GBP 100,000 - 130,000

Full time

29 days ago

Principal Site Reliability Engineer

Orgvue

London null

Hybrid

Hybrid

GBP 80,000 - 120,000

Full time

30+ days ago