Enable job alerts via email!

Senior Director - Operations and Reliability Engineering

ZipRecruiter

London

Hybrid

GBP 120,000 - 180,000

Full time

Yesterday
Be an early applicant

Generate a tailored resume in minutes

Land an interview and earn more. Learn more

Start fresh or import an existing resume

Job summary

A leading consulting firm is seeking a Senior Director of Operations and Reliability Engineering. This role demands a technical, innovative leader with extensive experience in IT operations, SRE, and DevOps, responsible for driving automation and operational resilience across global teams. Ideal candidates will have significant experience in cloud computing and a proven track record in strategic leadership and transformation.

Qualifications

  • 15+ years in IT operations, SRE, DevOps, or platform engineering.
  • 5+ years in senior leadership managing large-scale IT environments.
  • Strong experience with AWS, Azure, GCP, and hybrid environments.

Responsibilities

  • Lead the integration of SRE, DevOps, and automation-first operational models.
  • Oversee global IT infrastructure, cloud operations, and service management.
  • Ensure compliance with security and risk management standards.

Skills

IT operations
SRE
DevOps
automation
cloud computing
leadership
stakeholder management

Education

Certifications: ITIL, AWS/Azure/GCP Solutions Architect
SRE Foundation
CISSP or equivalent

Tools

Kubernetes
Terraform
Ansible

Job description

Job Description

*Locations*: Canary Wharf | BostonWho We AreBoston Consulting Group partners with leaders in business and society to tackle their most important challenges and capture their greatest opportunities. BCG was the pioneer in business strategy when it was founded in 1963. Today, we help clients with total transformation-inspiring complex change, enabling organizations to grow, building competitive advantage, and driving bottom-line impact.To succeed, organizations must blend digital and human capabilities.

Our diverse, global teams bring deep industry and functional expertise and a range of perspectives to spark change. BCG delivers solutions through leading-edge management consulting along with technology and design, corporate and digital ventures—and business purpose. We work in a uniquely collaborative model across the firm and throughout all levels of the client organization, generating results that allow our clients to thrive.What You'll DoThe Senior Director – Operations and Reliability Engineering is responsible for blendingSite Reliability Engineering (SRE), DevOps, and traditional operations modelsto build a next-Reliability Engineering function.

This role ensuresend-to-end automation at scale, 24x7 operational excellence, and high availabilityacrossall of BCG, includingBCG Core, BCG X, and Consulting Team (CT) worldwide. The leader will drivestrategic planning, execution, and optimizationof global IT infrastructure, cloud operations, and service management while ensuring asecure, scalable, and efficienttechnology environment. This role is accountable for embedding and assuringIT Service Management (ITSM) processesacross all teams, ensuring compliance with standardized frameworks and operational excellence.Key Responsibilities:Strategic Leadership & Transformation:* Define and execute amodern Reliability Engineering strategy, integratingSRE, DevOps, and automation-first operational models.* Driveend-to-end automationto eliminate toil, improve efficiency, and enhance operational resilience.* Lead the transition from traditional IT operations to aproactive, AI-driven, self-healing infrastructure.* Establish a globalobservability, telemetry, and predictive analytics frameworkfor real-time insights.* Align operational strategies with business goals, ensuring IT supports digital transformation initiatives acrossBCG Core, BCG X, and CT.Infrastructure & Cloud Operations:* Overseeglobal IT infrastructure, cloud platforms, and hybrid hosting environmentsacrossall BCG business units.* Managenetwork reliability, compute platforms, and cloud- servicesacross AWS, Azure, and GCP.* ScaleInfrastructure as Code (IaC),automated provisioning, andcloud workload optimization.* Driveedge computing, containerized workloads, and high-performance computing strategies.* ImplementAI-driven monitoring, self-healing automation, and full-stack observability.IT Service Management & Operational Excellence:* Mandate and assure the adoption of IT Service Management (ITSM) processes across all teams, ensuring standardized, efficient, and effective service delivery.* EstablishSRE-based operational metrics, includingSLOs, SLIs, and error budgets.* Overseeincident response, problem resolution, and root cause analysis with AI-driven remediation.* Ensurehigh availability, performance, and security compliancefor all enterprise services.* Develop afollow-the-sun operational support model, ensuring24x7 resilience and uptime across all of BCG.* Optimizeincident, change, and capacity management, ensuring alignment withITIL best practicesand automated workflows.* LeadService Asset and Configuration Management (SACM), ensuringaccurate and real-time management of software and IT assets within the CMDB.* Drive continuousenhancements to the CMDB, improvingvisibility, compliance, and lifecycle managementof IT assets.Security, Compliance & Risk Management:* Embedsecurity and compliance into operational workflowswith automated security controls.* Ensure adherence toISO 27001, NIST, SOC 2, GDPR, and cloud security best practices.* Collaborate withcybersecurity teamsto integratezero-trust security models.* Driveresiliency planning, disaster recovery, and business continuity initiatives.Financial & Vendor Management:* Optimize IT operational budgets with acost-effective, cloud- strategy.* Negotiatevendor contracts, ensuring alignment with business needs and service reliability.* Drivecost efficiency in cloud spending, SaaS platforms, and infrastructure investments.Leadership & Talent Development:* Build and mentor a high-performingReliability Engineering team, fostering a culture of automation and innovation.* Lead a team ofSREs, DevOps engineers, and platform reliability expertsacross global squads.* Promote acollaborative, data-driven, and proactive mindset, ensuring agility and operational resilience.* Establish workforce development programs forAI-driven operations, automation, and modern reliability practices.What You'll BringRequired Qualifications:* 15+ years of experiencein IT operations, SRE, DevOps, or platform engineering.* 5+ years in a senior leadership role, managinglarge-scale IT environments.* Deep technical expertise incloud computing (AWS, Azure, GCP), on-prem infrastructure, and hybrid environments.* Proven track record inend-to-end automation, Infrastructure as Code (IaC), and large-scale observability.* Experience inAI-driven IT operations, predictive analytics, and automated remediation.* Strong understanding ofzero-trust security, regulatory compliance, and risk management.* Excellent leadership, communication, and stakeholder management skills.Preferred Qualifications:* Certifications:ITIL, AWS/Azure/GCP Solutions Architect, SRE Foundation, CISSP, or equivalent.* Experience withKubernetes, Terraform, Ansible, and AI-powered operations tools.* Strong problem-solving abilities, with a data-driven approach to operational excellence.TheSenior Director – Operations Platform Leadis a pivotal leadership role responsible forshaping the future of IT operationsby integratingSRE, DevOps, and automation-first methodologies.

If you are a highly technical, innovation-driven leader passionate aboutscaling operations through automation and AI-driven resilience, we invite you to apply.Who You'll Work WithWork Environment & Additional Information:* Hybrid or on-site work model.* May require occasional travel forbusiness meetings, data center visits, or vendor engagements.* Ability to work in afast-paced, high-availability IT environment, with a focus on automation and reliability.Boston Consulting Group is an Equal Opportunity Employer. All qualified applicants will receive consideration for employment without regard to , , , , , , / expression, , , protected veteran status, or any other characteristic protected under , provincial, or local law, where applicable, and those with criminal histories will be considered in a manner consistent with applicable state and local laws. BCG is an E - Verify Employer.

(Click here )(https://careers.bcg.com/global/en/e-verify) for more information on E-Verify.

Get your free, confidential resume review.
or drag and drop a PDF, DOC, DOCX, ODT, or PAGES file up to 5MB.