Enable job alerts via email!

Head of Production Engineering & Site Reliability Engineering (SRE)

SS&C Technologies

London

On-site

GBP 90,000 - 140,000

Full time

3 days ago
Be an early applicant

Boost your interview chances

Create a job specific, tailored resume for higher success rate.

Job summary

SS&C Technologies, a leader in financial services technology, is seeking a Head of Production Engineering & SRE in London. You will manage critical applications, lead teams, and ensure reliability in a high-compliance environment. This role requires significant experience in cloud technologies and a strong leadership background, making it an excellent opportunity for a seasoned professional ready to drive innovation.

Benefits

Diversity and inclusion initiatives
Continuous learning opportunities
Performance reviews with career development

Qualifications

  • 10+ years in engineering, 5+ years in leadership roles.
  • Experience in managing high-compliance systems.
  • Strong understanding of cloud-native technologies and observability tools.

Responsibilities

  • Define and execute the roadmap for Production Engineering and SRE.
  • Own reliability and performance KPIs for GIDS applications.
  • Lead implementation of modern observability stacks.

Skills

Leadership
Cloud Technologies
DevOps Principles
CI/CD
Kubernetes
Observability
Incident Management

Education

AWS Certified Solutions Architect
CKA/CKAD Certification

Tools

Terraform
GitHub Actions
Splunk
Prometheus
Grafana

Job description

Head of Production Engineering & Site Reliability Engineering (SRE)

Join to apply for the Head of Production Engineering & Site Reliability Engineering (SRE) role at SS&C Technologies

Head of Production Engineering & Site Reliability Engineering (SRE)

1 week ago Be among the first 25 applicants

Join to apply for the Head of Production Engineering & Site Reliability Engineering (SRE) role at SS&C Technologies

As a leading financial services and healthcare technology company based on revenue, SS&C is headquartered in Windsor, Connecticut, and has 27,000+ employees in 35 countries. Some 20,000 financial services and healthcare organizations, from the world's largest companies to small and mid-market firms, rely on SS&C for expertise, scale, and technology.

Job Description

About SS&C Technologies

SS&C is a global provider of investment and financial software-enabled services and software for the global financial services and healthcare industries. The GIDS product suite powers mission-critical investor and distributor services across asset managers, insurance companies, retirement providers, and wealth management platforms.

Job Overview

As the Head of Production Engineering and Site Reliability Engineering (SRE) for the GIDS organisation, you will lead a team responsible for the scalability, resilience, performance, and reliability of cloud and hybrid infrastructure powering some of the most critical client-facing applications in financial services.

You will be the strategic and operational leader for platform reliability, observability, incident response, CI/CD modernisation, and developer productivity.

Why Join SS&C GIDS?

  • Lead mission-critical infrastructure for a globally recognised financial technology provider.
  • Influence the technical direction of a high-impact product suite.
  • Build a modern engineering organisation with a strong culture of innovation, ownership, and reliability.

Key Responsibilities

Leadership & Strategy

  • Define and execute the vision and roadmap for Production Engineering and SRE within GIDS.
  • Build and lead globally distributed, high-performance teams with a focus on talent development, SRE culture, and operational excellence.
  • Collaborate cross-functionally with Engineering, Product, Compliance, and Infrastructure teams to improve system reliability and efficiency.

Production Operations & Incident Management

  • Own reliability, uptime, and performance KPIs for GIDS applications and services.
  • Implement a comprehensive incident management lifecycle (on-call, escalation, RCA, blameless postmortems).
  • Reduce Mean Time to Detect (MTTD) and Mean Time to Resolve (MTTR) through automated observability, alerting, and playbooks.

CI/CD and Platform Engineering

  • Oversee the development and evolution of CI/CD pipelines for all GIDS products using GitHub Actions, ArgoCD, TeamCity, Octopus Deploy, and GitOps principles.
  • Integrate static and dynamic code analysis, vulnerability scanning, artifact promotion, and release gating into the SDLC.
  • Ensure pipeline scalability and governance while maintaining developer velocity.

Observability & Troubleshooting

  • Lead the implementation and usage of modern observability stacks (e.g., OpenTelemetry, Prometheus, Grafana, Splunk, Datadog).
  • Establish SLOs, SLIs, and error budgets with product and engineering teams.
  • Drive root cause identification using distributed tracing, advanced log analysis, and anomaly detection.

Security, Audit & Compliance

  • Partner with security and compliance teams to embed controls into infrastructure and software delivery.
  • Automate audit evidence collection, change tracking, and access management (e.g., HashiCorp Vault, OPA, AWS IAM).
  • Ensure all systems meet internal and regulatory audit requirements (SOC2, GDPR, etc.).

Infrastructure & Automation

  • Champion infrastructure-as-code (IaC) using Terraform, Helm, and Kubernetes for scalable cloud and hybrid deployments.
  • Optimise infrastructure cost, elasticity, and resilience through autoscaling, canary deployments, and chaos testing.
  • Maintain high SLAs for critical services running on Kubernetes, AWS, and on-prem hybrid infrastructure.

Talent Management & Culture

  • Attract, retain, and mentor top engineering talent with a strong focus on diversity and continuous learning.
  • Cultivate a culture of ownership, transparency, blameless accountability, and operational excellence.
  • Drive career development through structured learning paths, performance reviews, and skills-based mentoring.

Talent Management & Global Operations

  • Build and scale a globally distributed 24/7 operations team, ensuring consistent coverage and operational resilience.
  • Establish and enforce engineering and operational standards for deployments, monitoring, and incident response across geographies.
  • Implement and continuously refine a multi-tiered support structure (L1, L2, L3) with clear escalation paths and accountability.
  • Drive hiring, onboarding, and training initiatives that support both site reliability and continuous delivery.
  • Foster a strong engineering culture rooted in transparency, autonomy, learning, and operational excellence.
  • Develop strategies to prevent burnout in around-the-clock operations, including tooling, automation, and shift rotation planning.

Qualifications

Required:

  • 10+ years of experience in engineering, with 5+ years in a leadership role in SRE, DevOps, or Production Engineering.
  • Proven track record managing reliable, scalable systems in a high-compliance environment (e.g., FinTech, HealthTech).
  • Strong understanding of modern software development lifecycle, CI/CD, IaC, and cloud-native technologies.
  • Expertise in Kubernetes, AWS (or Azure/GCP), GitOps workflows, observability tools, and automation frameworks.
  • Excellent leadership, communication, and stakeholder management skills.
  • Certifications: AWS Certified Solutions Architect, CKA/CKAD, or relevant DevOps/SRE certs.
  • Familiarity with ISO/SOC2/GDPR compliance frameworks and evidence collection automation.

We encourage applications from people of all backgrounds and particularly welcome applications from under-represented groups, to enable us to bring a diversity of perspectives to our thinking and conversation. It's important to us that we strive to have a workforce that is diverse in the widest sense.

Unless explicitly requested or approached by SS&C Technologies, Inc. or any of its affiliated companies, the company will not accept unsolicited resumes from headhunters, recruitment agencies, or fee-based recruitment services.

SS&C Technologies is an Equal Employment Opportunity employer and does not discriminate against any applicant for employment or employee on the basis of race, color, religious creed, gender, age, marital status, sexual orientation, national origin, disability, veteran status or any other classification protected by applicable discrimination laws.

Seniority level
  • Seniority level
    Executive
Employment type
  • Employment type
    Full-time
Job function
  • Job function
    Marketing, Public Relations, and Writing/Editing
  • Industries
    Software Development

Referrals increase your chances of interviewing at SS&C Technologies by 2x

Sign in to set job alerts for “Head of Production” roles.

Greater London, England, United Kingdom 3 weeks ago

Greater London, England, United Kingdom 3 weeks ago

London, England, United Kingdom 2 weeks ago

Kensington And Chelsea, England, United Kingdom 2 weeks ago

London, England, United Kingdom 1 week ago

London, England, United Kingdom 1 week ago

London, England, United Kingdom 6 days ago

Director of Operations - International E-commerce

London, England, United Kingdom 5 months ago

London, England, United Kingdom 1 week ago

Greater London, England, United Kingdom 4 days ago

Director/ Head of Operations - FMCG Start Up

London, England, United Kingdom 4 days ago

Greater London, England, United Kingdom 1 week ago

Director/ Head of Operations - FMCG Start Up

South Merstham, England, United Kingdom 4 weeks ago

Digital Production Director - Experiential Marketing Agency

Hertfordshire, England, United Kingdom 5 hours ago

Romford, England, United Kingdom 2 weeks ago

Industrial & Operational Excellence Director (Role based in Athens, GR)

London, England, United Kingdom 1 month ago

Greenford, England, United Kingdom 2 weeks ago

Head of Conference Production at Insurtech Insights

London, England, United Kingdom 7 months ago

London, England, United Kingdom 1 week ago

London, England, United Kingdom 1 week ago

Greater London, England, United Kingdom 4 days ago

Greater London, England, United Kingdom 3 weeks ago

Surrey, England, United Kingdom 1 day ago

Deputy Director of Production & Technical

London, England, United Kingdom 1 month ago

We’re unlocking community knowledge in a new way. Experts add insights directly into each article, started with the help of AI.

Get your free, confidential resume review.
or drag and drop a PDF, DOC, DOCX, ODT, or PAGES file up to 5MB.

Similar jobs

Head of Production Engineering & Site Reliability Engineering (SRE)

SS&C Technologies

London

On-site

GBP 90,000 - 130,000

11 days ago

Lead, Site Reliability Engineering, Infrastructure Security London

MongoDB

London

Remote

GBP 60,000 - 100,000

30+ days ago

Remote Senior Site Reliability Engineer Manager (Remote)

Remotestar

London

Remote

GBP 80,000 - 100,000

30+ days ago

Head of SRE and Production Engineering

JR United Kingdom

London

On-site

GBP 85,000 - 130,000

7 days ago
Be an early applicant

DevOps Solution Architect

Join DevOps

London

On-site

GBP 85,000 - 120,000

7 days ago
Be an early applicant

Lead Cloud Architect

Anson McCade

London

Hybrid

GBP 70,000 - 100,000

4 days ago
Be an early applicant

Lead, Site Reliability Engineering, Infrastructure Security

MongoDB

London

Remote

GBP 60,000 - 100,000

30+ days ago

Senior Site Engineer

Austin Fraser

Ilford

On-site

GBP 80,000 - 100,000

7 days ago
Be an early applicant

Lead Cloud Architect

ZipRecruiter

London

Hybrid

GBP 100,000 - 118,000

7 days ago
Be an early applicant