Enable job alerts via email!

Manager, Site Reliability Engineering

Pearson

Durham (NC)

On-site

USD 90,000 - 150,000

Full time

30+ days ago

Boost your interview chances

Create a job specific, tailored resume for higher success rate.

Job summary

An established industry player is seeking a Manager of Site Reliability Engineering to lead a high-performing team focused on optimizing Content APIs in a cloud environment. In this pivotal role, you will foster a culture of automation and continuous improvement while managing systems that ensure scalability, reliability, and cost-efficiency. You will drive collaboration across various teams, implement Agile methodologies, and oversee incident management to enhance operational excellence. This is a fantastic opportunity to influence cloud operations and make a significant impact in a dynamic, 24/7 environment.

Qualifications

  • Strong communication and leadership skills; ability to engage cross-functional stakeholders.
  • Deep familiarity with DevOps/SRE tools, practices, and mindset.

Responsibilities

  • Lead and grow a high-performance SRE team delivering resilient and scalable systems.
  • Monitor and optimize cloud spend while designing cost-efficient and scalable architecture.

Skills

Leadership Skills
DevOps/SRE Tools
Infrastructure as Code (Terraform)
Configuration Management (Puppet, Chef, Ansible)
Scripting (Python, Go, Java)
Performance Optimization
Observability Tools (PagerDuty, Grafana)
Documentation Skills

Education

Bachelor's Degree in Computer Science or related field

Tools

Terraform
AWS
Puppet
Chef
Ansible
SaltStack
PagerDuty
Grafana

Job description

Manager, Site Reliability Engineering (Cloud Engineering Focus) Location: Raleigh/Durham, NC (Hybrid) Summary

Pearson’s Digital and Technology division is seeking a Manager of Site Reliability Engineering to lead a high-performing SRE team focused on building, maintaining, and optimizing Content APIs in AWS. This team ensures the scalability, reliability, security, and cost-efficiency of core services supporting Pearson’s global education platform.

You’ll manage an established team of engineers and help mature our cloud operations by fostering a culture of automation, continuous improvement, and operational excellence in a high-uptime, 24/7 environment.

What You’ll Be Doing

  • Lead and grow a high-performance SRE team delivering resilient and scalable systems.
  • Manage and continuously improve customer-facing systems with high availability expectations.
  • Apply Agile/Scrum methodologies to drive execution and iterative delivery.
  • Facilitate sprint planning, retrospectives, and other agile ceremonies.
  • Own the incident management lifecycle, including tooling, documentation, postmortems, and team training.
  • Oversee the on-call strategy and support readiness across engineering teams.
  • Drive collaboration across infrastructure, product, architecture, and security teams.
  • Monitor and optimize cloud spend while designing cost-efficient and scalable architecture.
  • Promote an “automate everything” mindset to reduce toil and improve system reliability.
  • Champion clear, actionable, and accessible documentation for systems and processes.

What We’re Looking For

  • Strong communication and leadership skills; ability to engage cross-functional stakeholders.
  • Deep familiarity with DevOps/SRE tools, practices, and mindset.
  • Hands-on experience with Infrastructure as Code tools (Terraform preferred, CloudFormation acceptable).
  • Configuration management experience using Puppet, Chef, Ansible, or SaltStack.
  • Proficient in scripting or development with Python, Go, Java, or similar languages.
  • Proven expertise in diagnosing and resolving complex performance issues across the stack.
  • Experience with observability and incident response platforms (e.g., PagerDuty, Grafana).
  • Strong documentation habits and attention to operational details.
  • Track record of optimizing cloud environments for both performance and cost.

Nice to Have

  • AWS Certifications (Solutions Architect, DevOps Engineer, Developer Associate)
  • Agile or Scrum certifications
  • Security background or experience working with secure architectures
  • Experience managing global or distributed engineering teams
Get your free, confidential resume review.
or drag and drop a PDF, DOC, DOCX, ODT, or PAGES file up to 5MB.

Similar jobs

Manager, Site Reliability Engineering (IaC)

Out in Science, Technology, Engineering, and Mathematics

Boston

Remote

USD 142,000 - 228,000

3 days ago
Be an early applicant

Manager, Site Reliability Engineering (Observability)

Out in Science, Technology, Engineering, and Mathematics

New York

Remote

USD 135,000 - 216,000

12 days ago

Manager, Site Reliability Engineering

Dayforce

Remote

USD 90,000 - 150,000

12 days ago

Manager - Site Reliability Engineering

UKG

Lowell

On-site

USD 126,000 - 182,000

4 days ago
Be an early applicant

Manager, Site Reliability Engineering

Dayforce US, Inc.

Minnesota

Remote

USD 90,000 - 150,000

28 days ago

Manager, Site Reliability Engineering

Dayforce US, Inc.

Minnesota

Remote

USD 80,000 - 130,000

29 days ago

Manager, Site Reliability Engineering

Axon

Seattle

Remote

USD 135,000 - 216,000

30+ days ago

Manager, Site Reliability Engineering

Jetty

Remote

USD 90,000 - 150,000

30+ days ago

Manager, Site Reliability Engineering

Pearson

Durham

Hybrid

USD 90,000 - 150,000

30+ days ago