Enable job alerts via email!

Sr. Manager, SRE

Backblaze

United States

Remote

USD 175,000 - 215,000

Full time

Yesterday

Be an early applicant

Job summary

A leading cloud storage company is seeking a Sr. Manager for Site Reliability Engineering to lead a global team. This role involves ensuring the reliability and availability of distributed services while driving strategy and operational excellence. Ideal candidates should have extensive experience in SRE, strong technical skills in cloud platforms, and proven leadership abilities. Join a collaborative culture focused on innovation and career development.

Benefits

Competitive compensation and benefits

Opportunity to shape global reliability engineering

Collaborative culture

Qualifications

10+ years in infrastructure, reliability, or operations engineering roles.
5+ years in people leadership with experience managing global teams.
Master's degree preferred.

Responsibilities

Build, lead, and mentor a global team of SREs.
Define the long-term vision and roadmap for SRE.
Own the end-to-end reliability of critical services.

Skills

Linux administration

Distributed systems knowledge

Cloud platforms expertise

Automation proficiency

Observability frameworks

Communication skills

Education

Bachelor’s degree in Computer Science or related field

Tools

Kubernetes

Docker

Terraform

CI/CD pipelines

Grafana

Datadog

Backblaze is the object storage leader in the open cloud movement, fueling customer success with cloud storage built purposefully to unlock budgets, unburden administrators, and unleash innovators. Together with our partners, we’re helping customers break free from the restrictive, overpriced legacy solutions that hold them back and blaze forward with the full power of the open cloud in their hands.

Founded in 2007, we scaled the business with less than $3 million in outside funding until 2021, when we did a traditional IPO on the Nasdaq stock exchange. Today, Backblaze generates over $125m in revenue and is the leading specialized storage cloud, managing over three billion gigabytes of data storage for 500K+ customers in 175+ countries, including businesses, developers, IT professionals, and individuals.

We are seeking a Sr. Manager, SRE to join our team!

We are seeking a seasoned Sr. Manager, Site Reliability Engineering (SRE) to lead a global team of engineers responsible for the performance, availability, and reliability of our distributed services and infrastructure. This leader will drive SRE strategy, implement operational excellence frameworks, and partner with engineering and product teams to ensure customer-facing platforms meet and exceed SLAs.

The Sr. Manager, SRE will balance hands-on technical leadership with strategic management, guiding the team in automation, observability, incident management, and service scalability while mentoring future leaders.

Key Responsibilities:

Build, lead, and mentor a team of SREs across multiple regions and time zones.
Define the long-term vision and roadmap for SRE, aligning with organizational objectives.
Partner with product and engineering to ensure reliability is embedded in design, development, and operations.

Operational Excellence

Own the end-to-end reliability of critical customer-facing services.
Establish and maintain SLOs, SLIs, and error budgets to measure and enforce service quality.
Drive root cause analysis and problem management for major incidents, ensuring long-term fixes are prioritized.
Champion adoption of ITIL/OSS processes (incident, change, problem, and capacity management).

Automation & Tooling

Expand automation in deployment, monitoring, testing, and incident response to reduce toil.
Oversee observability platforms (e.g., Catchpoint, Grafana, Moogsoft/BigPanda, Prometheus, Datadog, etc.).
Ensure robust configuration, capacity, and change management practices.

Cross-Functional Collaboration

Partner with Network Engineering, DevOps, NOC, and Product Engineering on scalable, resilient architecture.
Support business continuity, disaster recovery, and compliance requirements.
Engage with vendors and service providers to manage SLAs and performance outcomes.

People Development

Hire, coach, and develop engineers and managers, creating strong career paths within SRE.
Foster a culture of reliability, accountability, and continuous improvement.
Lead succession planning and leadership pipeline development.

Qualifications:

Education & Experience

Bachelor’s degree in Computer Science, Engineering, or related field (Master’s preferred).
10+ years in infrastructure, reliability, or operations engineering roles.
5+ years in people leadership with experience managing managers and global teams.

Technical Skills

Deep expertise in Linux operating systems (administration, performance tuning, troubleshooting, security hardening).
Strong knowledge of distributed systems, cloud platforms (AWS, GCP, Azure, private cloud), and networking fundamentals.
Solid background in observability, monitoring, logging, and alerting frameworks.
Proficiency with automation (Python, Go, Ansible, Terraform, CI/CD pipelines).
Familiarity with containers (Kubernetes, Docker) and microservices architectures.
Strong understanding of ITIL/OSS frameworks, SLO/error budget practices, and incident management at scale.

Leadership Skills

Proven ability to manage large-scale, high-availability environments.
Strong communication skills with executive presence; able to translate technical topics into business outcomes.
Demonstrated success in building and maturing high-performing SRE/operations teams.

Preferred Attributes

Experience in a service provider, CDN, or large-scale SaaS environment.
Familiarity with compliance and regulatory frameworks (SOC 2, ISO 27001, GDPR).
Track record of driving cultural transformation toward reliability-first principles.

What We Offer

Competitive compensation and benefits package.
Opportunity to shape the future of global reliability engineering at scale.
Collaborative culture with strong support for innovation and career growth.

We are committed to fostering a workforce where all employees feel a sense of belonging regardless of race, ethnicity, nationality, gender, sexual orientation, age, religion, socio-economic status, ability, veteran status, and education. We believe that our dedication to cultivating a diverse workspace not only allows us to better serve our customers in over 175 countries, but further reinforces our commitment to doing the right thing. We are proud to be an Equal Opportunity Employer.

The base pay range for this position is $175,000 - $215,000.

Get your free, confidential resume review.

or drag and drop a PDF, DOC, DOCX, ODT, or PAGES file up to 5MB.

Sr. Manager, SRE

Backblaze

United States

Remote

USD 175,000 - 215,000

Full time

Job summary

Benefits

Qualifications

Responsibilities

Skills

Education

Tools

Company

Services

Free resources

Support

Sr. Manager, SRE

Backblaze

United States

Remote

USD 175,000 - 215,000

Full time

Job summary

Benefits

Qualifications

Responsibilities

Skills

Education

Tools

Follow us

Company

Services

Free resources

Support