Enable job alerts via email!

Sr. Site Reliability Engineer

Addison Group

Austin (TX)

On-site

USD 99,000 - 200,000

Full time

6 days ago
Be an early applicant

Boost your interview chances

Create a job specific, tailored resume for higher success rate.

Job summary

An innovative firm seeks a Senior Site Reliability Engineer to enhance system resiliency and drive automation. This role requires leadership in incident management and collaboration with cross-functional teams to ensure operational excellence. You'll architect and maintain observability systems and lead initiatives to improve efficiency. If you're passionate about technology and eager to make a significant impact, this is the perfect opportunity to join a forward-thinking team committed to excellence and continuous improvement.

Qualifications

  • 4+ years in site reliability engineering or related field.
  • Strong leadership in incident management and operational excellence.

Responsibilities

  • Drive efficiency and automate manual processes.
  • Lead high-stakes production incidents as Senior Incident Commander.

Skills

Incident Management
Leadership Skills
Problem Solving
Collaboration
Accountability

Education

Bachelor's Degree in a Related Field

Tools

Git
Terraform
Ansible
CI/CD Pipelines
Monitoring Solutions

Job description

Join to apply for the Sr. Site Reliability Engineer role at Addison Group

Join to apply for the Sr. Site Reliability Engineer role at Addison Group

Get AI-powered advice on this job and more exclusive features.

n this position, you will be a vital member of our Site Reliability Engineering (SRE) team, responsible for improving incident response, advancing problem management, identifying automation opportunities, and managing observability tools. You'll work closely with Platform and Value Stream teams to strengthen system resiliency, champion a culture of Site Reliability Engineering, and support our transition from on-premise to cloud infrastructure.

Responsibilities & Qualifications

Ideal candidates will:

  • Lead positive change with clear, collaborative leadership and measurable project outcomes.
  • Solve challenges independently while offering solutions-focused guidance to peers.
  • Empower team growth by sharing knowledge transparently and providing constructive feedback.
  • Foster a culture of diversity of thought, mutual trust, and accountability.

What You’ll Do

  • Take ownership of key projects, driving efforts to improve efficiency, enable self-service, and automate manual processes.
  • Manage initiatives from discovery through planning, scheduling, and execution using Agile Scrum methodologies.
  • Lead high-stakes production incidents as a Senior Incident Commander, ensuring rapid resolution, clear communication, and poise under pressure.
  • Facilitate post-incident retrospectives, transforming technical learnings into actionable improvements.
  • Architect, implement, and maintain cutting-edge observability systems to ensure proactive incident detection and resolution.
  • Build and manage integrations across systems to streamline monitoring, alerting, and health reporting.
  • Define and execute strategies for system availability, performance, and reliability, aligning with organizational goals.
  • Collaborate with stakeholders to establish Service Level Objectives (SLOs) and design strategies for managing breaches.
  • Mentor and guide team members, setting high standards for technical excellence and operational discipline.
  • Offer candid, constructive feedback to improve processes, systems, and team performance.
  • Serve as a trusted advisor, advocating for best practices in reliability engineering and driving cultural change across the organization.

It Is Required That You Have

  • Bachelor’s degree in a related field or equivalent education, training, or experience.
  • At least 4 years of experience in site reliability engineering, DevOps, or related engineering discipline (or equivalent education, training or experience).
  • Strong leadership skills in incident management and operational excellence.
  • Demonstrated initiative, independent work, and results-driven success
  • Expertise in building and optimizing complex systems

It Would Be Great To Also Have

  • Expertise in ITIL practices and their application in modern IT environments.
  • Extensive experience in operations and engineering with distributed systems.
  • Proficiency with Git and modern CI/CD pipelines.
  • Advanced skills in programming (Java, C#) and scripting (Python, PowerShell, Bash).
  • Hands-on experience with automation tools (Terraform, Ansible) and infrastructure as code.
  • Proven success in implementing monitoring, logging, and alerting solutions.
  • Exceptional collaboration, negotiation, and presentation skills, with the ability to inspire and influence.
  • Experience providing constructive feedback and fostering continuous improvement.
  • A passion for achieving results, with a strong sense of accountability and teamwork.

Seniority level
  • Seniority level
    Mid-Senior level
Employment type
  • Employment type
    Other
Job function
  • Job function
    Engineering and Information Technology
  • Industries
    Staffing and Recruiting

Referrals increase your chances of interviewing at Addison Group by 2x

Sign in to set job alerts for “Senior Site Reliability Engineer” roles.

Austin, TX $99,500.00-$200,000.00 2 weeks ago

Austin, TX $80,000.00-$90,000.00 2 weeks ago

Austin, TX $140,000.00-$170,000.00 1 month ago

Austin, TX $117,000.00-$173,000.00 2 weeks ago

Austin, TX $90,000.00-$170,000.00 2 months ago

Backend Software Engineer, Cloudkitchens - Infrastructure
Software Engineer Autonomy - Intern 2025

We’re unlocking community knowledge in a new way. Experts add insights directly into each article, started with the help of AI.

Get your free, confidential resume review.
or drag and drop a PDF, DOC, DOCX, ODT, or PAGES file up to 5MB.

Similar jobs

FlightAware- Sr. Site Reliability Engineer (Remote)

Lensa

Austin

Remote

USD 101,000 - 203,000

2 days ago
Be an early applicant

Sr. Site Reliability Engineer

Dayforce US, Inc.

Minnesota

Remote

USD 80,000 - 130,000

6 days ago
Be an early applicant

Sr. Site Reliability Engineer

Dayforce

Remote

USD 80,000 - 120,000

Yesterday
Be an early applicant

Senior Site Reliability Engineer

Yelosoftware

Remote

USD 90,000 - 150,000

Today
Be an early applicant

Senior Site Reliability Engineer

Censys, Inc.

Ann Arbor

Remote

USD 145,000 - 195,000

Yesterday
Be an early applicant

Senior Site Reliability Engineer - Azure - Remote

Optum

Eden Prairie

Remote

USD 89,000 - 177,000

4 days ago
Be an early applicant

FlightAware- Sr. Site Reliability Engineer (Remote)

Pratt & Whitney

Remote

USD 101,000 - 203,000

5 days ago
Be an early applicant

Sr. Site Reliability Engineer (OpenTelemetry)

Optomi

Remote

USD 80,000 - 120,000

6 days ago
Be an early applicant

Senior Site Reliability Engineer

Rackspace Technology

Remote

USD 80,000 - 130,000

4 days ago
Be an early applicant