Enable job alerts via email!

Senior Site Reliability Engineer

Optimize Search Group

Irving (TX)

Hybrid

USD 120,000 - 150,000

Full time

2 days ago
Be an early applicant

Boost your interview chances

Create a job specific, tailored resume for higher success rate.

Job summary

A leading company is seeking a Support/Site Reliability Engineering Lead to oversee the stability and performance of digital platforms. This role involves leading support teams, ensuring operational excellence, and collaborating across various departments to enhance user experiences. Ideal candidates will have a strong technical background and experience in managing high-performing teams.

Qualifications

  • 8+ years in application support or site reliability engineering.
  • Minimum 3 years in a leadership role managing support or SRE teams.

Responsibilities

  • Lead day-to-day operations for support and reliability across platforms.
  • Monitor application health and performance using observability practices.
  • Define and enforce best-in-class support practices.

Skills

Incident Response
Root Cause Analysis
Monitoring Tools
CI/CD
DevOps Practices

Tools

Azure DevOps
ServiceNow
Datadog
Cloudflare

Job description

Get AI-powered advice on this job and more exclusive features.

Job Title: Support / Site Reliability Engineering (SRE) Lead – Digital Platforms (Web & Mobile)

Location: Irving, TX (Hybrid Monday - Wednesday On-Site)

Duration: Contract with option to hire

Position Summary:

We’re looking for a dynamic and experienced Support / SRE Lead to oversee the stability, performance, and operational excellence of our digital platforms—spanning both web and mobile applications. This role is ideal for a hands-on leader with a deep technical foundation, a passion for reliable systems, and a track record of building high-performing support teams. You’ll collaborate across engineering, product, and operations to ensure exceptional user experiences and robust, scalable platforms.

Key Responsibilities:

  • Lead day-to-day operations for support and reliability across web and mobile platforms.
  • Serve as the senior escalation point for major incidents, driving rapid and effective resolution.
  • Monitor application health, performance, and availability using modern observability practices.
  • Partner with development, QA, and product teams to support seamless deployments and feature rollouts.
  • Define and enforce best-in-class support practices, including incident response, problem management, and post-incident reviews.
  • Establish, track, and improve Service Level Objectives (SLOs) and Service Level Indicators (SLIs).
  • Design and implement scalable monitoring, alerting, and logging frameworks.
  • Develop and maintain automation scripts, CI/CD pipelines, and recovery tools to support system resilience.

Qualifications:

  • 8+ years of experience in application support, technical operations, or site reliability engineering.
  • Minimum 3 years in a leadership role managing support or SRE teams.
  • Deep understanding of web/mobile app architectures, APIs, cloud services (AWS, Azure, or GCP), and databases.
  • Strong experience with incident response, root cause analysis, and ITIL processes.
  • Proficiency with monitoring and alerting tools (e.g., Datadog, Cloudflare), and log analysis.
  • Hands-on experience with ticketing platforms like Azure DevOps, ServiceNow, or Freshservice.
  • Solid grasp of CI/CD pipelines, DevOps practices, and automation tooling.

Nice to Have:

  • Industry experience in e-commerce, fintech, healthcare, or media.
  • Exposure to mobile frameworks (Flutter, React Native, Kotlin, Swift).
  • Experience with CMS platforms (WordPress, Drupal, Crownpeak, AEM).
  • Relevant certifications (e.g., ITIL, AWS, Azure, SRE).
Seniority level
  • Seniority level
    Mid-Senior level
Employment type
  • Employment type
    Contract
Job function
  • Job function
    Information Technology
  • Industries
    Consumer Services

Referrals increase your chances of interviewing at Optimize Search Group by 2x

Get notified about new Site Reliability Engineer jobs in Irving, TX.

Site Reliability Engineering - Systems Engineer - Vice President - Dallas
DevOps Engineer with Ariba or SAC experience - 100% Remote

Plano, TX $200,000.00-$205,000.00 3 days ago

Principal Software Engineer - Workflow Tools

We’re unlocking community knowledge in a new way. Experts add insights directly into each article, started with the help of AI.

Get your free, confidential resume review.
or drag and drop a PDF, DOC, DOCX, ODT, or PAGES file up to 5MB.

Similar jobs

Senior Site Reliability Engineers

Centene Corporation

St. Louis

Remote

USD 112,000 - 159,000

Today
Be an early applicant

Senior Site Reliability Engineer (Azure)

Ignitec Inc

Remote

USD 140,000 - 160,000

2 days ago
Be an early applicant

[Hiring] Senior Site Reliability Engineer @Wisp

Wisp

Remote

USD 120,000 - 150,000

Yesterday
Be an early applicant

Senior Site Reliability Engineer

Censys

Ann Arbor

Remote

USD 145,000 - 195,000

2 days ago
Be an early applicant

Senior Site Reliability Engineer

Xilis, Inc.

Durham

Remote

USD 120,000 - 150,000

Today
Be an early applicant

Senior Site Reliability Engineer

General Motors

Remote

USD 90,000 - 130,000

Today
Be an early applicant

Senior Site Reliability Engineer

Exygy Inc

Remote

USD 120,000 - 125,000

Yesterday
Be an early applicant

Senior Site Reliability Engineer

Firsthand

Remote

USD 100,000 - 130,000

Yesterday
Be an early applicant

Senior Site Reliability Engineer

General Motors

Austin

Remote

USD 100,000 - 130,000

Yesterday
Be an early applicant