Enable job alerts via email!

Telecoms: Senior Manager, Technology Operations & Reliability

American Workforce Solutions

United States

Remote

USD 130,000 - 175,000

Full time

Yesterday
Be an early applicant

Boost your interview chances

Create a job specific, tailored resume for higher success rate.

Job summary

A leading company is seeking a Senior Manager for Technology Operations & Reliability. This remote role requires expertise in telecom platform management, incident response, and team leadership. The candidate will drive operational excellence and implement advanced SRE practices to ensure high availability and performance of services. Join a dynamic team focused on innovative communication solutions and enjoy the flexibility of a fully remote position with growth opportunities.

Benefits

Medical insurance
Vision insurance
401(k)
Child care support

Qualifications

  • 8+ years in Technology Operations, DevOps, or SRE.
  • Expertise in Asterisk, Kamailio, FreeSWITCH, and AWS.
  • Strong coding skills in Python, Go, or Bash.

Responsibilities

  • Monitor and maintain the telecom platform's availability and performance.
  • Lead incident management and root cause analysis.
  • Develop automation scripts and manage AWS infrastructure.

Skills

Leadership
Problem-Solving
Communication

Tools

AWS
Terraform
Kubernetes
Python
Grafana
Prometheus
Asterisk
Kamailio
FreeSWITCH
Jenkins

Job description

Telecoms: Senior Manager, Technology Operations & Reliability
Telecoms: Senior Manager, Technology Operations & Reliability

Senior Manager, Technology Operations & Reliability

Location: Remote (US)

Department: IT Operations

Reports To: Director Technology Operations

Position Overview

The Senior Manager, Technology Operations & Reliability is a critical leadership role responsible for ensuring the stability, performance, and resilience of our customer-focused telecom platform. This position combines hands-on technical expertise with strategic oversight, acting as an Incident Commander during high-pressure situations and driving long-term operational excellence. You will lead a talented team, foster collaboration, and implement advanced Site Reliability Engineering (SRE) and DevOps practices to maintain a highly available platform powered by open-source telecom software and AWS infrastructure.

Key Responsibilities

  • Platform Reliability: Proactively monitor and maintain the availability, performance, and resilience of our telecom platform, built on open-source telecom software (Asterisk, Kamailio, FreeSWITCH) and OpenStack in an AWS environment.
  • Incident Management: Serve as the Incident Commander, leading rapid response and resolution for outages, performance issues, and security incidents to minimize customer impact.
  • Root Cause Analysis: Conduct thorough root cause analyses (RCAs) and implement corrective actions to prevent recurrence, refining runbooks and escalation protocols.
  • Automation & Observability: Develop automation scripts (Python, Go, Bash) and deploy observability tools (Prometheus, Grafana, Splunk, Datadog) to enhance monitoring, reduce downtime, and enable proactive issue detection.
  • Cloud Infrastructure: Manage and optimize AWS-based infrastructure (EC2, EKS, CloudWatch) and Kubernetes clusters, ensuring scalability and fault tolerance for telecom workloads.
  • Telecom Expertise: Configure and troubleshoot open-source telecom software (Asterisk, Kamailio, FreeSWITCH) and VoIP protocols (SIP, RTP, WebRTC) to deliver high-quality call services.
  • DevOps & SRE Practices: Enhance CI/CD pipelines (Jenkins, GitLab CI), implement Infrastructure as Code (Terraform, CloudFormation), and champion SRE principles like SLOs, SLIs, and error budgets.
  • Team Leadership: Mentor and develop a diverse, high-performing team of engineers, fostering collaboration and upskilling through training and succession planning.
  • Cross-Functional Collaboration: Partner with engineering, product, and support teams to address reliability gaps, optimize system performance, and align with business goals.
  • Innovation: Stay ahead of industry trends (e.g., WebRTC advancements, 5G integration) to drive continuous improvement in system architecture and operational processes.

Qualifications

Technical Skills

  • Experience: 8+ years in Technology Operations, DevOps, or Site Reliability Engineering, with a focus on telecom or real-time communication systems.
  • Telecom Software: Hands-on expertise with Asterisk, Kamailio, FreeSWITCH, and OpenStack, including configuration, troubleshooting, and optimization.
  • Cloud & Containers: Proficiency in AWS (EC2, EKS, RDS, CloudWatch) and Kubernetes, with experience in high-availability architectures.
  • Observability Tools: Deep knowledge of Prometheus, Grafana, Splunk, Datadog, or ELK Stack for monitoring and log analysis.
  • Automation: Strong coding skills in Python, Go, or Bash for scripting and automation of operational tasks.
  • DevOps Tools: Experience with Terraform, CloudFormation, Jenkins, or GitLab CI for Infrastructure as Code and CI/CD pipelines.
  • Networking: Expertise in VoIP protocols (SIP, RTP, WebRTC), network troubleshooting (Wireshark), and QoS optimization.
  • SRE & ITIL: Familiarity with SRE principles (SLOs, SLIs, blameless postmortems) and ITIL frameworks for incident and change management.

Leadership & Soft Skills

  • Leadership: Proven ability to lead and mentor technical teams in a remote, results-driven environment.
  • Talent Management: Experience in onboarding, upskilling, and succession planning to build a high-performing team.
  • Communication: Excellent verbal and written skills for coordinating with cross-functional teams and presenting to non-technical stakeholders.
  • Problem-Solving: Strong analytical skills for diagnosing complex issues and implementing effective solutions under pressure.
  • Time Management: Ability to prioritize tasks and manage multiple high-stakes priorities in a fast-paced setting.
  • Collaboration: Adept at fostering a team-oriented culture using virtual tools (MS Teams, MS Office).
  • Confidentiality: Demonstrated ability to exercise discretion and make sound decisions.
  • 100% Remote: Work from home, with flexibility to collaborate across US time zones.

Why Join Us?

  • Impactful Mission: Contribute to a platform that delivers innovative communication solutions.
  • Innovative Environment: Work with cutting-edge open-source telecom technologies and AWS and Azure cloud infrastructure.
  • Remote Flexibility: Enjoy the freedom of a fully remote role with a collaborative, inclusive team culture.
  • Growth Opportunities: Lead and develop a high-performing team while staying at the forefront of telecom and SRE innovations.
Seniority level
  • Seniority level
    Mid-Senior level
Employment type
  • Employment type
    Full-time
Job function
  • Job function
    Engineering, Information Technology, and Management
  • Industries
    Telecommunications

Referrals increase your chances of interviewing at American Workforce Solutions by 2x

Inferred from the description for this job

Medical insurance

Vision insurance

401(k)

Child care support

Get notified when a new job is posted.

Sign in to set job alerts for “Technology Operations Manager” roles.
Business Operations Manager, One Medical Operations

United States $130,000.00-$175,000.00 1 week ago

United States $142,000.00-$202,000.00 2 days ago

Learning and Development Technology & Operations Manager
Director, Organizational Change Management

United States $69,750.00-$114,750.00 2 weeks ago

Operations Manager, Software Implementation
Strategy & Business Operations Manager (Fintech)
Revenue Operations Manager - Customer Success Ops

Texas, United States $172,400.00-$199,000.00 1 month ago

Technology Business Partner, Senior Manager

Chicago, IL $130,000.00-$150,000.00 3 weeks ago

Senior Technology Manager Lead, Global Trade (Remote)
Director and AMS Regional Leader - Contracts

We’re unlocking community knowledge in a new way. Experts add insights directly into each article, started with the help of AI.

Get your free, confidential resume review.
or drag and drop a PDF, DOC, DOCX, ODT, or PAGES file up to 5MB.