Enable job alerts via email!

Site Reliability Engineering Manager

Daxko

Birmingham (AL)

On-site

USD 163,000 - 211,000

Full time

Yesterday
Be an early applicant

Boost your interview chances

Create a job specific, tailored resume for higher success rate.

Job summary

A leading company in health and wellness solutions is seeking a Site Reliability Engineering Manager to oversee production assets and lead a globally distributed team. The role involves managing uptime, compliance, and performance monitoring while ensuring quality service delivery. The ideal candidate will have strong technical skills, leadership experience, and a commitment to operational excellence. This position offers a competitive salary and comprehensive benefits.

Benefits

Comprehensive benefits package
Performance-based incentives
Opportunities for growth

Qualifications

  • 3-5 years managing globally distributed teams.
  • 3-5 years in site reliability engineering.
  • Strong security mindset and experience implementing controls.

Responsibilities

  • Manage production assets and team workload.
  • Ensure uptime, data accuracy, and integrity.
  • Provide weekly reports on system availability.

Skills

Organizational Skills
Time Management
Analytical Skills
Problem-Solving Skills
Supervisory Skills
Leadership Skills
Observability

Tools

Linux
NGiNX
PHP
AWS
Docker
GitLab CI
Terraform

Job description

Daxko powers wellness to improve lives. Every day our team members focus their passion and expertise in helping health & wellness facilities operate efficiently and engage their members.

Whether a neighborhood yoga studio, a national franchise with locations in every city, a YMCA or JCC--and every type of organization in between--we build solutions that make every aspect of running and being a member of a health and wellness organization easier and delightful.

Job Description

As a Site Reliability Engineering Manager, you will manage all production assets for each product. Your responsibilities include: batching, upgrading, deploying new servers, organizing the team's workload, supporting engineering efforts, compliance, uptime, and performance monitoring. You’ll be responsible for prioritizing, organizing, and leading your team's execution of all work. You'll assess operational capabilities and performance to ensure the on-time delivery of quality products and services to all customers, both internal and external.

As a leader, you will:

  • Setand help theteam understand performance targets and goals
  • Evaluateand providereal-time feedback on performance
  • Trainand/or ensurethat the team is properly trained for their specific roles
  • Coordinateon-call rotation
  • Coordinatetraining for staff
  • Assistin resolving emergencies, such asinfrastructure or software outages
  • Manageheadcount and makestaffing decisions related to new hires and terminations

In your day-to-day, you will:

  • Oversee progress in achieving operational/production goals and objectives, especially with respect to quality, cost, and customer service.
  • Takeresponsibility for uptime, data accuracy, and integrity.
  • Interact with Engineering Leads to ensure alignment between teams
  • Maintain business continuity for all production assets
  • Ensure proper planning and prioritization using agile practices.
  • Ensure operations are in full compliance with all company and regulatory requirements.
  • Be a technical escalation point for your team.
  • Provide weekly reports on system availability, response, and capacity.
  • Manage on-call rotation among team members.
  • Have budget responsibilities, includingensuring fiscal responsibility for hosting and software licensing.
Qualifications
  • Three (3) to five (5) years of experience managing globally distributed team members
  • Three (3) to five (5) years of experience in a site reliability engineering capacity
  • Solid foundation in the following technologies:
    • Linux
    • Web Servers (NGiNX / PHP / Traefik / F5)
    • Virtualization Technologies (VMWare)
    • Cloud Platforms (AWS, Azure)
    • Containerization Systems (Docker, Kubernetes, Dynos)
    • Caching technology (Redis / rabbitmq )
  • Strong security mindset and experience implementing security controls
  • Excellent organizational skills and attention to detail.
  • Excellent time management skills with a proven ability to meet deadlines.
  • Strong analytical and problem-solving skills.
  • Strong supervisory and leadership skills.
  • Ability to prioritize tasks and to delegate them when appropriate.
  • Strong observability experience with Monitoring Technologies, creating custom checks, and managing alert profiles and escalation policies. (OpenTelemetry, Instana, LogicMonitor, PagerDuty, OpsGenie)
  • Experience with Tooling (GitLab CI, Jenkins, Chef, Terraform, Elastic Search, Kubernetes, Rancher)
  • Scripting experience with the following languages: Ruby, Python, Bash
  • Experience with SOC, PCI, GDPR standards and regulations
  • Experience working tickets and managing priorities within issue tracking systems (Atlassian Suite, etc.)
  • Experience developing or supporting Java, php, or node applications
  • Experience automating repetitive tasks
Additional Information

The salary range for this role is $163,000 - $211,000 per year. Where you fall within the compensation range is based on how you demonstrate the attributes and competencies required for the role.We mostly reserve the upper half of our compensation bands for internal growth. In addition to base salary, we offer a comprehensive benefits package, performance-based incentives, and opportunities for growth.

Daxko is dedicated to pursuing and hiring a diverse workforce. We are committed to diversity in the broadest sense, including thought and perspective, age, ability, nationality, ethnicity, orientation, and gender. The skills, perspectives, ideas, and experiences of all of our team members contribute to the vitality and success of our purpose and values.

We truly care for our team members, and this is reflected through our offices, and benefits, and great perks. These perks are only for our full-time team members. Some of our favorites include:

All your information will be kept confidential according to EEO guidelines.

Get your free, confidential resume review.
or drag and drop a PDF, DOC, DOCX, ODT, or PAGES file up to 5MB.

Similar jobs

Site Reliability Engineering Manager

Jobot

Cleveland

Remote

USD 165,000 - 190,000

14 days ago

Site Reliability Engineering Manager

Jobot

Fort Wayne

Remote

USD 165,000 - 190,000

14 days ago

Manager, Site Reliability Engineering (IaC)

Out in Science, Technology, Engineering, and Mathematics

Boston

Remote

USD 142,000 - 228,000

2 days ago
Be an early applicant

Site Reliability Engineering Manager

Jobot

Erie

Remote

USD 165,000 - 190,000

30+ days ago

Lead, Site Reliability Engineering, Infrastructure Security

MongoDB

San Francisco

Remote

USD 120,000 - 180,000

6 days ago
Be an early applicant

Head of Cloud & Site Reliability Engineering

ZipRecruiter

Bodega Bay

Remote

USD 157,000 - 235,000

6 days ago
Be an early applicant

Senior Manager Site Reliability Engineering (Kubernetes)- Remote

Akamai Technologies

Remote

USD 155,000 - 324,000

10 days ago

Manager, Site Reliability Engineering (Observability)

Out in Science, Technology, Engineering, and Mathematics

New York

Remote

USD 135,000 - 216,000

11 days ago

Site Reliability Engineering Manager

Daxko

Birmingham

On-site

USD 163,000 - 211,000

30+ days ago