Enable job alerts via email!

Site Reliability Engineering Manager

Jobot

Erie (Erie County)

Remote

USD 165,000 - 190,000

Full time

30+ days ago

Generate a tailored resume in minutes

Land an interview and earn more. Learn more

Start fresh or import an existing resume

Job summary

An innovative mid-sized SaaS organization is seeking a dynamic Site Reliability Engineering Manager to lead a talented team. In this pivotal role, you will ensure the reliability and scalability of systems supporting health and wellness platforms. You will collaborate with cross-functional teams to enhance infrastructure and applications while implementing best practices in monitoring and incident management. This company values its employees and offers a flexible work environment, comprehensive benefits, and opportunities for professional growth. Join a mission-driven team that is dedicated to making a positive impact in the health and wellness industry.

Benefits

Flexible paid time off
Affordable health, dental, and vision insurance
Monthly fitness reimbursement
401(k) matching
New-Parent Paid Leave
1-month paid sabbatical every 5 years
Up to 100% telecommute or hybrid work

Qualifications

  • 3+ years experience in Site Reliability Engineering or related field.
  • Strong problem-solving skills and ability to work in fast-paced environments.

Responsibilities

  • Lead a team of SREs to ensure system reliability and scalability.
  • Design, implement, and maintain infrastructure and applications.

Skills

Linux
AWS
Azure
Docker
Kubernetes
Python
Bash
GitLab CI
Jenkins
Terraform

Education

Bachelor's degree in Computer Science
Bachelor's degree in Engineering

Tools

VMware
Redis
RabbitMQ
ElasticSearch
Rancher

Job description

This range is provided by Jobot. Your actual pay will be based on your skills and experience — talk with your recruiter to learn more.

Base pay range

$165,000.00/yr - $190,000.00/yr

Manage a small team of SREs supporting commercial SaaS platforms in the health and wellness space - 100% Remote

Salary $165,000 - $190,000 per year

A Bit About Us

This mid-sized SaaS organization powers health & wellness throughout the world. Every day their members focus their passion and expertise in helping health & wellness facilities operate efficiently and engage their members.

Whether a neighborhood yoga studio, a national franchise with locations in every city, a YMCA or JCC--and every type of organization in between--we build solutions that make every aspect of running and being a member of a health and wellness organization easier and delightful.

Why join us?

We truly care for our team members, and this is reflected through our offices, benefits, and great perks.

  • Flexible paid time off
  • Affordable health, dental, and vision insurance options
  • Monthly fitness reimbursement
  • 401(k) matching
  • New-Parent Paid Leave
  • 1-month paid sabbatical every 5 years
  • Up to 100% telecommute or hybrid work in one of the offices
Job Details

We are seeking a dynamic and experienced Site Reliability Engineering Manager to join our team in the Technology industry. As the SRE Manager, you will be responsible for ensuring the reliability, availability, and scalability of our systems and infrastructure. You will work closely with cross-functional teams to design, implement, and maintain our infrastructure and applications. The successful candidate will have a strong background working in environments built on technologies like Linux, VMware, AWS, Azure, Docker, Kubernetes, Redis, RabbitMQ, monitoring, GitLab CI, Jenkins, Terraform, ElasticSearch, Rancher, Python, Bash, and Lambdas.

Responsibilities
  • Lead a team of SREs to ensure the reliability, availability, and scalability of our systems and infrastructure
  • Design, implement, and maintain our infrastructure and applications
  • Develop and implement monitoring and alerting systems to ensure the health of our systems and infrastructure
  • Collaborate with cross-functional teams to optimize our systems and infrastructure
  • Manage incident response and resolution processes
  • Develop and maintain disaster recovery plans
  • Ensure compliance with security and regulatory requirements
  • Continuously improve our processes and infrastructure to increase efficiency and reduce downtime
Qualifications
  • Bachelor's degree in Computer Science, Engineering, or related field
  • 3+ years of experience in Site Reliability Engineering or related field
  • Strong background in Linux, VMware, AWS, Azure, Docker, Kubernetes, Redis, RabbitMQ, monitoring, GitLab CI, Jenkins, Terraform, ElasticSearch, Rancher, Python, Bash, and Lambdas.
  • Experience leading a team of SREs
  • Strong problem-solving skills and ability to work in a fast-paced environment
  • Excellent communication and collaboration skills
  • Experience with agile methodologies and DevOps practices
  • Knowledge of security and regulatory requirements and best practices
  • Ability to manage incident response and resolution processes
  • Experience developing and maintaining disaster recovery plans
  • Strong commitment to continuous improvement and learning.

Jobot is an Equal Opportunity Employer. We provide an inclusive work environment that celebrates diversity and all qualified candidates receive consideration for employment without regard to race, color, religion, age, sex, national origin, disability status, genetics, protected veteran status, sexual orientation, gender identity or expression, or any other characteristic protected by federal, state or local laws.

Get your free, confidential resume review.
or drag and drop a PDF, DOC, DOCX, ODT, or PAGES file up to 5MB.