Enable job alerts via email!

Site Reliability Engineer III

Sinch

Atlanta (GA)

Remote

USD 142,000 - 181,000

Full time

4 days ago
Be an early applicant

Boost your interview chances

Create a job specific, tailored resume for higher success rate.

Job summary

Join a pioneering company in the communication sector as a Senior Site Reliability Engineer. In this role, you'll be instrumental in shaping and optimizing the infrastructure that supports a leading email platform. Collaborate with product engineering teams to enhance system reliability and performance while solving complex challenges in distributed systems. This innovative firm values creativity and teamwork, offering a dynamic environment where your contributions will drive real-world impact. If you're passionate about engineering and eager to make a difference, this opportunity is perfect for you.

Benefits

Comprehensive medical, dental, and vision plans
Free virtual counseling resources
401(k) options with employer match
Generous paid time off program
Paid parental leave
Flexible remote work offerings
Paid time off for volunteering

Qualifications

  • Strong background in infrastructure and reliability engineering.
  • Proficiency in cloud platforms and configuration management tools.
  • Expertise in monitoring tools and distributed databases.

Responsibilities

  • Collaborate to define and implement system requirements.
  • Design and maintain cloud-based microservices infrastructure.
  • Automate operational tasks to improve efficiency.

Skills

Infrastructure Engineering
Reliability Engineering
Cloud Platforms (GCP/AWS)
Configuration Management (Terraform, Ansible)
Monitoring Tools (Prometheus, Grafana)
Distributed Databases (Cassandra, Elasticsearch)
Linux Systems Administration
Programming (Python, Go)

Tools

Terraform
Ansible
Prometheus
Grafana

Job description

Description

Sinch is pioneering the way the world communicates. More than 150,000 businesses - including Google, Uber, Paypal, Visa, Tinder, and many others - rely on Sinch's Customer Communications Cloud to power engaging customer experiences through mobile messaging, voice, and email.

Whether you need to verify users or craft omnichannel campaigns, Sinch makes it easy. Our AI-infused Super Network, APIs, and applications ensure you can connect with your customers reliably and securely, at every step of their journey.

At Sinch we "Dream Big", "Win Together", "Keep it simple", and "Make it Happen". These values are our foundation!

At Sinch Mailgun, we're building the infrastructure that powers communication at internet scale. As one of the largest email providers in the world, our platform delivers billions of emails every day for developers, startups, and global enterprises alike.

We're looking for a Senior Site Reliability Engineer to join our SRE team, the group responsible for keeping our systems fast, reliable, and secure. In this role, you will assist in shaping, scaling, and optimizing the critically important infrastructure that underpins each Mailgun service. You'll work closely with product engineering teams to drive improvements, automate workflows, and ensure our systems meet the highest reliability standards.

This is more than just keeping the lights on. You'll be engineering the future of a platform trusted by developers and companies around the globe, solving complex distributed systems challenges, and driving real-world innovation in how email infrastructure is built and operated.

Responsibilities
  • Collaborate with other teams to define and implement system requirements.
  • Design, build, and maintain cloud-based microservices infrastructure.
  • Automate routine operational tasks and remediation processes to improve efficiency and reliability.
  • Proactively fix and resolve issues, collaborating with support teams, other engineering teams, and using monitoring tools to ensure system health.
  • Ensure that datastores operate efficiently and meet performance and availability goals.
  • Contribute to the team's growth by mentoring junior engineers and sharing standard methodologies.
  • Plan and execute strategies for scaling systems and infrastructure as needs grow.
Requirements
  • Strong background in infrastructure, operations, or software engineering with a focus on reliability.
  • Extensive experience working with cloud platforms such as Google Cloud Platform (GCP) or Amazon Web Services (AWS).
  • Proficiency in using configuration management tools like Terraform and Ansible to manage infrastructure.
  • Hands-on experience with modern monitoring and observability tools such as Prometheus, Grafana, and similar technologies.
  • Proven experience with distributed databases (e.g. Cassandra, Elasticsearch) and maintaining their health at scale.
  • Familiarity with distributed event stores and stream-processing platforms.
  • Strong coding skills in at least one modern programming language (Python, Go, etc.).
  • Expertise in running and maintaining production systems in a Linux environment and public cloud infrastructure.
  • Demonstrated expertise in architecting solutions for complex technical challenges, and the ability to lead initiatives from conception through to execution.
  • Strong interpersonal and communication skills, with a history of building effective relationships with cross-functional teams.
  • Ability to mentor and guide junior engineers, fostering a collaborative and inclusive team culture.
Preferred
  • Experience with container orchestration platforms.
  • Expertise in CI/CD pipeline automation and infrastructure as code practices.
  • Knowledge of network architecture and security best practices in cloud environments.
  • Experience with containerization and microservices architectures.
  • Advanced problem solving skills, particularly in highly sophisticated and distributed systems.
Our Hiring Process

At Sinch, we are committed to following a recruitment process that is fair, objective, consistent, and non-discriminatory. We use pre-employment assessment to create an inclusive application experience to help foster diverse and high performing teams.

Even if you do not meet all job requirements, don't let that stop you from considering Sinch for the next step in your career. We are always looking for people that could help us pioneer the way the world communicates.

Benefits
  • STAY HEALTHY: We offer comprehensive market competitive medical, dental, and vision plans. A variety of supplemental plans are also provided to meet your individual needs including access to telehealth for all participants.
  • CARE FOR YOURSELF: Take advantage of our free virtual counseling resources through our global Employee Assistance Program. Your mental health is as important as your physical health.
  • SECURE YOUR FUTURE: Plan for your future with our Roth and Pre-tax 401(k) options including an employer match for all participants.
  • TAKE A BREAK: Enjoy a generous paid time off program. We value balance and understand that performance at work requires time to rest at home and/or rejuvenate on vacation.
  • PUT FAMILY FIRST: We know that families can be built in a variety of ways; therefore, we offer paid parental leave and family planning support.
  • WORK WHEREVER: Our flexible remote work offerings allow you to work wherever you are the most productive and successful. It is what you do, not where you work, that matters.
  • MAKE AN IMPACT: Support betterment in your community and beyond by taking paid time off to support a volunteer program of your choice.

We're proud to be an equal opportunity employer, and all qualified applicants will be considered to join our team regardless of race, color, religion, creed, gender, national origin, age, disability, veteran status, marital status, pregnancy, sex, gender expression or identity, sexual orientation, citizenship, or any other legally protected class.

The annual starting salary for this position is $142,768.00 - $180,960.00. Factors which may affect starting pay within this range may include geography/market, skills, education, experience, and other qualifications. This role will be accepting applications until 4/28/25 at a minimum. Please note that the application timeline may be flexible to accommodate a comprehensive candidate evaluation.

Get your free, confidential resume review.
or drag and drop a PDF, DOC, DOCX, ODT, or PAGES file up to 5MB.

Similar jobs

Site Reliability Engineer III

ZipRecruiter

Atlanta

Remote

USD 142,000 - 181,000

8 days ago

Senior Site Reliability Engineer

Circle

Atlanta

On-site

USD 120,000 - 195,000

10 days ago

Engineer III - Data Reliability Engineer (Remote)

CrowdStrike Holdings, Inc.

Austin

Remote

USD 110,000 - 180,000

7 days ago
Be an early applicant

Database Reliability Engineer III - Data Services (Remote, CAN)

CrowdStrike

Remote

CAD 110,000 - 180,000

30+ days ago