Enable job alerts via email!

Senior Site Reliability Engineer, Atlas

MongoDB

United States

Remote

USD 120,000 - 160,000

Full time

7 days ago

Be an early applicant

Boost your interview chances

Create a job specific, tailored resume for higher success rate.

Job summary

An innovative company is seeking a talented Site Reliability Engineer (SRE) to enhance their Atlas platform. This role demands a strong infrastructure background and a customer-first mindset to ensure high availability and reliability. The ideal candidate will have extensive experience in managing critical systems, a firm grasp of cloud environments, and proficiency in modern programming languages. Join this dynamic team to tackle technical challenges and contribute to the development of a resilient multi-cloud platform that supports a wide range of customer applications. Embrace a culture that values employee well-being and professional growth.

Benefits

Fertility Assistance

Generous Parental Leave

Employee Affinity Groups

Disability Accommodations

Qualifications

5+ years of experience running critical systems at scale.
Strong understanding of large scale Linux environments.
Familiarity with major cloud providers and multi-cloud systems.

Responsibilities

Design and build complex systems for the Atlas platform.
Participate in a 24/7 on-call rotation for customer support.
Collaborate with teams to solve technical challenges.

Skills

Linux Administration

Cloud Infrastructure (AWS, Azure, GCP)

Automation

Programming (Go, Ruby, Python)

Web and Network Protocols (HTTP, TLS, DNS)

MongoDB’s mission is to empower innovators to create, transform, and disrupt industries by unleashing the power of software and data. We enable organizations of all sizes to easily build, scale, and run modern applications by helping them modernize legacy workloads, embrace innovation, and unleash AI. Our industry-leading developer data platform, MongoDB Atlas, is the only globally distributed, multi-cloud database and is available in more than 115 regions across AWS, Google Cloud, and Microsoft Azure. Atlas allows customers to build and run applications anywhere—on premises, or across cloud providers. With offices worldwide and over 175,000 new developers signing up to use MongoDB every month, it’s no wonder that leading organizations, like Samsung and Toyota, trust MongoDB to build next-generation, AI-powered applications.

The Team

We are looking for an experienced Senior Engineer for our SRE, Atlas team to support, maintain and grow the Atlas platform. As a senior SRE, you will be expected to be able to design & build complex systems, operate with autonomy and act as owner for everything you do.

The SRE Atlas team works alongside the various Atlas software engineering teams to provide expertise about running systems at scale, build new tooling and automation and perform essential maintenance of the Atlas fleet.

This is an SRE team, which means you can expect a highly hands-on approach, tackling the technical challenges of implementing large scale solutions that have the ability to impact our customer’s most crucial workloads.

Role Overview

We are seeking a talented Site Reliability Engineer (SRE) with a strong infrastructure background. This role requires engineers to have a customer-first mindset to ensure that everything we do results in a stronger product and a better experience for all Atlas customers.

The ideal candidate should

Have 5+ years of experience running critical systems at scale
Value efficiency in processes and operations, and display a preference for automation over manual processes (“allergic to ops work”)
Be familiar with a major cloud provider (AWS, Azure, or GCP) and possess the ability to build and operate systems in a multi-cloud environment
A strong understanding of how to run a large scale Linux environment, including low level fundamentals
Firm grasp of at least one modern programming language, beyond basic scripting (Go, Ruby, Python)
Solid understanding of web and network protocols and standards (HTTP, TLS, DNS, etc)
Participate in the development of a reliable and resilient multi-cloud platform that hosts business critical applications for a wide & varied range of customer applications
Collaborate with service-owning teams to provide internal support, solve technical challenges and adapt or build tooling to solve novel use cases in a generic fashion
Participate in a 24/7 on-call rotation to swiftly resolve issues related to any disruption of our customer facing Atlas fleet, ensuring minimal disruption and high availability

To drive the personal growth and business impact of our employees, we’re committed to developing a supportive and enriching culture for everyone. From employee affinity groups, to fertility assistance and a generous parental leave policy, we value our employees’ wellbeing and want to support them along every step of their professional and personal journeys. Learn more about what it’s like to work at MongoDB , and help us make an impact on the world!

MongoDB is committed to providing any necessary accommodations for individuals with disabilities within our application and interview process. To request an accommodation due to a disability, please inform your recruiter.

MongoDB, Inc. provides equal employment opportunities to all employees and applicants for employment and prohibits discrimination and harassment of any type and makes all hiring decisions without regard to race, color, religion, age, sex, national origin, disability status, genetics, protected veteran status, sexual orientation, gender identity or expression, or any other characteristic protected by federal, state or local laws.

REQ ID: 1263097752

MongoDB’s base salary range for this role in the U.S. is:

Get your free, confidential resume review.

or drag and drop a PDF, DOC, DOCX, ODT, or PAGES file up to 5MB.