Enable job alerts via email!

Site Reliability Engineer, Fleet Automation

Dropbox

Canada

Remote

CAD 134,000 - 182,000

Full time

Yesterday

Be an early applicant

Boost your interview chances

Create a job specific, tailored resume for higher success rate.

Job summary

An innovative company is seeking a Site Reliability Engineer to join their Fleet Automation team. This role is pivotal in managing large-scale infrastructure and enhancing reliability through automation. You'll work on critical systems that support millions of users, ensuring seamless deployment and maintenance of services. The ideal candidate will have a strong background in software design, Linux systems, and distributed systems, with a passion for improving processes. If you're ready to make a significant impact in a dynamic environment, this opportunity is perfect for you.

Benefits

Medical, dental, and vision coverage

Retirement savings plans

Flexible PTO

Life and disability insurance

Travel medical and accident insurance

Perks allowance for wellness

Parental leave and support

Mental health benefits

Qualifications

2+ years of industry experience in software design and systems.
Proficiency in Linux administration and distributed systems.

Responsibilities

Build scalable infrastructure for massive data and connections.
Automate server provisioning to improve efficiency.
Monitor system health and implement remediation.

Skills

Software Design

Systems Troubleshooting

Linux Administration

Distributed Systems Development

Monitoring Tools

Debugging

Automation

Education

Bachelor's Degree in Computer Science

Tools

Ansible

Chef

Puppet

AWS

Google Cloud

Azure

Site Reliability Engineer, Fleet Automation

Dropbox is a Virtual First company. For this role, we are currently only authorized to hire candidates from the following provinces: Alberta, British Columbia, Ontario, and Saskatchewan.

Company Description

Dropbox isn’t just a workplace—it’s a living lab for more enlightened ways of working. We're a global community of bold visionaries and resourceful doers who are shaping the future of Dropbox—and with it the future of work. Our Virtual First model combines the autonomy of a distributed workplace with the power of human connection, making space for both meaningful work and meaningful relationships. With our start-up mindset and enterprise-level opportunities, you can be who you are and grow into who you’re meant to be. Here, you can own your impact to make work more intuitive, joyful, and human—for you as a Dropboxer and for hundreds of millions of people worldwide. If you're ready to push boundaries—and yourself— Dropbox is ready for you.

Team Description

Role Description

Site Reliability Engineers on the Fleet Automation team are mission-critical for Dropbox success. The SRE team has major impact inside of Dropbox engineering—from testing our disaster readiness and building our in-house multi-exabyte storage system, Magic Pocket. Check out the Dropbox Tech Blog to learn more!

The Site Reliability Team consists of hybrid systems and software engineers who are responsible for managing large-scale infrastructure while improving reliability and automation. SREs are integrated within the Platform team, and we're looking for engineers interested in developing infrastructure software, maintaining it, and scaling it. You will join a small, impactful team within Dropbox that significantly influences the world.

Build scalable infrastructure to manage metadata for hundreds of billions of files, hundreds of petabytes of user data, and millions of concurrent connections
Design systems and processes for Dropbox engineers to manage and deploy their software into production
Automate server provisioning to reduce manual labor for networking and datacenter teams, enabling servers to self-provision and join the fleet automatically
Own foundational services such as DHCP, DNS, NTP, PXE
Build, test, and keep the fleet updated with the latest OS and Kernel
Monitor fleet health and implement host remediation services

Many Dropbox teams operate with on-call rotations, requiring engineers to be available during core and non-core hours. Applicants are encouraged to inquire about specific rotation schedules.

Requirements

BS degree in Computer Science or a related technical field involving coding, or equivalent experience
2+ years of industry experience
Proficiency in software design and systems, including OS, networks, or hardware troubleshooting
Experience with monitoring tools to ensure reliability of production services
Experience working with Linux in a production environment
Ability to diagnose technical issues, debug code, and automate routine tasks
Experience developing distributed systems
Familiarity with fundamental services like DHCP, DNS, NTP, PXE
Familiarity with configuration management tools such as Ansible, Chef, or Puppet

Preferred Qualifications

Experience with cloud services like AWS, Google Cloud, or Azure
Enthusiasm for new initiatives, contributing ideas, experimenting, and sharing outcomes

Compensation

Canada Pay Range: $134,300—$181,700 CAD

The listed range is the expected annual salary/OTE, subject to change. Salary/OTE is part of Dropbox’s total rewards, including potential bonuses, sales incentives, and RSUs.

Benefits

Dropbox offers comprehensive benefits, including:

Medical, dental, and vision coverage*
Retirement savings plans**
Flexible PTO and statutory holidays
Life and disability insurance*
Travel medical and accident insurance*
Perks allowance for wellness, learning, food, and more
Parental leave, fertility, adoption, surrogacy, and lactation support
Mental health benefits

Additional benefits details are available upon request. *Allowances may be provided where group plans are unavailable. **Dependent on location and law.

Dropbox is a Virtual First company and currently only hiring from Alberta, British Columbia, Ontario, and Saskatchewan.

Company Description

Dropbox is a global community shaping the future of work through innovative technology and a start-up mindset, serving hundreds of millions of users worldwide.

Team Description

Our engineering teams develop platforms like Dropbox Dash and Dropbox Sign, handling over a billion files daily. We utilize technologies like Python, React, Node.js, MongoDB, PostgreSQL, and Android development. Join us to turn complex challenges into scalable, intuitive solutions.

Role Description

As a Site Reliability Engineer on the Fleet Automation team, you will manage large-scale infrastructure, improve reliability, and automate processes. Our team plays a crucial role in Dropbox’s infrastructure and scalability.

Responsibilities

Build scalable infrastructure for massive data and connections
Design deployment and management systems for Dropbox engineers
Automate server provisioning for efficiency
Manage core services like DHCP, DNS, NTP, PXE
Maintain and update fleet OS and Kernel
Monitor system health and implement remediation

Participation in on-call rotations is expected. Please inquire about specifics during application.

Get your free, confidential resume review.

or drag and drop a PDF, DOC, DOCX, ODT, or PAGES file up to 5MB.

Similar jobs

Lead, Site Reliability Engineer, Fabric

MongoDB

Old Toronto

Remote

CAD 159,000 - 221,000

Yesterday

Be an early applicant

Senior Site Reliability Engineer

MongoDB

Remote

CAD 144,000 - 200,000

8 days ago

Intermediate Site Reliability Engineer, Foundations

GitLab

Remote

USD 103,000 - 222,000

9 days ago

Staff Infrastructure Site Reliability Engineer

Netlify Inc.

Remote

CAD 163,000 - 221,000

13 days ago

Staff Infrastructure Site Reliability Engineer

Remoteworldwide

Remote

CAD 90,000 - 150,000

3 days ago

Be an early applicant

Software Engineer, Site Reliability (Senior or Staff)

BioRender

Remote

CAD 80,000 - 150,000

7 days ago

Be an early applicant

Software Platform Engineering Manager - Ubuntu for Next-Gen Silicon

Canonical

Toronto

Remote

USD 90,000 - 150,000

9 days ago

Observability Engineer - Platform Reliability (Junior to Mid-Level)

Releady

Calgary

Remote

CAD 125,000 - 150,000

6 days ago

Be an early applicant

Service Reliability Engineer

Scotiabank

Toronto

Hybrid

CAD 120,000 - 165,000

4 days ago

Be an early applicant

Site Reliability Engineer, Fleet Automation

Dropbox

Canada

Remote

CAD 134,000 - 182,000

Full time

Job summary

Benefits

Qualifications

Responsibilities

Skills

Education

Tools

Job description

Similar jobs

Lead, Site Reliability Engineer, Fabric

Old Toronto

Remote

CAD 159,000 - 221,000

Senior Site Reliability Engineer

Remote

CAD 144,000 - 200,000

Intermediate Site Reliability Engineer, Foundations

Remote

USD 103,000 - 222,000

Staff Infrastructure Site Reliability Engineer

Remote

CAD 163,000 - 221,000

Staff Infrastructure Site Reliability Engineer

Remote

CAD 90,000 - 150,000

Software Engineer, Site Reliability (Senior or Staff)

Remote

CAD 80,000 - 150,000

Software Platform Engineering Manager - Ubuntu for Next-Gen Silicon

Toronto

Remote

USD 90,000 - 150,000

Observability Engineer - Platform Reliability (Junior to Mid-Level)

Calgary

Remote

CAD 125,000 - 150,000

Service Reliability Engineer

Toronto

Hybrid

CAD 120,000 - 165,000