Enable job alerts via email!

Site Reliability Engineer, Fleet Automation

Dropbox

Canada

Remote

CAD 134,000 - 182,000

Full time

Yesterday
Be an early applicant

Boost your interview chances

Create a job specific, tailored resume for higher success rate.

Job summary

An innovative company is seeking a Site Reliability Engineer to join their Fleet Automation team. This role is pivotal in managing large-scale infrastructure and enhancing reliability through automation. You'll work on critical systems that support millions of users, ensuring seamless deployment and maintenance of services. The ideal candidate will have a strong background in software design, Linux systems, and distributed systems, with a passion for improving processes. If you're ready to make a significant impact in a dynamic environment, this opportunity is perfect for you.

Benefits

Medical, dental, and vision coverage
Retirement savings plans
Flexible PTO
Life and disability insurance
Travel medical and accident insurance
Perks allowance for wellness
Parental leave and support
Mental health benefits

Qualifications

  • 2+ years of industry experience in software design and systems.
  • Proficiency in Linux administration and distributed systems.

Responsibilities

  • Build scalable infrastructure for massive data and connections.
  • Automate server provisioning to improve efficiency.
  • Monitor system health and implement remediation.

Skills

Software Design
Systems Troubleshooting
Linux Administration
Distributed Systems Development
Monitoring Tools
Debugging
Automation

Education

Bachelor's Degree in Computer Science

Tools

Ansible
Chef
Puppet
AWS
Google Cloud
Azure

Job description

Site Reliability Engineer, Fleet Automation

Dropbox is a Virtual First company. For this role, we are currently only authorized to hire candidates from the following provinces: Alberta, British Columbia, Ontario, and Saskatchewan.

Company Description

Dropbox isn’t just a workplace—it’s a living lab for more enlightened ways of working. We're a global community of bold visionaries and resourceful doers who are shaping the future of Dropbox—and with it the future of work. Our Virtual First model combines the autonomy of a distributed workplace with the power of human connection, making space for both meaningful work and meaningful relationships. With our start-up mindset and enterprise-level opportunities, you can be who you are and grow into who you’re meant to be. Here, you can own your impact to make work more intuitive, joyful, and human—for you as a Dropboxer and for hundreds of millions of people worldwide. If you're ready to push boundaries—and yourself— Dropbox is ready for you.

Team Description
Role Description

Site Reliability Engineers on the Fleet Automation team are mission-critical for Dropbox success. The SRE team has major impact inside of Dropbox engineering—from testing our disaster readiness and building our in-house multi-exabyte storage system, Magic Pocket. Check out the Dropbox Tech Blog to learn more!

The Site Reliability Team consists of hybrid systems and software engineers who are responsible for managing large-scale infrastructure while improving reliability and automation. SREs are integrated within the Platform team, and we're looking for engineers interested in developing infrastructure software, maintaining it, and scaling it. You will join a small, impactful team within Dropbox that significantly influences the world.

  • Build scalable infrastructure to manage metadata for hundreds of billions of files, hundreds of petabytes of user data, and millions of concurrent connections
  • Design systems and processes for Dropbox engineers to manage and deploy their software into production
  • Automate server provisioning to reduce manual labor for networking and datacenter teams, enabling servers to self-provision and join the fleet automatically
  • Own foundational services such as DHCP, DNS, NTP, PXE
  • Build, test, and keep the fleet updated with the latest OS and Kernel
  • Monitor fleet health and implement host remediation services

Many Dropbox teams operate with on-call rotations, requiring engineers to be available during core and non-core hours. Applicants are encouraged to inquire about specific rotation schedules.

Requirements
  • BS degree in Computer Science or a related technical field involving coding, or equivalent experience
  • 2+ years of industry experience
  • Proficiency in software design and systems, including OS, networks, or hardware troubleshooting
  • Experience with monitoring tools to ensure reliability of production services
  • Experience working with Linux in a production environment
  • Ability to diagnose technical issues, debug code, and automate routine tasks
  • Experience developing distributed systems
  • Familiarity with fundamental services like DHCP, DNS, NTP, PXE
  • Familiarity with configuration management tools such as Ansible, Chef, or Puppet
Preferred Qualifications
  • Experience with cloud services like AWS, Google Cloud, or Azure
  • Enthusiasm for new initiatives, contributing ideas, experimenting, and sharing outcomes
Compensation

Canada Pay Range: $134,300—$181,700 CAD

The listed range is the expected annual salary/OTE, subject to change. Salary/OTE is part of Dropbox’s total rewards, including potential bonuses, sales incentives, and RSUs.

Benefits

Dropbox offers comprehensive benefits, including:

  • Medical, dental, and vision coverage*
  • Retirement savings plans**
  • Flexible PTO and statutory holidays
  • Life and disability insurance*
  • Travel medical and accident insurance*
  • Perks allowance for wellness, learning, food, and more
  • Parental leave, fertility, adoption, surrogacy, and lactation support
  • Mental health benefits

Additional benefits details are available upon request. *Allowances may be provided where group plans are unavailable. **Dependent on location and law.

Dropbox is a Virtual First company and currently only hiring from Alberta, British Columbia, Ontario, and Saskatchewan.

Company Description

Dropbox is a global community shaping the future of work through innovative technology and a start-up mindset, serving hundreds of millions of users worldwide.

Team Description

Our engineering teams develop platforms like Dropbox Dash and Dropbox Sign, handling over a billion files daily. We utilize technologies like Python, React, Node.js, MongoDB, PostgreSQL, and Android development. Join us to turn complex challenges into scalable, intuitive solutions.

Role Description

As a Site Reliability Engineer on the Fleet Automation team, you will manage large-scale infrastructure, improve reliability, and automate processes. Our team plays a crucial role in Dropbox’s infrastructure and scalability.

Responsibilities
  • Build scalable infrastructure for massive data and connections
  • Design deployment and management systems for Dropbox engineers
  • Automate server provisioning for efficiency
  • Manage core services like DHCP, DNS, NTP, PXE
  • Maintain and update fleet OS and Kernel
  • Monitor system health and implement remediation

Participation in on-call rotations is expected. Please inquire about specifics during application.

Get your free, confidential resume review.
or drag and drop a PDF, DOC, DOCX, ODT, or PAGES file up to 5MB.

Similar jobs

Lead, Site Reliability Engineer, Fabric

MongoDB

Old Toronto

Remote

CAD 159,000 - 221,000

Yesterday
Be an early applicant

Senior Site Reliability Engineer

MongoDB

Remote

CAD 144,000 - 200,000

8 days ago

Intermediate Site Reliability Engineer, Foundations

GitLab

Remote

USD 103,000 - 222,000

9 days ago

Staff Infrastructure Site Reliability Engineer

Netlify Inc.

Remote

CAD 163,000 - 221,000

13 days ago

Staff Infrastructure Site Reliability Engineer

Remoteworldwide

Remote

CAD 90,000 - 150,000

3 days ago
Be an early applicant

Software Engineer, Site Reliability (Senior or Staff)

BioRender

Remote

CAD 80,000 - 150,000

7 days ago
Be an early applicant

Software Platform Engineering Manager - Ubuntu for Next-Gen Silicon

Canonical

Toronto

Remote

USD 90,000 - 150,000

9 days ago

Observability Engineer - Platform Reliability (Junior to Mid-Level)

Releady

Calgary

Remote

CAD 125,000 - 150,000

6 days ago
Be an early applicant

Service Reliability Engineer

Scotiabank

Toronto

Hybrid

CAD 120,000 - 165,000

4 days ago
Be an early applicant