Enable job alerts via email!

Senior Site Reliability Engineer

ZayZoon

Edmonton

Remote

CAD 80,000 - 120,000

Full time

30+ days ago

Boost your interview chances

Create a job specific, tailored resume for higher success rate.

Job summary

An innovative financial technology company is seeking a Senior Site Reliability Engineer to elevate their cloud infrastructure. This role involves working with cutting-edge AWS technologies and collaborating with a diverse team of engineers to ensure the reliability and scalability of their products. As part of a rapidly growing organization recognized for its achievements, you will play a crucial role in enhancing employee financial wellness through advanced technical solutions. If you thrive in a remote environment and are passionate about infrastructure and reliability, this opportunity is perfect for you.

Qualifications

  • 5+ years of infrastructure experience required.
  • 2+ years of AWS experience with certification necessary.

Responsibilities

  • Develop and maintain infrastructure-as-code CloudFormation templates.
  • Manage deployment pipelines and ensure performance metrics.

Skills

AWS
Infrastructure-as-Code (IaC)
CloudFormation
Containerization (Docker, ECS, ECR)
Observability Platforms (DataDog, NewRelic, OTel)
SQL
Data Analysis

Job description

Who We Are

Our goal is to save ten million hard-working employees ten billion dollars. We are a values-driven, well-funded, and fast-growing Financial Technology and HR company. We want to empower small and midsize businesses with financial tools that make them the place where people want to work.

We’ve created a financial empowerment platform that helps small but mighty HR teams make a big impact on employee financial wellness. ZayZoon is quickly becoming the employee financial wellness super-app that employees can’t live without, and employers are clamoring to offer to help attract and retain talent.

We are growing fast and have been recognized for rapid growth in the 2023 Deloitte Technology Fast 500 and Canadian Technology Fast 50 program! You can read more about it here.

About the Role

We are looking for a Senior Site Reliability Engineer to take ZayZoon’s cloud infrastructure to the next level with complex AWS builds, infrastructure-as-code, and observability/logging/APM solutions. You'll work in an embedded reliability team, alongside app and data engineers, to monitor, benchmark, and scale ZayZoon’s products. You will work with first-class technologies and staff to leverage all the goodies AWS has to offer, as well as creating a bridge between our bare metal infrastructure and our Ruby on Rails production app. Predictability, reliability, and scalability are your three favourite words.

Responsibilities:

  • Develop and maintain infrastructure-as-code CloudFormation templates, emphasizing serverless resources (ECS, Fargate, Lambda).
  • Instrumentation and daily metrics analysis of both infrastructure performance and our Ruby on Rails applications, using AWS tooling (Athena, CloudTrail, etc.) and third-party observability platforms (DataDog, OTel).
  • Manage deployment pipelines, including blue/green and intelligent auto-scaling.
  • Maintain and stay ahead of resource dependencies, particularly database (RDS, ElastiCache/Redis), including updates, playbooks, downtime planning.
  • Project costs and implement AWS cost savings programs and reserved instances.
  • Work alongside our risk and security teams to ensure ongoing SOC-2 and cybersecurity compliance.
  • Extensive collaboration with app developers on shared metrics, database performance, load testing.
  • Extensive collaboration with data engineers on facilitating data warehouse development, ELT, ETL.
  • Participate in our agile development process: sprint planning, story grooming, and stand-ups.
  • Adhere to our SDLC and secure coding practices and environment.

Minimum Requirements:

  • 5+ years infrastructure experience.
  • 2+ years AWS experience including certification and deployment of production applications.
  • Proficiency with IaC, specifically CloudFormation.
  • Experience with containerization (Docker, ECS, ECR).
  • Experience analyzing and acting on performance issues using observability platforms (DataDog, NewRelic, OTel).
  • Ability to build quickly when we need to experiment and build clean when MVP becomes core functionality.
  • Strong SQL and data analysis skills and an eagerness to dig into data as part of problem-solving.

Location Requirement: Candidates must be located in Canada to be considered.

We are organized as a remote team, as such we are looking for candidates who can work effectively remotely. You must have access to a secure high-speed internet connection and a secure workspace to ensure the security of private information. This role is available on a permanently remote basis.

Additional Information:

Please be aware that as part of our final hiring process, we will conduct reference calls with previous managers and possibly other individuals. Additionally, due to the nature of our business, a criminal record check and a basic security clearance will also be required.

We wish to thank all qualified applicants for their interest in joining our team!

#LI-REMOTE

Get your free, confidential resume review.
or drag and drop a PDF, DOC, DOCX, ODT, or PAGES file up to 5MB.

Similar jobs

Sr. Site Reliability Engineer

Diversis Capital LLC

Moose Jaw

Remote

CAD 70,000 - 110,000

14 days ago

Senior Site Reliability Engineer - (Remote - Canada)

Jobgether

Remote

CAD 80,000 - 120,000

20 days ago

Senior Site Reliability Engineer

Black Ties Group Inc.

Toronto

Remote

CAD 90,000 - 150,000

27 days ago

Senior Site Reliability Engineer (Kubernetes)

Supermetrics Oy

Remote

EUR 80,000 - 110,000

30+ days ago

Senior Site Reliability Engineer (Remote)

Fathom - AI Meeting Assistant

Remote

CAD 80,000 - 100,000

30+ days ago

Software Engineer, Site Reliability (Senior or Staff)

BioRender

Remote

CAD 80,000 - 150,000

6 days ago
Be an early applicant

Software Platform Engineering Manager - Ubuntu for Next-Gen Silicon

Canonical

Edmonton

Remote

USD 90,000 - 150,000

9 days ago

Site Reliability Engineer

Blink AI

Remote

CAD 70,000 - 110,000

Today
Be an early applicant

Site Reliability Engineer

Insight Global

Remote

CAD 100,000 - 125,000

6 days ago
Be an early applicant