Enable job alerts via email!

Site Reliability Engineer

Air-tek

Toronto

Hybrid

CAD 80,000 - 110,000

Full time

Today
Be an early applicant

Job summary

A dynamic software company in Toronto is seeking an experienced Site Reliability Engineer. You will ensure system reliability, collaborate across teams, and work with modern technologies like AWS and Docker. Candidates should have a bachelor's degree in computer science and 5+ years in relevant roles. Join us to make a tangible impact in a supportive and innovative environment.

Benefits

Collaborative team environment
Hands-on experience with modern technologies
Opportunities for professional growth

Qualifications

  • 5+ years of relevant experience.
  • Strong analytical and problem-solving skills.
  • Good written and oral communication skills.

Responsibilities

  • Ensure the uptime and reliability of the platform.
  • Deploy new code and services into the hosted platform.
  • Analyze and tune systems for maximum efficiency.
  • Automate manual work and create new tooling.
  • Collaborate with engineering teams on reliability.
  • Participate in on-call rotation to resolve critical issues.

Skills

Production monitoring and logging tools
System administration
Cloud technologies
Databases
CI/CD tools
Data streaming platforms
Programming languages
Analytical skills
Communication skills

Education

Bachelor’s degree in computer science, software engineering, or equivalent

Tools

AWS CloudWatch
DataDog
Docker
Linux
Amazon Web Services
Mongo Atlas
PostgreSQL
AWS Aurora
GitHub Actions
ArgoCD
Pulumi
Terraform
Kubernetes
Kafka
RabbitMQ
C# .NET
Node.js
PowerShell
Bash
Job description

About us

Air-tek is a Canadian-based software company with a powerful suite of unique products that have already achieved a significant share of a huge global market. The product market fit is excellent, and customers are lining up to buy. Although our global customers know us, we intentionally operate in stealth mode during this growth phase.

Our diverse team shares a collective passion for solving complex problems with a drive to innovate and a desire to create the passenger-centric travel industry

Based in Toronto, our inclusive culture is built on trust, collaboration, delivering a great product, and continuous personal development. We love what we do, and we support the team around us.

About the team

The SRE Team is dedicated to ensuring the reliability, scalability, and performance of Air-tek’s critical systems and services. We bridge the gap between development and operations by applying software engineering principles to operational challenges, fostering a culture of reliability, automation, and continuous improvement.

As a member of the team, you will work with a multitude of modern technologies, contribute to the vision of the SRE team, partner with other engineering teams to tackle new ideas and challenges, and have a direct impact on Air-Tek’s ability to sustain our rapid growth.

In this role you will

  • Ensure the uptime and reliability of Air-Tek’s platform in accordance with company SLOs.
  • Ensure the successful deployment of new code and services into our hosted platform.
  • Analyze and tune systems to operate at maximum efficiency.
  • Reduce the toil of manual work through automation and creating new tooling.
  • Collaborate with other engineering teams to integrate reliability into the software development lifecycle.
  • Be a member of the team’s on-call rotation – responding to and resolving critical issues.

Skills and Experience

  • Bachelor’s degree in computer science, software engineering, or equivalent.
  • [5+] years of relevant experience
  • Experience with production monitoring and logging tools such as AWS CloudWatch and DataDog.
  • Experience with some or all of the following tools we leverage:
  • System administration: Docker, Linux
  • Cloud: Amazon Web Services
  • Databases: Mongo Atlas, PostgreSQL, AWS Aurora
  • CI/CD: GitHub Actions, ArgoCD
  • Environment management: Pulumi, Terraform, Kubernetes
  • Data streaming platforms: Kafka, RabbitMQ
  • Experience with programming and scripting languages such as C# .NET, Node.js, PowerShell, and Bash.
  • Possess strong analytical and problem-solving skills and have the confidence to tackle difficult problems.
  • Good written and oral communication skills with the ability to explain technical concepts and designs clearly and succinctly.

Why join us?

  • Be part of a collaborative, inclusive team that values innovation and creativity.
  • Work with exciting, modern technologies and gain hands-on experience across a diverse range of projects.
  • Contribute to solutions that make a tangible impact.
  • Enjoy opportunities for professional growth and development in a supportive environment

Hybrid Policy

  • Please note that this role requires three days a week in our Toronto office which is located near Union Station
Get your free, confidential resume review.
or drag and drop a PDF, DOC, DOCX, ODT, or PAGES file up to 5MB.