Job Search and Career Advice Platform

Enable job alerts via email!

Staff Site Reliability Engineer

Achievers

Remote

CAD 124,000 - 170,000

Full time

12 days ago

Generate a tailored resume in minutes

Land an interview and earn more. Learn more

Job summary

A leading tech firm in Toronto seeks a Staff Site Reliability Engineer to enhance global infrastructure using Google Cloud Platform and Kubernetes. In this role, you will focus on AI-driven automation, high-availability architecture, and mentoring team members. The ideal candidate has substantial experience in systems engineering and cloud architecture, enjoys tackling complex challenges, and thrives in a collaborative, high-growth environment. Join us to transform infrastructure reliability and empower millions of users worldwide.

Benefits

Health Benefits
Life Insurance
Flexible Vacation
Parental Leave Top-up
Employer matched RRSP contributions
Employee Assistance Program
Professional Development
Diversity Celebrations
Hybrid Work Flexibility

Qualifications

  • 15 years of experience in systems engineering.
  • Extensive knowledge of cloud-native architecture.
  • Experience in architecting production workloads on GCP.

Responsibilities

  • Lead design and evolution of global, high-availability infrastructure.
  • Implement AI-integrated workflows for operational tasks.
  • Collaborate with cross-functional teams to define reliability roadmap.
  • Guide design reviews and promote best practices.

Skills

Linux kernels
network protocols (TCP/IP, BGP, DNS)
GCP expertise
AI integration
advanced Python or Go
observability frameworks
database management
communication skills

Tools

GCP
GKE
Terraform
CI/CD pipelines
Service Mesh (Istio)
New Relic
Prometheus
Grafana
Job description

Our Site Reliability Engineering team sits at the intersection of software engineering and operations, building reliable, scalable cloud systems that our teams and customers can trust.

As Staff Site Reliability Engineer , you'll play a critical role in the management and advancement of our global infrastructure. You'll leverage approximately 15 years of technical expertise - specifically focusing on the evolution of high-concurrency, distributed systems , and the orchestration of hyper-scale cloud environments . In this position, you will leverage your expertise to architect our GCP/GKE environment and lead the integration of AI-driven workflows . This includes utilizing bots, automated PR remediation, and intelligent alerting to ensure our platform can scale efficiently and reliably.


Why you'll love this role:
  • Lead high-impact initiatives that shape how millions of people experience work around the world.
  • Bring your unique perspective to complex and challenging projects - apply your expertisein architecture, influence technical direction, and mentor fellow team members.
  • Join a close-knit, no-ego, high-performing teamthat solves meaningful problems and celebrates successes together.
  • Work alongside an experienced leadership teamwho is genuinely invested in your career growth.
  • Thrive in afast-paced, high-growth environmentwhereinnovationis encouraged andyour voice truly matters.
How you’ll shape our cloud infrastructure:
  • Architectural Leadership: Lead the design and ongoing evolution of our global, high-availability infrastructure, focusing on Google Cloud Platform (GCP) and Kubernetes (GKE).
  • AI & Automation Strategy: Identify repetitive operational tasks and implement AI-integrated workflows, such as Slack or Teams bots for incident triage, AI-augmented alerting, and automated PR generation to address infrastructure drift.
  • Cross-Functional Influence: Collaborate with Product, Engineering, and Leadership teams to identify systemic risks, manage complex changes, and define the long-term reliability roadmap.
  • Infrastructure-as-Code (IaC): Establish and exemplify best practices for Terraform and CI/CD pipelines, empowering development teams to deploy code rapidly and securely.
  • System Resiliency: Lead high-level initiatives in disaster recovery, multi-region networking, and the design of zero-trust security architectures.
  • Technical Mentorship: Guide design reviews and promote best practices, enhancing the technical skills and capabilities of the entire SRE organization.
Experience we feel will set you up for success:
  • The 15-Year Lens: Possess extensive systems engineering experience, with in-depth knowledge of Linux kernels, network protocols (TCP/IP, BGP, DNS), and cloud-native architecture.
  • GCP Expertise: Demonstrated, hands-on experience in architecting and managing production workloads on Google Cloud Platform and GKE.
  • AI/Workflow Automation: Practical experience or a strong vision for integrating AI tools and LLMs to automate SRE tasks, documentation, or incident response.
  • Code Proficiency: Advanced skills in Python or Go, with the ability to develop sophisticated internal tools and automation frameworks.
  • Observability Mastery: Expert understanding of observability frameworks (such as New Relic, Prometheus, Grafana) to enable data-driven decision-making.
  • Database Foundations: Deep knowledge of managing relational databases (MySQL, MongoDB) at scale.
  • Communication: Exceptional ability to clearly convey complex technical infrastructure challenges as actionable business insights to non-technical stakeholders.
The Achievers Mindset
  • Disruptive Innovator: Set industry trends by applying emerging technologies like AI to address longstanding infrastructure challenges.
  • Self-Starter: Maintain a mindset of continuous improvement, always seeking opportunities to automate processes.
  • Culture of Success: Believe that platform reliability is fundamental to both employee success and customer trust.
Bonus Points
  • Hands-on experience with Service Mesh (Istio) and advanced GCP Networking features, such as Interconnect and Shared VPC.
  • A proven history of migrating legacy automation systems to modern, AI-augmented CI/CD workflows.

Why Achieversis a Great Place to Work

At Achievers, we believe recognition is a powerful driver of connection. With more than 4.3 million users across 190 countries, our employee recognition and rewards platform empowers organizations to build cultures where people feel seen and valued, everyday. We’re a team of passionate, thoughtful builders who care deeply about our product, our customers, and each other. Visit achievers.com to see how we’re inspiring recognition everywhere.

Our Approach to Total Rewards

$124,000 - $170,000 reflects the salary range for this role, depending on experience, skills, and market data. We’re committed to providing a fair and competitive offer based on what you bring to the team. Each A-Players' compensation is reviewed at least annually against performance and impact in role. We want you to see your path to growth, understand your impact, and feel valued every step of the way.

Benefits and Perk s for permanent full-time employees:

✨Rewardsfor your impact through our Recognition and Rewards program

HealthBenefits and Life Insurance Coverage beginning on your first day

ParentalLeave Top-up

Employer matched RRSPcontributions

️ FlexibleVacation to recharge, so you can bring your best

Employeeand Family Assistance Programoffering mental health, legal, and financial counselling

Supportedprofessional development and career growth(Linkedin Learning, mentorship)

Employee-Led Employee Resource Groupsthat celebrate our diversity

♀️ Regular events designed to build connection, belonging, and well-being

Hybridflexibility, with time in our beautiful Liberty Village, Toronto office

This posting is for a currently vacancy on our team.

Achievers is proud to be an equal opportunity employer committed to building a diverse, inclusive workplace where everyone can do their best work. We encourage qualified candidates from all backgrounds and experiences to apply.

Achievers is committed to ensuring an inclusive and accessible recruitment process for all candidates. If you require any accommodations for your interview, such as assistive technology, wheelchair accessibility, or alternative formats of materials, please let us know. We are happy to make necessary arrangements to support your needs.

Get your free, confidential resume review.
or drag and drop a PDF, DOC, DOCX, ODT, or PAGES file up to 5MB.