Enable job alerts via email!

Site Reliability Engineer

Equifax

Toronto

On-site

CAD 80,000 - 120,000

Full time

30+ days ago

Boost your interview chances

Create a job specific, tailored resume for higher success rate.

Job summary

An established industry player is seeking a Site Reliability Engineer to enhance system reliability and performance. This role involves managing uptime across cloud-native and hybrid architectures, developing infrastructure as code, and creating CI/CD pipelines. The ideal candidate will have a strong background in software engineering and cloud environments, along with proficiency in programming languages such as Python and Java. Join a dynamic team that values diversity, problem-solving, and collaboration in a supportive environment, where your contributions will significantly impact operational excellence and system resilience.

Qualifications

5-7 Jahre Erfahrung in Software Engineering oder Systemadministration.
Kenntnisse in Cloud-Umgebungen und CI/CD-Praktiken.

Responsibilities

Verwalten der Systemverfügbarkeit in Cloud- und Hybridarchitekturen.
Entwickeln von CI/CD-Pipelines für Build, Test und Deployment.

Skills

Python

Bash

Java

JavaScript

Node.js

Terraform

Docker

Kubernetes

CI/CD practices

Education

BSc in Computer Science

Tools

Terraform

Jenkins

Chef

Ansible

Join to apply for the Site Reliability Engineer role at Equifax.

Synopsis of the role

Site Reliability Engineering (SRE) at Equifax is a discipline that combines software and systems engineering for building and running large-scale, distributed, fault-tolerant systems. SRE ensures that internal and external services meet or exceed reliability and performance expectations while adhering to Equifax engineering principles.

SRE is also an engineering approach to building and running production systems – we engineer solutions to operational problems. Our SREs are responsible for overall system operation and we use a breadth of tools and approaches to solve a broad set of problems. Practices such as limiting time spent on operational work, blameless postmortems, proactive identification, and prevention of potential outages are integral to our culture.

Our SRE culture emphasizes diversity, intellectual curiosity, problem solving, and openness. Equifax brings together people with varied backgrounds, experiences, and perspectives. We encourage collaboration, big thinking, and risk-taking in a blame-free environment. We promote self-direction on meaningful projects and provide support and mentorship for growth and pride in work.

What You’ll Do

Manage system uptime across cloud-native (AWS, GCP) and hybrid architectures.
Build infrastructure as code (IaC) patterns that meet security and engineering standards using technologies like Terraform, scripting, and cloud SDKs.
Develop CI/CD pipelines for build, test, and deployment using platform tools like Jenkins and cloud-native toolchains.
Create automated tools for deploying service requests and detailed runbooks for managing, detecting, remediating, and restoring services.
Troubleshoot complex distributed architecture service maps and participate in on-call for high-severity incidents, improving runbooks to reduce MTTR.
Lead blameless postmortems and own actions to prevent recurrences.

What Experience You Need

BSc in Computer Science or related field involving coding, or equivalent experience.
5-7 years in software engineering, systems administration, database administration, or networking.
2+ years developing and/or managing software in public cloud environments.
Experience monitoring infrastructure and application uptime and performance.
Proficiency in languages such as Python, Bash, Java, Go, JavaScript, or Node.js.
Knowledge of systems, storage, networking, security, and databases.
Skills in system administration, automation, and orchestration using Terraform, Chef, Ansible, Docker, Kubernetes, etc.
Experience with CI/CD practices and tools.
Cloud Certification is strongly preferred.

What Could Set You Apart

Demonstrate skills in:

DevSecOps practices and engineering resilience.
Operational excellence and system monitoring.
Systems thinking and technology trend awareness.
Technical communication and presentation skills.
Troubleshooting and problem resolution.

Additional Details

Seniority level: Mid-Senior level
Employment type: Full-time
Job function: Engineering and Information Technology

Referrals can increase your chances of interviewing at Equifax. Get notified about new SRE jobs in Toronto, Ontario, Canada.

Get your free, confidential resume review.

or drag and drop a PDF, DOC, DOCX, ODT, or PAGES file up to 5MB.

Similar jobs

Toronto

Remote

CAD 80,000 - 110,000

30+ days ago

Site Reliability Engineer

Equifax

Toronto

On-site

CAD 80,000 - 120,000

Full time

Job summary

Qualifications

Responsibilities

Skills

Education

Tools

Job description

Similar jobs

Senior Site Reliability Engineer

Toronto

Remote

CAD 100,000 - 150,000