Enable job alerts via email!

Site Reliability Engineer

Equifax

Toronto

On-site

CAD 80,000 - 120,000

Full time

22 days ago

Boost your interview chances

Create a job specific, tailored resume for higher success rate.

Job summary

An established industry player is seeking a Site Reliability Engineer to enhance system reliability and performance. This role involves managing uptime across cloud-native and hybrid architectures, developing infrastructure as code, and creating CI/CD pipelines. The ideal candidate will have a strong background in software engineering and cloud environments, along with proficiency in programming languages such as Python and Java. Join a dynamic team that values diversity, problem-solving, and collaboration in a supportive environment, where your contributions will significantly impact operational excellence and system resilience.

Qualifications

  • 5-7 Jahre Erfahrung in Software Engineering oder Systemadministration.
  • Kenntnisse in Cloud-Umgebungen und CI/CD-Praktiken.

Responsibilities

  • Verwalten der Systemverfügbarkeit in Cloud- und Hybridarchitekturen.
  • Entwickeln von CI/CD-Pipelines für Build, Test und Deployment.

Skills

Python
Bash
Java
Go
JavaScript
Node.js
Terraform
Docker
Kubernetes
CI/CD practices

Education

BSc in Computer Science

Tools

Terraform
Jenkins
Chef
Ansible

Job description

Join to apply for the Site Reliability Engineer role at Equifax.

Synopsis of the role

Site Reliability Engineering (SRE) at Equifax is a discipline that combines software and systems engineering for building and running large-scale, distributed, fault-tolerant systems. SRE ensures that internal and external services meet or exceed reliability and performance expectations while adhering to Equifax engineering principles.

SRE is also an engineering approach to building and running production systems – we engineer solutions to operational problems. Our SREs are responsible for overall system operation and we use a breadth of tools and approaches to solve a broad set of problems. Practices such as limiting time spent on operational work, blameless postmortems, proactive identification, and prevention of potential outages are integral to our culture.

Our SRE culture emphasizes diversity, intellectual curiosity, problem solving, and openness. Equifax brings together people with varied backgrounds, experiences, and perspectives. We encourage collaboration, big thinking, and risk-taking in a blame-free environment. We promote self-direction on meaningful projects and provide support and mentorship for growth and pride in work.

What You’ll Do

  • Manage system uptime across cloud-native (AWS, GCP) and hybrid architectures.
  • Build infrastructure as code (IaC) patterns that meet security and engineering standards using technologies like Terraform, scripting, and cloud SDKs.
  • Develop CI/CD pipelines for build, test, and deployment using platform tools like Jenkins and cloud-native toolchains.
  • Create automated tools for deploying service requests and detailed runbooks for managing, detecting, remediating, and restoring services.
  • Troubleshoot complex distributed architecture service maps and participate in on-call for high-severity incidents, improving runbooks to reduce MTTR.
  • Lead blameless postmortems and own actions to prevent recurrences.

What Experience You Need

  • BSc in Computer Science or related field involving coding, or equivalent experience.
  • 5-7 years in software engineering, systems administration, database administration, or networking.
  • 2+ years developing and/or managing software in public cloud environments.
  • Experience monitoring infrastructure and application uptime and performance.
  • Proficiency in languages such as Python, Bash, Java, Go, JavaScript, or Node.js.
  • Knowledge of systems, storage, networking, security, and databases.
  • Skills in system administration, automation, and orchestration using Terraform, Chef, Ansible, Docker, Kubernetes, etc.
  • Experience with CI/CD practices and tools.
  • Cloud Certification is strongly preferred.

What Could Set You Apart

Demonstrate skills in:

  • DevSecOps practices and engineering resilience.
  • Operational excellence and system monitoring.
  • Systems thinking and technology trend awareness.
  • Technical communication and presentation skills.
  • Troubleshooting and problem resolution.
Additional Details
  • Seniority level: Mid-Senior level
  • Employment type: Full-time
  • Job function: Engineering and Information Technology

Referrals can increase your chances of interviewing at Equifax. Get notified about new SRE jobs in Toronto, Ontario, Canada.

Get your free, confidential resume review.
or drag and drop a PDF, DOC, DOCX, ODT, or PAGES file up to 5MB.

Similar jobs

Senior Site Reliability Engineer - Remote

Kablamo Pty Ltd

Toronto

Remote

CAD 100,000 - 130,000

2 days ago
Be an early applicant

Site Reliability Engineer (SRE) - Platform Infrastructure team (100% Remote - Canada)

Hopper

Toronto

Remote

CAD 100,000 - 130,000

3 days ago
Be an early applicant

Site Reliability Engineer

Wave Mobile Money

Ontario

Remote

USD 100,000 - 153,000

5 days ago
Be an early applicant

Senior Site Reliability Engineer II

Tbwa Chiat / Day Inc

Ontario

Remote

CAD 100,000 - 130,000

5 days ago
Be an early applicant

Senior Site Reliability Engineer

GoDaddy

British Columbia

Remote

CAD 90,000 - 130,000

Yesterday
Be an early applicant

Sr Lead Site Reliability Engineer

Lumen Argentina

Sault Ste. Marie

Remote

CAD 90,000 - 120,000

Today
Be an early applicant

Site Reliability Engineer | North America | Canada | Europe | Fully Remote

Escape Velocity Entertainment

Remote

CAD 100,000 - 130,000

4 days ago
Be an early applicant

Site Reliability Engineer, Customer Security

Coalition, Inc.

Remote

CAD 90,000 - 120,000

4 days ago
Be an early applicant

Site Reliability Engineer 3 New

Behavox Limited.

Remote

CAD 90,000 - 120,000

5 days ago
Be an early applicant