Enable job alerts via email!

Senior Site Reliability Engineer (SRE)

CGI

Montreal

On-site

CAD 80,000 - 110,000

Full time

11 days ago

Boost your interview chances

Create a job specific, tailored resume for higher success rate.

Job summary

An established industry player is seeking a Senior Site Reliability Engineer to lead the design and maintenance of robust, scalable infrastructure. In this pivotal role, you will collaborate with development teams to enhance system reliability and performance while driving automation and operational excellence. Your expertise in cloud platforms and container orchestration will be key as you tackle complex technical challenges. Join a dynamic team that values ownership, teamwork, and innovation, and make a significant impact in a company committed to diversity and inclusion.

Qualifications

  • 5+ years in site reliability engineering or systems engineering.
  • Proficient in cloud platforms and container orchestration.
  • Strong problem-solving skills and excellent communication.

Responsibilities

  • Lead design and operation of fault-tolerant systems.
  • Develop automation for deployment and incident response.
  • Mentor junior SREs and ensure 24/7 system reliability.

Skills

Kubernetes
Linux
Python
English
French

Education

Bachelor’s degree in Computer Science

Tools

AWS
Azure
Google Cloud
Terraform
Ansible
CloudFormation
Prometheus
Grafana
ELK stack

Job description

Position Description:

As a Senior Site Reliability Engineer, you will lead the design, implementation, and maintenance of highly reliable, scalable, and efficient infrastructure and services. You will collaborate closely with development teams to ensure system reliability, performance, and availability while driving automation and operational excellence across the platform.

Primary Responsibilities
-Lead the design, deployment, and operation of large-scale, fault-tolerant systems to ensure high availability and performance.
-Develop and implement automation to streamline deployment, monitoring, and incident response processes.
-Monitor system health, analyze metrics, and proactively identify and resolve reliability, scalability, and performance issues.
-Collaborate with software engineering teams to improve system design, deployment pipelines, and operational practices.
-Manage incident response, conduct root cause analysis, and implement corrective actions to prevent recurrence.
-Drive continuous improvement in infrastructure efficiency, reliability, and scalability through innovative solutions.
-Document system architecture, operational procedures, and best practices to support knowledge sharing and operational consistency.
-Mentor and provide technical leadership to junior SREs and cross-functional teams.
-Participate in on-call rotations to ensure 24/7 system reliability and rapid incident resolution.
-Engage with stakeholders to align SRE practices with business goals and technical strategies.

Key Skills and Qualifications
-Extensive experience in site reliability engineering, systems engineering, or related roles, typically 5+ years.
-Strong proficiency with cloud platforms (AWS, Azure, Google Cloud) and container orchestration tools (Kubernetes, Docker).
-Expertise in Linux system administration, networking, and security best practices.
-Proficient in programming and scripting languages such as Python, Go, Bash, or similar for automation.
-Experience with infrastructure as code (Terraform, Ansible, CloudFormation) and CI/CD pipelines.
-Deep understanding of monitoring, logging, and alerting tools (Prometheus, Grafana, ELK stack).
-Proven ability to design and maintain scalable, distributed systems and fault-tolerant architectures.
-Strong problem-solving skills and ability to handle complex technical challenges independently.
-Excellent communication skills to collaborate effectively across teams and with external vendors.
-Familiarity with incident management frameworks and service-level objectives (SLOs), service-level agreements (SLAs).

Preferred Qualifications
-Bachelor’s degree in Computer Science, Engineering, or a related technical field.
-Certifications in cloud technologies (AWS Certified Solutions Architect, Google Professional Cloud Architect, etc.).
-Experience with financial services, large-scale SaaS platforms, or enterprise IT environments.
-Knowledge of security compliance and regulatory requirements relevant to infrastructure.

Challenges and Impact
-Balancing rapid feature delivery with system reliability and operational stability.
-Managing complex, multi-platform, geographically distributed environments.
-Driving automation and efficiency in a constantly evolving technical landscape.
-Building strong relationships with stakeholders to ensure alignment and seamless service delivery.

LANGUAGE: French, English
Ability to communicate in English, both orally and in writing, is a requirement as the person in this position will need to collaborate regularly with colleagues and partners in the United States.

Use of the term ‘engineering’ in this job posting refers to the technical sense related to Information Technology (IT) and does not imply that the individual practices engineering or possesses the requisite license as prescribed by the applicable provincial or territorial engineering regulator. We are seeking individuals with expertise in IT engineering-related functions, but licensure from an engineering regulator is not a prerequisite for this position. Engineering is a regulated profession in Canada which is restricted in terms of use of titles and designation.

Skills:
  • English
  • French
  • Kubernetes
  • Linux
  • Python
What you can expect from us:

Together, as owners, let’s turn meaningful insights into action.

Life at CGI is rooted in ownership, teamwork, respect and belonging. Here, you’ll reach your full potential because…

You are invited to be an owner from day 1 as we work together to bring our Dream to life. That’s why we call ourselves CGI Partners rather than employees. We benefit from our collective success and actively shape our company’s strategy and direction.

Your work creates value. You’ll develop innovative solutions and build relationships with teammates and clients while accessing global capabilities to scale your ideas, embrace new opportunities, and benefit from expansive industry and technology expertise.

You’ll shape your career by joining a company built to grow and last. You’ll be supported by leaders who care about your health and well-being and provide you with opportunities to deepen your skills and broaden your horizons.

At CGI, we recognize the richness that diversity brings. We strive to create a work culture where all belong and collaborate with clients in building more inclusive communities. As an equal-opportunity employer, we want to empower all our members to succeed and grow. If you require an accommodation at any point during the recruitment process, please let us know. We will be happy to assist.

Come join our team—one of the largest IT and business consulting services firms in the world.

Get your free, confidential resume review.
or drag and drop a PDF, DOC, DOCX, ODT, or PAGES file up to 5MB.

Similar jobs

Senior Site Reliability Engineer

ITjobs.ca

Montreal

On-site

CAD 80,000 - 85,000

6 days ago
Be an early applicant

Senior Site Reliability Engineer

Coalition Inc

Remote

CAD 90,000 - 130,000

Today
Be an early applicant

Senior Site Reliability Engineer

VTR Global Com

British Columbia

Remote

CAD 80,000 - 120,000

Yesterday
Be an early applicant

Site Reliability Engineer

Dayforce

Remote

CAD 70,000 - 110,000

4 days ago
Be an early applicant

Staff Infrastructure Site Reliability Engineer

Remoteworldwide

Remote

CAD 90,000 - 150,000

4 days ago
Be an early applicant

Senior Site Reliability Engineer - (Remote - Canada)

Jobgether

Remote

CAD 80,000 - 120,000

21 days ago

Software Engineer, Site Reliability (Senior or Staff)

BioRender

Remote

CAD 80,000 - 150,000

8 days ago

Sr. Site Reliability Engineer

Diversis Capital LLC

Moose Jaw

Remote

CAD 70,000 - 110,000

15 days ago

Senior Site Reliability Engineer

Infotek Consulting Inc.

Montreal

On-site

CAD 80,000 - 100,000

30 days ago