Enable job alerts via email!

Lead Site Reliability Engineer (SRE)

RBC

Toronto

Hybrid

CAD 100,000 - 140,000

Full time

Yesterday
Be an early applicant

Boost your interview chances

Create a job specific, tailored resume for higher success rate.

Job summary

RBC is seeking a highly skilled Site Reliability Engineer to enhance the reliability and scalability of QEX tools and cloud platforms. This role requires expertise in cloud technologies like Azure and AWS, strong automation skills, and a solid foundation in Site Reliability Engineering practices. Join a dynamic team focused on improving developer productivity and ensuring seamless operations.

Benefits

A comprehensive Total Rewards Program including bonuses and flexible benefits
Work in a dynamic, collaborative, progressive, and high-performing team
Flexible work/life balance options
World-class training program in financial services

Qualifications

  • 5+ years’ experience with full stack development including JavaScript, React, Node.js, Python.
  • Thorough understanding of Site Reliability Engineering and best practices for critical systems.
  • Strong analytical skills and excellent communication skills.

Responsibilities

  • Manage and enhance the reliability, performance, and scalability of QEX tools.
  • Implement cloud-native solutions and participate in cloud migration to Azure and AWS.
  • Develop automation scripts to improve operational efficiency.

Skills

Agile Methodology
Automation
DevOps
Cloud Technology
Site Reliability Engineering

Education

Bachelor’s degree in Computer Science, Engineering, or relevant field

Tools

Atlassian JIRA
PagerDuty
Dynatrace
AWS
Azure

Job description

Job Summary

Job Description

WHAT IS THE OPPORTUNITY?

The DevOps/QEx Operations team is seeking a highly skilled and motivated Site Reliability Engineer (SRE) to manage and enhance the reliability, performance, and scalability of QEX tools, developer tools (Jira, Confluence, qTest), and cloud platforms. The ideal candidate will have a strong background in Site Reliability Engineering practices, cloud technologies (Azure and AWS), automation, and monitoring, with a focus on ensuring seamless operations for Quality Engineering (QE) tools, developer productivity platforms, and cloud migration initiatives.

WHAT WILL YOU DO?

  • Work closely with Quality Engineering, DevOps, Development, IT, and Cloud teams to align SRE practices with organizational goals.

  • QEX Tools Management:

    • Ensure the availability, reliability, and performance of tools like, Jira, Confluence, and qTest and other QEx tools.

    • Manage integrations between Jira and qTest, ensuring seamless synchronization of data and workflows.

  • Cloud Platform Management:

    • Implement cloud-native solutions and best practices to enhance infrastructure and application reliability.

    • Participate in the migration of on-premises applications to Azure and AWS, ensuring minimal disruption to operations.

  • Automation and Efficiency:

    • Develop and implement automation scripts to streamline repetitive tasks and improve operational efficiency.

    • Set up and maintain robust monitoring systems to proactively detect and resolve issues.

  • Monitoring and Incident Management:

    • Utilize tools like Dynatrace, Catchpoint, and PagerDuty for real-time monitoring, incident management, and escalation.

    • Act as the primary point of contact for incidents, ensuring swift resolution within defined SLAs.

  • Root Cause Analysis and Prevention:

    • Conduct root cause analysis (RCA) for incidents and implement preventive measures to avoid recurrence.

  • Collaboration and Stakeholder Engagement:

    • Partner with Quality Engineering, Development, IT, and Cloud teams to align SRE practices with organizational goals.

    • Provide technical support and guidance to teams using QEX tools, developer tools, and cloud platforms.

  • Onboarding and Support:

    • Support onboarding of new projects and users to qTest, including creating integration filters and configuring service IDs for Jira projects.

  • Documentation and Knowledge Sharing:

    • Maintain comprehensive documentation for tool configurations, integrations, cloud migration processes, and SRE practices.

    • Share knowledge and best practices with team members to foster a culture of continuous improvement.

  • Proactive Issue Resolution:

    • Monitor and address issues proactively using real-time insights and alerts from tools like Dynatrace and Catchpoint.

WHAT DO YOU NEED TO SUCCEED?

Must have:

  • 5+ years’ experience with full stack development, including experience with frameworks and languages such as JavaScript, React, Node.js, Python, or similar.

  • Thorough understanding of Site Reliability Engineering and best practices for running and maintaining critical systems.

  • Proficient with cloud-based services (e.g., AWS, Azure, Google Cloud Platform) and a strong grasp of developing cloud-native applications.

  • Knowledge of Continuous Integration/Continuous Delivery (CI/CD) methodologies and associated tools.

  • Experience with architecting, implementing, and deploying systems into integrated environments.

  • Strong analytical skills, problem-solving abilities, and excellent communication skills.

Nice-to-have:

  • Salesforce DevOps: Familiarity with Salesforce & Flosum for managing source-driven development and CI/CD workflows.

  • Bachelor’s degree in Computer Science, Engineering, or in a field relevant to the role.

  • Strategic thinker with excellent interpersonal skills to work across functions and businesses

What’s in it for you?

We thrive on the challenge to be our best, progressive thinking to keep growing, and working together to deliver trusted advice to help our clients thrive and communities prosper. We care about each other, reaching our potential, making a difference to our communities, and achieving success that is mutual.

  • A comprehensive Total Rewards Program including bonuses and flexible benefits, competitive compensation, commissions, and stock where applicable

  • Leaders who support your development through coaching and managing opportunities

  • Ability to make a difference and lasting impact

  • Work in a dynamic, collaborative, progressive, and high-performing team

  • A world-class training program in financial services

  • Flexible work/life balance options

  • Opportunities to do challenging work

#LI-Hybrid

#LI-POST

#TECHPJ

Job Skills

Agile Methodology, Application Infrastructure, Atlassian JIRA, Automation, Cloud Platform, Cloud Technology, DevOps, Group Problem Solving, IT Automation, IT Monitoring, Operations Support, PagerDuty, Production Support, Site Reliability Engineering, Software Development Life Cycle (SDLC), Software Engineering, Software Product Technical Knowledge, System Applications, Systems Software, Teamwork

Additional Job Details

Address:

RBC WATERPARK PLACE, 88 QUEENS QUAY W:TORONTO

City:

TORONTO

Country:

Canada

Work hours/week:

37.5

Employment Type:

Full time

Platform:

TECHNOLOGY AND OPERATIONS

Job Type:

Regular

Pay Type:

Salaried

Posted Date:

2025-06-06

Application Deadline:

2025-06-30

Note: Applications will be accepted until 11:59 PM on the day prior to the application deadline date above

Inclusion and Equal Opportunity Employment

At RBC, we believe an inclusive workplace that has diverse perspectives is core to our continued growth as one of the largest and most successful banks in the world. Maintaining a workplace where our employees feel supported to perform at their best, effectively collaborate, drive innovation, and grow professionally helps to bring our Purpose to life and create value for our clients and communities. RBC strives to deliver this through policies and programs intended to foster a workplace based on respect, belonging and opportunity for all.

Join our Talent Community

Stay in-the-know about great career opportunities at RBC. Sign up and get customized info on our latest jobs, career tips and Recruitment events that matter to you.

Expand your limits and create a new future together at RBC. Find out how we use our passion and drive to enhance the well-being of our clients and communities at jobs.rbc.com.

Get your free, confidential resume review.
or drag and drop a PDF, DOC, DOCX, ODT, or PAGES file up to 5MB.

Similar jobs

Lead Site Reliability Engineer

Apex Systems

Toronto

On-site

CAD 120,000 - 160,000

Today
Be an early applicant

Principal Platform Engineer, Materia AI

TRSS

Toronto

Hybrid

CAD 120,000 - 160,000

Yesterday
Be an early applicant

Lead Site Reliability Engineer

Apex Systems

Old Toronto

Hybrid

CAD 90,000 - 130,000

30+ days ago