Enable job alerts via email!

SRE & Automation Engineer

Bank of Montreal

Toronto

On-site

CAD 60,000 - 112,000

Full time

2 days ago
Be an early applicant

Boost your interview chances

Create a job specific, tailored resume for higher success rate.

Job summary

An established industry player is seeking a skilled SRE & Automation Engineer to join their dynamic team. This role involves building and managing platform infrastructure while ensuring high availability and stellar performance. You'll be at the forefront of delivering insights from massive-scale data, working with cutting-edge technologies in both On-Prem and Cloud environments. If you are passionate about problem-solving and eager to contribute to innovative solutions, this is the perfect opportunity to grow your career in a collaborative and inclusive workplace.

Qualifications

  • 7-10 years of experience in site reliability engineering.
  • Proficiency in multiple programming languages and automation tools.

Responsibilities

  • Build and manage platform infrastructure and applications.
  • Monitor system performance and enhance reliability through automation.

Skills

Python
Java
GO
C/C++
Ruby
JavaScript
UNIX
Bash scripting
Proactive problem-solving
Strong communication skills

Education

Masters degree in Computer Science

Tools

Ansible
JIRA
GitHub
Kubernetes
AWS
Azure

Job description

SRE & Automation Engineer page is loaded

SRE & Automation Engineer

Apply locations: Toronto, ON, CAN | Time type: Full time | Posted on: Posted 2 Days Ago | Job requisition id: R250011735

Application Deadline: 05/22/2025

Address: 4100 Gordon Baker Road

Job Family Group: Technology

At BMO, we’re passionate about building software that solves problems. We count on our site reliability engineers (SREs) to empower users with a rich feature set, high availability, and stellar performance to pursue their missions. As we expand customer deployments, we’re seeking an experienced SRE to deliver insights from massive-scale data in real time. We are looking for a candidate who is eager to learn, contribute, and bring fresh ideas, with a focus on positive user experiences, growth within our Premium IT Teams, and leading SRE initiatives in large-scale On-Prem & Cloud environments.

Responsibilities include:

  1. Build software and systems to manage platform infrastructure and applications.
  2. Improve reliability, quality, and time-to-market of software solutions.
  3. Measure and optimize system performance, innovate to meet customer needs.
  4. Provide operational support for large-scale distributed applications.
  5. Monitor availability and system health in production.
  6. Develop operational support for full-stack applications.
  7. Analyze metrics for performance tuning and fault finding.
  8. Participate in system design, platform management, and capacity planning.
  9. Create sustainable systems through automation.
  10. Balance feature development speed with reliability and service-level objectives.
  11. Collaborate with operations teams for system troubleshooting and monitoring.
  12. Enhance system resilience and scalability with coding and change management skills.
  13. Increase automation and self-healing capabilities.
  14. Report performance metrics to stakeholders.
  15. Act as subject matter expert for internal and external stakeholders.
  16. Analyze data to provide insights and strategic recommendations.
  17. Implement changes based on industry and internal trends.
  18. Engage with various areas across the bank and provide strategic input.
  19. Stay updated on industry trends through professional development.
  20. Operate at a group/enterprise level as a resource to senior leaders.

Required skills and qualifications:

  • Masters degree (or equivalent) in computer science or related field with 7-10 years of experience.
  • Proficiency in programming languages such as Python, Java, GO, C/C++, Ruby, JavaScript.
  • Experience with UNIX, Bash scripting.
  • Knowledge of CI/CD and automation tools like Ansible, JIRA.
  • Experience with source code management (GitHub).
  • Knowledge of distributed storage technologies and resource management frameworks (e.g., Kubernetes).
  • Experience with Cloud platforms like AWS & Azure.
  • Understanding of REST APIs.
  • Proactive problem-solving approach.
  • Understanding of IT operating processes, monitoring, logging, and alerting.
  • Strong communication, analytical, and collaboration skills.
  • Ability to manage ambiguity and make data-driven decisions.

Preferred skills and qualifications:

  • Prior success in site reliability engineering.
  • Advanced coding experience beyond scripting.

Salary: $60,000.00 - $111,700.00

Pay Type: Salaried

Salaries vary based on location, skills, experience, education, and qualifications, and may include bonuses and other perks. For more details on our benefits, visit: Total Rewards

About Us

At BMO, our purpose is to create lasting, positive change. We foster an inclusive, respectful workplace that values diversity and provides accommodations upon request. Learn more at our careers page.

Get your free, confidential resume review.
or drag and drop a PDF, DOC, DOCX, ODT, or PAGES file up to 5MB.

Similar jobs

SRE & Automation Engineer

BMO Financial Group

Toronto

On-site

CAD 60,000 - 112,000

Yesterday
Be an early applicant

SENIOR SDET (Senior Test Automation Engineer)

COMPASS GROUP CANADA

Mississauga

On-site

CAD 80,000 - 120,000

30+ days ago

Platform Engineer (Kubernetes)

TMX Group

Toronto

On-site

CAD 80,000 - 120,000

22 days ago

Platform Engineer II, Cloud Operations

WeAreTechWomen

Toronto

On-site

CAD 70,000 - 110,000

30+ days ago