Job Search and Career Advice Platform

Enable job alerts via email!

Machine Learning Platform Engineer (DevOps)

TD

Canada

On-site

CAD 120,000 - 154,000

Full time

30 days ago

Generate a tailored resume in minutes

Land an interview and earn more. Learn more

Job summary

A leading financial institution in Canada seeks a Machine Learning Infrastructure Engineer to deploy and modernize ML systems. You will work primarily with tools like Kubernetes, Docker, and Terraform on Azure cloud. Ideal candidates possess experience in building automated infrastructures, strong scripting skills, and a background in software engineering. This role offers competitive compensation and growth opportunities within a collaborative environment.

Benefits

Comprehensive health benefits
Career development opportunities
Employee discounts

Qualifications

  • 2+ years of experience building sophisticated and automated production infrastructure.
  • Experience with Kubernetes, Docker and container orchestration.
  • Experience with Terraform.

Responsibilities

  • Deploy and modernise Machine Learning infrastructure.
  • Ensure deployments comply with enterprise security standards.
  • Design, deliver and upgrade scalable systems for data processing and model serving.

Skills

Automated production infrastructure
Kubernetes
Docker
Terraform
Scripting skills (Bash, Python, Groovy)
CI/CD tools and pipelines
Linux systems administration
Git
Jenkins
Verbal and written communication

Education

BA/BS degree in Computer Science or equivalent experience

Tools

Azure cloud
Job description
Work Location

Toronto, Ontario, Canada

Hours

37.5

Line of Business

Analytics, Insights, & Artificial Intelligence

Pay Details

$120,000 - $153,500 CAD

The pay details posted reflect a temporary market premium specific to this role that is reassessed annually.

TD is committed to providing fair and equitable compensation opportunities to all colleagues. Growth opportunities and skill development are defining features of the colleague experience at TD. Our compensation policies and practices have been designed to allow colleagues to progress through the salary range over time as they progress in their role. The base pay actually offered may vary based upon the candidate's skills and experience, job-related knowledge, geographic location, and other specific business and organizational needs.

As a candidate, you are encouraged to ask compensation related questions and have an open dialogue with your recruiter who can provide you more specific details for this role.

Job Description

In this role you will deploy and modernise Machine Learning infrastructure. Ensuring deployments comply with enterprise security standards. The day-to-day tasks involve design, deployment, delivery and upgrading of scalable systems designed for data ingestion, processing, validation, model training, large‑scale computation, monitoring and model serving. Our stack includes Kubernetes, Docker, Databricks, Blobfuse, Terraform, Helm, Github Actions, Saltstack and AzureML, with a majority of our infrastructure running on Azure cloud.

Key Responsibilities
Customer
  • Provide expertise on fundamental engineering practices for the broader AI/ML engineering team and inspire the adoption of ML engineering practice across the organisation.
  • Interpret the meaning of new strategic directions and set objectives and measurements.
Shareholder
  • Adhere to enterprise frameworks or methodologies that relate to activities for our business area.
  • Ensure respective programmes/policies/practices are well managed, meet business needs, comply with internal and external requirements and align with business priorities.
  • Consistently exercise discretion in managing correspondence, information and all matters of confidentiality; escalated issues where appropriate.
  • Ensure business operations are in compliance with applicable internal and external requirements (e.g. financial controls, segregation of duties, transaction approvals and physical control of assets).
  • Participate in cross‑functional / enterprise initiatives as a subject‑matter expert helping to identify risk and provide guidance for complex situations.
Employee / Team
  • Participate fully as a member of the team, support a positive work environment that promotes service to the business, quality, innovation and teamwork and ensure timely communication of issues / points of interest.
  • Provide thought leadership and / or industry knowledge for own area of expertise in own area and participate in knowledge transfer within the team and business unit.
  • Keep current on emerging trends / developments and grow knowledge of the business, related tools and techniques.
  • Participate in personal performance management and development activities, including cross‑training within own team.
Experience and / or Education
  • 2+ years of experience building sophisticated and automated production infrastructure.
  • Experience with Kubernetes, Docker and container orchestration.
  • Experience with Terraform.
  • A background in software engineering, working within a software development team.
  • Solid cloud experience (preferably Azure or AWS).
  • Strong scripting skills, i.e. Bash, Python, Groovy, etc.
  • Experience with managing CI/CD tools and pipelines.
  • Experience with Linux systems administration skills in a Cloud environment, Redhat and Ubuntu.
  • Experience with Git, and Jenkins.
  • Strong verbal and written communication skills, with the ability to work effectively across teams and produce engineering documentation.
  • BA/BS degree or equivalent experience; Computer Science background preferred.
Nice‑to‑Have
  • Knowledge of IP networking, VPNs, DNS, load balancing and firewalls.
  • Familiarity with cloud monitoring tools.
  • Experience with automated testing tools.
  • Experience troubleshooting and tuning systems performance.
  • Experience with Saltstack or other configuration management.
  • Experience resolving and triaging docker image problems.
  • Experience optimizing system‑level design and architecture.
  • Experience deploying and maintaining ML systems.
  • Experience with vulnerability management and hardening of systems, platforms and applications.
Who We Are

TD is one of the world's leading global financial institutions and is the fifth largest bank in North America by branches/stores. Every day, we deliver legendary customer experiences to over 27 million households and businesses in Canada, the United States and around the world. More than 95,000 TD colleagues bring their skills, talent and creativity to the Bank, those we serve and the economies we support. We are guided by our vision to Be the Better Bank and our purpose to enrich the lives of our customers, communities and colleagues.

Our Total Rewards Package

Our Total Rewards package reflects the investments we make in our colleagues to help them and their families achieve their financial, physical and mental well‑being goals. Total Rewards at TD includes a base salary, variable compensation and several other key plans such as health and well‑being benefits, savings and retirement programmes, paid time off, banking benefits and discounts, career development and reward and recognition programmes. Learn more.

Additional Information

We're delighted that you are considering building a career with TD. Through regular development conversations, training programmes, and a competitive benefits plan, we are committed to providing the support our colleagues need to thrive both at work and at home.

Please be advised that this job opportunity is subject to provincial regulation for employment purposes. It is imperative to acknowledge that each province or territory within the jurisdiction of Canada may have its own set of regulations, requirements.

Colleague Development

If you're interested in a specific career path or are looking to build certain skills, we want to help you succeed. You'll have regular career, development, and performance conversations with your manager, as well as an online learning platform and a variety of mentoring programmes to help you unlock future opportunities. Whether you have a passion for helping customers and want to expand your experience, or you want to coach and inspire your colleagues, there are many different career paths within our organisation at TD – and we are committed to helping you identify opportunities that support your goals.

Training & Onboarding

We will provide training and onboarding sessions to ensure that you have everything you need to succeed in your new role.

Interview Process

We’ll reach out to candidates of interest to schedule an interview. We do our best to communicate outcomes to all applicants by email or phone call.

Accommodation

Your accessibility is important to us. Please let us know if you’d like accommodations (including accessible meeting rooms, captioning for virtual interviews, etc.) to help us remove barriers so that you can participate throughout the interview process.

Language Requirement (Quebec only)

Sans Objet

Get your free, confidential resume review.
or drag and drop a PDF, DOC, DOCX, ODT, or PAGES file up to 5MB.