Enable job alerts via email!

Data Engineer- AI/ML

Roche

Mississauga

On-site

CAD 70,000 - 85,000

Full time

Yesterday
Be an early applicant

Boost your interview chances

Create a job specific, tailored resume for higher success rate.

Job summary

Roche is seeking a Data Engineer to join their AI solutions development squad in Mississauga. The role involves designing and maintaining robust data infrastructure to support innovative AI applications, collaborating with cross-functional teams, and optimizing data processing strategies. Candidates should have extensive experience in data engineering, particularly with cloud platforms and AI technologies.

Qualifications

  • 5-7+ years in data engineering supporting AI/ML applications.
  • Proficiency in Python and SQL required.
  • Experience with AWS or Azure cloud platforms.

Responsibilities

  • Design and maintain data infrastructure for AI applications.
  • Develop ETL/ELT pipelines for real-time data processing.
  • Collaborate with AI engineers and data scientists.

Skills

Python
SQL
Data Security & Governance
ETL/ELT Pipelines
APIs & Microservices

Education

B.Sc. in Computer Science
B.Eng. in Data Engineering

Tools

Snowflake
AWS
Azure
Docker
Kubernetes

Job description

1 week ago Be among the first 25 applicants

At Roche you can show up as yourself, embraced for the unique qualities you bring. Our culture encourages personal expression, open dialogue, and genuine connections, where you are valued, accepted and respected for who you are, allowing you to thrive both personally and professionally. This is how we aim to prevent, stop and cure diseases and ensure everyone has access to healthcare today and for generations to come. Join Roche, where every voice matters.

The Position

A healthier future. That’s what drives us.

Galileo is a strategic Roche Informatics program aiming to enable high-value AI (initial focus: Generative AI - GenAI) use cases at Roche through fit-for-purpose platforms and services, establishing a foundation for a Center of Excellence in AI. The recently formed Use Case Delivery (UCD) Team, consisting of a number of delivery squads, is tasked with building innovative GenAI applications.

We are looking for a highly skilled and dedicated Data Engineer to join a new AI solutions development squad that will be building cutting-edge applications leveraging Large Language Models (LLMs). We will be building AI solutions end-to-end: from concept, through prototyping, productization, to operations. The Data Engineer will be responsible for designing, building, and maintaining robust data infrastructure to support AI applications. The ideal candidate will have expertise in handling structured and unstructured data, vector databases, real-time data processing, and cloud-based AI solutions (AWS or Azure).

The Opportunity:

  • Generative AI Application Co-creation: Collaborate with AI engineers, data scientists, product owners, and other developers in Agile teams to integrate LLMs into scalable, robust, fair and ethical end-user applications, focusing on user experience, relevance, and real-time performance
  • Data Infrastructure Development and Data Integration: Design and implement scalable, high-performance data pipelines for AI/GenAI applications, ensuring efficient data ingestion, transformation, storage and retrieval; integrate different databases, requiring understanding of data architectures / Domain data ecosystem
  • Vector Database Management: Work with vector databases (e.g., AWS OpenSearch or Azure AI Search) to store and retrieve high-dimensional data for Generative AI workloads
  • Cloud-Based Data Engineering: Build and maintain cloud-based data solutions using AWS (OpenSearch, S3) or Azure (Azure AI Search, Azure Blob Storage)
  • Snowflake Implementation: Design and optimize data storage and processing using Snowflake for scalable, cloud-native analytics solutions
  • Data Processing & Transformation: Develop ETL/ELT pipelines to enable real-time and batch data processing
  • Support AI Model Workflows: Collaborate with AI/ML Engineers and Data Scientists to ensure seamless integration of data pipelines with AI finetuning, inference and training workflows
  • Performance Optimization: Optimize data storage, retrieval, and processing strategies for efficiency, scalability, and cost-effectiveness

Who you are:

  • Experience: A minimum of 5-7+ years in data engineering, preferably supporting AI/ML applications and hold B.Sc., B.Eng., or higher, or equivalent in Computer Science, Data Engineering or related fields
  • Programming: Proficiency in Python, SQL and vector database native languages
  • Databases: Experience with relational, NoSQL, vector databases, and Snowflake in particular
  • Cloud Platforms: Hands-on experience with AWS (OpenSearch, S3, Lambda) or Azure (Azure AI Search, Azure Blob Storage, Azure Automation)
  • ETL/ELT Pipelines: Experience building scalable ETL/ELT workflows using dbt, Apache Airflow, or similar
  • APIs & Microservices: Ability to design and integrate RESTful APIs for data exchange
  • Data Security & Governance: Understanding of encryption and role-based access controls
  • Version Control & DevOps: Familiarity with Git, CI/CD, containerization (Docker, Kubernetes), and Infrastructure as Code (Terraform, CloudFormation)
  • Generative AI Support: Experience working with AI-specific data needs, such as embeddings, RAG (Retrieval Augmented Generation), and LLM fine-tuning data preparation

Relocation benefits are not available for this job posting.

Who we are

A healthier future drives us to innovate. Together, more than 100’000 employees across the globe are dedicated to advance science, ensuring everyone has access to healthcare today and for generations to come. Our efforts result in more than 26 million people treated with our medicines and over 30 billion tests conducted using our Diagnostics products. We empower each other to explore new possibilities, foster creativity, and keep our ambitions high, so we can deliver life-changing healthcare solutions that make a global impact.

Let’s build a healthier future, together.

Roche is an Equal Opportunity Employer.

Seniority level
  • Seniority level
    Associate
Employment type
  • Employment type
    Full-time
Job function
  • Job function
    Information Technology
  • Industries
    Pharmaceutical Manufacturing, Biotechnology Research, and Medical Equipment Manufacturing

Referrals increase your chances of interviewing at Roche by 2x

Get notified about new Data Engineer jobs in Mississauga, Ontario, Canada.

Velocity - Data Engineer Internship/Co-Op - Fall 2025

Richmond Hill, Ontario, Canada 4 weeks ago

Software Engineer, Backend (All Levels / All Teams)
Software Engineer I, Entry Level (Fall 2024-Spring 2025) - Toronto

Mississauga, Ontario, Canada CA$70,000.00-CA$85,000.00 2 weeks ago

We’re unlocking community knowledge in a new way. Experts add insights directly into each article, started with the help of AI.

Get your free, confidential resume review.
or drag and drop a PDF, DOC, DOCX, ODT, or PAGES file up to 5MB.

Similar jobs

Data Engineer, AI/ML (Toronto, Hydrid / Remote)

Autodesk

Toronto

Remote

CAD 80,000 - 110,000

12 days ago

Senior Data Engineer, AI/ML (Toronto, Hybrid / Remote)

Autodesk

Toronto

Remote

CAD 80,000 - 120,000

13 days ago

Databricks AI Engineer

Cognizant

Halifax

Remote

CAD 80,000 - 120,000

Today
Be an early applicant

Data Engineer, AI/ML (Toronto, Hydrid / Remote)

Autodesk, Inc.

Toronto

Hybrid

CAD 80,000 - 110,000

13 days ago

Python and Kubernetes Software Engineer - Data, AI/ML & Analytics

Canonical

Toronto

Remote

USD 75,000 - 115,000

21 days ago

RCI-GT-34092 Senior AI Engineer

Rangam

Mississauga

Hybrid

CAD 80,000 - 100,000

Today
Be an early applicant

Artificial Intelligence Engineer

Sarinas Consulting

Remote

CAD 80,000 - 120,000

Today
Be an early applicant

Senior Data Engineer, AI/ML (Toronto, Hybrid / Remote)

Autodesk, Inc.

Toronto

Hybrid

CAD 80,000 - 120,000

13 days ago

Senior Data Engineer, AI/ML (Toronto, Hybrid / Remote)

Autodesk, Inc.

Toronto

Hybrid

CAD 80,000 - 120,000

14 days ago