Enable job alerts via email!

Senior Data Scientist, Generative AI

Roche

Mississauga

On-site

CAD 100,000 - 120,000

Full time

Today
Be an early applicant

Job summary

A global healthcare leader is seeking a Senior Data Scientist in Mississauga. The ideal candidate will lead innovative efforts in generative AI, requiring expertise in machine learning and Python. This full-time role involves optimizing AI models and collaborating with diverse teams. Strong qualifications in data science and a Master's degree are essential. Relocation benefits are not available. Join us to help shape the future of healthcare.

Qualifications

  • 5+ years of experience in Computer Science, ML, or related field.
  • Strong knowledge of generative modeling techniques.
  • Experience with large-scale datasets, including text or scientific data.

Responsibilities

  • Develop and optimize generative AI models for data generation.
  • Architect scalable pipelines for data preprocessing and model training.
  • Collaborate with cross-functional teams to implement solutions.

Skills

Generative AI expertise
Machine Learning
Natural Language Processing
Python proficiency
Deep learning frameworks (PyTorch, TensorFlow)

Education

Master's degree in Computer Science or related field

Tools

PyTorch
TensorFlow
SQL/NoSQL
Job description

At Roche you can show up as yourself, embraced for the unique qualities you bring. Our culture encourages personal expression, open dialogue, and genuine connections, where you are valued, accepted and respected for who you are, allowing you to thrive both personally and professionally. This is how we aim to prevent, stop and cure diseases and ensure everyone has access to healthcare today and for generations to come. Join Roche, where every voice matters.

The Position

A healthier future. It’s what drives us to innovate. To continuously advance science and ensure everyone has access to the healthcare they need today and for generations to come. Creating a world where we all have more time with the people we love. That’s what makes us Roche

We are seeking a highly skilled and motivated Senior Data Scientist with deep expertise in Generative AI to join our team. The successful candidate will lead efforts to design, build, and deploy generative models across diverse domains, driving innovation in how we understand and synthesize complex data. While the primary focus of this role is on advancing generative modeling and large-scale AI systems, prior experience working with DNA, protein, or other biological datasets will be considered a strong advantage.

This position requires a strong foundation in machine learning, natural language processing, or multimodal AI, coupled with proven proficiency in Python and modern deep learning frameworks.

The opportunity
  • Develop, fine-tune, and optimize generative AI models (e.g., LLMs, diffusion models, VAEs, transformers) for text, multimodal, and structured data generation.

  • Architect scalable pipelines for data preprocessing, model training, and evaluation across large and complex datasets.

  • Apply state-of-the-art research in generative modeling to real-world problems and drive novel applications across multiple domains.

  • Collaborate cross-functionally with engineers, scientists, and product teams to translate generative AI capabilities into impactful solutions.

  • Design experiments to evaluate generative models for quality, robustness, and interpretability.

  • Communicate technical findings clearly to both technical and non-technical stakeholders.

  • Stay up to date with the latest advancements in generative AI, foundation models, and multimodal learning, and identify opportunities for adoption.

Who you are
  • Master degree with 5+ years of experience in Computer Science, Machine Learning, Electrical Engineering, Applied Mathematics, or a closely related field.

  • Demonstrated proficiency in Python and deep learning frameworks such as PyTorch or TensorFlow.

  • Strong knowledge of generative modeling techniques (e.g., transformers, diffusion, GANs, VAEs).

  • Proven ability to solve complex modeling challenges and innovate beyond standard approaches.

  • Experience working with large-scale datasets, including text, multimodal, or scientific data.

  • Experience with efficient training techniques, such as LoRA, quantization, distillation, and model pruning.

  • Background or hands-on experience with DNA, protein, or other biological data.

  • Familiarity with scalable computing environments (cloud platforms, GPU/TPU acceleration, distributed training).

  • Excellent communication and interpersonal skills for effective collaboration in a multidisciplinary team.

Preferred
  • Experience with evaluation frameworks for generative AI, including human-in-the-loop assessment.

  • Hands-on experience with Retrieval-Augmented Generation (RAG) systems, including vector databases (FAISS, Milvus, Pinecone, Weaviate), indexing strategies, and hybrid retrieval pipelines.

  • Experience with databases and querying (SQL/NoSQL), including data modeling, optimization, and efficient retrieval for large-scale AI workflows.

  • Experience with Protein/DNA language models (e.g., DNABert, Evo, ProtBERT, ESM, AlphaFold-like approaches).

Relocation benefits are not available for this posting.

About Roche

A healthier future drives us to innovate. Together, more than 100’000 employees across the globe are dedicated to advance science, ensuring everyone has access to healthcare today and for generations to come. Our efforts result in more than 26 million people treated with our medicines and over 30 billion tests conducted using our Diagnostics products. We empower each other to explore new possibilities, foster creativity, and keep our ambitions high, so we can deliver life-changing healthcare solutions that make a global impact.

Let’s build a healthier future, together.

Roche is an Equal Opportunity Employer.

Get your free, confidential resume review.
or drag and drop a PDF, DOC, DOCX, ODT, or PAGES file up to 5MB.