We are seeking a talented Machine Learning (ML) Scientist to join a groundbreaking initiative to develop Digital Twins for rare diseases. You will work within a multidisciplinary project team, across the Open Targets, Molecular Systems and Petsalaki research groups at the EMBL European Bioinformatics Institute (EMBL-EBI). This project is funded through the Chan Zuckerberg Initiative with a strong emphasis on making datasets and models open source where possible.
Rare diseases collectively impact approximately 300 million individuals worldwide, but their study is hindered by limited patient-level data. To address this challenge, this project aims to develop ‘Digital Twins’ of rare disease patients by combining mechanistic, GenAI and other machine learning framework models to integrate patient-level multi-omics and clinical data to provide insights into rare diseases. The models will utilize extensive public datasets of single-cell multiomics including transcriptomics from diverse disease conditions, and simulations from mechanistic models. This will be applied to the challenge of limited multi-omics data for rare disease, with the aim of developing rare disease Digital Twins to provide new insights into disease mechanisms and potential treatments.
The role involves designing and implementing ML models that integrate multi-omics data and clinically relevant endpoints, contributing to the creation of virtual patient models to simulate disease trajectories and therapeutic responses.
This is a unique opportunity to develop and apply advanced ML methodologies and significantly contribute to understanding rare disease biology, enabling applications such as diagnosis, drug repurposing, and new treatment development.
The ML Modeller’s primary tasks include developing and applying advanced ML and GenAI frameworks to integrate and analyze multi-omics datasets. The role involves working collaboratively with biocurators, bioinformaticians, and mechanistic modellers to ensure seamless integration of data into Digital Twin models. Responsibilities include:
• PhD (or equivalent experience) in Computer Science, Computational Biology, Bioinformatics, or a related field.
• Proven track record in developing and deploying ML models for large datasets.
• Experience with advanced ML, VAE, GenAI frameworks and large-scale data modelling.
• Proficiency in Python, R, or similar programming languages.
• Experience with ML frameworks such as TensorFlow, PyTorch, or Scikit-learn.
• Strong knowledge of advanced statistical techniques and modern deep learning methods.
• Expertise in pipeline workflow management tools like Nextflow or Snakemake.
• Excellent communication skills, both written and verbal, for collaborative teamwork and reporting.
• Self-motivated and capable of working independently and within multidisciplinary teams.
• Enthusiasm to advance research in disease modelling and patient care.
• Demonstrated capacity to prioritize and manage multiple independent projects in a dynamic environment.
• Experience in developing ML models for biological or clinical datasets.
• Hands-on experience with multi-omics data integration and analysis.
• Experience publishing in high-impact journals and presenting at international conferences.
• Familiarity with single-cell transcriptomics, bulk omics data, and genomics.
• Strong knowledge of FAIR principles and open data standards.
• Experience with cloud computing platforms and high-performance computing environments.
• Strong ability to convey complex ML concepts to non-technical stakeholders.
Contract length: 2 years fixed-term grant-limited, to work on the CZI Digital Twin grant.
Salary: Grade 5 or 6 depending on qualifications and experience, monthly salary at £3,229 or £3,612 after tax but excluding pension and insurance contributions. Plus generous benefits.
Do something meaningful at EMBL-EBI where you can apply your talent and passion to accelerate science and tackle some of humankind’s greatest challenges. EMBL-EBI, part of the European Molecular Biology Laboratory, is a worldwide leader in the storage, analysis and dissemination of large biological datasets. We provide the global research community with access to publicly available databases and tools which are crucial for the advancement of healthcare, food security, and biodiversity.
Join a culture of innovation in a highly collaborative and inclusive community where our employees enjoy a relaxed atmosphere. We are committed to ensuring our employees feel valued, supported and empowered to reach their professional potential.
For detailed information please visit our employee benefits page.
Closing Date: 16/03/2025
European Molecular Biology Laboratory (EMBL)
Senior (5+ years of experience)
Tagged as: Academia, Machine Learning, NLP, United Kingdom