
Activez les alertes d’offres d’emploi par e-mail !
Générez un CV personnalisé en quelques minutes
Décrochez un entretien et gagnez plus. En savoir plus
Un laboratoire de recherche en bioinformatique recherche un ingénieur pour participer au développement de la suite logicielle PPanGGOLiN pour l'analyse des pangenomes au niveau du genre. Le candidat idéal a un Master en bioinformatique ou informatique, avec une expertise en bioinformatique et développement logiciel. Ce poste est basé à Évry, France, et propose un contrat de 2 ans débutant en avril 2026.
Prokaryotes—bacteria and archaea—are diverse, ubiquitous organisms with vast impacts on health, soil, and ocean ecosystems. Large-scale genome sequencing and pangenomics have revealed their molecular diversity, especially the role of Mobile Genetic Elements (MGEs). Pangenomics analyzes genetic variability across all genomes of a group, distinguishing between core genes (shared by all) and accessory genes (variable, linked to phenotypic traits). These methods address the challenge of big data in biology [1], advancing our understanding of microbial evolution in epidemiological and environmental contexts.
For years, the LABGeM team has developed a pangenome graph model at the gene family level, compressing data from thousands of genomes while preserving gene order. The PPanGGOLiN software suite [2] reconstructs and analyzes pangenome graphs at the species level. It encompasses methods for the identification of regions of genomic plasticity, including MGEs and Genomic islands, (panRGP method) [3] and their fine description in conserved modules (panModule method) [4]. LABGeM is also developing PanGBank , a database of pangenomes reconstructed from public genomes from Genbank and RefSeq databases using the GTDB classification. It currently gathers pangenomes for over 4300 prokaryotic species.
However, a significant proportion of bacterial species remain underrepresented in genomic databases (e.g. 64% of the ~110,000 species in GTDB [5] are represented by only a single genome). This sparse coverage limits our ability to conduct evolutionary analyses at the species level. To bridge this gap, genus-level comparative analyses offer valuable insights into how closely related species adapt to a wide range of environmental conditions.
In the framework of ANR PanGAIMiX project (2025-2029), which aims to leverage large language models to enhance large scale computational microbiology using pangenome graphs, we are recruiting an engineer to join our team and contribute to the development of PPanGGOLiN to allow analyses at the genus level.
Your mission:
You will help enhance the PPanGGOLiN model and methods to enable genus-level pangenome analyses, while preserving species-level detail.
The main objectives of this work will be to:
Candidate profile
Duration: 2 years contract starting in April 2026.
Remuneration: According to qualifications and experience (CEA salary scale)
Location: Genoscope, Evry, in the Bioinformatics Analysis Laboratory for Genomics and Metabolism (LABGeM).
References
[1] Computational Pan-Genomics Consortium. Computational pan-genomics: status, promises and challenges. Brief Bioinform. 2016. doi:10.1093/bib/bbw089
[2] Gautreau G, et al. PPanGGOLiN: Depicting microbial diversity via a partitioned pangenome graph. PLoS Comput Biol. 2020;16: e1007732. doi:10.1371/journal.pcbi.1007732
[3] Bazin A, et al. panRGP MESSAGE: a pangenome-based method to predict genomic islands and explore their diversity. Bioinformatics. 2020;36: i651– actuación… . doi:10.1093/bioinformatics/btaa792
Candidature
Procédure : To apply, please send your resume and cover letter to the following addresses: acalteau-ayur...
Contacts
David Vallenet
vallenet@genoscope.cns.fr