Job Search and Career Advice Platform

Activez les alertes d’offres d’emploi par e-mail !

engineer for the development of new methods in pangenomics

SFBI

Arrondissement d'Évry

Sur place

EUR 40 000 - 60 000

Plein temps

Hier
Soyez parmi les premiers à postuler

Générez un CV personnalisé en quelques minutes

Décrochez un entretien et gagnez plus. En savoir plus

Résumé du poste

Un laboratoire de recherche en bioinformatique recherche un ingénieur pour participer au développement de la suite logicielle PPanGGOLiN pour l'analyse des pangenomes au niveau du genre. Le candidat idéal a un Master en bioinformatique ou informatique, avec une expertise en bioinformatique et développement logiciel. Ce poste est basé à Évry, France, et propose un contrat de 2 ans débutant en avril 2026.

Qualifications

  • Expertise en algorithmes bioinformatiques (graphes).
  • Maîtrise du développement logiciel.
  • Capacités de programmation avancées en Python.

Responsabilités

  • Améliorer le modèle et les méthodes PPanGGOLiN pour des analyses de pangenome au niveau du genre.
  • Adapter le modèle de données PPanGGOLiN aux différents niveaux taxonomiques.
  • Mettre à jour les méthodes panRGP et panModule avec de nouveaux ajustements de paramètres.

Connaissances

Compétences en bioinformatique
Développement logiciel
Programmation en Python
Connaissances en génomique microbienne
Statistiques

Formation

Master en bioinformatique ou informatique
Description du poste
engineer for the development of new methods in pangenomics
Description

Prokaryotes—bacteria and archaea—are diverse, ubiquitous organisms with vast impacts on health, soil, and ocean ecosystems. Large-scale genome sequencing and pangenomics have revealed their molecular diversity, especially the role of Mobile Genetic Elements (MGEs). Pangenomics analyzes genetic variability across all genomes of a group, distinguishing between core genes (shared by all) and accessory genes (variable, linked to phenotypic traits). These methods address the challenge of big data in biology [1], advancing our understanding of microbial evolution in epidemiological and environmental contexts.

For years, the LABGeM team has developed a pangenome graph model at the gene family level, compressing data from thousands of genomes while preserving gene order. The PPanGGOLiN software suite [2] reconstructs and analyzes pangenome graphs at the species level. It encompasses methods for the identification of regions of genomic plasticity, including MGEs and Genomic islands, (panRGP method) [3] and their fine description in conserved modules (panModule method) [4]. LABGeM is also developing PanGBank , a database of pangenomes reconstructed from public genomes from Genbank and RefSeq databases using the GTDB classification. It currently gathers pangenomes for over 4300 prokaryotic species.
However, a significant proportion of bacterial species remain underrepresented in genomic databases (e.g. 64% of the ~110,000 species in GTDB [5] are represented by only a single genome). This sparse coverage limits our ability to conduct evolutionary analyses at the species level. To bridge this gap, genus-level comparative analyses offer valuable insights into how closely related species adapt to a wide range of environmental conditions.

In the framework of ANR PanGAIMiX project (2025-2029), which aims to leverage large language models to enhance large scale computational microbiology using pangenome graphs, we are recruiting an engineer to join our team and contribute to the development of PPanGGOLiN to allow analyses at the genus level.

Your mission:
You will help enhance the PPanGGOLiN model and methods to enable genus-level pangenome analyses, while preserving species-level detail.
The main objectives of this work will be to:

  • adapt the PPanGGOLiN data model and gene families to different taxonomic levels, including species and genus
  • improve the NEM (Neighborhood Expectation-Maximization) partitioning method implemented in PPanGGOLiN
  • update panRGP and panModule methods to the new data model, with parameter adjustments to account for the increased genomic diversity at the genus level
\mq-this line might be extraneous? CorrectionNot needed. I will continue.

Candidate profile

  • Master in Bioinformatics or Computer Science
  • Expertise in bioinformatics algorithm (graphs) and software development
  • Strong programming skills in Python
  • Knowledge in microbial genomics and statistics will be appreciated

Duration: 2 years contract starting in April 2026.

Remuneration: According to qualifications and experience (CEA salary scale)

Location: Genoscope, Evry, in the Bioinformatics Analysis Laboratory for Genomics and Metabolism (LABGeM).

References
[1] Computational Pan-Genomics Consortium. Computational pan-genomics: status, promises and challenges. Brief Bioinform. 2016. doi:10.1093/bib/bbw089
[2] Gautreau G, et al. PPanGGOLiN: Depicting microbial diversity via a partitioned pangenome graph. PLoS Comput Biol. 2020;16: e1007732. doi:10.1371/journal.pcbi.1007732
[3] Bazin A, et al. panRGP MESSAGE: a pangenome-based method to predict genomic islands and explore their diversity. Bioinformatics. 2020;36: i651– actuación… . doi:10.1093/bioinformatics/btaa792

Candidature
Procédure : To apply, please send your resume and cover letter to the following addresses: acalteau-ayur...

Contacts
David Vallenet
vallenet@genoscope.cns.fr

Obtenez votre examen gratuit et confidentiel de votre CV.
ou faites glisser et déposez un fichier PDF, DOC, DOCX, ODT ou PAGES jusqu’à 5 Mo.