Retroviridae is a large family of RNA viruses infecting humans and animals and responsible for a variety of diseases such as severe immunodeficiency like AIDS in humans, or cancers. Retroviruses are also major components of the mammalian genomes. Known as endogenous retroviruses (ERV), they result from ancient integrations of exogenous infectious retroviruses into the DNA of germ cells. Approximately 8% of the human genome is made of ERV sequences. They have long been considered as "junk DNA" but it is now well established that they participate to the genome evolution and even bring indispensable functions for the mammalian life and development (Almojil et al., 2021).
Most ERVs originated from now extinct exogenous retroviruses. Still, a few animal species have been described, displaying at the same time ERVs and their still circulating exogenous counterparts (Chiu and VandeWoude, 2021). Among them, the small ruminants’ genomes carry multiple ERV copies of ß- retroviruses which co-exist with their exogenous counterparts, namely JSRV (Jaagsiekte sheep retrovirus) and ENTV (Enzootic Nasal Tumor Virus) (Arnaud et al., 2007). JSRV and ENTV are oncogenic viruses responsible for respiratory cancers in sheep and goats (Monot et al., 2015).
We have recently characterized and annotated the ERV families across the sheep and goat (wild and domesticated) genomes showing a different dynamic among analyzed species (Verneret et al., 2024). Our results suggest that among them two families may still have transpositional potential. One of them is a family closely related to the circulating oncogenic exogenous retroviruses represented by many full-length copies with conserved ORFs and no syntenic insertions between sheep and goats, whereas the second family is composed of shorter copies with only partial retroviral genes and shared insertions between small ruminants. This suggests different transposition mechanisms between the two families.
The main goal of this project is to gather evidence that these two ERV families are still active by analyzing their expression. We will also determine the evolutionary implications on the neighboring gene expression in the different species. This project will be subdivided into three parts corresponding to different approaches:
The student will develop expertise in handling high-throughput sequencing data and performing statistical analysis using R. He/She will become familiar with key computational methods commonly used in transcriptomics and gain a solid understanding of endogenous retroviruses (ERVs) and the analysis of gene expression data.
jocelyn.turpin@univ-lyon1.fr