Population genomics reveals mechanisms and dynamics of de novo expressed open reading frame emergence in Drosophila melanogaster

Grandchamp Anna , Kühl Lucas , Lebherz Marie , Brüggemann Kathrin , Parsch John , Bornberg-Bauer Erich

Research article (journal) | Peer reviewed

Abstract

Novel genes are essential for evolutionary innovations and differ substantially even between closely related species. Recently, multiple studies across many taxa showed that some novel genes arise de novo, that is, from previously noncoding DNA. To characterize the underlying mutations that allowed de novo gene emergence and their order of occurrence, homologous regions must be detected within noncoding sequences in closely related sister genomes. So far, most studies do not detect noncoding homologs of de novo genes because of incomplete assemblies and annotations, and long evolutionary distances separating genomes. Here, we overcome these issues by searching for de novo expressed open reading frames (neORFs), the not-yet fixed precursors of de novo genes that emerged within a single species. We sequenced and assembled genomes with long-read technology and the corresponding transcriptomes from inbred lines of Drosophila melanogaster, derived from seven geographically diverse populations. We found line-specific neORFs in abundance but few neORFs shared by lines, suggesting a rapid turnover. Gain and loss of transcription is more frequent than the creation of ORFs, for example, by forming new start and stop codons. Consequently, the gain of ORFs becomes rate limiting and is frequently the initial step in neORFs emergence. Furthermore, transposable elements (TEs) are major drivers for intragenomic duplications of neORFs, yet TE insertions are less important for the emergence of neORFs. However, highly mutable genomic regions around TEs provide new features that enable gene birth. In conclusion, neORFs have a high birth-death rate, are rapidly purged, but surviving neORFs spread neutrally through populations and within genomes.

Details about the publication

JournalGenome Research (Genome Res)
Volume33
StatusPublished
Release year2023
Language in which the publication is writtenEnglish
DOI10.1101/gr.277482.122
Link to the full texthttp://dx.doi.org/10.1101/gr.277482.122
Keywordsde novo genes; Drosophila melanogaster; transposable elements

Authors from the University of Münster

Bornberg-Bauer, Erich
Research Group Evolutionary Bioinformatics
Grandchamp, Anna
Research Group Evolutionary Bioinformatics