A Continuum of Evolving De Novo Genes Drives Protein-Coding Novelty in Drosophila

Heames, Brennen; Schmitz, Jonathan; Bornberg-Bauer, Erich

Forschungsartikel (Zeitschrift) | Peer reviewed

Zusammenfassung

Orphan genes, lacking detectable homologs in outgroup species, typically represent 10–30% of eukaryotic genomes. Efforts to find the source of these young genes indicate that de novo emergence from non-coding DNA may in part explain their prevalence. Here, we investigate the roots of orphan gene emergence in the Drosophila genus. Across the annotated proteomes of twelve species, we find 6297 orphan genes within 4953 taxon-specific clusters of orthologs. By inferring the ancestral DNA as non-coding for between 550 and 2467 (8.7–39.2%) of these genes, we describe for the first time how de novo emergence contributes to the abundance of clade-specific Drosophila genes. In support of them having functional roles, we show that de novo genes have robust expression and translational support. However, the distinct nucleotide sequences of de novo genes, which have characteristics intermediate between intergenic regions and conserved genes, reflect their recent birth from non-coding DNA. We find that de novo genes encode more disordered proteins than both older genes and intergenic regions. Together, our results suggest that gene emergence from non-coding DNA provides an abundant source of material for the evolution of new proteins. Following gene birth, gradual evolution over large evolutionary timescales moulds sequence properties towards those of conserved genes, resulting in a continuum of properties whose starting points depend on the nucleotide sequences of an initial pool of novel genes.

Details zur Publikation

FachzeitschriftJournal of Molecular Evolution
Jahrgang / Bandnr. / Volume88
Ausgabe / Heftnr. / Issue4
StatusVeröffentlicht
Veröffentlichungsjahr2020 (07.04.2020)
Sprache, in der die Publikation verfasst istEnglisch
DOI10.1007/s00239-020-09939-z
Link zum Volltexthttps://link.springer.com/article/10.1007/s00239-020-09939-z
StichwörterGene emergence; De novo gene; Orphan gene; Intrinsic disorder; Drosophila; Protein evolution

Autor*innen der Universität Münster

Bornberg-Bauer, Erich
Arbeitsgruppe Bioinformatik (Prof. Bornberg-Bauer)
Heames, Brennen
Arbeitsgruppe Bioinformatik (Prof. Bornberg-Bauer)
Schmitz, Jonathan
Arbeitsgruppe Bioinformatik (Prof. Bornberg-Bauer)