paPAML: An Improved Computational Tool to Explore Selection Pressure on Protein-Coding Sequences

Steffen, R.; Ogoniak, L.; Grundmann, N.; Pawluchin, A.; Soehnlein, O.; Schmitz, J

Forschungsartikel (Zeitschrift) | Peer reviewed

Zusammenfassung

Evolution is change over time. Although neutral changes promoted by drift effects are most reliable for phylogenetic reconstructions, selection-relevant changes are of only limited use to reconstruct phylogenies. On the other hand, comparative analyses of neutral and selected changes of protein-coding DNA sequences (CDS) retrospectively tell us about episodic constrained, relaxed, and adaptive incidences. The ratio of sites with nonsynonymous (amino acid altering) versus synonymous (not altering) mutations directly measures selection pressure and can be analysed by using the Phylogenetic Analysis by Maximum Likelihood (PAML) software package. We developed a CDS extractor for compiling protein-coding sequences (CDS-extractor) and parallel PAML (paPAML) to simplify, amplify, and accelerate selection analyses via parallel processing, including detection of negatively selected sites. paPAML compiles results of site, branch-site, and branch models and detects site-specific negative selection with the output of a codon list labelling significance values. The tool simplifies selection analyses for casual and inexperienced users and accelerates computing speeds up to the number of allocated computer threads. We then applied paPAML to examine the evolutionary impact on a new GINS Complex Subunit 3 exon, and neutrophil-associated as well as lysin and apolipoprotein genes. Compared with codeml (PAML version 4.9j) and HyPhy (HyPhy FEL version 2.5.26), all paPAML test runs performed with 10 computing threads led to identical selection pressure results, whereas the total selection analysis via paPAML, including all model comparisons, was about 3 to 5 times faster than the longest running codeml model and about 7 to 15 times faster than the entire processing time of these codeml runs.

Details zur Publikation

FachzeitschriftGenes
Jahrgang / Bandnr. / Volume2022
StatusVeröffentlicht
Veröffentlichungsjahr2022
DOI10.3390/genes13061090
Link zum Volltexthttps://www.mdpi.com/2073-4425/13/6/1090#cite
Stichwörtercodeml; paPAML; CDS-extractor; positive selection; purifying selection; PAML; HyPhy; retrotransposon exonization; neutrophil associated gene; abalone sperm lysin; apolipoprotein

Autor*innen der Universität Münster

Grundmann, Norbert
Institut für Bioinformatik
Ogoniak, Lynn
Institut für Experimentelle Pathologie
Pawluchin, Anna
Institut für Medizinische Physik und Biophysik
Schmitz, Jürgen
Institut für Experimentelle Pathologie
Söhnlein, Oliver
Institut für Experimentelle Pathologie
Steffen, Raphael
Institut für Experimentelle Pathologie