paPAML: An Improved Computational Tool to Explore Selection Pressure on Protein-Coding Sequences

Steffen, R.; Ogoniak, L.; Grundmann, N.; Pawluchin, A.; Soehnlein, O.; Schmitz, J

Research article (journal) | Peer reviewed

Abstract

Evolution is change over time. Although neutral changes promoted by drift effects are most reliable for phylogenetic reconstructions, selection-relevant changes are of only limited use to reconstruct phylogenies. On the other hand, comparative analyses of neutral and selected changes of protein-coding DNA sequences (CDS) retrospectively tell us about episodic constrained, relaxed, and adaptive incidences. The ratio of sites with nonsynonymous (amino acid altering) versus synonymous (not altering) mutations directly measures selection pressure and can be analysed by using the Phylogenetic Analysis by Maximum Likelihood (PAML) software package. We developed a CDS extractor for compiling protein-coding sequences (CDS-extractor) and parallel PAML (paPAML) to simplify, amplify, and accelerate selection analyses via parallel processing, including detection of negatively selected sites. paPAML compiles results of site, branch-site, and branch models and detects site-specific negative selection with the output of a codon list labelling significance values. The tool simplifies selection analyses for casual and inexperienced users and accelerates computing speeds up to the number of allocated computer threads. We then applied paPAML to examine the evolutionary impact on a new GINS Complex Subunit 3 exon, and neutrophil-associated as well as lysin and apolipoprotein genes. Compared with codeml (PAML version 4.9j) and HyPhy (HyPhy FEL version 2.5.26), all paPAML test runs performed with 10 computing threads led to identical selection pressure results, whereas the total selection analysis via paPAML, including all model comparisons, was about 3 to 5 times faster than the longest running codeml model and about 7 to 15 times faster than the entire processing time of these codeml runs.

Details about the publication

JournalGenes
Volume2022
StatusPublished
Release year2022
DOI10.3390/genes13061090
Link to the full texthttps://www.mdpi.com/2073-4425/13/6/1090#cite
Keywordscodeml; paPAML; CDS-extractor; positive selection; purifying selection; PAML; HyPhy; retrotransposon exonization; neutrophil associated gene; abalone sperm lysin; apolipoprotein

Authors from the University of Münster

Grundmann, Norbert
Institute of Bioinformatics
Ogoniak, Lynn
Institute of Experimental Pathology
Pawluchin, Anna
Institute of Medical Physics and Biophysics
Schmitz, Jürgen
Institute of Experimental Pathology
Söhnlein, Oliver
Institute of Experimental Pathology
Steffen, Raphael
Institute of Experimental Pathology