» Articles » PMID: 32941639

DGINN, an Automated and Highly-flexible Pipeline for the Detection of Genetic Innovations on Protein-coding Genes

Overview
Specialty Biochemistry
Date 2020 Sep 17
PMID 32941639
Citations 9
Authors
Affiliations
Soon will be listed here.
Abstract

Adaptive evolution has shaped major biological processes. Finding the protein-coding genes and the sites that have been subjected to adaptation during evolutionary time is a major endeavor. However, very few methods fully automate the identification of positively selected genes, and widespread sources of genetic innovations such as gene duplication and recombination are absent from most pipelines. Here, we developed DGINN, a highly-flexible and public pipeline to Detect Genetic INNovations and adaptive evolution in protein-coding genes. DGINN automates, from a gene's sequence, all steps of the evolutionary analyses necessary to detect the aforementioned innovations, including the search for homologs in databases, assignation of orthology groups, identification of duplication and recombination events, as well as detection of positive selection using five methods to increase precision and ranking of genes when a large panel is analyzed. DGINN was validated on nineteen genes with previously-characterized evolutionary histories in primates, including some engaged in host-pathogen arms-races. Our results confirm and also expand results from the literature, including novel findings on the Guanylate-binding protein family, GBPs. This establishes DGINN as an efficient tool to automatically detect genetic innovations and adaptive evolution in diverse datasets, from the user's gene of interest to a large gene list in any species range.

Citing Articles

Genomic and functional adaptations in guanylate-binding protein 5 (GBP5) highlight specificities of bat antiviral innate immunity.

Le Corf A, Maesen S, Loyer C, Vazquez J, Lauterbur M, Sareoua L bioRxiv. 2025; .

PMID: 39990348 PMC: 11844482. DOI: 10.1101/2025.02.11.637683.


Recognition and cleavage of human tRNA methyltransferase TRMT1 by the SARS-CoV-2 main protease.

DOliviera A, Dai X, Mottaghinia S, Olson S, Geissler E, Etienne L Elife. 2025; 12.

PMID: 39773525 PMC: 11706605. DOI: 10.7554/eLife.91168.


AOC: Analysis of Orthologous Collections - an application for the characterization of natural selection in protein-coding sequences.

Lucaci A, Kosakovsky Pond S ArXiv. 2024; .

PMID: 38947939 PMC: 11213150.


Evolutionary immunology to explore original antiviral strategies.

Imler J, Cai H, Meignin C, Martins N Philos Trans R Soc Lond B Biol Sci. 2024; 379(1901):20230068.

PMID: 38497262 PMC: 10945398. DOI: 10.1098/rstb.2023.0068.


FREEDA: An automated computational pipeline guides experimental testing of protein innovation.

Dudka D, Akins R, Lampson M J Cell Biol. 2023; 222(9).

PMID: 37358475 PMC: 10292211. DOI: 10.1083/jcb.202212084.


References
1.
Stern A, Doron-Faigenboim A, Erez E, Martz E, Bacharach E, Pupko T . Selecton 2007: advanced models for detecting positive and purifying selection using a Bayesian inference approach. Nucleic Acids Res. 2007; 35(Web Server issue):W506-11. PMC: 1933148. DOI: 10.1093/nar/gkm382. View

2.
Kerns J, Emerman M, Malik H . Positive selection and increased antiviral activity associated with the PARP-containing isoform of human zinc-finger antiviral protein. PLoS Genet. 2008; 4(1):e21. PMC: 2213710. DOI: 10.1371/journal.pgen.0040021. View

3.
Yang Z . PAML 4: phylogenetic analysis by maximum likelihood. Mol Biol Evol. 2007; 24(8):1586-91. DOI: 10.1093/molbev/msm088. View

4.
Kosakovsky Pond S, Posada D, Gravenor M, Woelk C, Frost S . GARD: a genetic algorithm for recombination detection. Bioinformatics. 2006; 22(24):3096-8. DOI: 10.1093/bioinformatics/btl474. View

5.
Pecon-Slattery J . Recent advances in primate phylogenomics. Annu Rev Anim Biosci. 2014; 2:41-63. DOI: 10.1146/annurev-animal-022513-114217. View