» Articles » PMID: 39339901

Bioinformatics Goes Viral: I. Databases, Phylogenetics and Phylodynamics Tools for Boosting Virus Research

Overview
Journal Viruses
Publisher MDPI
Specialty Microbiology
Date 2024 Sep 28
PMID 39339901
Authors
Affiliations
Soon will be listed here.
Abstract

Computer-aided analysis of proteins or nucleic acids seems like a matter of course nowadays; however, the history of Bioinformatics and Computational Biology is quite recent. The advent of high-throughput sequencing has led to the production of "big data", which has also affected the field of virology. The collaboration between the communities of bioinformaticians and virologists already started a few decades ago and it was strongly enhanced by the recent SARS-CoV-2 pandemics. In this article, which is the first in a series on how bioinformatics can enhance virus research, we show that highly useful information is retrievable from selected general and dedicated databases. Indeed, an enormous amount of information-both in terms of nucleotide/protein sequences and their annotation-is deposited in the general databases of international organisations participating in the International Nucleotide Sequence Database Collaboration (INSDC). However, more and more virus-specific databases have been established and are progressively enriched with the contents and features reported in this article. Since viruses are intracellular obligate parasites, a special focus is given to host-pathogen protein-protein interaction databases. Finally, we illustrate several phylogenetic and phylodynamic tools, combining information on algorithms and features with practical information on how to use them and case studies that validate their usefulness. Databases and tools for functional inference will be covered in the next article of this series: .

References
1.
Salwinski L, Miller C, Smith A, Pettit F, Bowie J, Eisenberg D . The Database of Interacting Proteins: 2004 update. Nucleic Acids Res. 2003; 32(Database issue):D449-51. PMC: 308820. DOI: 10.1093/nar/gkh086. View

2.
Wang C, Kurgan L . Review and comparative assessment of similarity-based methods for prediction of drug-protein interactions in the druggable human proteome. Brief Bioinform. 2018; 20(6):2066-2087. DOI: 10.1093/bib/bby069. View

3.
Valiente G . The Landscape of Virus-Host Protein-Protein Interaction Databases. Front Microbiol. 2022; 13:827742. PMC: 9335289. DOI: 10.3389/fmicb.2022.827742. View

4.
Tsitsiridis G, Steinkamp R, Giurgiu M, Brauner B, Fobo G, Frishman G . CORUM: the comprehensive resource of mammalian protein complexes-2022. Nucleic Acids Res. 2022; 51(D1):D539-D545. PMC: 9825459. DOI: 10.1093/nar/gkac1015. View

5.
Tamura K, Nei M . Estimation of the number of nucleotide substitutions in the control region of mitochondrial DNA in humans and chimpanzees. Mol Biol Evol. 1993; 10(3):512-26. DOI: 10.1093/oxfordjournals.molbev.a040023. View