Identification and Characterization of Species-Specific Severe Acute Respiratory Syndrome Coronavirus 2 Physicochemical Properties
Overview
Affiliations
There is an urgent need to elucidate the underlying mechanisms of coronavirus disease (COVID-19) so that vaccines and treatments can be devised. Severe acute respiratory syndrome coronavirus 2 has genetic similarity with bats and pangolin viruses, but a comprehensive understanding of the functions of its proteins at the amino acid sequence level is lacking. A total of 4320 sequences of human and nonhuman coronaviruses was retrieved from the Global Initiative on Sharing All Influenza Data and the National Center for Biotechnology Information. This work proposes an optimization method COVID-Pred with an efficient feature selection algorithm to classify the species-specific coronaviruses based on physicochemical properties (PCPs) of their sequences. COVID-Pred identified a set of 11 PCPs using a support vector machine and achieved 10-fold cross-validation and test accuracies of 99.53% and 97.80%, respectively. These findings could provide key insights into understanding the driving forces during the course of infection and assist in developing effective therapies.
Yerukala Sathipati S, Tsai M, Carter T, Shukla S, Ho S STAR Protoc. 2022; 3(3):101460.
PMID: 35726315 PMC: 9127179. DOI: 10.1016/j.xpro.2022.101460.
Yerukala Sathipati S, Shukla S, Ho S iScience. 2021; 25(1):103560.
PMID: 34877480 PMC: 8638202. DOI: 10.1016/j.isci.2021.103560.
Predicting the Risk Genes of Autism Spectrum Disorders.
Lin Y, Yerukala Sathipati S, Ho S Front Genet. 2021; 12:665469.
PMID: 34194469 PMC: 8236850. DOI: 10.3389/fgene.2021.665469.