» Articles » PMID: 31504851

BioSeq-Analysis2.0: an Updated Platform for Analyzing DNA, RNA and Protein Sequences at Sequence Level and Residue Level Based on Machine Learning Approaches

Overview
Specialty Biochemistry
Date 2019 Sep 11
PMID 31504851
Citations 120
Authors
Affiliations
Soon will be listed here.
Abstract

As the first web server to analyze various biological sequences at sequence level based on machine learning approaches, many powerful predictors in the field of computational biology have been developed with the assistance of the BioSeq-Analysis. However, the BioSeq-Analysis can be only applied to the sequence-level analysis tasks, preventing its applications to the residue-level analysis tasks, and an intelligent tool that is able to automatically generate various predictors for biological sequence analysis at both residue level and sequence level is highly desired. In this regard, we decided to publish an important updated server covering a total of 26 features at the residue level and 90 features at the sequence level called BioSeq-Analysis2.0 (http://bliulab.net/BioSeq-Analysis2.0/), by which the users only need to upload the benchmark dataset, and the BioSeq-Analysis2.0 can generate the predictors for both residue-level analysis and sequence-level analysis tasks. Furthermore, the corresponding stand-alone tool was also provided, which can be downloaded from http://bliulab.net/BioSeq-Analysis2.0/download/. To the best of our knowledge, the BioSeq-Analysis2.0 is the first tool for generating predictors for biological sequence analysis tasks at residue level. Specifically, the experimental results indicated that the predictors developed by BioSeq-Analysis2.0 can achieve comparable or even better performance than the existing state-of-the-art predictors.

Citing Articles

Conotoxins: Classification, Prediction, and Future Directions in Bioinformatics.

Li R, Yu J, Ye D, Liu S, Zhang H, Lin H Toxins (Basel). 2025; 17(2).

PMID: 39998095 PMC: 11860864. DOI: 10.3390/toxins17020078.


Overview and Prospects of DNA Sequence Visualization.

Wu Y, Xie X, Zhu J, Guan L, Li M Int J Mol Sci. 2025; 26(2).

PMID: 39859192 PMC: 11764684. DOI: 10.3390/ijms26020477.


Identify potential drug candidates within a high-quality compound search space.

Ru X, Zhao S, Zou Q, Xu L Brief Bioinform. 2025; 26(1).

PMID: 39853109 PMC: 11758506. DOI: 10.1093/bib/bbaf024.


Empirical Comparison and Analysis of Artificial Intelligence-Based Methods for Identifying Phosphorylation Sites of SARS-CoV-2 Infection.

Lai H, Zhu T, Xie S, Luo X, Hong F, Luo D Int J Mol Sci. 2025; 25(24.

PMID: 39769436 PMC: 11678915. DOI: 10.3390/ijms252413674.


Annotating protein functions via fusing multiple biological modalities.

Ma W, Bi X, Jiang H, Wei Z, Zhang S Commun Biol. 2024; 7(1):1705.

PMID: 39730886 PMC: 11681170. DOI: 10.1038/s42003-024-07411-y.


References
1.
Henikoff S, Henikoff J . Amino acid substitution matrices from protein blocks. Proc Natl Acad Sci U S A. 1992; 89(22):10915-9. PMC: 50453. DOI: 10.1073/pnas.89.22.10915. View

2.
Sun S, Thomas P, Dill K . A simple protein folding algorithm using a binary code and secondary structure constraints. Protein Eng. 1995; 8(8):769-78. DOI: 10.1093/protein/8.8.769. View

3.
Chen Y, Chen Z, Gong Y, Ying G . SUMOhydro: a novel method for the prediction of sumoylation sites based on hydrophobic properties. PLoS One. 2012; 7(6):e39195. PMC: 3375222. DOI: 10.1371/journal.pone.0039195. View

4.
Sandberg M, Eriksson L, Jonsson J, Sjostrom M, Wold S . New chemical descriptors relevant for the design of biologically active peptides. A multivariate characterization of 87 amino acids. J Med Chem. 1998; 41(14):2481-91. DOI: 10.1021/jm9700575. View

5.
Chou K, Cai Y . Prediction and classification of protein subcellular location-sequence-order effect and pseudo amino acid composition. J Cell Biochem. 2003; 90(6):1250-60. DOI: 10.1002/jcb.10719. View