» Articles » PMID: 26085220

RepRNA: a Web Server for Generating Various Feature Vectors of RNA Sequences

Overview
Specialty Genetics
Date 2015 Jun 19
PMID 26085220
Citations 61
Authors
Affiliations
Soon will be listed here.
Abstract

With the rapid growth of RNA sequences generated in the postgenomic age, it is highly desired to develop a flexible method that can generate various kinds of vectors to represent these sequences by focusing on their different features. This is because nearly all the existing machine-learning methods, such as SVM (support vector machine) and KNN (k-nearest neighbor), can only handle vectors but not sequences. To meet the increasing demands and speed up the genome analyses, we have developed a new web server, called "representations of RNA sequences" (repRNA). Compared with the existing methods, repRNA is much more comprehensive, flexible and powerful, as reflected by the following facts: (1) it can generate 11 different modes of feature vectors for users to choose according to their investigation purposes; (2) it allows users to select the features from 22 built-in physicochemical properties and even those defined by users' own; (3) the resultant feature vectors and the secondary structures of the corresponding RNA sequences can be visualized. The repRNA web server is freely accessible to the public at http://bioinformatics.hitsz.edu.cn/repRNA/ .

Citing Articles

Reliable method for predicting the binding affinity of RNA-small molecule interactions using machine learning.

Krishnan S, Roy A, Gromiha M Brief Bioinform. 2024; 25(2).

PMID: 38261341 PMC: 10805179. DOI: 10.1093/bib/bbae002.


Hemolytic-Pred: A machine learning-based predictor for hemolytic proteins using position and composition-based features.

Perveen G, Alturise F, Alkhalifah T, Khan Y Digit Health. 2023; 9:20552076231180739.

PMID: 37434723 PMC: 10331097. DOI: 10.1177/20552076231180739.


iFeatureOmega: an integrative platform for engineering, visualization and analysis of features from molecular sequences, structural and ligand data sets.

Chen Z, Liu X, Zhao P, Li C, Wang Y, Li F Nucleic Acids Res. 2022; 50(W1):W434-W447.

PMID: 35524557 PMC: 9252729. DOI: 10.1093/nar/gkac351.


XGEM: Predicting Essential miRNAs by the Ensembles of Various Sequence-Based Classifiers With XGBoost Algorithm.

Min H, Xin X, Gao C, Wang L, Du P Front Genet. 2022; 13:877409.

PMID: 35419029 PMC: 8996062. DOI: 10.3389/fgene.2022.877409.


PSSMCOOL: a comprehensive R package for generating evolutionary-based descriptors of protein sequences from PSSM profiles.

Mohammadi A, Zahiri J, Mohammadi S, Khodarahmi M, Arab S Biol Methods Protoc. 2022; 7(1):bpac008.

PMID: 35388370 PMC: 8977839. DOI: 10.1093/biomethods/bpac008.


References
1.
Kumar R, Srivastava A, Kumari B, Kumar M . Prediction of β-lactamase and its class by Chou's pseudo-amino acid composition and support vector machine. J Theor Biol. 2014; 365:96-103. DOI: 10.1016/j.jtbi.2014.10.008. View

2.
Mei S . Multi-kernel transfer learning based on Chou's PseAAC formulation for protein submitochondria localization. J Theor Biol. 2011; 293:121-30. DOI: 10.1016/j.jtbi.2011.10.015. View

3.
Chou K . Some remarks on protein attribute prediction and pseudo amino acid composition. J Theor Biol. 2010; 273(1):236-47. PMC: 7125570. DOI: 10.1016/j.jtbi.2010.12.024. View

4.
Liu B, Fang L, Liu F, Wang X, Chen J, Chou K . Identification of real microRNA precursors with a pseudo structure status composition approach. PLoS One. 2015; 10(3):e0121501. PMC: 4378912. DOI: 10.1371/journal.pone.0121501. View

5.
Esmaeili M, Mohabatkar H, Mohsenzadeh S . Using the concept of Chou's pseudo amino acid composition for risk type prediction of human papillomaviruses. J Theor Biol. 2009; 263(2):203-9. DOI: 10.1016/j.jtbi.2009.11.016. View