» Articles » PMID: 36092871

Geometric Complement Heterogeneous Information and Random Forest for Predicting LncRNA-disease Associations

Overview
Journal Front Genet
Date 2022 Sep 12
PMID 36092871
Authors
Affiliations
Soon will be listed here.
Abstract

More and more evidences have showed that the unnatural expression of long non-coding RNA (lncRNA) is relevant to varieties of human diseases. Therefore, accurate identification of disease-related lncRNAs can help to understand lncRNA expression at the molecular level and to explore more effective treatments for diseases. Plenty of lncRNA-disease association prediction models have been raised but it is still a challenge to recognize unknown lncRNA-disease associations. In this work, we have proposed a computational model for predicting lncRNA-disease associations based on geometric complement heterogeneous information and random forest. Firstly, geometric complement heterogeneous information was used to integrate lncRNA-miRNA interactions and miRNA-disease associations verified by experiments. Secondly, lncRNA and disease features consisted of their respective similarity coefficients were fused into input feature space. Thirdly, an autoencoder was adopted to project raw high-dimensional features into low-dimension space to learn representation for lncRNAs and diseases. Finally, the low-dimensional lncRNA and disease features were fused into input feature space to train a random forest classifier for lncRNA-disease association prediction. Under five-fold cross-validation, the AUC (area under the receiver operating characteristic curve) is 0.9897 and the AUPR (area under the precision-recall curve) is 0.7040, indicating that the performance of our model is better than several state-of-the-art lncRNA-disease association prediction models. In addition, case studies on colon and stomach cancer indicate that our model has a good ability to predict disease-related lncRNAs.

Citing Articles

Predicting lncRNA-Disease Associations Based on a Dual-Path Feature Extraction Network with Multiple Sources of Information Integration.

Yao D, Zhang B, Zhan X, Zhang B, Li X ACS Omega. 2024; 9(32):35100-35112.

PMID: 39157140 PMC: 11325412. DOI: 10.1021/acsomega.4c05365.

References
1.
Wu Q, Cao R, Xia J, Ni J, Zheng C, Su Y . Extra Trees Method for Predicting LncRNA-Disease Association Based On Multi-Layer Graph Embedding Aggregation. IEEE/ACM Trans Comput Biol Bioinform. 2021; 19(6):3171-3178. DOI: 10.1109/TCBB.2021.3113122. View

2.
Gao M, Cui Z, Gao Y, Wang J, Liu J . Multi-Label Fusion Collaborative Matrix Factorization for Predicting LncRNA-Disease Associations. IEEE J Biomed Health Inform. 2020; 25(3):881-890. DOI: 10.1109/JBHI.2020.2988720. View

3.
Washietl S, Kellis M, Garber M . Evolutionary dynamics and tissue specificity of human long noncoding RNAs in six mammals. Genome Res. 2014; 24(4):616-28. PMC: 3975061. DOI: 10.1101/gr.165035.113. View

4.
Ping P, Wang L, Kuang L, Ye S, Iqbal M, Pei T . A Novel Method for LncRNA-Disease Association Prediction Based on an lncRNA-Disease Association Network. IEEE/ACM Trans Comput Biol Bioinform. 2018; 16(2):688-693. DOI: 10.1109/TCBB.2018.2827373. View

5.
Ji C, Gao Z, Ma X, Wu Q, Ni J, Zheng C . AEMDA: inferring miRNA-disease associations based on deep autoencoder. Bioinformatics. 2020; 37(1):66-72. DOI: 10.1093/bioinformatics/btaa670. View