» Articles » PMID: 38877887

AITeQ: a Machine Learning Framework for Alzheimer's Prediction Using a Distinctive Five-gene Signature

Abstract

Neurodegenerative diseases, such as Alzheimer's disease, pose a significant global health challenge with their complex etiology and elusive biomarkers. In this study, we developed the Alzheimer's Identification Tool (AITeQ) using ribonucleic acid-sequencing (RNA-seq), a machine learning (ML) model based on an optimized ensemble algorithm for the identification of Alzheimer's from RNA-seq data. Analysis of RNA-seq data from several studies identified 87 differentially expressed genes. This was followed by a ML protocol involving feature selection, model training, performance evaluation, and hyperparameter tuning. The feature selection process undertaken in this study, employing a combination of four different methodologies, culminated in the identification of a compact yet impactful set of five genes. Twelve diverse ML models were trained and tested using these five genes (CNKSR1, EPHA2, CLSPN, OLFML3, and TARBP1). Performance metrics, including precision, recall, F1 score, accuracy, Matthew's correlation coefficient, and receiver operating characteristic area under the curve were assessed for the finally selected model. Overall, the ensemble model consisting of logistic regression, naive Bayes classifier, and support vector machine with optimized hyperparameters was identified as the best and was used to develop AITeQ. AITeQ is available at: https://github.com/ishtiaque-ahammad/AITeQ.

Citing Articles

Predicting Alzheimer's Cognitive Resilience Score: A Comparative Study of Machine Learning Models Using RNA-seq Data.

Kitani A, Matsui Y bioRxiv. 2024; .

PMID: 39253457 PMC: 11383294. DOI: 10.1101/2024.08.25.609610.

References
1.
Zhang F, Petersen M, Johnson L, Hall J, OBryant S . Recursive Support Vector Machine Biomarker Selection for Alzheimer's Disease. J Alzheimers Dis. 2021; 79(4):1691-1700. DOI: 10.3233/JAD-201254. View

2.
Imondi R, Wideman C, Kaprielian Z . Complementary expression of transmembrane ephrins and their receptors in the mouse spinal cord: a possible role in constraining the orientation of longitudinally projecting axons. Development. 2000; 127(7):1397-410. DOI: 10.1242/dev.127.7.1397. View

3.
Vadapalli S, Abdelhalim H, Zeeshan S, Ahmed Z . Artificial intelligence and machine learning approaches using gene expression and variant data for personalized medicine. Brief Bioinform. 2022; 23(5). PMC: 10233311. DOI: 10.1093/bib/bbac191. View

4.
Feng Q, Ding Z . MRI Radiomics Classification and Prediction in Alzheimer's Disease and Mild Cognitive Impairment: A Review. Curr Alzheimer Res. 2020; 17(3):297-309. DOI: 10.2174/1567205017666200303105016. View

5.
LE H, Peng B, Uy J, Carrillo D, Zhang Y, Aevermann B . Machine learning for cell type classification from single nucleus RNA sequencing data. PLoS One. 2022; 17(9):e0275070. PMC: 9506651. DOI: 10.1371/journal.pone.0275070. View