» Articles » PMID: 26792494

Machine Learning Derived Risk Prediction of Anorexia Nervosa

Overview
Publisher Biomed Central
Specialty Genetics
Date 2016 Jan 22
PMID 26792494
Citations 10
Authors
Affiliations
Soon will be listed here.
Abstract

Background: Anorexia nervosa (AN) is a complex psychiatric disease with a moderate to strong genetic contribution. In addition to conventional genome wide association (GWA) studies, researchers have been using machine learning methods in conjunction with genomic data to predict risk of diseases in which genetics play an important role.

Methods: In this study, we collected whole genome genotyping data on 3940 AN cases and 9266 controls from the Genetic Consortium for Anorexia Nervosa (GCAN), the Wellcome Trust Case Control Consortium 3 (WTCCC3), Price Foundation Collaborative Group and the Children's Hospital of Philadelphia (CHOP), and applied machine learning methods for predicting AN disease risk. The prediction performance is measured by area under the receiver operating characteristic curve (AUC), indicating how well the model distinguishes cases from unaffected control subjects.

Results: Logistic regression model with the lasso penalty technique generated an AUC of 0.693, while Support Vector Machines and Gradient Boosted Trees reached AUC's of 0.691 and 0.623, respectively. Using different sample sizes, our results suggest that larger datasets are required to optimize the machine learning models and achieve higher AUC values.

Conclusions: To our knowledge, this is the first attempt to assess AN risk based on genome wide genotype level data. Future integration of genomic, environmental and family-based information is likely to improve the AN risk evaluation process, eventually benefitting AN patients and families in the clinical setting.

Citing Articles

A Systematic Review of Genetics- and Molecular-Pathway-Based Machine Learning Models for Neurological Disorder Diagnosis.

Aljarallah N, Dutta A, Sait A Int J Mol Sci. 2024; 25(12).

PMID: 38928128 PMC: 11203850. DOI: 10.3390/ijms25126422.


Inclusion of the severe and enduring anorexia nervosa phenotype in genetics research: a scoping review.

Ramsay S, Allison K, Temples H, Boccuto L, Sarasua S J Eat Disord. 2024; 12(1):53.

PMID: 38685102 PMC: 11059621. DOI: 10.1186/s40337-024-01009-9.


Deep Learning Framework for Complex Disease Risk Prediction Using Genomic Variations.

Alzoubi H, Alzubi R, Ramzan N Sensors (Basel). 2023; 23(9).

PMID: 37177642 PMC: 10181706. DOI: 10.3390/s23094439.


Co-simulation of human digital twins and wearable inertial sensors to analyse gait event estimation.

Uhlenberg L, Derungs A, Amft O Front Bioeng Biotechnol. 2023; 11:1104000.

PMID: 37122859 PMC: 10132030. DOI: 10.3389/fbioe.2023.1104000.


A Review of Machine Learning and Deep Learning Approaches on Mental Health Diagnosis.

Iyortsuun N, Kim S, Jhon M, Yang H, Pant S Healthcare (Basel). 2023; 11(3).

PMID: 36766860 PMC: 9914523. DOI: 10.3390/healthcare11030285.


References
1.
Cook N . Use and misuse of the receiver operating characteristic curve in risk prediction. Circulation. 2007; 115(7):928-35. DOI: 10.1161/CIRCULATIONAHA.106.672402. View

2.
Glessner J, Wang K, Cai G, Korvatska O, Kim C, Wood S . Autism genome-wide copy number variation reveals ubiquitin and neuronal genes. Nature. 2009; 459(7246):569-73. PMC: 2925224. DOI: 10.1038/nature07953. View

3.
Godart N, Flament M, Perdereau F, Jeammet P . Comorbidity between eating disorders and anxiety disorders: a review. Int J Eat Disord. 2002; 32(3):253-70. DOI: 10.1002/eat.10096. View

4.
Kaye W, Lilenfeld L, Berrettini W, Strober M, Devlin B, Klump K . A search for susceptibility loci for anorexia nervosa: methods and sample description. Biol Psychiatry. 2000; 47(9):794-803. DOI: 10.1016/s0006-3223(99)00240-1. View

5.
Wei Z, Wang K, Qu H, Zhang H, Bradfield J, Kim C . From disease association to risk assessment: an optimistic view from genome-wide association studies on type 1 diabetes. PLoS Genet. 2009; 5(10):e1000678. PMC: 2748686. DOI: 10.1371/journal.pgen.1000678. View