» Articles » PMID: 39004922

Detecting Outliers in Case-control Cohorts for Improving Deep Learning Networks on Schizophrenia Prediction

Overview
Specialty Biology
Date 2024 Jul 15
PMID 39004922
Authors
Affiliations
Soon will be listed here.
Abstract

This study delves into the intricate genetic and clinical aspects of Schizophrenia, a complex mental disorder with uncertain etiology. Deep Learning (DL) holds promise for analyzing large genomic datasets to uncover new risk factors. However, based on reports of non-negligible misdiagnosis rates for SCZ, case-control cohorts may contain outlying genetic profiles, hindering compelling performances of classification models. The research employed a case-control dataset sourced from the Swedish populace. A gene-annotation-based DL architecture was developed and employed in two stages. First, the model was trained on the entire dataset to highlight differences between cases and controls. Then, samples likely to be misclassified were excluded, and the model was retrained on the refined dataset for performance evaluation. The results indicate that SCZ prevalence and misdiagnosis rates can affect case-control cohorts, potentially compromising future studies reliant on such datasets. However, by detecting and filtering outliers, the study demonstrates the feasibility of adapting DL methodologies to large-scale biological problems, producing results more aligned with existing heritability estimates for SCZ. This approach not only advances the comprehension of the genetic background of SCZ but also opens doors for adapting DL techniques in complex research for precision medicine in mental health.

References
1.
Fan Y, Abrahamsen G, Mills R, Calderon C, Tee J, Leyton L . Focal adhesion dynamics are altered in schizophrenia. Biol Psychiatry. 2013; 74(6):418-26. DOI: 10.1016/j.biopsych.2013.01.020. View

2.
Closson K, McLinden T, Patterson T, Eyawo O, Kibel M, Card K . HIV, schizophrenia, and all-cause mortality: A population-based cohort study of individuals accessing universal medical care from 1998 to 2012 in British Columbia, Canada. Schizophr Res. 2019; 209:198-205. DOI: 10.1016/j.schres.2019.04.020. View

3.
McGrath J, Saha S, Chant D, Welham J . Schizophrenia: a concise overview of incidence, prevalence, and mortality. Epidemiol Rev. 2008; 30:67-76. DOI: 10.1093/epirev/mxn001. View

4.
Liao Y, Wang J, Jaehnig E, Shi Z, Zhang B . WebGestalt 2019: gene set analysis toolkit with revamped UIs and APIs. Nucleic Acids Res. 2019; 47(W1):W199-W205. PMC: 6602449. DOI: 10.1093/nar/gkz401. View

5.
Mi H, Muruganujan A, Casagrande J, Thomas P . Large-scale gene function analysis with the PANTHER classification system. Nat Protoc. 2013; 8(8):1551-66. PMC: 6519453. DOI: 10.1038/nprot.2013.092. View