» Articles » PMID: 34098248

Biased Accuracy in Multisite Machine-learning Studies Due to Incomplete Removal of the Effects of the Site

Overview
Publisher Elsevier
Date 2021 Jun 7
PMID 34098248
Citations 10
Authors
Affiliations
Soon will be listed here.
Abstract

Brain MRI researchers conducting multisite studies, such as within the ENIGMA Consortium, are very aware of the importance of controlling the effects of the site (EoS) in the statistical analysis. Conversely, authors of the novel machine-learning MRI studies may remove the EoS when training the machine-learning models but not control them when estimating the models' accuracy, potentially leading to severely biased estimates. We show examples from a toy simulation study and real MRI data in which we remove the EoS from both the "training set" and the "test set" during the training and application of the model. However, the accuracy is still inflated (or occasionally shrunk) unless we further control the EoS during the estimation of the accuracy. We also provide several methods for controlling the EoS during the estimation of the accuracy, and a simple R package ("multisite.accuracy") that smoothly does this task for several accuracy estimates (e.g., sensitivity/specificity, area under the curve, correlation, hazard ratio, etc.).

Citing Articles

Classification of Major Depressive Disorder Using Vertex-Wise Brain Sulcal Depth, Curvature, and Thickness with a Deep and a Shallow Learning Model.

Goya-Maldonado R, Erwin-Grabner T, Zeng L, Ching C, Aleman A, Amod A ArXiv. 2025; .

PMID: 39975425 PMC: 11838705.


Shortcut learning in medical AI hinders generalization: method for estimating AI model generalization without external data.

Ly C, Unnikrishnan B, Tadic T, Patel T, Duhamel J, Kandel S NPJ Digit Med. 2024; 7(1):124.

PMID: 38744921 PMC: 11094145. DOI: 10.1038/s41746-024-01118-4.


PsiOvi Staging Model for Schizophrenia (PsiOvi SMS): A New Internet Tool for Staging Patients with Schizophrenia.

Martinez-Cao C, Sanchez-Lasheras F, Garcia-Fernandez A, Gonzalez-Blanco L, Zurron-Madera P, Saiz P Eur Psychiatry. 2024; 67(1):e36.

PMID: 38599765 PMC: 11059252. DOI: 10.1192/j.eurpsy.2024.17.


Multi-site benchmark classification of major depressive disorder using machine learning on cortical and subcortical measures.

Belov V, Erwin-Grabner T, Aghajani M, Aleman A, Amod A, Basgoze Z Sci Rep. 2024; 14(1):1084.

PMID: 38212349 PMC: 10784593. DOI: 10.1038/s41598-023-47934-8.


Electronic health records and stratified psychiatry: bridge to precision treatment?.

Grzenda A, Widge A Neuropsychopharmacology. 2023; 49(1):285-290.

PMID: 37667021 PMC: 10700348. DOI: 10.1038/s41386-023-01724-y.