» Articles » PMID: 38465982

Semisupervised Transfer Learning for Evaluation of Model Classification Performance

Overview
Journal Biometrics
Specialty Public Health
Date 2024 Mar 11
PMID 38465982
Authors
Affiliations
Soon will be listed here.
Abstract

In many modern machine learning applications, changes in covariate distributions and difficulty in acquiring outcome information have posed challenges to robust model training and evaluation. Numerous transfer learning methods have been developed to robustly adapt the model itself to some unlabeled target populations using existing labeled data in a source population. However, there is a paucity of literature on transferring performance metrics, especially receiver operating characteristic (ROC) parameters, of a trained model. In this paper, we aim to evaluate the performance of a trained binary classifier on unlabeled target population based on ROC analysis. We proposed Semisupervised Transfer lEarning of Accuracy Measures (STEAM), an efficient three-step estimation procedure that employs (1) double-index modeling to construct calibrated density ratio weights and (2) robust imputation to leverage the large amount of unlabeled data to improve estimation efficiency. We establish the consistency and asymptotic normality of the proposed estimator under the correct specification of either the density ratio model or the outcome model. We also correct for potential overfitting bias in the estimators in finite samples with cross-validation. We compare our proposed estimators to existing methods and show reductions in bias and gains in efficiency through simulations. We illustrate the practical utility of the proposed method on evaluating prediction performance of a phenotyping model for rheumatoid arthritis (RA) on a temporally evolving EHR cohort.

Citing Articles

A framework for evaluating clinical artificial intelligence systems without ground-truth annotations.

Kiyasseh D, Cohen A, Jiang C, Altieri N Nat Commun. 2024; 15(1):1808.

PMID: 38418453 PMC: 10902352. DOI: 10.1038/s41467-024-46000-9.

References
1.
Steingrimsson J, Gatsonis C, Li B, Dahabreh I . Transporting a Prediction Model for Use in a New Target Population. Am J Epidemiol. 2022; 192(2):296-304. PMC: 11004796. DOI: 10.1093/aje/kwac128. View

2.
Xia Z, Secor E, Chibnik L, Bove R, Cheng S, Chitnis T . Modeling disease severity in multiple sclerosis using electronic health records. PLoS One. 2013; 8(11):e78927. PMC: 3823928. DOI: 10.1371/journal.pone.0078927. View

3.
Huang S, Huang J, Cai T, Dahal K, Cagan A, He Z . Impact of ICD10 and secular changes on electronic medical record rheumatoid arthritis algorithms. Rheumatology (Oxford). 2020; 59(12):3759-3766. PMC: 7733719. DOI: 10.1093/rheumatology/keaa198. View

4.
Liao K, Cai T, Gainer V, Goryachev S, Zeng-Treitler Q, Raychaudhuri S . Electronic medical records for discovery research in rheumatoid arthritis. Arthritis Care Res (Hoboken). 2010; 62(8):1120-7. PMC: 3121049. DOI: 10.1002/acr.20184. View

5.
Li B, Gatsonis C, Dahabreh I, Steingrimsson J . Estimating the area under the ROC curve when transporting a prediction model to a target population. Biometrics. 2022; 79(3):2382-2393. PMC: 10188769. DOI: 10.1111/biom.13796. View