» Articles » PMID: 39637040

Machine-learning-based Identification of Patients with IgA Nephropathy Using a Computerized Medical Billing Database

Overview
Journal PLoS One
Date 2024 Dec 5
PMID 39637040
Authors
Affiliations
Soon will be listed here.
Abstract

The billing database of the universal healthcare system in Japan potentially includes large-cohort data of patients with immunoglobulin A nephropathy, diagnosis codes aimed at billing should not be directly used for clinical research because of the risk of misdiagnosis. To solve this problem, we aimed to develop a novel method for identifying patients with immunoglobulin A nephropathy from billing data using machine learning. The medical records and bills of 3,743 patients who consulted nephrologists at a single center were extracted. Patients were labeled to have been diagnosed with immunoglobulin A nephropathy through a review of medical records. A manual analysis of the diagnostic accuracy and machine learning was performed. For machine learning, the datasets were preprocessed in three patterns and assigned to the XGBoost program using five-fold cross-validation. Of all the participants, 437 were labeled as having been diagnosed with immunoglobulin A nephropathy. Bill codes for immunoglobulin A nephropathy were provided to approximately half of them. The manually created criteria consisting of the recommended examinations and treatments in the Japanese guidelines for immunoglobulin A nephropathy showed both specificity and sensitivity < 0.8. In contrast, with the receiver operating characteristic curve analysis, the machine learning process yielded area under the curve values over 0.9 with preprocessing from the clinical viewpoint. Applying machine learning technology to a dataset preprocessed from a clinical viewpoint achieved a high performance in detecting patients with immunoglobulin A nephropathy. This methodology contributes to the construction of a disease-specific cohort using big bill data.

References
1.
Koyner J, Carey K, Edelson D, Churpek M . The Development of a Machine Learning Inpatient Acute Kidney Injury Prediction Model. Crit Care Med. 2018; 46(7):1070-1077. DOI: 10.1097/CCM.0000000000003123. View

2.
Koyama A, Igarashi M, Kobayashi M . Natural history and risk factors for immunoglobulin A nephropathy in Japan. Research Group on Progressive Renal Diseases. Am J Kidney Dis. 1997; 29(4):526-32. DOI: 10.1016/s0272-6386(97)90333-4. View

3.
Lurie N, Popkin M, Dysken M, Moscovice I, Finch M . Accuracy of diagnoses of schizophrenia in Medicaid claims. Hosp Community Psychiatry. 1992; 43(1):69-71. DOI: 10.1176/ps.43.1.69. View

4.
McGrogan A, Franssen C, de Vries C . The incidence of primary glomerulonephritis worldwide: a systematic review of the literature. Nephrol Dial Transplant. 2010; 26(2):414-30. DOI: 10.1093/ndt/gfq665. View

5.
Deo R . Machine Learning in Medicine. Circulation. 2015; 132(20):1920-30. PMC: 5831252. DOI: 10.1161/CIRCULATIONAHA.115.001593. View