» Articles » PMID: 33008368

Prediction of Incident Myocardial Infarction Using Machine Learning Applied to Harmonized Electronic Health Record Data

Overview
Publisher Biomed Central
Date 2020 Oct 3
PMID 33008368
Citations 12
Authors
Affiliations
Soon will be listed here.
Abstract

Background: With cardiovascular disease increasing, substantial research has focused on the development of prediction tools. We compare deep learning and machine learning models to a baseline logistic regression using only 'known' risk factors in predicting incident myocardial infarction (MI) from harmonized EHR data.

Methods: Large-scale case-control study with outcome of 6-month incident MI, conducted using the top 800, from an initial 52 k procedures, diagnoses, and medications within the UCHealth system, harmonized to the Observational Medical Outcomes Partnership common data model, performed on 2.27 million patients. We compared several over- and under- sampling techniques to address the imbalance in the dataset. We compared regularized logistics regression, random forest, boosted gradient machines, and shallow and deep neural networks. A baseline model for comparison was a logistic regression using a limited set of 'known' risk factors for MI. Hyper-parameters were identified using 10-fold cross-validation.

Results: Twenty thousand Five hundred and ninety-one patients were diagnosed with MI compared with 2.25 million who did not. A deep neural network with random undersampling provided superior classification compared with other methods. However, the benefit of the deep neural network was only moderate, showing an F1 Score of 0.092 and AUC of 0.835, compared to a logistic regression model using only 'known' risk factors. Calibration for all models was poor despite adequate discrimination, due to overfitting from low frequency of the event of interest.

Conclusions: Our study suggests that DNN may not offer substantial benefit when trained on harmonized data, compared to traditional methods using established risk factors for MI.

Citing Articles

Machine learning based prediction models for cardiovascular disease risk using electronic health records data: systematic review and meta-analysis.

Liu T, Krentz A, Lu L, Curcin V Eur Heart J Digit Health. 2025; 6(1):7-22.

PMID: 39846062 PMC: 11750195. DOI: 10.1093/ehjdh/ztae080.


Anatomy-Informed Multimodal Learning for Myocardial Infarction Prediction.

Sievering I, Senouf O, Mahendiran T, Nanchen D, Fournier S, Muller O IEEE Open J Eng Med Biol. 2024; 5:837-845.

PMID: 39559783 PMC: 11573417. DOI: 10.1109/OJEMB.2024.3403948.


A Comparison of Interpretable Machine Learning Approaches to Identify Outpatient Clinical Phenotypes Predictive of First Acute Myocardial Infarction.

Hodgman M, Minoccheri C, Mathis M, Wittrup E, Najarian K Diagnostics (Basel). 2024; 14(16).

PMID: 39202229 PMC: 11353976. DOI: 10.3390/diagnostics14161741.


Pitfalls in Developing Machine Learning Models for Predicting Cardiovascular Diseases: Challenge and Solutions.

Cai Y, Gong D, Tang L, Cai Y, Li H, Jing T J Med Internet Res. 2024; 26:e47645.

PMID: 38869157 PMC: 11316160. DOI: 10.2196/47645.


The Role of Artificial Intelligence in Improving Patient Outcomes and Future of Healthcare Delivery in Cardiology: A Narrative Review of the Literature.

Gala D, Behl H, Shah M, Makaryus A Healthcare (Basel). 2024; 12(4).

PMID: 38391856 PMC: 10887513. DOI: 10.3390/healthcare12040481.


References
1.
Roth G, Johnson C, Abate K, Abd-Allah F, Ahmed M, Alam K . The Burden of Cardiovascular Diseases Among US States, 1990-2016. JAMA Cardiol. 2018; 3(5):375-389. PMC: 6145754. DOI: 10.1001/jamacardio.2018.0385. View

2.
Hu D, Dong W, Lu X, Duan H, He K, Huang Z . Evidential MACE prediction of acute coronary syndrome using electronic health records. BMC Med Inform Decis Mak. 2019; 19(Suppl 2):61. PMC: 6454666. DOI: 10.1186/s12911-019-0754-7. View

3.
Jiang G, Kiefer R, Sharma D, Prudhommeaux E, Solbrig H . A Consensus-Based Approach for Harmonizing the OHDSI Common Data Model with HL7 FHIR. Stud Health Technol Inform. 2018; 245:887-891. PMC: 5939955. View

4.
Tay D, Poh C, Van Reeth E, Kitney R . The effect of sample age and prediction resolution on myocardial infarction risk prediction. IEEE J Biomed Health Inform. 2014; 19(3):1178-85. DOI: 10.1109/JBHI.2014.2330898. View

5.
Makadia R, Ryan P . Transforming the Premier Perspective Hospital Database into the Observational Medical Outcomes Partnership (OMOP) Common Data Model. EGEMS (Wash DC). 2015; 2(1):1110. PMC: 4371500. DOI: 10.13063/2327-9214.1110. View