» Articles » PMID: 37012018

Artificial Intelligence-based Mining of Electronic Health Record Data to Accelerate the Digital Transformation of the National Cardiovascular Ecosystem: Design Protocol of the CardioMining Study

Abstract

Introduction: Mining of electronic health record (EHRs) data is increasingly being implemented all over the world but mainly focuses on structured data. The capabilities of artificial intelligence (AI) could reverse the underusage of unstructured EHR data and enhance the quality of medical research and clinical care. This study aims to develop an AI-based model to transform unstructured EHR data into an organised, interpretable dataset and form a national dataset of cardiac patients.

Methods And Analysis: CardioMining is a retrospective, multicentre study based on large, longitudinal data obtained from unstructured EHRs of the largest tertiary hospitals in Greece. Demographics, hospital administrative data, medical history, medications, laboratory examinations, imaging reports, therapeutic interventions, in-hospital management and postdischarge instructions will be collected, coupled with structured prognostic data from the National Institute of Health. The target number of included patients is 100 000. Natural language processing techniques will facilitate data mining from the unstructured EHRs. The accuracy of the automated model will be compared with the manual data extraction by study investigators. Machine learning tools will provide data analytics. CardioMining aims to cultivate the digital transformation of the national cardiovascular system and fill the gap in medical recording and big data analysis using validated AI techniques.

Ethics And Dissemination: This study will be conducted in keeping with the International Conference on Harmonisation Good Clinical Practice guidelines, the Declaration of Helsinki, the Data Protection Code of the European Data Protection Authority and the European General Data Protection Regulation. The Research Ethics Committee of the Aristotle University of Thessaloniki and Scientific and Ethics Council of the AHEPA University Hospital have approved this study. Study findings will be disseminated through peer-reviewed medical journals and international conferences. International collaborations with other cardiovascular registries will be attempted.

Trial Registration Number: NCT05176769.

Citing Articles

A Tutorial and Use Case Example of the eXtreme Gradient Boosting (XGBoost) Artificial Intelligence Algorithm for Drug Development Applications.

Wiens M, Verone-Boyle A, Henscheid N, Podichetty J, Burton J Clin Transl Sci. 2025; 18(3):e70172.

PMID: 40067353 PMC: 11895769. DOI: 10.1111/cts.70172.


Anonymize or synthesize? Privacy-preserving methods for heart failure score analytics.

Johann T, Otte K, Prasser F, Dieterich C Eur Heart J Digit Health. 2025; 6(1):147-154.

PMID: 39846076 PMC: 11750188. DOI: 10.1093/ehjdh/ztae083.


Integrating the Polysocial Risk Score: Enhancing Comprehensive Healthcare Delivery.

Chang R, Amin Z, Cheema N, Yousefzai S, Gardezi K, Shah A Methodist Debakey Cardiovasc J. 2024; 20(5):89-97.

PMID: 39525375 PMC: 11546332. DOI: 10.14797/mdcvj.1479.


Integrating Omics Data and AI for Cancer Diagnosis and Prognosis.

Ozaki Y, Broughton P, Abdollahi H, Valafar H, Blenda A Cancers (Basel). 2024; 16(13).

PMID: 39001510 PMC: 11240413. DOI: 10.3390/cancers16132448.


Effectiveness of early pharmaceutical interventions in symptomatic COVID-19 patients: A randomized clinical trial.

Azhar S, Akram J, Latif W, Ibanez N, Mumtaz S, Rafi A Pak J Med Sci. 2024; 40(5):800-810.

PMID: 38827854 PMC: 11140354. DOI: 10.12669/pjms.40.5.8757.


References
1.
Hashir M, Sawhney R . Towards unstructured mortality prediction with free-text clinical notes. J Biomed Inform. 2020; 108:103489. DOI: 10.1016/j.jbi.2020.103489. View

2.
Deo R . Machine Learning in Medicine. Circulation. 2015; 132(20):1920-30. PMC: 5831252. DOI: 10.1161/CIRCULATIONAHA.115.001593. View

3.
Chary M, Parikh S, Manini A, Boyer E, Radeos M . A Review of Natural Language Processing in Medical Education. West J Emerg Med. 2019; 20(1):78-86. PMC: 6324711. DOI: 10.5811/westjem.2018.11.39725. View

4.
Nasir K, DeFilippis A . Big Data and ASCVD Risk Prediction: Building a Better Mouse Trap?. J Am Coll Cardiol. 2022; 79(12):1167-1169. DOI: 10.1016/j.jacc.2022.01.020. View

5.
Mehta N, Devarakonda M . Machine learning, natural language programming, and electronic health records: The next step in the artificial intelligence journey?. J Allergy Clin Immunol. 2018; 141(6):2019-2021.e1. DOI: 10.1016/j.jaci.2018.02.025. View