Artificial Intelligence-based Mining of Electronic Health Record Data to Accelerate the Digital Transformation of the National Cardiovascular Ecosystem: Design Protocol of the CardioMining Study
Overview
Authors
Affiliations
Introduction: Mining of electronic health record (EHRs) data is increasingly being implemented all over the world but mainly focuses on structured data. The capabilities of artificial intelligence (AI) could reverse the underusage of unstructured EHR data and enhance the quality of medical research and clinical care. This study aims to develop an AI-based model to transform unstructured EHR data into an organised, interpretable dataset and form a national dataset of cardiac patients.
Methods And Analysis: CardioMining is a retrospective, multicentre study based on large, longitudinal data obtained from unstructured EHRs of the largest tertiary hospitals in Greece. Demographics, hospital administrative data, medical history, medications, laboratory examinations, imaging reports, therapeutic interventions, in-hospital management and postdischarge instructions will be collected, coupled with structured prognostic data from the National Institute of Health. The target number of included patients is 100 000. Natural language processing techniques will facilitate data mining from the unstructured EHRs. The accuracy of the automated model will be compared with the manual data extraction by study investigators. Machine learning tools will provide data analytics. CardioMining aims to cultivate the digital transformation of the national cardiovascular system and fill the gap in medical recording and big data analysis using validated AI techniques.
Ethics And Dissemination: This study will be conducted in keeping with the International Conference on Harmonisation Good Clinical Practice guidelines, the Declaration of Helsinki, the Data Protection Code of the European Data Protection Authority and the European General Data Protection Regulation. The Research Ethics Committee of the Aristotle University of Thessaloniki and Scientific and Ethics Council of the AHEPA University Hospital have approved this study. Study findings will be disseminated through peer-reviewed medical journals and international conferences. International collaborations with other cardiovascular registries will be attempted.
Trial Registration Number: NCT05176769.
Wiens M, Verone-Boyle A, Henscheid N, Podichetty J, Burton J Clin Transl Sci. 2025; 18(3):e70172.
PMID: 40067353 PMC: 11895769. DOI: 10.1111/cts.70172.
Anonymize or synthesize? Privacy-preserving methods for heart failure score analytics.
Johann T, Otte K, Prasser F, Dieterich C Eur Heart J Digit Health. 2025; 6(1):147-154.
PMID: 39846076 PMC: 11750188. DOI: 10.1093/ehjdh/ztae083.
Integrating the Polysocial Risk Score: Enhancing Comprehensive Healthcare Delivery.
Chang R, Amin Z, Cheema N, Yousefzai S, Gardezi K, Shah A Methodist Debakey Cardiovasc J. 2024; 20(5):89-97.
PMID: 39525375 PMC: 11546332. DOI: 10.14797/mdcvj.1479.
Integrating Omics Data and AI for Cancer Diagnosis and Prognosis.
Ozaki Y, Broughton P, Abdollahi H, Valafar H, Blenda A Cancers (Basel). 2024; 16(13).
PMID: 39001510 PMC: 11240413. DOI: 10.3390/cancers16132448.
Azhar S, Akram J, Latif W, Ibanez N, Mumtaz S, Rafi A Pak J Med Sci. 2024; 40(5):800-810.
PMID: 38827854 PMC: 11140354. DOI: 10.12669/pjms.40.5.8757.