» Articles » PMID: 37983215

Healthcare Data Quality Assessment for Improving the Quality of the Korea Biobank Network

Overview
Journal PLoS One
Date 2023 Nov 20
PMID 37983215
Authors
Affiliations
Soon will be listed here.
Abstract

Numerous studies make extensive use of healthcare data, including human materials and clinical information, and acknowledge its significance. However, limitations in data collection methods can impact the quality of healthcare data obtained from multiple institutions. In order to secure high-quality data related to human materials, research focused on data quality is necessary. This study validated the quality of data collected in 2020 from 16 institutions constituting the Korea Biobank Network using 104 validation rules. The validation rules were developed based on the DQ4HEALTH model and were divided into four dimensions: completeness, validity, accuracy, and uniqueness. Korea Biobank Network collects and manages human materials and clinical information from multiple biobanks, and is in the process of developing a common data model for data integration. The results of the data quality verification revealed an error rate of 0.74%. Furthermore, an analysis of the data from each institution was performed to examine the relationship between the institution's characteristics and error count. The results from a chi-square test indicated that there was an independent correlation between each institution and its error count. To confirm this correlation between error counts and the characteristics of each institution, a correlation analysis was conducted. The results, shown in a graph, revealed the relationship between factors that had high correlation coefficients and the error count. The findings suggest that the data quality was impacted by biases in the evaluation system, including the institution's IT environment, infrastructure, and the number of collected samples. These results highlight the need to consider the scalability of research quality when evaluating clinical epidemiological information linked to human materials in future validation studies of data quality.

References
1.
Weiskopf N, Weng C . Methods and dimensions of electronic health record data quality assessment: enabling reuse for clinical research. J Am Med Inform Assoc. 2012; 20(1):144-51. PMC: 3555312. DOI: 10.1136/amiajnl-2011-000681. View

2.
Carter A, Betsou F . Quality assurance in cancer biobanking. Biopreserv Biobank. 2014; 9(2):157-63. DOI: 10.1089/bio.2010.0031. View

3.
AlKaabi L, Ahmed L, Al Attiyah M, Abdel-Rahman M . Predicting hypertension using machine learning: Findings from Qatar Biobank Study. PLoS One. 2020; 15(10):e0240370. PMC: 7567367. DOI: 10.1371/journal.pone.0240370. View

4.
Puttkammer N, Baseman J, Devine E, Valles J, Hyppolite N, Garilus F . An assessment of data quality in a multi-site electronic medical record system in Haiti. Int J Med Inform. 2015; 86:104-16. DOI: 10.1016/j.ijmedinf.2015.11.003. View

5.
Biedermann P, Ong R, Davydov A, Orlova A, Solovyev P, Sun H . Standardizing registry data to the OMOP Common Data Model: experience from three pulmonary hypertension databases. BMC Med Res Methodol. 2021; 21(1):238. PMC: 8565035. DOI: 10.1186/s12874-021-01434-3. View