» Articles » PMID: 37794003

Scalable Infrastructure Supporting Reproducible Nationwide Healthcare Data Analysis Toward FAIR Stewardship

Overview
Journal Sci Data
Specialty Science
Date 2023 Oct 4
PMID 37794003
Authors
Affiliations
Soon will be listed here.
Abstract

Transparent and FAIR disclosure of meta-information about healthcare data and infrastructure is essential but has not been well publicized. In this paper, we provide a transparent disclosure of the process of standardizing a common data model and developing a national data infrastructure using national claims data. We established an Observational Medical Outcome Partnership (OMOP) common data model database for national claims data of the Health Insurance Review and Assessment Service of South Korea. To introduce a data openness policy, we built a distributed data analysis environment and released metadata based on the FAIR principle. A total of 10,098,730,241 claims and 56,579,726 patients' data were converted as OMOP common data model. We also built an analytics environment for distributed research and made the metadata publicly available. Disclosure of this infrastructure to researchers will help to eliminate information inequality and contribute to the generation of high-quality medical evidence.

Citing Articles

Semantic enrichment of Pomeranian health study data using LOINC and WHO-FIC terminology mapping principles.

Inau E, Radke D, Bird L, Westphal S, Ittermann T, Schafer C JAMIA Open. 2025; 8(2):ooaf010.

PMID: 40059976 PMC: 11884810. DOI: 10.1093/jamiaopen/ooaf010.


Long-term use of proton pump inhibitors was associated with rapid progression to end stage kidney disease in a Korean nationwide study.

Lee Y, Kim J, Yu D, Je N, Rhee H Sci Rep. 2024; 14(1):31477.

PMID: 39733180 PMC: 11682415. DOI: 10.1038/s41598-024-83321-7.


Hospital Length of Stay Prediction for Planned Admissions Using Observational Medical Outcomes Partnership Common Data Model: Retrospective Study.

Lee H, Kim S, Moon H, Lee H, Kim K, Jung S J Med Internet Res. 2024; 26:e59260.

PMID: 39576284 PMC: 11624451. DOI: 10.2196/59260.


One-year post-acute COVID-19 syndrome and mortality in South Korea: a nationwide matched cohort study using claims data.

Won J, Hong Y, Kim S, Lee H Front Public Health. 2024; 12:1403153.

PMID: 39050601 PMC: 11266063. DOI: 10.3389/fpubh.2024.1403153.


Integrated Real-World Data Warehouses Across 7 Evolving Asian Health Care Systems: Scoping Review.

Shau W, Santoso H, Jip V, Setia S J Med Internet Res. 2024; 26:e56686.

PMID: 38749399 PMC: 11200047. DOI: 10.2196/56686.


References
1.
Schneeweiss S . Learning from big health care data. N Engl J Med. 2014; 370(23):2161-3. DOI: 10.1056/NEJMp1401111. View

2.
Chan You S, Krumholz H . The Evolution of Evidence-Based Medicine: When the Magic of the Randomized Clinical Trial Meets Real-World Data. Circulation. 2022; 145(2):107-109. DOI: 10.1161/CIRCULATIONAHA.121.057931. View

3.
Lo-Ciganic W, Donohue J, Yang Q, Huang J, Chang C, Weiss J . Developing and validating a machine-learning algorithm to predict opioid overdose in Medicaid beneficiaries in two US states: a prognostic modelling study. Lancet Digit Health. 2022; 4(6):e455-e465. PMC: 9236281. DOI: 10.1016/S2589-7500(22)00062-0. View

4.
Nosrati E . Harnessing administrative data to study health inequality. Lancet Public Health. 2022; 7(9):e726-e727. DOI: 10.1016/S2468-2667(22)00172-4. View

5.
Portuondo J, Harris A, Massarweh N . Using Administrative Codes to Measure Health Care Quality. JAMA. 2022; 328(9):825-826. DOI: 10.1001/jama.2022.12823. View