» Articles » PMID: 27014709

Data Extraction and Management in Networks of Observational Health Care Databases for Scientific Research: A Comparison of EU-ADR, OMOP, Mini-Sentinel and MATRICE Strategies

Abstract

Introduction: We see increased use of existing observational data in order to achieve fast and transparent production of empirical evidence in health care research. Multiple databases are often used to increase power, to assess rare exposures or outcomes, or to study diverse populations. For privacy and sociological reasons, original data on individual subjects can't be shared, requiring a distributed network approach where data processing is performed prior to data sharing.

Case Descriptions And Variation Among Sites: We created a conceptual framework distinguishing three steps in local data processing: (1) data reorganization into a data structure common across the network; (2) derivation of study variables not present in original data; and (3) application of study design to transform longitudinal data into aggregated data sets for statistical analysis. We applied this framework to four case studies to identify similarities and differences in the United States and Europe: Exploring and Understanding Adverse Drug Reactions by Integrative Mining of Clinical Records and Biomedical Knowledge (EU-ADR), Observational Medical Outcomes Partnership (OMOP), the Food and Drug Administration's (FDA's) Mini-Sentinel, and the Italian network-the Integration of Content Management Information on the Territory of Patients with Complex Diseases or with Chronic Conditions (MATRICE).

Findings: National networks (OMOP, Mini-Sentinel, MATRICE) all adopted shared procedures for local data reorganization. The multinational EU-ADR network needed locally defined procedures to reorganize its heterogeneous data into a common structure. Derivation of new data elements was centrally defined in all networks but the procedure was not shared in EU-ADR. Application of study design was a common and shared procedure in all the case studies. Computer procedures were embodied in different programming languages, including SAS, R, SQL, Java, and C++.

Conclusion: Using our conceptual framework we found several areas that would benefit from research to identify optimal standards for production of empirical knowledge from existing databases.an opportunity to advance evidence-based care management. In addition, formalized CM outcomes assessment methodologies will enable us to compare CM effectiveness across health delivery settings.

Citing Articles

Cancer chemotherapy in pregnancy and adverse pediatric outcomes: a population-based cohort study.

Metcalfe A, Cairncross Z, McMorris C, Friedenreich C, Nelson G, Bhatti P J Natl Cancer Inst. 2024; 117(3):554-561.

PMID: 39475425 PMC: 11884850. DOI: 10.1093/jnci/djae273.


MENDS-on-FHIR: leveraging the OMOP common data model and FHIR standards for national chronic disease surveillance.

Essaid S, Andre J, Brooks I, Hohman K, Hull M, Jackson S JAMIA Open. 2024; 7(2):ooae045.

PMID: 38818114 PMC: 11137321. DOI: 10.1093/jamiaopen/ooae045.


MENDS-on-FHIR: Leveraging the OMOP common data model and FHIR standards for national chronic disease surveillance.

Essaid S, Andre J, Brooks I, Hohman K, Hull M, Jackson S medRxiv. 2023; .

PMID: 38045364 PMC: 10690355. DOI: 10.1101/2023.08.09.23293900.


Impact of 2018 EU Risk Minimisation Measures and Revised Pregnancy Prevention Programme on Utilisation and Prescribing Trends of Medicinal Products Containing Valproate: An Interrupted Time Series Study.

Abtahi S, Pajouheshnia R, Duran C, Riera-Arnau J, Gamba M, Alsina E Drug Saf. 2023; 46(7):689-702.

PMID: 37294532 PMC: 10252161. DOI: 10.1007/s40264-023-01314-3.


Long-term Mortality in Individuals Diagnosed With Cancer During Pregnancy or Postpartum.

Cairncross Z, Shack L, Nelson G, Friedenreich C, Ray J, Fell D JAMA Oncol. 2023; 9(6):791-799.

PMID: 37022714 PMC: 10080404. DOI: 10.1001/jamaoncol.2023.0339.


References
1.
Buja A, Damiani G, Gini R, Visca M, Federico B, Donato D . Systematic age-related differences in chronic disease management in a population-based cohort study: a new paradigm of primary care is required. PLoS One. 2014; 9(3):e91340. PMC: 3954692. DOI: 10.1371/journal.pone.0091340. View

2.
Valkhoff V, Schade R, t Jong G, Romio S, Schuemie M, Arfe A . Population-based analysis of non-steroidal anti-inflammatory drug use among children in four European countries in the SOS project: what size of data platforms and which study designs do we need to assess safety issues?. BMC Pediatr. 2013; 13:192. PMC: 4225575. DOI: 10.1186/1471-2431-13-192. View

3.
Avillach P, Dufour J, Diallo G, Salvo F, Joubert M, Thiessard F . Design and validation of an automated method to detect known adverse drug reactions in MEDLINE: a contribution from the EU-ADR project. J Am Med Inform Assoc. 2012; 20(3):446-52. PMC: 3628051. DOI: 10.1136/amiajnl-2012-001083. View

4.
Trifiro G, Patadia V, Schuemie M, Coloma P, Gini R, Herings R . EU-ADR healthcare database network vs. spontaneous reporting system database: preliminary comparison of signal detection. Stud Health Technol Inform. 2011; 166:25-30. View

5.
Hernan M, Savitz D . From "big epidemiology" to "colossal epidemiology": when all eggs are in one basket. Epidemiology. 2013; 24(3):344-5. DOI: 10.1097/EDE.0b013e31828c7694. View