» Articles » PMID: 39281737

A Common Longitudinal Intensive Care Unit Data Format (CLIF) to Enable Multi-institutional Federated Critical Illness Research

Abstract

Background: Critical illness, or acute organ failure requiring life support, threatens over five million American lives annually. Electronic health record (EHR) data are a source of granular information that could generate crucial insights into the nature and optimal treatment of critical illness. However, data management, security, and standardization are barriers to large-scale critical illness EHR studies.

Methods: A consortium of critical care physicians and data scientists from eight US healthcare systems developed the Common Longitudinal Intensive Care Unit (ICU) data Format (CLIF), an open-source database format that harmonizes a minimum set of ICU Data Elements for use in critical illness research. We created a pipeline to process adult ICU EHR data at each site. After development and iteration, we conducted two proof-of-concept studies with a federated research architecture: 1) an external validation of an in-hospital mortality prediction model for critically ill patients and 2) an assessment of 72-hour temperature trajectories and their association with mechanical ventilation and in-hospital mortality using group-based trajectory models.

Results: We converted longitudinal data from 94,356 critically ill patients treated in 2020-2021 (mean age 60.6 years [standard deviation 17.2], 30% Black, 7% Hispanic, 45% female) across 8 health systems and 33 hospitals into the CLIF format, The in-hospital mortality prediction model performed well in the health system where it was derived (0.81 AUC, 0.06 Brier score). Performance across CLIF consortium sites varied (AUCs: 0.74-0.83, Brier scores: 0.06-0.01), and demonstrated some degradation in predictive capability. Temperature trajectories were similar across health systems. Hypothermic and hyperthermic-slow-resolver patients consistently had the highest mortality.

Conclusions: CLIF facilitates efficient, rigorous, and reproducible critical care research. Our federated case studies showcase CLIF's potential for disease sub-phenotyping and clinical decision-support evaluation. Future applications include pragmatic EHR-based trials, target trial emulations, foundational multi-modal AI models of critical illness, and real-time critical care quality dashboards.

References
1.
Benzoni N, Carey K, Bewley A, Klaus J, Fuller B, Edelson D . Temperature Trajectory Subphenotypes in Oncology Patients with Neutropenia and Suspected Infection. Am J Respir Crit Care Med. 2022; 207(10):1300-1309. PMC: 10595453. DOI: 10.1164/rccm.202205-0920OC. View

2.
Miller W, Han X, Peek M, Ashana D, Parker W . Accuracy of the Sequential Organ Failure Assessment Score for In-Hospital Mortality by Race and Relevance to Crisis Standards of Care. JAMA Netw Open. 2021; 4(6):e2113891. PMC: 8214156. DOI: 10.1001/jamanetworkopen.2021.13891. View

3.
Pollard T, Johnson A, Raffa J, Celi L, Mark R, Badawi O . The eICU Collaborative Research Database, a freely available multi-center database for critical care research. Sci Data. 2018; 5:180178. PMC: 6132188. DOI: 10.1038/sdata.2018.178. View

4.
Lyons P, Hofford M, Yu S, Michelson A, Payne P, Hough C . Factors Associated With Variability in the Performance of a Proprietary Sepsis Prediction Model Across 9 Networked Hospitals in the US. JAMA Intern Med. 2023; 183(6):611-612. PMC: 10071393. DOI: 10.1001/jamainternmed.2022.7182. View

5.
Rojas J, Carey K, Edelson D, Venable L, Howell M, Churpek M . Predicting Intensive Care Unit Readmission with Machine Learning Using Electronic Health Record Data. Ann Am Thorac Soc. 2018; 15(7):846-853. PMC: 6207111. DOI: 10.1513/AnnalsATS.201710-787OC. View