» Articles » PMID: 31831740

MIMIC-CXR, a De-identified Publicly Available Database of Chest Radiographs with Free-text Reports

Overview
Journal Sci Data
Specialty Science
Date 2019 Dec 14
PMID 31831740
Citations 275
Authors
Affiliations
Soon will be listed here.
Abstract

Chest radiography is an extremely powerful imaging modality, allowing for a detailed inspection of a patient's chest, but requires specialized training for proper interpretation. With the advent of high performance general purpose computer vision algorithms, the accurate automated analysis of chest radiographs is becoming increasingly of interest to researchers. Here we describe MIMIC-CXR, a large dataset of 227,835 imaging studies for 65,379 patients presenting to the Beth Israel Deaconess Medical Center Emergency Department between 2011-2016. Each imaging study can contain one or more images, usually a frontal view and a lateral view. A total of 377,110 images are available in the dataset. Studies are made available with a semi-structured free-text radiology report that describes the radiological findings of the images, written by a practicing radiologist contemporaneously during routine clinical care. All images and reports have been de-identified to protect patient privacy. The dataset is made freely available to facilitate and encourage a wide range of research in computer vision, natural language processing, and clinical data mining.

Citing Articles

Towards a holistic framework for multimodal LLM in 3D brain CT radiology report generation.

Li C, Chang K, Yang C, Wu H, Chen W, Bansal H Nat Commun. 2025; 16(1):2258.

PMID: 40050277 PMC: 11885477. DOI: 10.1038/s41467-025-57426-0.


Medical foundation large language models for comprehensive text analysis and beyond.

Xie Q, Chen Q, Chen A, Peng C, Hu Y, Lin F NPJ Digit Med. 2025; 8(1):141.

PMID: 40044845 PMC: 11882967. DOI: 10.1038/s41746-025-01533-1.


Multi-Branch CNN-LSTM Fusion Network-Driven System With BERT Semantic Evaluator for Radiology Reporting in Emergency Head CTs.

Tomassini S, Duranti D, Zeggada A, Cosimo Quattrocchi C, Melgani F, Giorgini P IEEE J Transl Eng Health Med. 2025; 13:61-74.

PMID: 40035027 PMC: 11875635. DOI: 10.1109/JTEHM.2025.3535676.


Cross-Modal Augmented Transformer for Automated Medical Report Generation.

Tang Y, Yuan Y, Tao F, Tang M IEEE J Transl Eng Health Med. 2025; 13:33-48.

PMID: 40035024 PMC: 11875640. DOI: 10.1109/JTEHM.2025.3536441.


Interpretability of AI race detection model in medical imaging with saliency methods.

Konate S, Lebrat L, Cruz R, Wawira Gichoya J, Price B, Seyyed-Kalantari L Comput Struct Biotechnol J. 2025; 28:63-70.

PMID: 40026803 PMC: 11871413. DOI: 10.1016/j.csbj.2025.01.007.


References
1.
Candemir S, Jaeger S, Palaniappan K, Musco J, Singh R, Xue Z . Lung segmentation in chest radiographs using anatomical atlases with nonrigid registration. IEEE Trans Med Imaging. 2013; 33(2):577-90. DOI: 10.1109/TMI.2013.2290491. View

2.
Braunschweiger P, Goodman K . The CITI program: an international online resource for education in human subjects protection and the responsible conduct of research. Acad Med. 2007; 82(9):861-4. DOI: 10.1097/ACM.0b013e31812f7770. View

3.
Rosenkrantz A, Wang W, Hughes D, Duszak Jr R . A County-Level Analysis of the US Radiologist Workforce: Physician Supply and Subspecialty Characteristics. J Am Coll Radiol. 2018; 15(4):601-606. DOI: 10.1016/j.jacr.2017.11.007. View

4.
Pollard T, Johnson A, Raffa J, Celi L, Mark R, Badawi O . The eICU Collaborative Research Database, a freely available multi-center database for critical care research. Sci Data. 2018; 5:180178. PMC: 6132188. DOI: 10.1038/sdata.2018.178. View

5.
Shiraishi J, Katsuragawa S, Ikezoe J, Matsumoto T, Kobayashi T, Komatsu K . Development of a digital image database for chest radiographs with and without a lung nodule: receiver operating characteristic analysis of radiologists' detection of pulmonary nodules. AJR Am J Roentgenol. 2000; 174(1):71-4. DOI: 10.2214/ajr.174.1.1740071. View