Resources
Database
Credentialed Access
Mengliang Zhang,
Xinyue Hu,
Lin Gu,
Tatsuya Harada,
Kazuma Kobayashi,
Ronald Summers,
Yingying Zhu
The CAD-Chest dataset provides comprehensive annotations of disease, including disease severity, uncertainty, and location based on the MIMIC-CXR radiologist reports.
chesr x-ray
disease label
Published: Dec. 8, 2023.
Version: 1.0
Database
Credentialed Access
Joy Wu,
Nkechinyere Agu,
Ismini Lourentzou,
Arjun Sharma,
Joseph Paguio,
Jasper Seth Yao,
Edward Christopher Dee,
William Mitchell,
Satyananda Kashyap,
Andrea Giovannini,
Leo Anthony Celi,
Tanveer Syeda-Mahmood,
Mehdi Moradi
The Chest ImaGenome dataset is a scene graph dataset with additional chronological comparison relations for chest X-rays. It is automatically derived from the MIMIC-CXR dataset. A manually annotated gold standard is also available for 500 patients.
scene graph
visual dialogue
object detection
semantic reasoning
bounding box
knowledge graph
explainability
reasoning
relation extraction
chest
disease progression
cxr
machine learning
chest x-ray
radiology
multimodal
deep learning
visual question answering
Published: July 13, 2021.
Version: 1.0.0
Database
Restricted Access
Ruizhi Liao,
Geeticka Chauhan,
Polina Golland,
Seth Berkowitz,
Steven Horng
Pulmonary edema metadata and labels for MIMIC-CXR
Published: Feb. 9, 2021.
Version: 1.0.1
Database
Credentialed Access
Shruthi Bannur,
Stephanie Hyland,
Qianchu Liu,
Fernando Pérez-García,
Max Ilse,
Daniel Coelho de Castro,
Benedikt Boecking,
Harshita Sharma,
Kenza Bouzid,
Anton Schwaighofer,
Maria Teodora Wetscherek,
Hannah Richardson,
Tristan Naumann,
Javier Alvarez Valle,
Ozan Oktay
The MS-CXR-T is a multimodal benchmark that enhances the MIMIC-CXR v2 dataset by including expert-verified annotations. Its goal is to evaluate biomedical visual-language processing models in terms of temporal semantics extracted from image and text.
disease progression
cxr
vision-language processing
chest x-ray
radiology
multimodal
Published: March 17, 2023.
Version: 1.0.0
Database
Credentialed Access
Jean-Benoit Delbrouck
RadGraph-XL is a large, expert-annotated dataset of 2,300 radiology reports covering multiple modalities and anatomies. It enables accurate extraction of clinical entities and relations for downstream medical AI tasks.
Published: Sept. 12, 2025.
Version: 1.0.0
Database
Credentialed Access
Hanbin Ko
CXR-Align is a benchmark dataset created to evaluate vision-language models' capability to interpret negations in chest X-ray (CXR) reports, featuring systematically modified reports from MIMIC-CXR.
Published: Aug. 21, 2025.
Version: 1.0.0
Database
Restricted Access
Kendall Park,
Rory Sayres,
Andrew Sellergren,
Tom Pollard,
Fayaz Jamil,
Timo Kohlberger,
Charles Lau,
Atilla Kiraly
This work further refines the labels associated with CheXpert in MIMIC-CXR-JPG 2.0.0 by filtering with Med-PaLM 2 followed by verification by manual review by three US board-certified radiologists.
mimic-cxr labels
Published: Feb. 4, 2025.
Version: 1.0.0
Database
Open Access
Alba Martin-Yebra,
Juan Pablo Martínez,
Pablo Laguna
The MUSIC study is a prospective, multicentre, longitudinal study designed to assess risk predictors of cardiac mortality and sudden cardiac death in ambulatory patients with chronic heart failure.
Published: Jan. 24, 2025.
Version: 1.0.1
Visualize waveforms
Database
Credentialed Access
Mingjie Li,
Wenjia Cai,
Rui Liu,
Yuetian Weng,
Tengfei Liu,
Cong Wang,
xin chen,
zhong liu,
Caineng Pan,
Mengke Li,
yingfeng zheng,
Yizhi Liu,
Flora Salim,
Karin Verspoor,
Xiaodan Liang,
Xiaojun Chang
Benchmark dataset for report generation based on fundus fluorescein angiography images and reports.
fundus fluorescein angiography
medical report generation
vision and language
explainable and reliable evaluation
Published: Jan. 21, 2025.
Version: 1.1.0
Database
Credentialed Access
Benedikt Boecking,
Naoto Usuyama,
Shruthi Bannur,
Daniel Coelho de Castro,
Anton Schwaighofer,
Stephanie Hyland,
Harshita Sharma,
Maria Teodora Wetscherek,
Tristan Naumann,
Aditya Nori,
Javier Alvarez Valle,
Hoifung Poon,
Ozan Oktay
MS-CXR is a new dataset containing 1162 chest X-ray bounding box labels paired with radiology text descriptions, annotated and verified by two board-certified radiologists.
vision-language processing
chest x-ray
phrase grounding
localization
Published: Nov. 15, 2024.
Version: 1.1.0