Resources
Database Credentialed Access
EHRXQA: A Multi-Modal Question Answering Dataset for Electronic Health Records with Chest X-ray Images
Seongsu Bae, Daeun Kyung, Jaehee Ryu, Eunbyeol Cho, Gyubok Lee, Sunjun Kweon, Jungwoo Oh, Lei JI, Eric Chang, Tackeun Kim, Edward Choi
question answering electronic health records evaluation chest x-ray multi-modal question answering ehr question answering semantic parsing benchmark machine learning deep learning visual question answering
Published: July 23, 2024. Version: 1.0.0
Database Restricted Access
EchoNext: A Dataset for Detecting Echocardiogram-Confirmed Structural Heart Disease from ECGs
Pierre Elias, Joshua Finer
heart failure clinical decision support artificial intelligence health equity ecg machine learning deep learning electrocardiogram aortic stenosis cardiovascular screening valvular heart disease digital health ai model deployment left ventricular dysfunction ai in healthcare population health transthoracic echocardiogram structural heart disease
Published: Sept. 16, 2025. Version: 1.1.0
Database Credentialed Access
RadGraph-XL: A Large-Scale Expert-Annotated Dataset for Entity and Relation Extraction from Radiology Reports
Jean-Benoit Delbrouck
Published: Sept. 12, 2025. Version: 1.0.0
Database Open Access
MIMIC-IV Clinical Database Demo on FHIR
Alex Bennett, Hannes Ulrich, Joshua Wiedekopf, Piotr Szul, John Grimes, Alistair Johnson
fhir electronic health records mimic
Published: Aug. 27, 2025. Version: 2.1.0
Database Restricted Access
Swiss-Mammo: A physician-written, synthetic dataset of German mammography reports
Daniel Reichenpfader, Sandro von Däniken, Harald Marcel Bonel
radiology mammography structured reporting bi-rads
Published: June 24, 2025. Version: 1.0.1
Database Open Access
Hillel Yaffe Glaucoma Dataset (HYGD): A Gold-Standard Annotated Fundus Dataset for Glaucoma Detection
Or Abramovich, Hadas Pizem, Jonathan Fhima, Eran Berkowitz, Ben Gofrit, Jan Van Eijgen, Eytan Blumenthal, Joachim Behar
ophthalmology retina dfi gold-standard gon fundus glaucoma
Published: June 3, 2025. Version: 1.0.0
Database Credentialed Access
Medical-CXR-VQA dataset: A Large-Scale LLM-Enhanced Medical Dataset for Visual Question Answering on Chest X-Ray Images
Xinyue Hu, Lin Gu, Kazuma Kobayashi, liangchen liu, Mengliang Zhang, Tatsuya Harada, Ronald Summers, Yingying Zhu
Published: Jan. 21, 2025. Version: 1.0.0
Database Credentialed Access
ReXPref-Prior: A MIMIC-CXR Preference Dataset for Reducing Hallucinated Prior Exams in Radiology Report Generation
Oishi Banerjee, Hong-Yu Zhou, Subathra Adithan, Stephen Kwak, Kay Wu, Pranav Rajpurkar
chest x-rays reinforcement learning hallucination
Published: Aug. 14, 2024. Version: 1.0.0
Database Credentialed Access
RadGraph2: Tracking Findings Over Time in Radiology Reports
Adam Dejl, Sameer Khanna, Patricia Therese Pile, Kibo Yoon, Steven QH Truong, Hanh Duong, Agustina Saenz, Pranav Rajpurkar
chest x-rays relation extraction disease progression information extraction radiology reports named entity recognition
Published: Aug. 8, 2024. Version: 1.0.0
Database Credentialed Access
EHRXQA: A Multi-Modal Question Answering Dataset for Electronic Health Records with Chest X-ray Images
Seongsu Bae, Daeun Kyung, Jaehee Ryu, Eunbyeol Cho, Gyubok Lee, Sunjun Kweon, Jungwoo Oh, Lei JI, Eric Chang, Tackeun Kim, Edward Choi
question answering electronic health records evaluation chest x-ray multi-modal question answering ehr question answering semantic parsing benchmark machine learning deep learning visual question answering
Published: July 23, 2024. Version: 1.0.0