Resources
Database Credentialed Access
EHRXQA: A Multi-Modal Question Answering Dataset for Electronic Health Records with Chest X-ray Images
question answering machine learning electronic health records evaluation chest x-ray multi-modal question answering ehr question answering semantic parsing benchmark deep learning visual question answering
Published: July 23, 2024. Version: 1.0.0
Database Credentialed Access
CXR-Align: A Benchmark for CXR-Report Alignment with Negations
Published: Aug. 21, 2025. Version: 1.0.0
Database Credentialed Access
EHR-DS-QA: A Synthetic QA Dataset Derived from Medical Discharge Summaries for Enhanced Medical Information Retrieval Systems
mimic-iv clinical question-answering medical discharge summaries large language models
Published: Jan. 11, 2024. Version: 1.0.0
Database Credentialed Access
MedVAL-Bench: Expert-Annotated Medical Text Validation Benchmark
Published: Nov. 14, 2025. Version: 1.0.1
Database Credentialed Access
Antibiotic Resistance Microbiology Dataset Mass General Brigham (ARMD-MGB)
medical informatics antimicrobial resistance electronic health records
Published: Dec. 5, 2025. Version: 1.0.0
Database Credentialed Access
MIMIC-IV-Ext Clinical Decision Making: A MIMIC-IV Derived Dataset for Evaluation of Large Language Models on the Task of Clinical Decision Making for Abdominal Pathologies
clinical decision making abdominal pathologies treatment plan emergency room diagnosis large language models
Published: July 8, 2024. Version: 1.1
Database Credentialed Access
MIMIC-III Clinical Database
MIMIC-III is a large, freely-available database comprising deidentified health-related data associated with over forty thousand patients who stayed in critical care units of the Beth Israel Deaconess Medical Center between 2001 and 2012. The databas…
clinical intensive care critical care natural language processing machine learning
Published: Sept. 4, 2016. Version: 1.4
Database Credentialed Access
Paediatric Intensive Care database
intensive care pediatrics critical care natural language processing
Published: Nov. 12, 2020. Version: 1.1.0
Database Credentialed Access
NCH Sleep DataBank: A Large Collection of Real-world Pediatric Sleep Studies with Longitudinal Clinical Data
eeg ehr pediatrics polysomnography clinical decision support sleep study ecg electronic health records sleep disorders
Published: Oct. 27, 2021. Version: 3.1.0
Database Credentialed Access
MIMIC-CXR Database
computer vision chest x-rays natural language processing machine learning radiology mimic
Published: July 23, 2024. Version: 2.1.0