Resources
Database Credentialed Access
EHR-DS-QA: A Synthetic QA Dataset Derived from Medical Discharge Summaries for Enhanced Medical Information Retrieval Systems
Konstantin Kotschenreuther
mimic-iv clinical question-answering medical discharge summaries large language models
Published: Jan. 11, 2024. Version: 1.0.0
Database Credentialed Access
MIMIC-IV-Ext Clinical Decision Making: A MIMIC-IV Derived Dataset for Evaluation of Large Language Models on the Task of Clinical Decision Making for Abdominal Pathologies
Paul Hager, Friederike Jungmann, Daniel Rueckert
clinical decision making abdominal pathologies treatment plan emergency room diagnosis large language models
Published: July 8, 2024. Version: 1.1
Database Credentialed Access
MIMIC-III Clinical Database
Alistair Johnson, Tom Pollard, Roger Mark
MIMIC-III is a large, freely-available database comprising deidentified health-related data associated with over forty thousand patients who stayed in critical care units of the Beth Israel Deaconess Medical Center between 2001 and 2012. The databas…
clinical intensive care critical care natural language processing machine learning
Published: Sept. 4, 2016. Version: 1.4
Database Credentialed Access
Paediatric Intensive Care database
Haomin Li, Xian Zeng, Gang Yu
intensive care pediatrics critical care natural language processing
Published: Nov. 12, 2020. Version: 1.1.0
Database Credentialed Access
NCH Sleep DataBank: A Large Collection of Real-world Pediatric Sleep Studies with Longitudinal Clinical Data
Harlin Lee, Boyue Li, Yungui Huang, Yuejie Chi, Simon Lin
eeg ehr pediatrics polysomnography clinical decision support sleep study ecg sleep disorders electronic health records
Published: Oct. 27, 2021. Version: 3.1.0
Database Credentialed Access
MIMIC-CXR Database
Alistair Johnson, Tom Pollard, Roger Mark, Seth Berkowitz, Steven Horng
computer vision chest x-rays natural language processing radiology mimic machine learning
Published: July 23, 2024. Version: 2.1.0
Database Credentialed Access
National Institutes of Health Stroke Scale (NIHSS) Annotations for the MIMIC-III Database
Jiayang Wang, Xiaoshuo Huang, Lin Yang, Jiao Li
Published: Jan. 25, 2021. Version: 1.0.0
Database Contributor Review
COVID Data for Shared Learning (CDSL): A comprehensive, multimodal COVID-19 dataset from HM Hospitales
Álvaro Ritoré, Andreea M Oprescu, Alberto Estirado Bronchalo, Miguel Ángel Armengol de la Hoz
covid-19 multimodal database radiological images open data healthcare data machine learning and ai
Published: Oct. 25, 2024. Version: 1.0.0
Database Restricted Access
EchoNext: A Dataset for Detecting Echocardiogram-Confirmed Structural Heart Disease from ECGs
Pierre Elias, Joshua Finer
heart failure clinical decision support artificial intelligence health equity ecg deep learning machine learning electrocardiogram aortic stenosis cardiovascular screening valvular heart disease digital health ai model deployment left ventricular dysfunction ai in healthcare population health transthoracic echocardiogram structural heart disease
Published: Sept. 16, 2025. Version: 1.1.0
Database Restricted Access
EchoNext: A Dataset for Detecting Echocardiogram-Confirmed Structural Heart Disease from ECGs
Pierre Elias, Joshua Finer
heart failure clinical decision support artificial intelligence health equity ecg deep learning machine learning electrocardiogram aortic stenosis cardiovascular screening valvular heart disease digital health ai model deployment left ventricular dysfunction ai in healthcare population health transthoracic echocardiogram structural heart disease
Published: Sept. 16, 2025. Version: 1.1.0