Resources
Database
Credentialed Access
Nils Strodthoff,
Juan Miguel Lopez Alcaraz,
Wilhelm Haverkamp
Dataset that links ECG records from MIMIC-IV-ECG to ED discharge and hospital discharge diagnoses, which enables to train general ECG prediction models based on clinical labels and facilitates the retrieval of further clinical metadata from MIMIC-IV.
machine learning
electrocardiography
mimic
Published: Aug. 30, 2024.
Version: 1.0.1
Database
Credentialed Access
Asad Aali,
Vasiliki Bikia,
Maya Varma,
Nicole Chiou,
Sophie Ostmeier,
Arnav Singhvi,
Magdalini Paschali,
Ashwin Kumar,
Andrew Johnston,
Karimar Amador Martinez,
Eduardo Perez Guerrero,
Paola Cruz Rivera,
Sergios Gatidis,
Christian Bluethgen,
Eduardo Pontes Reis,
Eddy Zandee van Rilland,
Poonam Hosamani,
Kevin Keet,
Minjoung Go,
Evelyn Ling,
David Larson,
Curtis Langlotz,
Roxana Daneshjou,
Jason Hom,
Sanmi Koyejo,
Emily Alsentzer,
Akshay Chaudhari
MedVAL-Bench is the first large-scale physician-validated benchmark for medical text validation, spanning 6 diverse medical tasks and containing 840 language model-generated outputs annotated by 12 physicians with error assessments and risk grades.
Published: Nov. 14, 2025.
Version: 1.0.1
Database
Credentialed Access
Hanbin Ko
CXR-Align is a benchmark dataset created to evaluate vision-language models' capability to interpret negations in chest X-ray (CXR) reports, featuring systematically modified reports from MIMIC-CXR.
Published: Aug. 21, 2025.
Version: 1.0.0
Database
Open Access
Patrick Wagner,
Nils Strodthoff,
Ralf-Dieter Bousseljot,
Wojciech Samek,
Tobias Schaeffter
The PTB-XL ECG dataset is a large dataset of 21801 clinical 12-lead ECGs from 18869 patients of 10 second length. The raw signal data has been annotated by up to two cardiologists with 71 different ECG statements and is supplemented by rich metadata.
ptb-xl
ptb
ecg
electrocardiography
Published: Nov. 9, 2022.
Version: 1.0.3
Visualize waveforms
Database
Credentialed Access
Jayetri Bardhan,
Anthony Colas,
Kirk Roberts,
Daisy Zhe Wang
DrugEHRQA is a QA dataset containing question-answers from MIMIC-III tables and discharge summaries.
question-answer
qa
Published: April 12, 2022.
Version: 1.0.0
Database
Open Access
Alex Bennett,
Hannes Ulrich,
Joshua Wiedekopf,
Piotr Szul,
John Grimes,
Alistair Johnson
The MIMIC-IV Clinical Database Demo on FHIR is a 100 patient subset of the MIMIC-IV v2.2 and MIMIC-IV-ED v2.2 clinical databases converted into the Fast Healthcare Interoperability Resources (FHIR) format.
fhir
electronic health records
mimic
Published: Aug. 27, 2025.
Version: 2.1.0
Database
Credentialed Access
Benedikt Boecking,
Naoto Usuyama,
Shruthi Bannur,
Daniel Coelho de Castro,
Anton Schwaighofer,
Stephanie Hyland,
Harshita Sharma,
Maria Teodora Wetscherek,
Tristan Naumann,
Aditya Nori,
Javier Alvarez Valle,
Hoifung Poon,
Ozan Oktay
MS-CXR is a new dataset containing 1162 chest X-ray bounding box labels paired with radiology text descriptions, annotated and verified by two board-certified radiologists.
vision-language processing
chest x-ray
phrase grounding
localization
Published: Nov. 15, 2024.
Version: 1.1.0
Database
Credentialed Access
Shruthi Bannur,
Stephanie Hyland,
Qianchu Liu,
Fernando Pérez-García,
Max Ilse,
Daniel Coelho de Castro,
Benedikt Boecking,
Harshita Sharma,
Kenza Bouzid,
Anton Schwaighofer,
Maria Teodora Wetscherek,
Hannah Richardson,
Tristan Naumann,
Javier Alvarez Valle,
Ozan Oktay
The MS-CXR-T is a multimodal benchmark that enhances the MIMIC-CXR v2 dataset by including expert-verified annotations. Its goal is to evaluate biomedical visual-language processing models in terms of temporal semantics extracted from image and text.
disease progression
cxr
vision-language processing
chest x-ray
radiology
multimodal
Published: March 17, 2023.
Version: 1.0.0
Database
Credentialed Access
Chenlong Yin,
Weijia Zhang
The processed MIMIC-III dataset for the benchmark of Irregular Multivariate Time Series Forecasting: A Transformable Patching Graph Neural Networks Approach.
Published: April 9, 2025.
Version: 1.0.0
Model
Credentialed Access
Shekoofeh Azizi,
Jan Freyberg,
Laura Culp,
Patricia MacWilliams,
Sara Mahdavi,
Vivek Natarajan,
Alan Karthikesalingam
Medical AI Research Foundations is a repository of medical foundation models.
Published: April 25, 2023.
Version: 1.0.0