Resources


Database Open Access

MIMIC-IV Clinical Database Demo on FHIR

Alex Bennett, Hannes Ulrich, Joshua Wiedekopf, Piotr Szul, John Grimes, Alistair Johnson

The MIMIC-IV Clinical Database Demo on FHIR is a 100 patient subset of the MIMIC-IV v2.2 and MIMIC-IV-ED v2.2 clinical databases converted into the Fast Healthcare Interoperability Resources (FHIR) format.

fhir electronic health records mimic

Published: Aug. 27, 2025. Version: 2.1.0


Database Restricted Access

EchoNext: A Dataset for Detecting Echocardiogram-Confirmed Structural Heart Disease from ECGs

Pierre Elias, Joshua Finer

EchoNext is a curated dataset of electrocardiograms (ECGs) paired with echocardiogram-confirmed structural heart disease labels, designed to support the development and validation of machine learning models.

clinical decision support heart failure artificial intelligence ecg health equity machine learning electrocardiogram deep learning ai model deployment population health transthoracic echocardiogram left ventricular dysfunction structural heart disease aortic stenosis cardiovascular screening digital health ai in healthcare valvular heart disease

Published: Aug. 5, 2025. Version: 1.0.0


Database Credentialed Access

MIMIC-IV on FHIR

Alex Bennett, Joshua Wiedekopf, Hannes Ulrich, Philip van Damme, Piotr Szul, John Grimes, Alistair Johnson

MIMIC-IV and MIMIC-IV-ED data mapped into FHIR resources.

mimic-iv fhir electronic health record us core fast healthcare interoperability resources mimic

Published: Nov. 12, 2024. Version: 2.1


Database Credentialed Access

MIMIC-Ext-MIMIC-CXR-VQA: A Complex, Diverse, And Large-Scale Visual Question Answering Dataset for Chest X-ray Images

Seongsu Bae, Daeun Kyung, Jaehee Ryu, Eunbyeol Cho, Gyubok Lee, Sunjun Kweon, Jungwoo Oh, Lei JI, Eric Chang, Tackeun Kim, Edward Choi

We introduce MIMIC-Ext-MIMIC-CXR-VQA, a complex, diverse, and large-scale dataset designed for Visual Question Answering (VQA) tasks within the medical domain, focusing primarily on chest radiographs.

question answering chest x-ray benchmark evaluation radiology machine learning electronic health records deep learning multimodal visual question answering

Published: July 19, 2024. Version: 1.0.0


Database Credentialed Access

CORAL: expert-Curated medical Oncology Reports to Advance Language model inference

Madhumita Sushil, Vanessa Kennedy, Divneet Mandair, Brenda Miao, Travis Zack, Atul Butte

Medical oncology progress notes annotated with advanced, comprehensive oncology-relevant concepts and relationships.

artificial intelligence information extraction oncology natural language processing large language models electronic health records

Published: Feb. 7, 2024. Version: 1.0


Challenge Credentialed Access

MIT Critical Datathon 2023: a MIMIC-IV Derived Dataset for Pulse Oximetry Correction Models

João Matos, Tristan Struja, David S Restrepo, Luis Filipe Nakayama, Jack Gallifant, Luca Weishaupt, Nikita Mullangi, Maria Loureiro, Skyler Shapiro, Adrien Carrel, Leo Anthony Celi

A SaO2-SpO2 Pairs Dataset derived from MIMIC-IV

pulse oximetry health equity machine learning

Published: May 8, 2023. Version: 1.0.0


Database Credentialed Access

RadQA: A Question Answering Dataset to Improve Comprehension of Radiology Reports

Sarvesh Soni, Kirk Roberts

RadQA is an electronic health record question answering dataset containing clinical questions that can be answered using the Findings and Impressions sections of radiology reports

machine reading comprehension radiology reports clinical notes question answering electronic health records

Published: Dec. 9, 2022. Version: 1.0.0


Database Open Access

Multilevel Monitoring of Activity and Sleep in Healthy People

Alessio Rossi, Eleonora Da Pozzo, Dario Menicagli, Chiara Tremolanti, Corrado Priami, Alina Sirbu, David Clifton, Claudia Martini, David Morelli

Multilevel Monitoring of Activity and Sleep in Healthy people (MMASH) dataset provides 24 hours of continuous beat-to-beat heart data, triaxal accelerometer data, sleep quality, physical activity, psychological characteristics and salivary samples.

sleep physiological response melatonin cortisol circadian rhythm psychological response saliva health

Published: June 19, 2020. Version: 1.0.0


Database Open Access

MIMIC Database

The MIMIC Database includes data recorded from over 90 ICU patients. The data in each case include signals and periodic measurements obtained from a bedside monitor as well as clinical data obtained from the patient's medical record. The recordi…

health record ehr icu critical care mimic

Published: March 15, 2000. Version: 1.0.0

Visualize waveforms

Database Open Access

MIMIC-IV Clinical Database Demo on FHIR

Alex Bennett, Hannes Ulrich, Joshua Wiedekopf, Piotr Szul, John Grimes, Alistair Johnson

The MIMIC-IV Clinical Database Demo on FHIR is a 100 patient subset of the MIMIC-IV v2.2 and MIMIC-IV-ED v2.2 clinical databases converted into the Fast Healthcare Interoperability Resources (FHIR) format.

fhir electronic health records mimic

Published: Aug. 27, 2025. Version: 2.1.0