Resources


Database Credentialed Access

MIMIC-III and eICU-CRD: Feature Representation by FIDDLE Preprocessing

Shengpu Tang, Parmida Davarmanesh, Yanmeng Song, et al.

Features and labels from MIMIC-III and eICU-CRD produced by FIDDLE, an EHR preprocessing pipeline.

preprocessing electronic health record machine learning

Published: April 28, 2021. Version: 1.0.0


Database Restricted Access

Flatten: COVID-19 Survey Data on Symptoms, Demographics and Mental Health in Canada

Shrey Jain, Marie Charpignon, Mathew Samuel, et al.

Freely accessible COVID-19 symptom dataset surveying Canadians and gathered from March to July of 2020 by the global humanitarian aid non-profit Flatten. This dataset of 294,106 surveys gathered from March 23rd to July 30th in 2020.

public health population statistics covid-19

Published: March 8, 2021. Version: 1.0


Model Credentialed Access

Transformer models trained on MIMIC-III to generate synthetic patient notes

Ali Amin-Nejad, Julia Ive, Sumithra Velupillai

Machine learning models that have been trained using MIMIC-III to enable the creation of synthetic discharge summaries.

Published: May 27, 2020. Version: 1.0.0


Database Open Access

MIMIC-III Waveform Database Matched Subset

Benjamin Moody, George Moody, Mauricio Villarroel, et al.

Physiological signals (including continuous ECG, PPG, ABP, and other signals) that are associated with patients in the MIMIC-III Clinical Database.

Published: April 7, 2020. Version: 1.0

Visualize waveforms

Software Open Access

Cerebral Haemodynamic Autoregulatory Information System GUI

Acute Brain injury (ABI) is a devastating event requiring intensive acute treatment and post-injury rehabilitation, both delivered for indeterminate periods of time. For severe ABIs, acute treatment is aimed at stabilizing the patient to prevent sec…

brain injury intracranial pressure

Published: Dec. 16, 2016. Version: 1.0.0


Database Open Access

ECG Effects of Ranolazine, Dofetilide, Verapamil, and Quinidine

ECGs of 22 subjects for a study aimed at comparing the effects of QT prolonging drugs versus placebo on electrophysiological parameters.

medication ecg

Published: July 26, 2016. Version: 1.0.0

Visualize waveforms

Challenge Open Access

QT Interval Measurement: The PhysioNet/Computing in Cardiology Challenge 2006

The seventh annual PhysioNet/Computers in Cardiology Challenge addresses a question of high clinical interest: Can the QT interval be measured by fully automated methods with an accuracy acceptable for clinical evaluations?

challenge ecg

Published: Nov. 1, 2006. Version: 1.0.0


Challenge Credentialed Access

SNOMED CT Entity Linking Challenge

Will Hardman, Mark Banks, Rory Davidson, et al.

272 discharge notes from the MIMIC-IV-Note dataset annotated with SNOMED CT concepts.

snomed clinical annotation entity linking

Published: Jan. 12, 2026. Version: 1.2.0


Database Credentialed Access

Predictors of Hospital Onset Infection: A Matched Retrospective Cohort Dataset

Ziming Wei, Luke Sagers, Caroline McKenna, et al.

NPA-CP is a freely accessible dataset derived from electronic health record (EHR) information at MGB between 2015 and 2024. The dataset includes 11 different pathogens and can be used to predict hospital-onset infections for these pathogens.

electronic health records infection control clinical machine learning infectious diseases hospital onset infection colonization pressure

Published: Nov. 4, 2025. Version: 1.0.0


Database Credentialed Access

MIMIC-Ext-DrugDetection

Fabrice Harel-Canada, Nanyun Peng, David Goodman, et al.

This project offers a multilabel annotated dataset of clinical note sentences from MIMIC-III/IV for substance use detection. It supports NLP research for identifying various co-occurring drug use mentions in patient records.

ehr mimic-iv substance use clinical notes methamphetamine multi-label cocaine drug detection polysubstance use prescription opioid misuse cannabis benzodiazepine misuse injection drug use heroin mimic-iii

Published: Sept. 25, 2025. Version: 1.0.0