Featured Resources


Database Credentialed Access

MIMIC-CXR Database

Alistair Johnson, Tom Pollard, Roger Mark, Seth Berkowitz, Steven Horng

Chest radiographs in DICOM format with associated free-text reports.

mimic machine learning natural language processing chest x-rays computer vision radiology

Published: Sept. 19, 2019. Version: 2.0.0


Database Credentialed Access

eICU Collaborative Research Database

Tom Pollard, Alistair Johnson, Jesse Raffa, Leo Anthony Celi, Omar Badawi, Roger Mark

The eICU Collaborative Research Database is a multi-center database comprising deidentified health data associated with over 200,000 admissions to ICUs across the United States between 2014-2015. The database includes vital sign measurements, care p…

critical care icu telemedicine

Published: April 15, 2019. Version: 2.0


Database Credentialed Access

MIMIC-III Clinical Database

Alistair Johnson, Tom Pollard, Roger Mark

MIMIC-III is a large, freely-available database comprising deidentified health-related data associated with over forty thousand patients who stayed in critical care units of the Beth Israel Deaconess Medical Center between 2001 and 2012.

The databa…

critical care intensive care machine learning natural language processing clinical

Published: Sept. 4, 2016. Version: 1.4


Database Open Access

MIT-BIH Arrhythmia Database

The MIT-BIH Arrhythmia Database contains 48 half-hour excerpts of two-channel ambulatory ECG recordings, obtained from 47 subjects studied by the BIH Arrhythmia Laboratory between 1975 and 1979. Twenty-three recordings were chosen at random from a s…

ecg arrhythmia

Published: Feb. 24, 2005. Version: 1.0.0

Visualize waveforms

Database Open Access

MIT-BIH Atrial Fibrillation Database

This database includes 25 long-term ECG recordings of human subjects with atrial fibrillation (mostly paroxysmal).

ecg atrial fibrillation

Published: Nov. 4, 2000. Version: 1.0.0

Visualize waveforms

Software Open Access

Generalized Multiscale Entropy Analysis

The method of generalized multiscale entropy (GMSE) analysis is useful for investigating complexity in physiologic signals and other series that have correlations at multiple (time) scales. It represents a generalization of the original method of mu…

entropy complexity

Published: Feb. 2, 2019. Version: 1.0.0


Latest Resources


Challenge Open Access

WiDS (Women in Data Science) Datathon 2020: ICU Mortality Prediction

Meredith Lee, Jesse Raffa, Marzyeh Ghassemi, Tom Pollard, Sharada Kalanidhi, Omar Badawi, Karen Matthys, Leo Anthony Celi

WiDS (Women in Data Science) Datathon 2020: ICU Mortality Prediction focuses on patient health. Join a team, explore the data, and share your insights: http://bit.ly/WiDSdatathon2020

icu mortality risk challenge data science kaggle predictive analytics women in data science

Published: Jan. 22, 2020. Version: 1.0.0


Database Open Access

Computed Tomography Images for Intracranial Hemorrhage Detection and Segmentation

Murtadha Hssayeni

Head computed tomography (CT) scans with intracranial hemorrhage (ICH) segmentation, ICH subtypes and skull fracture.

ich segmentation intracranial hemmorhage nifti computed tomography skull fracture

Published: Dec. 20, 2019. Version: 1.3.0


Database Credentialed Access

Paediatric Intensive Care database

Haomin Li, Xian Zeng, Gang Yu

PIC (Paediatric Intensive Care) is a large paediatric-specific, single-centre, bilingual database comprising information relating to children admitted to critical care units at a large children’s hospital in China.

critical care intensive care natural language processing pediatrics

Published: Dec. 1, 2019. Version: 1.0.0


Database Credentialed Access

MedNLI for Shared Task at ACL BioNLP 2019

Chaitanya Shivade

Data for the MedNLI Shared Task at the 2019 ACL BioNLP 2019 Workshop on Biomedical Language Processing

recognizing textual entailment mimic natural language inference

Published: Nov. 28, 2019. Version: 1.0.1


Database Credentialed Access

MIMIC-CXR-JPG - chest radiographs with structured labels

Alistair Johnson, matt lungren, Yifan Peng, Zhiyong Lu, Roger Mark, Seth Berkowitz, Steven Horng

Chest x-rays in JPG format with structured labels derived from the associated radiology report.

mimic computer vision radiology deep learning chest x-ray

Published: Nov. 14, 2019. Version: 2.0.0


Database Open Access

Wilson Central Terminal ECG Database

Hossein Moeinzadeh, Gaetano Gargiulo

This dataset contains the true unipolar leads associated with three limb leads and six precordial leads. The true unipolar leads include the potential of Einthoven limbs, six electrodes on the chest, and the Wilson Central Terminal (WCT).

wilson central terminal limb potential ecg electrocardiography unipolar lead

Published: Nov. 13, 2019. Version: 1.0.1

Visualize waveforms