Resources


Database Credentialed Access

GLOBEM Dataset: Multi-Year Datasets for Longitudinal Human Behavior Modeling Generalization

Xuhai Xu, Han Zhang, Yasaman Sefidgar, Yiyi Ren, Xin Liu, Woosuk Seo, Jennifer Brown, Kevin Kuehn, Mike Merrill, Paula Nurius, Shwetak Patel, Tim Althoff, Margaret Morris, Eve Riskin, Jennifer Mankoff, Anind Dey

GLOBEM datasets contain the first released multi-year mobile and wearable sensing datasets from 2018 to 2021, containing 705 person-years and 497 unique participants.

health ubiquitous computing well-being passive mobile sensing human behavior modeling

Published: March 14, 2023. Version: 1.1


Database Restricted Access

Flatten: COVID-19 Survey Data on Symptoms, Demographics and Mental Health in Canada

Shrey Jain, Marie Charpignon, Mathew Samuel, Jaydeep Mistry, Nicholas Frosst, Leo Anthony Celi, Marzyeh Ghassemi

Freely accessible COVID-19 symptom dataset surveying Canadians and gathered from March to July of 2020 by the global humanitarian aid non-profit Flatten. This dataset of 294,106 surveys gathered from March 23rd to July 30th in 2020.

public health population statistics covid-19

Published: March 8, 2021. Version: 1.0


Database Restricted Access

In-hospital physical activity measured with a new Bosch accelerometer sensor system

Severin Schricker, Nico Schmid, Moritz Schanz, Martin Kimmel, Mark Dominik Alscher

Measurements of physical activity with wrist-worn Bosch sensor platform to test predictive performance for the duration of hospitalization and readmission in 58 patients with acute illnesses in internal medicine

prediction acute illness hospitalization readmission accelerometry accelerometer

Published: Dec. 3, 2020. Version: 1.0


Database Credentialed Access

MIMIC-CXR Database

Alistair Johnson, Tom Pollard, Roger Mark, Seth Berkowitz, Steven Horng

Chest radiographs in DICOM format with associated free-text reports.

mimic computer vision machine learning chest x-rays radiology natural language processing

Published: Sept. 19, 2019. Version: 2.0.0


Database Open Access

Sleep Heart Health Study PSG Database

Data collected for a prospective cohort study designed to investigate the relationship between sleep disordered breathing and cardiovascular disease.

polysomnogram sleep multiparameter

Published: Oct. 23, 2003. Version: 1.0.0

Visualize waveforms

Database Credentialed Access

CXR-PRO: MIMIC-CXR with Prior References Omitted

Vignav Ramesh, Nathan Chi, Pranav Rajpurkar

CXR-PRO is an adaptation of the MIMIC-CXR dataset (consisting of chest radiographs and their associated free-text radiology reports) with references to non-existent priors removed.

generation large language models free-text radiology reports references to priors retrieval

Published: Nov. 23, 2022. Version: 1.0.0


Database Contributor Review

BRATECA (Brazilian Tertiary Care Dataset): a Clinical Information Dataset for the Portuguese Language

Henrique Dias, Ana Helena Dias Pereira dos Ulbrich

Brazilian clinical dataset containing over 70,000 admissions from 10 hospitals in two Brazilian states.

prescriptions exams tertiary care clinical notes natural language processing

Published: July 14, 2022. Version: 1.1


Database Credentialed Access

RadGraph: Extracting Clinical Entities and Relations from Radiology Reports

Saahil Jain, Ashwin Agrawal, Adriel Saporta, Steven QH Truong, Du Nguyen Duong, Tan Bui, Pierre Chambon, Matthew Lungren, Andrew Ng, Curtis Langlotz, Pranav Rajpurkar

RadGraph is a dataset of entities and relations in full-text chest X-ray radiology reports, which are obtained using a novel information extraction (IE) schema to capture clinically relevant information in a radiology report.

entity and relation extraction graph multi-modal radiology natural language processing

Published: June 3, 2021. Version: 1.0.0


Database Credentialed Access

MIMIC-CXR-JPG - chest radiographs with structured labels

Alistair Johnson, Matt Lungren, Yifan Peng, Zhiyong Lu, Roger Mark, Seth Berkowitz, Steven Horng

Chest x-rays in JPG format with structured labels derived from the associated radiology report.

mimic computer vision radiology chest x-ray deep learning

Published: Nov. 14, 2019. Version: 2.0.0


Database Credentialed Access

MIMIC-III Clinical Database

Alistair Johnson, Tom Pollard, Roger Mark

MIMIC-III is a large, freely-available database comprising deidentified health-related data associated with over forty thousand patients who stayed in critical care units of the Beth Israel Deaconess Medical Center between 2001 and 2012. The databas…

clinical intensive care machine learning critical care natural language processing

Published: Sept. 4, 2016. Version: 1.4