Resources


Database Credentialed Access

CXR-PRO: MIMIC-CXR with Prior References Omitted

Vignav Ramesh, Nathan Chi, Pranav Rajpurkar

CXR-PRO is an adaptation of the MIMIC-CXR dataset (consisting of chest radiographs and their associated free-text radiology reports) with references to non-existent priors removed.

generation large language models free-text radiology reports references to priors retrieval

Published: Nov. 23, 2022. Version: 1.0.0


Database Credentialed Access

GLOBEM Dataset: Multi-Year Datasets for Longitudinal Human Behavior Modeling Generalization

Xuhai Xu, Han Zhang, Yasaman Sefidgar, Yiyi Ren, Xin Liu, Woosuk Seo, Jennifer Brown, Kevin Kuehn, Mike Merrill, Paula Nurius, Shwetak Patel, Tim Althoff, Margaret Morris, Eve Riskin, Jennifer Mankoff, Anind Dey

GLOBEM datasets contain the first released multi-year mobile and wearable sensing datasets from 2018 to 2021, containing 705 person-years and 497 unique participants.

health ubiquitous computing passive mobile sensing human behavior modeling well-being

Published: Nov. 4, 2022. Version: 1.0


Database Contributor Review

BRATECA (Brazilian Tertiary Care Dataset): a Clinical Information Dataset for the Portuguese Language

Henrique Dias, Ana Helena Dias Pereira dos Ulbrich

Brazilian clinical dataset containing over 70,000 admissions from 10 hospitals in two Brazilian states.

prescriptions exams clinical notes tertiary care natural language processing

Published: July 14, 2022. Version: 1.1


Database Open Access

Surface electromyographic signals collected during long-lasting ground walking of young able-bodied subjects

Francesco Di Nardo, Christian Morbidoni, Sandro Fioretti

The dataset is composed of long-lasting surface electromyographic (sEMG) signals recorded from ten muscles during ground walking of 31 young able-bodied subjects in Movement Analysis Lab, UniversitĂ  Politecnica delle Marche, Ancona, Italy.

surface emg signal walking biomedical signals gait analysis muscle recruitment

Published: March 31, 2022. Version: 1.0.0

Visualize waveforms

Database Open Access

Auditory evoked potential EEG-Biometric dataset

Nibras Abo Alzahab, Angelo Di Iorio, Luca Apollonio, Muaaz Alshalak, Alessandro Gravina, Luca Antognoli, Marco Baldi, Lorenzo Scalise, Bilal Alchalabi

Recording of electroencephalogram (EEG) signals with the aim to develop an EEG-based Biometric. The Data includes resting-state and auditory stimuli experiments.

eeg biometric electroencephalogram auditory stimuli resting-state

Published: Dec. 1, 2021. Version: 1.0.0

Visualize waveforms

Database Credentialed Access

VinDr-CXR: An open dataset of chest X-rays with radiologist annotations

Ha Quy Nguyen, Hieu Huy Pham, le tuan linh, Minh Dao, lam khanh

VinDr-CXR: An open dataset of chest X-rays with radiologist's annotations

computer vision lesion detection disease classification chest x-ray interpretation deep learning

Published: June 22, 2021. Version: 1.0.0


Database Credentialed Access

MIMIC-CXR-JPG - chest radiographs with structured labels

Alistair Johnson, Matt Lungren, Yifan Peng, Zhiyong Lu, Roger Mark, Seth Berkowitz, Steven Horng

Chest x-rays in JPG format with structured labels derived from the associated radiology report.

mimic computer vision radiology chest x-ray deep learning

Published: Nov. 14, 2019. Version: 2.0.0


Database Open Access

Wilson Central Terminal ECG Database

Hossein Moeinzadeh, Gaetano Gargiulo

Wilson Central Terminal ECG signals recorded from 92 patients.

wilson central terminal limb potential unipolar lead electrocardiography ecg

Published: Nov. 13, 2019. Version: 1.0.1

Visualize waveforms

Database Open Access

Non-EEG Dataset for Assessment of Neurological Status

Non-EEG physiological signals collected using non-invasive wrist worn biosensors and consists of electrodermal activity, temperature, acceleration, heart rate, and arterial oxygen level.

acceleration multiparameter electrodermal activity heart rate temperature

Published: July 19, 2017. Version: 1.0.0

Visualize waveforms

Database Credentialed Access

MIMIC-III Clinical Database

Alistair Johnson, Tom Pollard, Roger Mark

MIMIC-III is a large, freely-available database comprising deidentified health-related data associated with over forty thousand patients who stayed in critical care units of the Beth Israel Deaconess Medical Center between 2001 and 2012. The databas…

intensive care clinical critical care machine learning natural language processing

Published: Sept. 4, 2016. Version: 1.4