Resources


Database Credentialed Access

NCH Sleep DataBank: A Large Collection of Real-world Pediatric Sleep Studies with Longitudinal Clinical Data

Harlin Lee, Boyue Li, Yungui Huang, Yuejie Chi, Simon Lin

The NCH Sleep DataBank includes 3,984 pediatric sleep studies on 3,673 unique patients conducted at Nationwide Children's Hospital between 2017 and 2019. It contains polysomnography (PSG), clinical annotations, and longitudinal clinical data.

eeg ehr polysomnography pediatrics clinical decision support sleep study electronic health records ecg sleep disorders

Published: Oct. 27, 2021. Version: 3.1.0


Database Restricted Access

BigIdeasLab_STEP: Heart rate measurements captured by smartwatches for differing skin tones

Brinnae Bent, Jessilyn Dunn

Comparison of HR values reported by ECG (Bittium Faros) and wearables including Biovotion Everion, Empatica E4, Apple Watch, Garmin, Fitbit, and Xiaomi Miband.

wearables heart rate

Published: Feb. 10, 2021. Version: 1.0


Database Credentialed Access

Eye Gaze Data for Chest X-rays

Alexandros Karargyris, Satyananda Kashyap, Ismini Lourentzou, Joy Wu, Matthew Tong, Arjun Sharma, Shafiq Abedin, David Beymer, Vandana Mukherjee, Elizabeth Krupinski, Mehdi Moradi

This dataset was a collected using an eye tracking system while a radiologist interpreted and read 1,083 public CXR images. The dataset contains the following aligned modalities: image, transcribed report text, dictation audio and eye gaze data.

audio convolutional network heatmap eye tracking multimodal machine learning chest x-ray radiology explainability chest cxr deep learning

Published: Sept. 12, 2020. Version: 1.0.0


Database Open Access

Multilevel Monitoring of Activity and Sleep in Healthy People

Alessio Rossi, Eleonora Da Pozzo, Dario Menicagli, Chiara Tremolanti, Corrado Priami, Alina Sirbu, David Clifton, Claudia Martini, David Morelli

Multilevel Monitoring of Activity and Sleep in Healthy people (MMASH) dataset provides 24 hours of continuous beat-to-beat heart data, triaxal accelerometer data, sleep quality, physical activity, psychological characteristics and salivary samples.

sleep physiological response melatonin cortisol circadian rhythm psychological response saliva health

Published: June 19, 2020. Version: 1.0.0


Database Credentialed Access

Phenotype Annotations for Patient Notes in the MIMIC-III Database

Edward Moseley, Leo Anthony Celi, Joy Wu, Franck Dernoncourt

Clinical notes, annotated by at least two expert annotators for over ten patient phenotypes, including advanced cancer, substance abuse, and treatment non-adherence.

patient classification natural language processing

Published: March 5, 2020. Version: 1.20.03


Database Open Access

Tappy Keystroke Data

This is the keystroke dataset for the study titled 'High-accuracy detection of early Parkinson's Disease using multiple characteristics of finger movement while typing'. This research report is currently under review for publication by P…

parkinsons neuroelectric movement

Published: Oct. 20, 2017. Version: 1.0.0


Database Open Access

Clinical data from the MIMIC-II database for a case study on indwelling arterial catheters

Jesse Raffa

Dataset extracted from MIMIC-II for a tutorial on effectiveness of indwelling arterial catheters in hemodynamically stable patients with respiratory failure for mortality outcomes.

Published: Oct. 28, 2016. Version: 1.0


Challenge Credentialed Access

ShAReCLEF eHealth 2013: Natural Language Processing and Information Retrieval for Clinical Care

Danielle Mowery

2013 ShARe/CLEF eHealth Evaluation Lab: Natural Language Processing and Information Retrieval for Clinical Care (Tasks 1 and 2).

natural language processing

Published: Feb. 15, 2013. Version: 1.0


Database Credentialed Access

Medical Expert Annotations of Unsupported Facts in Doctor-Written and LLM-Generated Patient Summaries

Stefan Hegselmann, Shannon Shen, Florian Gierse, Monica Agrawal, David Sontag, Xiaoyi Jiang

Annotations for unsupported facts in 100 original MIMIC patient summaries (discharge instructions) and hallucinations in 100 Large Language Model (LLM) generated patient summaries labeled by two medical experts.

Published: April 28, 2024. Version: 1.0.0


Database Open Access

ScientISST MOVE: Annotated Wearable Multimodal Biosignals recorded during Everyday Life Activities in Naturalistic Environments

João Areias Saraiva, Mariana Abreu, Ana Sofia Carmo, Hugo Plácido da Silva, Ana Fred

Multimodal (ECG, EMG, EDA, PPG, TEMP, ACC) biosignal dataset of everyday activities. Created with 3 wearable devices based on ScientISST Sense and Empatica E4.

multimodal greet lift uncontrolled environments run jump gesticulate walk wearable

Published: March 25, 2024. Version: 1.0.1