Resources


Database Credentialed Access

Nosocomial Risk Datasets from MIMIC-III

Travis Goodwin

Text-based Longitudinal Data for Predicting Nosocomial Disease Risk as used by CANTRIP.

pressure injury risk prediction acute kidney injury anemia forecasting natural language processing deep learning

Published: Sept. 15, 2022. Version: 1.0


Database Open Access

Multimodal Synchronized Motion Capture, Force Plate, and Radar Dataset of the One-Legged Stand Test for Fall-Risk Assessment

Daniel Copeland, Evan Linton, Xiang Zhang, et al.

A multimodal dataset of 32 participants performing the One-Legged Stand Test (OLST), with synchronized motion capture, force plate, and 24 GHz radar data. Each of 1,241 trials is labeled with foot-lift, stability phases, and foot-touchdown.

motion capture human pose estimation human movement fall risk assessment non-contact sensing one-legged stand test force plate analysis digital biomarkers human balance testing geriatrics radar signal processing postural control multimodal sensing biomechanics aging and mobility

Published: Jan. 25, 2026. Version: 1.0


Database Open Access

Smart Health for Assessing the Risk of Events via ECG Database

Holter recordings of 139 hypertensive patients recruited at the Centre of Hypertension of the University Hospital of Naples Federico II.

risk hypertension holter hrv ecg

Published: May 19, 2015. Version: 1.0.0

Visualize waveforms

Database Credentialed Access

MIMIC-IV-Ext Triage Instruction Corpus

Qingyang Shen, Quan Guo

MIMIC-IV-Ext Triage Instruction Corpus includes 9,629 ED triage cases organized by the five-level ESI, enabling LLMs to improve triage accuracy. It provides CSV data, generation prompts, expert validation samples, and SQL QC scripts.

nlp clinical decision support machine learning large language models emergency severity index emergency triage

Published: March 4, 2025. Version: 1.0.0


Database Open Access

BIG IDEAs Lab Glycemic Variability and Wearable Device Data

Peter Cho, Juseong Kim, Brinnae Bent, et al.

Glucose measurements and wrist-worn wearable sensor data from highnormoglycemic participants.

biomedical engineering pre-diabetes biomarkers

Published: April 13, 2026. Version: 1.1.3


Database Credentialed Access

Clinical Time Series Datasets for Trajectory Flow Matching Evaluation: ICU Sepsis, ICU Cardiac Arrest, and ICU GIB Cohorts

Yuan Pu, Dennis Shung, Alexander Tong, et al.

This resource comprises three clinical time series datasets used in the paper Trajectory Flow Matching with Applications to Clinical Time Series Modeling to evaluate models for handling irregularly sampled data in critical care settings.

clinical time series

Published: March 23, 2026. Version: 1.0.0


Database Credentialed Access

MIMIC-IV-Ext-PE: Pulmonary Embolism Labels for CT Pulmonary Angiography Radiology Reports

Barbara Lam, Omid Jafari, Peiqi Wang, et al.

CTPA (computed tomography pulmonary angiogram) radiology reports from MIMIC-IV with pulmonary embolism (PE) adjudication

Published: March 23, 2026. Version: 1.0.0


Database Credentialed Access

Bridge2AI-Voice Pediatric Dataset

Yael Bensoussan, Alexandros Sigaras, Anais Rameau, et al.

A dataset of questionnaire responses, spectrograms, and other information for pediatric participants collected for the Bridge2AI voice as a biomarker of health project.

voice bridge2ai

Published: Dec. 17, 2025. Version: 1.0.0


Database Credentialed Access

Bridge2AI-Voice: An ethically-sourced, diverse voice dataset linked to health information

Yael Bensoussan, Alexandros Sigaras, Anais Rameau, et al.

A dataset of features from voice recordings and metadata to enable the development, benchmarking, and validation of clinically applicable machine-learning models for diagnosing a wide range of health conditions.

voice bridge2ai

Published: Dec. 16, 2025. Version: 3.0.0


Database Credentialed Access

MedVAL-Bench: Expert-Annotated Medical Text Validation Benchmark

Asad Aali, Vasiliki Bikia, Maya Varma, et al.

MedVAL-Bench is the first large-scale physician-validated benchmark for medical text validation, spanning 6 diverse medical tasks and containing 840 language model-generated outputs annotated by 12 physicians with error assessments and risk grades.

Published: Nov. 14, 2025. Version: 1.0.1