Resources


Database Credentialed Access

MIMIC-III-Ext-PPG: A PPG Benchmark Dataset for Cardiorespiratory Analysis

Mohammad Moulaeifard, Peter H Charlton, Nils Strodthoff

Large-Scale, Quality-Assessed PPG-based Benchmark Dataset for Cardiovascular and Respiratory Signal Analysis based on MIMIC-III

blood pressure critical care photoplethysmography electrocardiogram heart rhythm signal quality respiratory rate

Published: Feb. 4, 2026. Version: 1.0.0


Database Open Access

Brugada-HUCA: 12-Lead ECG Recordings for the Study of Brugada Syndrome

Nahuel Costa Cortez, Daniel Garcia Iglesias

Brugada syndrome is a rare but potentially life-threatening cardiac arrhythmia disorder, with an elevated risk of sudden cardiac death. This dataset introduces 12-lead ECG recordings gather to support the study of this rare disease.

Published: Feb. 2, 2026. Version: 1.0.0

Visualize waveforms

Database Open Access

tOLIet: Single-lead Thigh-based Electrocardiography Using Polimeric Dry Electrodes

Aline Santos Silva, Hugo Plácido da Silva, Miguel Correia, et al.

We present tOLIet, the first thigh ECG dataset with real signals captured by a toilet seat with electrodes. There are 149 recordings from 86 people, useful for research into cardiovascular assessment using "invisible" ECG.

Published: Feb. 2, 2026. Version: 1.0.1


Database Open Access

Multimodal Synchronized Motion Capture, Force Plate, and Radar Dataset of the One-Legged Stand Test for Fall-Risk Assessment

Daniel Copeland, Evan Linton, Xiang Zhang, et al.

A multimodal dataset of 32 participants performing the One-Legged Stand Test (OLST), with synchronized motion capture, force plate, and 24 GHz radar data. Each of 1,241 trials is labeled with foot-lift, stability phases, and foot-touchdown.

motion capture human pose estimation human movement fall risk assessment non-contact sensing one-legged stand test force plate analysis digital biomarkers human balance testing geriatrics radar signal processing postural control multimodal sensing biomechanics aging and mobility

Published: Jan. 25, 2026. Version: 1.0


Model Credentialed Access

Fine-tuning foundational models to code diagnoses from veterinary health records

Adam Kiehl, Nadia Saklou, G Joseph Strecker, et al.

Fine-tuned GatorTron LLM for veterinary diagnosis coding to 7,739 SNOMED-CT codes based on clinical summary text from the Colorado State University Veterinary Teaching Hospital.

transformers natural language processing large language models foundational models one health diagnoses snomed-ct veterinary medicine omop cdm veterinary medical records clinical coding

Published: Jan. 25, 2026. Version: 1.0.0


Challenge Credentialed Access

SNOMED CT Entity Linking Challenge

Will Hardman, Mark Banks, Rory Davidson, et al.

272 discharge notes from the MIMIC-IV-Note dataset annotated with SNOMED CT concepts.

snomed clinical annotation entity linking

Published: Jan. 12, 2026. Version: 1.2.0


Database Credentialed Access

Lunguage: A Benchmark for Structured and Sequential Chest X-ray Interpretation

Jong Hak Moon, Geon Choi, Paloma Rabaey, et al.

A radiologist-annotated benchmark of structured chest X-ray reports at single and sequential levels, comprising 1,473 reports across 18 relation types and 80 longitudinal cases.

fine-grained structured reports attribute-level clinical reasoning medical text structuring longitudinal clinical reasoning chest x-ray report parsing medical information structuring benchmark dataset for radiology report medical information extraction structured radiology reports temporal relation extraction radiology report benchmarking longitudinal clinical understanding

Published: Jan. 11, 2026. Version: 1.0.0


Database Open Access

PSG-IPA: A PolySomnoGraphic Inter-scorer Performance Assessment database

Diego Alvarez-Estevez

The HMC-IPA dataset comprises 20 PSG recordings, each with manual and computer-assisted scorings by 12 sleep technologists, for studying inter-scorer variability and evaluating automated sleep analysis algorithms

Published: Jan. 8, 2026. Version: 1.0.0

Visualize waveforms

Challenge Credentialed Access

ArchEHR-QA: A Dataset for Addressing Patient's Information Needs related to Clinical Course of Hospitalization

Sarvesh Soni, Dina Demner-Fushman

A dataset for grounded question answering (QA) from electronic health records (EHRs).

question answering electronic health record patient portals clinicians

Published: Jan. 1, 2026. Version: 1.3


Database Credentialed Access

Bridge2AI-Voice Pediatric Dataset

Yael Bensoussan, Alexandros Sigaras, Anais Rameau, et al.

A dataset of questionnaire responses, spectrograms, and other information for pediatric participants collected for the Bridge2AI voice as a biomarker of health project.

voice bridge2ai

Published: Dec. 17, 2025. Version: 1.0.0