Resources


Database Credentialed Access

Nosocomial Risk Datasets from MIMIC-III

Travis Goodwin

Text-based Longitudinal Data for Predicting Nosocomial Disease Risk as used by CANTRIP.

pressure injury risk prediction acute kidney injury anemia forecasting natural language processing deep learning

Published: Sept. 15, 2022. Version: 1.0


Database Credentialed Access

Nosocomial Risk Datasets from MIMIC-III

Travis Goodwin

Text-based Longitudinal Data for Predicting Nosocomial Disease Risk as used by CANTRIP.

pressure injury risk prediction acute kidney injury anemia forecasting natural language processing deep learning

Published: Sept. 15, 2022. Version: 1.0


Database Open Access

Smart Health for Assessing the Risk of Events via ECG Database

Holter recordings of 139 hypertensive patients recruited at the Centre of Hypertension of the University Hospital of Naples Federico II.

risk hypertension holter hrv ecg

Published: May 19, 2015. Version: 1.0.0

Visualize waveforms

Database Open Access

Smart Health for Assessing the Risk of Events via ECG Database

Holter recordings of 139 hypertensive patients recruited at the Centre of Hypertension of the University Hospital of Naples Federico II.

risk hypertension holter hrv ecg

Published: May 19, 2015. Version: 1.0.0

Visualize waveforms

Database Credentialed Access

MIMIC-IV-Ext Triage Instruction Corpus

Qingyang Shen, Quan Guo

MIMIC-IV-Ext Triage Instruction Corpus includes 9,629 ED triage cases organized by the five-level ESI, enabling LLMs to improve triage accuracy. It provides CSV data, generation prompts, expert validation samples, and SQL QC scripts.

nlp clinical decision support machine learning large language models emergency severity index emergency triage

Published: March 4, 2025. Version: 1.0.0


Database Open Access

Long Term Movement Monitoring Database

The LTMM database contains 3-day 3D accelerometer recordings of 71 elder community residents, used to study gait, stability, and fall risk.

risk stability accelerometer gait

Published: June 20, 2016. Version: 1.0.0

Visualize waveforms

Database Credentialed Access

MedVAL-Bench: Expert-Annotated Medical Text Validation Benchmark

Asad Aali, Vasiliki Bikia, Maya Varma, Nicole Chiou, Sophie Ostmeier, Arnav Singhvi, Magdalini Paschali, Ashwin Kumar, Andrew Johnston, Karimar Amador Martinez, Eduardo Perez Guerrero, Paola Cruz Rivera, Sergios Gatidis, Christian Bluethgen, Eduardo Pontes Reis, Eddy Zandee van Rilland, Poonam Hosamani, Kevin Keet, Minjoung Go, Evelyn Ling, David Larson, Curtis Langlotz, Roxana Daneshjou, Jason Hom, Sanmi Koyejo, Emily Alsentzer, Akshay Chaudhari

MedVAL-Bench is the first large-scale physician-validated benchmark for medical text validation, spanning 6 diverse medical tasks and containing 840 language model-generated outputs annotated by 12 physicians with error assessments and risk grades.

Published: Nov. 14, 2025. Version: 1.0.1


Database Credentialed Access

MIMIC-IV-Ext-22MCTS: A 22 Millions-Event Temporal Clinical Time-Series Dataset with Relative Timestamp

Jing Wang, Xing Niu, Tong Zhang, Jie Shen, Juyong Kim, Jeremy Weiss

It is a time series clinical events dataset with concrete temporal information. The dataset consists of 22,588,586 clinical events and related timestamps from 267,284 discharge summaries of the MIMIC-IV-Note.

mimic clinical event annotation time series temporal annotation

Published: Sept. 29, 2025. Version: 1.0.0


Database Restricted Access

Community-Acquired Pneumonia, Endotypes and Phenotypes (NACef): Prospective, observational cohort study of Translational Medicine

Natalia Sanabria-Herrera, Esteban Garcia Gallo, Luis Felipe Reyes

Community-Acquired Pneumonia (CAP) poses a significant health risk, linked to high in-hospital morbidity and mortality rates. The dataset includes clinical details of 768 CAP patients at Clinica Universidad de La Sabana, Colombia.

Published: Aug. 21, 2025. Version: 2.0.1


Database Credentialed Access

Bridge2AI-Voice: An ethically-sourced, diverse voice dataset linked to health information

Yael Bensoussan, Alexandros Sigaras, Anais Rameau, Olivier Elemento, Maria Powell, David Dorr, Philip Payne, Vardit Ravitsky, Jean-Christophe Bélisle-Pipon, Alistair Johnson, Ruth Bahr, Stephanie Watts, Donald Bolser, Jennifer Siu, Jordan Lerner-Ellis, Frank Rudzicz, Micah Boyer, Samantha Salvi Cruz, Yassmeen Abdel-Aty, Toufeeq Ahmed Syed, James Anibal, Stephen Aradi, Ana Sophia Martinez, Shaheen Awan, Steven Bedrick, Alexander Bernier, Isaac Bevers, Rahul Brito, Selina Casalino, John Costello, Iris De Santiago, Enrique Diaz-Ocampo, Mohamed Ebraheem, Ellie Eiseman, Mahmoud Elmahdy, Emily Evangelista, Kenneth Fletcher, Hortense Gallois, Alexander Gelbard, Anna Goldenberg, Karim Hanna, William Hersh, Lochana Jayachandran, Kaley Jenney, Kathy Jenkins, Stacy Jo, Ayush Kalia, Andrea Krussel, Elisa Lapadula, Chloe Loewith, Radhika Mahajan, Vrishni Maharaj, Siyu Miao, Matthew Mifsud, Marian Mikhael, Elijah Moothedan, Yosef Nafii, Tempestt Neal, Karlee Newberry, Evan Ng, Christopher Nickel, Megan Urbano, Trevor Pharr, Matthew Pontell, Claire Premi-Bortolotto, JM Rahman, Sarah Rohde, Laurie Russell, Suketu Shah, Ahmed Shawkat, Elizabeth Silberholz, Duncan Sutherland, Venkata Swarna Mukhi, Jeffrey Tang, Jamie Toghranegar, Kimberly Vinson, Claire Wilson, Madeleine Zanin, Xijie Zeng, Theresa Zesiewicz, Robin Zhao, Pantelis Zisimopoulos, Satrajit Ghosh

A dataset of features from voice recordings and metadata to enable the development, benchmarking, and validation of clinically applicable machine-learning models for diagnosing a wide range of health conditions.

voice bridge2ai

Published: Aug. 18, 2025. Version: 2.0.1