Resources


Database Credentialed Access

Nosocomial Risk Datasets from MIMIC-III

Travis Goodwin

Text-based Longitudinal Data for Predicting Nosocomial Disease Risk as used by CANTRIP.

pressure injury risk prediction acute kidney injury anemia forecasting natural language processing deep learning

Published: Sept. 15, 2022. Version: 1.0


Database Credentialed Access

Nosocomial Risk Datasets from MIMIC-III

Travis Goodwin

Text-based Longitudinal Data for Predicting Nosocomial Disease Risk as used by CANTRIP.

pressure injury risk prediction acute kidney injury anemia forecasting natural language processing deep learning

Published: Sept. 15, 2022. Version: 1.0


Database Open Access

Smart Health for Assessing the Risk of Events via ECG Database

Holter recordings of 139 hypertensive patients recruited at the Centre of Hypertension of the University Hospital of Naples Federico II.

risk hypertension holter hrv ecg

Published: May 19, 2015. Version: 1.0.0

Visualize waveforms

Database Open Access

Smart Health for Assessing the Risk of Events via ECG Database

Holter recordings of 139 hypertensive patients recruited at the Centre of Hypertension of the University Hospital of Naples Federico II.

risk hypertension holter hrv ecg

Published: May 19, 2015. Version: 1.0.0

Visualize waveforms

Database Credentialed Access

MIMIC-IV-Ext Triage Instruction Corpus

Qingyang Shen, Quan Guo

MIMIC-IV-Ext Triage Instruction Corpus includes 9,629 ED triage cases organized by the five-level ESI, enabling LLMs to improve triage accuracy. It provides CSV data, generation prompts, expert validation samples, and SQL QC scripts.

nlp clinical decision support large language models machine learning emergency severity index emergency triage

Published: March 4, 2025. Version: 1.0.0


Database Open Access

Long Term Movement Monitoring Database

The LTMM database contains 3-day 3D accelerometer recordings of 71 elder community residents, used to study gait, stability, and fall risk.

risk stability accelerometer gait

Published: June 20, 2016. Version: 1.0.0

Visualize waveforms

Database Credentialed Access

MIMIC-IV-Ext-22MCTS: A 22 Millions-Event Temporal Clinical Time-Series Dataset with Relative Timestamp

Jing Wang, Xing Niu, Tong Zhang, Jie Shen, Juyong Kim, Jeremy Weiss

It is a time series clinical events dataset with concrete temporal information. The dataset consists of 22,588,586 clinical events and related timestamps from 267,284 discharge summaries of the MIMIC-IV-Note.

mimic clinical event annotation time series temporal annotation

Published: Sept. 29, 2025. Version: 1.0.0


Database Restricted Access

Community-Acquired Pneumonia, Endotypes and Phenotypes (NACef): Prospective, observational cohort study of Translational Medicine

Natalia Sanabria-Herrera, Esteban Garcia Gallo, Luis Felipe Reyes

Community-Acquired Pneumonia (CAP) poses a significant health risk, linked to high in-hospital morbidity and mortality rates. The dataset includes clinical details of 768 CAP patients at Clinica Universidad de La Sabana, Colombia.

Published: Aug. 21, 2025. Version: 2.0.1


Database Credentialed Access

Bridge2AI-Voice: An ethically-sourced, diverse voice dataset linked to health information

Yael Bensoussan, Alexandros Sigaras, Anais Rameau, Olivier Elemento, Maria Powell, David Dorr, Philip Payne, Vardit Ravitsky, Jean-Christophe Bélisle-Pipon, Alistair Johnson, Ruth Bahr, Stephanie Watts, Donald Bolser, Jennifer Siu, Jordan Lerner-Ellis, Frank Rudzicz, Micah Boyer, Samantha Salvi Cruz, Yassmeen Abdel-Aty, Toufeeq Ahmed Syed, James Anibal, Stephen Aradi, Ana Sophia Martinez, Shaheen Awan, Steven Bedrick, Alexander Bernier, Isaac Bevers, Rahul Brito, Selina Casalino, John Costello, Iris De Santiago, Enrique Diaz-Ocampo, Mohamed Ebraheem, Ellie Eiseman, Mahmoud Elmahdy, Emily Evangelista, Kenneth Fletcher, Hortense Gallois, Alexander Gelbard, Anna Goldenberg, Karim Hanna, William Hersh, Lochana Jayachandran, Kaley Jenney, Kathy Jenkins, Stacy Jo, Ayush Kalia, Andrea Krussel, Elisa Lapadula, Chloe Loewith, Radhika Mahajan, Vrishni Maharaj, Siyu Miao, Matthew Mifsud, Marian Mikhael, Elijah Moothedan, Yosef Nafii, Tempestt Neal, Karlee Newberry, Evan Ng, Christopher Nickel, Megan Urbano, Trevor Pharr, Matthew Pontell, Claire Premi-Bortolotto, JM Rahman, Sarah Rohde, Laurie Russell, Suketu Shah, Ahmed Shawkat, Elizabeth Silberholz, Duncan Sutherland, Venkata Swarna Mukhi, Jeffrey Tang, Jamie Toghranegar, Kimberly Vinson, Claire Wilson, Madeleine Zanin, Xijie Zeng, Theresa Zesiewicz, Robin Zhao, Pantelis Zisimopoulos, Satrajit Ghosh

A dataset of features from voice recordings and metadata to enable the development, benchmarking, and validation of clinically applicable machine-learning models for diagnosing a wide range of health conditions.

voice bridge2ai

Published: Aug. 18, 2025. Version: 2.0.1


Database Credentialed Access

Immunosuppressive Condition and Medication Annotations for Admission Notes in the MIMIC-III Database

Vijeeth Guggilla, Melissa Bak, Mengjia Kang, Theresa Walunas, Catherine A Gao

This database contains 200 MIMIC-III admission notes with adjudicated labels for histories of various immunosuppressive conditions and usage of various immunosuppressive medications.

Published: Aug. 4, 2025. Version: 1.0.0