Featured Resources


Database Credentialed Access

Bridge2AI-Voice: An ethically-sourced, diverse voice dataset linked to health information

Yael Bensoussan, Alexandros Sigaras, Anais Rameau, Olivier Elemento, Maria Powell, David Dorr, Philip Payne, Vardit Ravitsky, Jean-Christophe Bélisle-Pipon, Alistair Johnson, Ruth Bahr, Stephanie Watts, Donald Bolser, Jennifer Siu, Jordan Lerner-Ellis, Frank Rudzicz, Micah Boyer, Samantha Salvi Cruz, Yassmeen Abdel-Aty, Toufeeq Ahmed Syed, James Anibal, Stephen Aradi, Ana Sophia Martinez, Shaheen Awan, Steven Bedrick, Isaac Bevers, Rahul Brito, Selina Casalino, John Costello, Iris De Santiago, Enrique Diaz-Ocampo, Mohamed Ebraheem, Ellie Eiseman, Mahmoud Elmahdy, Emily Evangelista, Kenneth Fletcher, Alexander Gelbard, Anna Goldenberg, Karim Hanna, William Hersh, Lochana Jayachandran, Kaley Jenney, Kathy Jenkins, Stacy Jo, Ayush Kalia, Andrea Krussel, Elisa Lapadula, Chloe Loewith, Radhika Mahajan, Vrishni Maharaj, Siyu Miao, Matthew Mifsud, Marian Mikhael, Elijah Moothedan, Yosef Nafii, Tempestt Neal, Karlee Newberry, Evan Ng, Christopher Nickel, Trevor Pharr, Claire Premi-Bortolotto, JM Rahman, Sarah Rohde, Laurie Russell, Suketu Shah, Ahmed Shawkat, Elizabeth Silberholz, Duncan Sutherland, Venkata Swarna Mukhi, Jeffrey Tang, Jamie Toghranegar, Kimberly Vinson, Claire Wilson, Madeleine Zanin, Xijie Zeng, Theresa Zesiewicz, Robin Zhao, Pantelis Zisimopoulos, Satrajit Ghosh

A dataset of features from voice recordings and metadata to enable the development, benchmarking, and validation of clinically applicable machine-learning models for diagnosing a wide range of health conditions.

voice bridge2ai

Published: April 16, 2025. Version: 2.0.0


Database Open Access

VitalDB, a high-fidelity multi-parameter vital signs database in surgical patients

Hyung-Chul Lee, Chul-Woo Jung

VitalDB, a high-fidelity multi-parameter vital signs database in surgical patients

waveform anesthesia vitaldb intraoperative biosignal ecg

Published: Sept. 21, 2022. Version: 1.0.0


Database Credentialed Access

Northwestern ICU (NWICU) database

Dana Moukheiber, William Temps, Bhadrappa Molgi, Yikuan Li, Alice Lu, Prasanth Nannapaneni, Abdulrahman Chahin, Sicheng Hao, Felipe Torres Fabregas, Leo Anthony Celi, Adrian Wong, Maxwell Lloyd, Xavier Borrat Frigola, Hyung-Chul Lee, Daniel Schneider, Tom Pollard, Yuan Luo, Abel Kho, Roger Mark

A freely available COVID-rich ICU database comprising de-identified health-related data from Northwestern Memorial Health Center (NHMC).

Published: Nov. 19, 2024. Version: 0.1.0


Database Credentialed Access

MIMIC-IV

Alistair Johnson, Lucas Bulgarelli, Tom Pollard, Steven Horng, Leo Anthony Celi, Roger Mark

Large database of de-identified health information from patients admitted to Beth Israel Deaconess Medical Center

critical care intensive care unit mimic machine learning

Published: Jan. 6, 2023. Version: 2.2


Database Credentialed Access

MIMIC-CXR Database

Alistair Johnson, Tom Pollard, Roger Mark, Seth Berkowitz, Steven Horng

Chest radiographs in DICOM format with associated free-text reports.

computer vision chest x-rays natural language processing mimic machine learning radiology

Published: Sept. 19, 2019. Version: 2.0.0


Database Credentialed Access

BRAX, a Brazilian labeled chest X-ray dataset

Eduardo Pontes Reis, Joselisa Paiva, Maria Carolina Bueno da Silva, Guilherme Alberto Sousa Ribeiro, Victor Fornasiero Paiva, Lucas Bulgarelli, Henrique Lee, Paulo Victor dos Santos, vanessa brito, Lucas Amaral, Gabriel Beraldo, Jorge Nebhan Haidar Filho, Gustavo Teles, Gilberto Szarf, Tom Pollard, Alistair Johnson, Leo Anthony Celi, Edson Amaro

BRAX contains 24,959 chest radiography exams and 40,967 images acquired in a large general Brazilian hospital. All images have been read by trained radiologists and 14 labels were derived from Brazilian Portuguese reports using NLP.

chest x-ray dataset artificial intelligence

Published: June 17, 2022. Version: 1.1.0


Latest Resources


Database Open Access

bigP3BCI: An Open, Diverse and Machine Learning Ready P300-based Brain-Computer Interface Dataset

Boyla Mainsah, Chance Fleeting, Thomas Balmat, Eric Sellers, Leslie Collins

A collection of data from P300-based brain-computer interface studies.

brain-computer interface electroencephalography ieee p2731 working group standard amyotrophic lateral sclerosis p300 speller p300 event related potential oddball paradigm error-related potential

Published: May 19, 2025. Version: 1.0.0


Database Credentialed Access

MIMIC-IV-Ext Cardiac Disease

Jiawei Cao, Sendong Zhao

The subset of the MIMIC-IV dataset includes the examination results and diagnostic information of 4,761 cardiac disease patients. The examination results for each patient are listed separately as evidence for the final diagnosis.

Published: May 6, 2025. Version: 1.0.0


Database Credentialed Access

FDTooth: Intraoral Photographs and Cone-Beam Computed Tomography Images for Fenestration and Dehiscence Detection

Yanqi Yang, Xiaomeng LI, Keyuan Liu, Marawan Elbatel

FDTooth is a dataset containing intraoral photographs and cone-beam computed tomography (CBCT) images with annotations for automated detection of fenestration and dehiscence in anterior teeth.

Published: May 5, 2025. Version: 1.0.0


Database Credentialed Access

MeDiSumQA: Patient-Oriented Question-Answer Generation from Discharge Letters

Amin Dada, Osman Alperen Koras, Marie Bauer, Amanda Butler, Kaleb Smith, Jens Kleesiek, Julian Friedrich

MeDiSumQA is a dataset of patient-oriented QA pairs from MIMIC-IV discharge summaries, designed to evaluate LLMs in generating safe, patient-friendly medical responses for clinical QA and healthcare communication.

Published: May 5, 2025. Version: 1.0.0


Database Open Access

Minute level step counts and physical activity data from the National Health and Nutrition Examination Survey (NHANES) 2011-2014

Lily Koffman, John Muschelli

Minute level step counts obtained from five step counting algorithms for raw accelerometry data, and minute level Activity Counts, MIMS, wear predictions, and wear flags for all participants who wore accelerometers in NHANES 2011-2014.

accelerometry physical activity steps nhanes

Published: May 5, 2025. Version: 1.0.1


Database Restricted Access

DREAMT: Dataset for Real-time sleep stage EstimAtion using Multisensor wearable Technology

Ke Wang, Jiamu Yang, Ayush Shetty, Jessilyn Dunn

We present high resolution wearable device multichannel data along with clinical labeled and recorded sleep stage and polysomnography (PSG) data from 100 sleep abnormal patients with sleep apnea.

sleep disorders wearable biomedical time series classification

Published: April 30, 2025. Version: 2.1.0