Resources
Database Contributor Review
CARMEN-I: A resource of anonymized electronic health records in Spanish and Catalan for training and testing NLP tools
Eulalia Farre Maduell, Salvador Lima-Lopez, Santiago Andres Frid, Artur Conesa, Elisa Asensio, Antonio Lopez-Rueda, Helena Arino, Elena Calvo, Maria Jesús Bertran, Maria Angeles Marcos, Montserrat Nofre Maiz, Laura Tañá Velasco, Antonia Marti, Ricardo Farreres, Xavier Pastor, Xavier Borrat Frigola, Martin Krallinger
de-identification clinical ner anonymization
Published: April 20, 2024. Version: 1.0.1
Database Open Access
Smart Health for Assessing the Risk of Events via ECG Database
risk hypertension holter hrv ecg
Published: May 19, 2015. Version: 1.0.0
Visualize waveformsChallenge Open Access
Heart Murmur Detection from Phonocardiogram Recordings: The George B. Moody PhysioNet Challenge 2022
Matthew Reyna, Yashar Kiarashi, Andoni Elola, Jorge Oliveira, Francesco Renna, Annie Gu, Erick Andres Perez Alday, Nadi Sadr, Sandra Mattos, Miguel Coimbra, Reza Sameni, Ali Bahrami Rad, Zuzana Koscova, Gari Clifford
challenge competition cardiac auscultation congenital heart diseases
Published: Sept. 28, 2023. Version: 1.0.0
Database Credentialed Access
MIMIC-III-Ext-VeriFact-BHC: Labeled Propositions From Brief Hospital Course Summaries for Long-form Clinical Text Evaluation
Philip Chung, Akshay Swaminathan, Alex Goodell, Yeasul Kim, Momsen Reincke, Lichy Han, Ben Deverett, Mohammad Amin Sadeghi, Abdel badih El Ariss, Marc Ghanem, David Seong, Andrew Lee, Caitlin Coombes, Brad Bradshaw, Mahir Sufian, Hyo Jung Hong, Teresa Nguyen, Mohammad Rasouli, Komal Kamra, Mark Burbridge, James McAvoy, Roya Saffary, Stephen Parnell Ma, Dev Dash, James Xie, Ellen Wang, Cliff Schmiesing, Nigam Shah, Nima Aghaeepour
artificial intelligence clinical notes natural language processing large language models brief hospital course electronic health records long-form text chart review text reranking atomic claim hybrid retrieval clinical informatics clinical medicine fact verification retrieval-augmented generation logical atomism text embedding formal logic llm-as-a-judge llm evaluation
Published: April 9, 2025. Version: 1.0.0
Database Credentialed Access
BOLD, a blood-gas and oximetry linked dataset
João Matos, Tristan Struja, Jack Gallifant, Luis Filipe Nakayama, Marie Charpignon, Xiaoli Liu, Jaime dos Santos Cardoso, Leo Anthony Celi, An Kwok Wong
pulse oximetry intensive care unit health equity electronic health records
Published: Nov. 8, 2023. Version: 1.0
Database Credentialed Access
GLOBEM Dataset: Multi-Year Datasets for Longitudinal Human Behavior Modeling Generalization
Xuhai Xu, Han Zhang, Yasaman Sefidgar, Yiyi Ren, Xin Liu, Woosuk Seo, Jennifer Brown, Kevin Kuehn, Mike Merrill, Paula Nurius, Shwetak Patel, Tim Althoff, Margaret Morris, Eve Riskin, Jennifer Mankoff, Anind Dey
health ubiquitous computing well-being passive mobile sensing human behavior modeling
Published: March 14, 2023. Version: 1.1
Database Credentialed Access
MIMIC-IV-Note: Deidentified free-text clinical notes
Alistair Johnson, Tom Pollard, Steven Horng, Leo Anthony Celi, Roger Mark
deidentification critical care clinical notes natural language processing electronic health record mimic
Published: Jan. 6, 2023. Version: 2.2
Database Credentialed Access
NCH Sleep DataBank: A Large Collection of Real-world Pediatric Sleep Studies with Longitudinal Clinical Data
Harlin Lee, Boyue Li, Yungui Huang, Yuejie Chi, Simon Lin
eeg ehr pediatrics clinical decision support polysomnography sleep study ecg sleep disorders electronic health records
Published: Oct. 27, 2021. Version: 3.1.0
Database Credentialed Access
Immunosuppressive Condition and Medication Annotations for Admission Notes in the MIMIC-III Database
Vijeeth Guggilla, Melissa Bak, Mengjia Kang, Theresa Walunas, Catherine A Gao
Published: Aug. 4, 2025. Version: 1.0.0
Database Restricted Access
DREAMT: Dataset for Real-time sleep stage EstimAtion using Multisensor wearable Technology
Ke Wang, Jiamu Yang, Ayush Shetty, Jessilyn Dunn
wearable sleep disorders biomedical time series classification
Published: April 30, 2025. Version: 2.1.0