Resources
Database Credentialed Access
MIMIC-IV-Note: Deidentified free-text clinical notes
Alistair Johnson, Tom Pollard, Steven Horng, Leo Anthony Celi, Roger Mark
deidentification critical care clinical notes natural language processing electronic health record mimic
Published: Jan. 6, 2023. Version: 2.2
Database Contributor Review
BRATECA (Brazilian Tertiary Care Dataset): a Clinical Information Dataset for the Portuguese Language
Henrique Dias, Ana Helena Dias Pereira dos Ulbrich
prescriptions exams tertiary care clinical notes natural language processing
Published: July 14, 2022. Version: 1.1
Database Credentialed Access
Annotated Question-Answer Pairs for Clinical Notes in the MIMIC-III Database
Xiang Yue, Xinliang Frederick Zhang, Huan Sun
clinical question answering clinical nlp clinical reading comprehension
Published: Jan. 15, 2021. Version: 1.0.0
Database Credentialed Access
MIMIC-III-Ext-VeriFact-BHC: Labeled Propositions From Brief Hospital Course Summaries for Long-form Clinical Text Evaluation
Philip Chung, Akshay Swaminathan, Alex Goodell, Yeasul Kim, Momsen Reincke, Lichy Han, Ben Deverett, Mohammad Amin Sadeghi, Abdel badih El Ariss, Marc Ghanem, David Seong, Andrew Lee, Caitlin Coombes, Brad Bradshaw, Mahir Sufian, Hyo Jung Hong, Teresa Nguyen, Mohammad Rasouli, Komal Kamra, Mark Burbridge, James McAvoy, Roya Saffary, Stephen Parnell Ma, Dev Dash, James Xie, Ellen Wang, Cliff Schmiesing, Nigam Shah, Nima Aghaeepour
artificial intelligence clinical notes natural language processing large language models brief hospital course electronic health records long-form text chart review text reranking atomic claim hybrid retrieval clinical informatics clinical medicine fact verification retrieval-augmented generation logical atomism text embedding formal logic llm-as-a-judge llm evaluation
Published: April 9, 2025. Version: 1.0.0
Database Credentialed Access
SCRIPT CarpeDiem Dataset: demographics, outcomes, and per-day clinical parameters for critically ill patients with suspected pneumonia
Nikolay Markov, Catherine A Gao, Thomas Stoeger, Anna Pawlowski, Mengjia Kang, Prasanth Nannapaneni, Rogan Grant, Luke Rasmussen, Daniel Schneider, Justin Starren, Richard Wunderink, GR Scott Budinger, Alexander Misharin, Benjamin Singer, NU SCRIPT Study Investigators
Published: March 13, 2023. Version: 1.1.0
Database Credentialed Access
MIMIC-IV-Note: Deidentified free-text clinical notes
Alistair Johnson, Tom Pollard, Steven Horng, Leo Anthony Celi, Roger Mark
deidentification critical care clinical notes natural language processing electronic health record mimic
Published: Jan. 6, 2023. Version: 2.2
Database Contributor Review
BRATECA (Brazilian Tertiary Care Dataset): a Clinical Information Dataset for the Portuguese Language
Henrique Dias, Ana Helena Dias Pereira dos Ulbrich
prescriptions exams tertiary care clinical notes natural language processing
Published: July 14, 2022. Version: 1.1
Challenge Credentialed Access
ShAReCLEF eHealth 2013: Natural Language Processing and Information Retrieval for Clinical Care
Danielle Mowery
Published: Feb. 15, 2013. Version: 1.0
Database Credentialed Access
C-REACT: Contextualized Race and Ethnicity Annotations for Clinical Text
Oliver Bear Don't Walk IV, Adrienne Pichon, Harry Reyes Nieva, Tony Sun, Jaan Lı, Joshua Winston Joseph, Sivan Kinberg, Lauren R Richter, Salvatore Crusco, Kyle Kulas, Shaan Ahmed, Daniel Snyder, Ashkon Rahbari, Benjamin Ranard, Pallavi Juneja, Dina Demner-Fushman, Noemie Elhadad
clinical notes patient country information race and ethnicity patient language information
Published: Oct. 21, 2024. Version: 1.0.0
Database Credentialed Access
C-REACT: Contextualized Race and Ethnicity Annotations for Clinical Text
Oliver Bear Don't Walk IV, Adrienne Pichon, Harry Reyes Nieva, Tony Sun, Jaan Lı, Joshua Winston Joseph, Sivan Kinberg, Lauren R Richter, Salvatore Crusco, Kyle Kulas, Shaan Ahmed, Daniel Snyder, Ashkon Rahbari, Benjamin Ranard, Pallavi Juneja, Dina Demner-Fushman, Noemie Elhadad
clinical notes patient country information race and ethnicity patient language information
Published: Oct. 21, 2024. Version: 1.0.0