Resources
Challenge Credentialed Access
ShAReCLEF eHealth 2013: Natural Language Processing and Information Retrieval for Clinical Care
Danielle Mowery
Published: Feb. 15, 2013. Version: 1.0
Database Contributor Review
BRATECA (Brazilian Tertiary Care Dataset): a Clinical Information Dataset for the Portuguese Language
Henrique Dias, Ana Helena Dias Pereira dos Ulbrich
prescriptions exams tertiary care clinical notes natural language processing
Published: July 14, 2022. Version: 1.1
Database Credentialed Access
Medical Expert Annotations of Unsupported Facts in Doctor-Written and LLM-Generated Patient Summaries
Stefan Hegselmann, Shannon Shen, Florian Gierse, Monica Agrawal, David Sontag, Xiaoyi Jiang
Published: April 30, 2025. Version: 1.0.1
Database Credentialed Access
C-REACT: Contextualized Race and Ethnicity Annotations for Clinical Text
Oliver Bear Don't Walk IV, Adrienne Pichon, Harry Reyes Nieva, Tony Sun, Jaan Lı, Joshua Winston Joseph, Sivan Kinberg, Lauren R Richter, Salvatore Crusco, Kyle Kulas, Shaan Ahmed, Daniel Snyder, Ashkon Rahbari, Benjamin Ranard, Pallavi Juneja, Dina Demner-Fushman, Noemie Elhadad
clinical notes patient country information race and ethnicity patient language information
Published: Oct. 21, 2024. Version: 1.0.0
Challenge Credentialed Access
ShAReCLEF eHealth Evaluation Lab 2014 (Task 2): Disorder Attributes in Clinical Reports
Danielle Mowery
Published: Nov. 1, 2013. Version: 1.0
Database Credentialed Access
C-REACT: Contextualized Race and Ethnicity Annotations for Clinical Text
Oliver Bear Don't Walk IV, Adrienne Pichon, Harry Reyes Nieva, Tony Sun, Jaan Lı, Joshua Winston Joseph, Sivan Kinberg, Lauren R Richter, Salvatore Crusco, Kyle Kulas, Shaan Ahmed, Daniel Snyder, Ashkon Rahbari, Benjamin Ranard, Pallavi Juneja, Dina Demner-Fushman, Noemie Elhadad
clinical notes patient country information race and ethnicity patient language information
Published: Oct. 21, 2024. Version: 1.0.0
Database Credentialed Access
MIMIC-III-Ext-VeriFact-BHC: Labeled Propositions From Brief Hospital Course Summaries for Long-form Clinical Text Evaluation
Philip Chung, Akshay Swaminathan, Alex Goodell, Yeasul Kim, Momsen Reincke, Lichy Han, Ben Deverett, Mohammad Amin Sadeghi, Abdel badih El Ariss, Marc Ghanem, David Seong, Andrew Lee, Caitlin Coombes, Brad Bradshaw, Mahir Sufian, Hyo Jung Hong, Teresa Nguyen, Mohammad Rasouli, Komal Kamra, Mark Burbridge, James McAvoy, Roya Saffary, Stephen Parnell Ma, Dev Dash, James Xie, Ellen Wang, Cliff Schmiesing, Nigam Shah, Nima Aghaeepour
artificial intelligence clinical notes natural language processing large language models brief hospital course electronic health records long-form text chart review text reranking atomic claim hybrid retrieval clinical informatics clinical medicine fact verification retrieval-augmented generation logical atomism text embedding formal logic llm-as-a-judge llm evaluation
Published: April 9, 2025. Version: 1.0.0
Database Credentialed Access
MIMIC-III-Ext-VeriFact-BHC: Labeled Propositions From Brief Hospital Course Summaries for Long-form Clinical Text Evaluation
Philip Chung, Akshay Swaminathan, Alex Goodell, Yeasul Kim, Momsen Reincke, Lichy Han, Ben Deverett, Mohammad Amin Sadeghi, Abdel badih El Ariss, Marc Ghanem, David Seong, Andrew Lee, Caitlin Coombes, Brad Bradshaw, Mahir Sufian, Hyo Jung Hong, Teresa Nguyen, Mohammad Rasouli, Komal Kamra, Mark Burbridge, James McAvoy, Roya Saffary, Stephen Parnell Ma, Dev Dash, James Xie, Ellen Wang, Cliff Schmiesing, Nigam Shah, Nima Aghaeepour
artificial intelligence clinical notes natural language processing large language models brief hospital course electronic health records long-form text chart review text reranking atomic claim hybrid retrieval clinical informatics clinical medicine fact verification retrieval-augmented generation logical atomism text embedding formal logic llm-as-a-judge llm evaluation
Published: April 9, 2025. Version: 1.0.0
Database Credentialed Access
Phenotype Annotations for Patient Notes in the MIMIC-III Database
Edward Moseley, Leo Anthony Celi, Joy Wu, Franck Dernoncourt
patient classification natural language processing
Published: March 5, 2020. Version: 1.20.03
Database Credentialed Access
C-REACT: Contextualized Race and Ethnicity Annotations for Clinical Text
Oliver Bear Don't Walk IV, Adrienne Pichon, Harry Reyes Nieva, Tony Sun, Jaan Lı, Joshua Winston Joseph, Sivan Kinberg, Lauren R Richter, Salvatore Crusco, Kyle Kulas, Shaan Ahmed, Daniel Snyder, Ashkon Rahbari, Benjamin Ranard, Pallavi Juneja, Dina Demner-Fushman, Noemie Elhadad
clinical notes patient country information race and ethnicity patient language information
Published: Oct. 21, 2024. Version: 1.0.0