Resources
Database Credentialed Access
EHRNoteQA: An LLM Benchmark for Real-World Clinical Practice Using Discharge Summaries
Sunjun Kweon, Jiyoun Kim, Heeyoung Kwak, Dongchul Cha, Hangyul Yoon, Kwang Hyun Kim, Jeewon Yang, Seunghyun Won, Edward Choi
Published: June 26, 2024. Version: 1.0.1
Database Credentialed Access
ODD: A Benchmark Dataset for the NLP-based Opioid Related Aberrant Behavior Detection
Sunjae Kwon, Xun Wang, Weisong Liu, Emily Druhl, Minhee Sung, Joel Reisman, Wenjun Li, Robert Kerns, William Becker, Hong Yu
substance use natural language processing opioid related aberrant behavior
Published: Jan. 11, 2024. Version: 1.0.0
Database Credentialed Access
Annotation dataset of problematic opioid use and related contexts from MIMIC-III Critical Care Database discharge summaries
Melissa Poulsen, Vanessa Troiani, Philip Freda, Danielle Mowery, Anahita Davoudi
opioid use disorder substance use natural language processing clinical notes
Published: Feb. 8, 2023. Version: 1.0.0
Database Credentialed Access
Tasks 1 and 3 from Progress Note Understanding Suite of Tasks: SOAP Note Tagging and Problem List Summarization
Yanjun Gao, John Caskey, Timothy Miller, Brihat Sharma, Matthew Churpek, Dmitriy Dligach, Majid Afshar
Published: Sept. 30, 2022. Version: 1.0.0
Database Credentialed Access
Learning to Ask Like a Physician: a Discharge Summary Clinical Questions (DiSCQ) Dataset
Eric Lehman
question generation question answering machine learning
Published: July 28, 2022. Version: 1.0
Database Credentialed Access
MIMIC-III and eICU-CRD: Feature Representation by FIDDLE Preprocessing
Shengpu Tang, Parmida Davarmanesh, Yanmeng Song, Danai Koutra, Michael Sjoding, Jenna Wiens
preprocessing electronic health record machine learning
Published: April 28, 2021. Version: 1.0.0
Database Credentialed Access
National Institutes of Health Stroke Scale (NIHSS) Annotations for the MIMIC-III Database
Jiayang Wang, Xiaoshuo Huang, Lin Yang, Jiao Li
Published: Jan. 25, 2021. Version: 1.0.0
Database Credentialed Access
Phenotype Annotations for Patient Notes in the MIMIC-III Database
Edward Moseley, Leo Anthony Celi, Joy Wu, Franck Dernoncourt
patient classification natural language processing
Published: March 5, 2020. Version: 1.20.03
Database Open Access
MIMIC-IV demo data in the Medical Event Data Standard (MEDS)
Robin Philippus van de Water, Ethan Steinberg, Michael Wornow, Patrick Rockenschaub, Matthew McDermott
ehr critical care electronic health record mimic machine learning meds medical event data standard
Published: Sept. 29, 2025. Version: 0.0.1
Database Credentialed Access
MIMIC-Ext-DrugDetection
Fabrice Harel-Canada, Nanyun Peng, David Goodman, Ruby Romero, Allan Nguyen, Brandon Moghanian, Anabel Salimian
ehr mimic-iv substance use clinical notes methamphetamine multi-label cocaine drug detection polysubstance use prescription opioid misuse cannabis benzodiazepine misuse injection drug use heroin mimic-iii
Published: Sept. 25, 2025. Version: 1.0.0