Resources
Database Credentialed Access
MIMIC-Ext-DrugDetection
ehr mimic-iv substance use clinical notes mimic-iii methamphetamine multi-label cocaine drug detection polysubstance use prescription opioid misuse cannabis benzodiazepine misuse injection drug use heroin
Published: Sept. 25, 2025. Version: 1.0.0
Database Credentialed Access
PatientSim: A Persona-Driven Simulator for Realistic Doctor-Patient Interactions
electronic health records multi-turn dialogue llm simulation doctor-patient consultation
Published: Oct. 18, 2025. Version: 1.0.0
Database Credentialed Access
RaDialog Instruct Dataset
medical image understaning radiology chatbot radiology report generation radiology assistant large vision-language models
Published: July 12, 2024. Version: 1.1.0
Database Credentialed Access
MIMIC-III-Ext-VeriFact-BHC: Labeled Propositions From Brief Hospital Course Summaries for Long-form Clinical Text Evaluation
artificial intelligence natural language processing clinical notes electronic health records brief hospital course large language models long-form text chart review text reranking atomic claim hybrid retrieval clinical informatics clinical medicine fact verification retrieval-augmented generation logical atomism text embedding formal logic llm-as-a-judge llm evaluation
Published: April 9, 2025. Version: 1.0.0
Challenge Credentialed Access
ArchEHR-QA: A Dataset for Addressing Patient's Information Needs related to Clinical Course of Hospitalization
question answering electronic health record patient portals clinicians
Published: Jan. 1, 2026. Version: 1.3
Database Credentialed Access
EchoGraph-annotated ECHO-NOTE2NUM examples
Published: Dec. 3, 2025. Version: 1.0.0
Database Credentialed Access
Nosocomial Risk Datasets from MIMIC-III
pressure injury risk prediction acute kidney injury anemia forecasting natural language processing deep learning
Published: Sept. 15, 2022. Version: 1.0
Database Restricted Access
Multimodal Physiological Monitoring During Virtual Reality Piloting Tasks
Published: Aug. 25, 2022. Version: 1.0.0
Database Credentialed Access
DrugEHRQA: A Question Answering Dataset on Structured and Unstructured Electronic Health Records For Medicine Related Queries
Published: April 12, 2022. Version: 1.0.0
Database Credentialed Access
RadNLI: A natural language inference dataset for the radiology domain
Published: June 29, 2021. Version: 1.0.0