Resources
Database Credentialed Access
BRATECA (Brazilian Tertiary Care Dataset): a Clinical Information Dataset for the Portuguese Language
Henrique Dias, Ana Helena Dias Pereira dos Ulbrich
exams natural language processing tertiary care prescriptions clinical notes
Published: May 13, 2022. Version: 1.0
Database Credentialed Access
Deidentified Medical Text
Margaret Douglass, Bill Long, George Moody, Peter Szolovits, Li-wei Lehman, Roger Mark, Gari D. Clifford
medical text nursing notes de-identification hipaa
Published: Dec. 18, 2007. Version: 1.0
Database Credentialed Access
Annotated Question-Answer Pairs for Clinical Notes in the MIMIC-III Database
Xiang Yue, Xinliang Frederick Zhang, Huan Sun
clinical question answering clinical nlp clinical reading comprehension
Published: Jan. 15, 2021. Version: 1.0.0
Database Credentialed Access
MIMIC-III Clinical Database
Alistair Johnson, Tom Pollard, Roger Mark
MIMIC-III is a large, freely-available database comprising deidentified health-related data associated with over forty thousand patients who stayed in critical care units of the Beth Israel Deaconess Medical Center between 2001 and 2012. The databas…
intensive care clinical critical care machine learning natural language processing
Published: Sept. 4, 2016. Version: 1.4
Database Credentialed Access
Phenotype Annotations for Patient Notes in the MIMIC-III Database
Edward Moseley, Leo Anthony Celi, Joy Wu, Franck Dernoncourt
patient classification natural language processing
Published: March 5, 2020. Version: 1.20.03
Database Open Access
MIMIC-III Clinical Database Demo
Alistair Johnson, Tom Pollard, Roger Mark
mimic critical care electronic health records
Published: April 24, 2019. Version: 1.4
Database Credentialed Access
DrugEHRQA: A Question Answering Dataset on Structured and Unstructured Electronic Health Records For Medicine Related Queries
Jayetri Bardhan, Anthony Colas, Kirk Roberts, Daisy Zhe Wang
Published: April 12, 2022. Version: 1.0.0
Database Credentialed Access
MedNLI - A Natural Language Inference Dataset For The Clinical Domain
Chaitanya Shivade
natural language inference recognizing textual entailment
Published: Oct. 1, 2019. Version: 1.0.0
Database Credentialed Access
CLIP: A Dataset for Extracting Action Items for Physicians from Hospital Discharge Notes
James Mullenbach, Yada Pruksachatkun, Sean Adler, Jennifer Seale, Jordan Swartz, T Greg McKelvey, Yi Yang, David Sontag
Published: June 21, 2021. Version: 1.0.0
Model Credentialed Access
Clinical BERT Models Trained on Pseudo Re-identified MIMIC-III Notes
Eric Lehman, Sarthak Jain, Karl Pichotta, Yoav Goldberg, Byron Wallace
Published: April 28, 2021. Version: 1.0.0