Resources
Database Contributor Review
CARMEN-I: A resource of anonymized electronic health records in Spanish and Catalan for training and testing NLP tools
de-identification clinical ner anonymization
Published: April 20, 2024. Version: 1.0.1
Database Credentialed Access
ODD: A Benchmark Dataset for the NLP-based Opioid Related Aberrant Behavior Detection
substance use natural language processing opioid related aberrant behavior
Published: Jan. 11, 2024. Version: 1.0.0
Database Credentialed Access
Tasks 1 and 3 from Progress Note Understanding Suite of Tasks: SOAP Note Tagging and Problem List Summarization
Published: Sept. 30, 2022. Version: 1.0.0
Challenge Credentialed Access
Analysis of Clinical Text: Task 14 of SemEval 2015
Published: Dec. 28, 2014. Version: 2.0
Database Restricted Access
Application of Med-PaLM 2 in the refinement of MIMIC-CXR labels
Published: Feb. 4, 2025. Version: 1.0.0
Database Credentialed Access
MIMIC-IV-Note: Deidentified free-text clinical notes
deidentification critical care natural language processing clinical notes electronic health record mimic
Published: Jan. 6, 2023. Version: 2.2
Model Credentialed Access
EntityBERT: BERT-based Models Pretrained on MIMIC-III with or without Entity-centric Masking Strategy for the Clinical Domain
Published: March 17, 2022. Version: 1.0.1
Challenge Credentialed Access
Analysis of Clinical Text: Task 14 of SemEval 2015
Published: Dec. 28, 2014. Version: 2.0
Database Credentialed Access
MIMIC-IV-Ext Triage Instruction Corpus
nlp clinical decision support machine learning large language models emergency severity index emergency triage
Published: March 4, 2025. Version: 1.0.0
Model Credentialed Access
Fine-tuning foundational models to code diagnoses from veterinary health records
transformers natural language processing large language models foundational models one health diagnoses snomed-ct veterinary medicine omop cdm veterinary medical records clinical coding
Published: Jan. 25, 2026. Version: 1.0.0