Resources
Software Open Access
Transformer-DeID: Deidentification of free-text clinical notes with transformers
deidentification neural networks transformers
Published: Nov. 2, 2023. Version: 1.0.0
Database Contributor Review
BRATECA (Brazilian Tertiary Care Dataset): a Clinical Information Dataset for the Portuguese Language
prescriptions exams tertiary care natural language processing clinical notes
Published: July 14, 2022. Version: 1.1
Database Contributor Review
BRATECA (Brazilian Tertiary Care Dataset): a Clinical Information Dataset for the Portuguese Language
prescriptions exams tertiary care natural language processing clinical notes
Published: July 14, 2022. Version: 1.1
Database Credentialed Access
Bridge2AI-Voice: An ethically-sourced, diverse voice dataset linked to health information
Published: Dec. 16, 2025. Version: 3.0.0
Database Credentialed Access
MIMIC-IV-Note: Deidentified free-text clinical notes
deidentification critical care natural language processing clinical notes electronic health record mimic
Published: Jan. 6, 2023. Version: 2.2
Database Credentialed Access
MIMIC-IV-Note: Deidentified free-text clinical notes
deidentification critical care natural language processing clinical notes electronic health record mimic
Published: Jan. 6, 2023. Version: 2.2
Database Restricted Access
OpenOximetry Repository
Published: Feb. 28, 2025. Version: 1.1.1
Database Contributor Review
Chest Computed Tomography for patients with sepsis in the Emergency Department
Published: Oct. 28, 2024. Version: 1.0.0
Database Credentialed Access
Predictors of Hospital Onset Infection: A Matched Retrospective Cohort Dataset
electronic health records infection control clinical machine learning infectious diseases hospital onset infection colonization pressure
Published: Nov. 4, 2025. Version: 1.0.0
Database Credentialed Access
DrugEHRQA: A Question Answering Dataset on Structured and Unstructured Electronic Health Records For Medicine Related Queries
Published: April 12, 2022. Version: 1.0.0