Resources


Database Credentialed Access

SCRIPT CarpeDiem Dataset: demographics, outcomes, and per-day clinical parameters for critically ill patients with suspected pneumonia

Nikolay Markov, Catherine A Gao, Thomas Stoeger, Anna Pawlowski, Mengjia Kang, Prasanth Nannapaneni, Rogan Grant, Luke Rasmussen, Daniel Schneider, Justin Starren, Richard Wunderink, GR Scott Budinger, Alexander Misharin, Benjamin Singer, NU SCRIPT Study Investigators

SCRIPT seeks to delineate the host/pathogen interactions during pneumonia using multiomic analysis of bronchoalveolar lavage fluid joined with clinical data and physician adjudication.

Published: March 13, 2023. Version: 1.1.0


Database Credentialed Access

MIMIC-IV-Note: Deidentified free-text clinical notes

Alistair Johnson, Tom Pollard, Steven Horng, Leo Anthony Celi, Roger Mark

Deidentified free-text clinical notes for patients in the MIMIC-IV Clinical Database.

deidentification mimic critical care natural language processing clinical notes electronic health record

Published: Jan. 6, 2023. Version: 2.2


Database Contributor Review

BRATECA (Brazilian Tertiary Care Dataset): a Clinical Information Dataset for the Portuguese Language

Henrique Dias, Ana Helena Dias Pereira dos Ulbrich

Brazilian clinical dataset containing over 70,000 admissions from 10 hospitals in two Brazilian states.

prescriptions exams tertiary care natural language processing clinical notes

Published: July 14, 2022. Version: 1.1


Challenge Credentialed Access

ShAReCLEF eHealth 2013: Natural Language Processing and Information Retrieval for Clinical Care

Danielle Mowery

2013 ShARe/CLEF eHealth Evaluation Lab: Natural Language Processing and Information Retrieval for Clinical Care (Tasks 1 and 2).

natural language processing

Published: Feb. 15, 2013. Version: 1.0


Model Credentialed Access

EntityBERT: BERT-based Models Pretrained on MIMIC-III with or without Entity-centric Masking Strategy for the Clinical Domain

Chen Lin, Steven Bethard, Guergana Savova, Timothy Miller, Dmitriy Dligach

Pretraining of models with a broad representation of biomedical terminology (PubMedBERT) on MIMIC-III corpus along with or without a novel entity-centric masking strategy.

Published: March 17, 2022. Version: 1.0.1


Database Credentialed Access

MIMIC-II Clinical Database

Mohammed Saeed, Mauricio Villarroel, Andrew Reisner, Gari Clifford, Li-wei Lehman, George Moody, Thomas Heldt, Tin Kyaw, Benjamin Moody, Roger Mark

Electronic health record data collected from >30,000 patients admitted to ICUs at a single tertiary care hospital.

ehr icu mimic-ii bidmc

Published: April 24, 2011. Version: 2.6.0


Software Open Access

Transformer-DeID: Deidentification of free-text clinical notes with transformers

Callandra Moore, Lucas Bulgarelli, Tom Pollard, Alistair Johnson

Fine tune transformer-based neural networks to deidentify clinical text data.

deidentification neural networks transformers

Published: Nov. 2, 2023. Version: 1.0.0


Database Open Access

MIMIC-IV Clinical Database Demo on FHIR

Alex Bennett, Joshua Wiedekopf, Hannes Ulrich, Alistair Johnson

MIMIC-IV-on-FHIR is a hundred patient demo of MIMIC-IV v2.0 in the Fast Healthcare Interoperability Resources(FHIR) format. MIMIC-IV-on-FHIR provides implementers with a real-world FHIR datastore to aid in FHIR research and development.

mimic electronic health records fhir

Published: June 7, 2022. Version: 2.0


Model Credentialed Access

What's in a Note? Unpacking Predictive Value in Clinical Note Representations

Tristan Naumann, William Boag

Word vectors corresponding to the AMIA 2018 Informatics Summit paper of the same name.

Published: Jan. 7, 2018. Version: 0.1


Challenge Credentialed Access

Analysis of Clinical Text: Task 14 of SemEval 2015

Guergana Savova

This is the dataset for SemEval 2014 and 2015, Analysis of Clinical Text

semeval nlp

Published: Dec. 28, 2014. Version: 2.0