Resources


Model Credentialed Access

Clinical BERT Models Trained on Pseudo Re-identified MIMIC-III Notes

Eric Lehman, Sarthak Jain, Karl Pichotta, Yoav Goldberg, Byron Wallace

We explore recovering sensitive info from BERT trained over non-deidentified EHR. We make our models and data available to further facilitate research.

Published: April 28, 2021. Version: 1.0.0


Database Restricted Access

Pulmonary Edema Severity Grades Based on MIMIC-CXR

Ruizhi Liao, Geeticka Chauhan, Polina Golland, Seth Berkowitz, Steven Horng

Pulmonary edema metadata and labels for MIMIC-CXR

Published: Feb. 9, 2021. Version: 1.0.1


Database Credentialed Access

National Institutes of Health Stroke Scale (NIHSS) Annotations for the MIMIC-III Database

Jiayang Wang, Xiaoshuo Huang, Lin Yang, Jiao Li

A dataset of annotated NIHSS scale items and corresponding scores from stroke patients discharge summaries in MIMIC-III.

Published: Jan. 25, 2021. Version: 1.0.0


Model Credentialed Access

Transformer models trained on MIMIC-III to generate synthetic patient notes

Ali Amin-Nejad, Julia Ive, Sumithra Velupillai

Machine learning models that have been trained using MIMIC-III to enable the creation of synthetic discharge summaries.

Published: May 27, 2020. Version: 1.0.0


Database Open Access

MIMIC-III Waveform Database Matched Subset

Benjamin Moody, George Moody, Mauricio Villarroel, Gari Clifford, Ikaro Silva

Physiological signals (including continuous ECG, PPG, ABP, and other signals) that are associated with patients in the MIMIC-III Clinical Database.

Published: April 7, 2020. Version: 1.0

Visualize waveforms

Database Open Access

MIMIC-III Waveform Database

Benjamin Moody, George Moody, Mauricio Villarroel, Gari Clifford, Ikaro Silva

The MIMIC-III Waveform Database contains numerous physiological signals (including continuous ECG, PPG, ABP, and other signals) and periodic measurements, recorded by bedside patient monitors from about 30,000 patients in intensive care units.

Published: April 7, 2020. Version: 1.0

Visualize waveforms

Database Open Access

Clinical data from the MIMIC-II database for a case study on indwelling arterial catheters

Jesse Raffa

Dataset extracted from MIMIC-II for a tutorial on effectiveness of indwelling arterial catheters in hemodynamically stable patients with respiratory failure for mortality outcomes.

Published: Oct. 28, 2016. Version: 1.0


Database Credentialed Access

RadNLI: A natural language inference dataset for the radiology domain

Yasuhide Miura, Yuhao Zhang, Emily Tsai, Curtis Langlotz, Dan Jurafsky

A radiology NLI dataset introduced in the paper: Improving Factual Completeness and Consistency of Image-to-text Radiology Report Generation

Published: June 29, 2021. Version: 1.0.0


Database Credentialed Access

CLIP: A Dataset for Extracting Action Items for Physicians from Hospital Discharge Notes

James Mullenbach, Yada Pruksachatkun, Sean Adler, Jennifer Seale, Jordan Swartz, T Greg McKelvey, Yi Yang, David Sontag

Clinical action items annotated over MIMIC-III. 718 discharge summaries are labeled at a sentence- and character-level with multiple action labels including Appointment, Lab, Procedure, Medication, Imaging, Patient Instructions, and Other.

Published: June 21, 2021. Version: 1.0.0


Challenge Credentialed Access

ShAReCLEF eHealth 2013: Natural Language Processing and Information Retrieval for Clinical Care

Danielle Mowery

2013 ShARe/CLEF eHealth Evaluation Lab: Natural Language Processing and Information Retrieval for Clinical Care (Tasks 1 and 2).

natural language processing

Published: Feb. 15, 2013. Version: 1.0