Resources


Database Credentialed Access

MIMIC-CXR Database

Alistair Johnson, Tom Pollard, Roger Mark, Seth Berkowitz, Steven Horng

Chest radiographs in DICOM format with associated free-text reports.

mimic chest x-rays computer vision radiology natural language processing machine learning

Published: Sept. 19, 2019. Version: 2.0.0


Database Credentialed Access

MIMIC-CXR-JPG - chest radiographs with structured labels

Alistair Johnson, Matt Lungren, Yifan Peng, Zhiyong Lu, Roger Mark, Seth Berkowitz, Steven Horng

Chest x-rays in JPG format with structured labels derived from the associated radiology report.

mimic computer vision radiology chest x-ray deep learning

Published: Nov. 14, 2019. Version: 2.0.0


Software Open Access

De-Identification Software Package

The deid software package includes code and dictionaries for automated location and removal of protected health information (PHI) in free text from medical records.

anonymization deidentification phi

Published: Dec. 18, 2007. Version: 1.1


Database Credentialed Access

Deidentified Medical Text

Margaret Douglass, Bill Long, George Moody, Peter Szolovits, Li-wei Lehman, Roger Mark, Gari Clifford

Gold standard corpus of 2,434 deidentified nursing notes

medical text nursing notes de-identification hipaa

Published: Dec. 18, 2007. Version: 1.0


Software Open Access

edf-anonymize

edf-anonymize reads an EDF or EDF+ file (input), writing an anonymized copy of it as output.

anonymization deidentification phi

Published: May 17, 2010. Version: 1.0.0


Model Credentialed Access

Clinical BERT Models Trained on Pseudo Re-identified MIMIC-III Notes

Eric Lehman, Sarthak Jain, Karl Pichotta, Yoav Goldberg, Byron Wallace

We explore recovering sensitive info from BERT trained over non-deidentified EHR. We make our models and data available to further facilitate research.

Published: April 28, 2021. Version: 1.0.0