Resources


Database Credentialed Access

Predictors of Hospital Onset Infection: A Matched Retrospective Cohort Dataset

Ziming Wei, Luke Sagers, Caroline McKenna, et al.

NPA-CP is a freely accessible dataset derived from electronic health record (EHR) information at MGB between 2015 and 2024. The dataset includes 11 different pathogens and can be used to predict hospital-onset infections for these pathogens.

electronic health records infection control clinical machine learning infectious diseases hospital onset infection colonization pressure

Published: Nov. 4, 2025. Version: 1.0.0


Database Credentialed Access

MIMIC-III - SequenceExamples for TensorFlow modeling

Jonas Kemp, Kun Zhang, Andrew Dai

MIMIC-III data converted into TensorFlow SequenceExample format, for use in modeling pipelines.

tensorflow sequence modeling deep learning machine learning

Published: Sept. 29, 2020. Version: 1.0.0


Database Open Access

MIT-BIH Arrhythmia Database

Two-channel ambulatory ECG recordings, obtained from 47 subjects studied by the BIH Arrhythmia Laboratory between 1975 and 1979.

arrhythmia ecg

Published: Feb. 24, 2005. Version: 1.0.0

Visualize waveforms

Database Contributor Review

Medical Information Mart for Intensive Care Brazil (MIMIC-BR): a Brazilian Dataset of Anonymized Hospital and ICU Clinical Data

Gabriela Steil, Adhara Brandão Lima Vanhoz, Mateus de Lima Freitas, et al.

Medical Information Mart for Intensive Care Brazil (MIMIC-BR) is a Brazilian dataset of hospital and ICU adult patients anonymized data. It includes 31,789 admissions to the Einstein Hospital Israelita during a period of 3 years in the last 10 years

critical care dataset artificial intelligence intensive care unit machine learning tertiary heatlhcare data anonymization inpatients

Published: May 21, 2026. Version: 1.0.0