Featured Resources


Database Credentialed Access

MIMIC-IV

Alistair Johnson, Lucas Bulgarelli, Tom Pollard, Steven Horng, Leo Anthony Celi, Roger Mark

Database of patients admitted to the Beth Israel Deaconess Medical Center

mimic critical care intensive care unit machine learning

Published: March 16, 2021. Version: 1.0


Database Credentialed Access

MIMIC-CXR Database

Alistair Johnson, Tom Pollard, Roger Mark, Seth Berkowitz, Steven Horng

Chest radiographs in DICOM format with associated free-text reports.

mimic computer vision chest x-rays radiology machine learning natural language processing

Published: Sept. 19, 2019. Version: 2.0.0


Database Credentialed Access

eICU Collaborative Research Database

Tom Pollard, Alistair Johnson, Jesse Raffa, Leo Anthony Celi, Omar Badawi, Roger Mark

Multi-center database comprising deidentified health data associated with over 200,000 admissions to ICUs across the United States between 2014-2015.

telemedicine icu critical care

Published: April 15, 2019. Version: 2.0


Database Open Access

MIT-BIH Arrhythmia Database

Two-channel ambulatory ECG recordings, obtained from 47 subjects studied by the BIH Arrhythmia Laboratory between 1975 and 1979.

arrhythmia ecg

Published: Feb. 24, 2005. Version: 1.0.0

Visualize waveforms

Database Open Access

MIT-BIH Atrial Fibrillation Database

This database includes 25 long-term ECG recordings of human subjects with atrial fibrillation (mostly paroxysmal).

atrial fibrillation ecg

Published: Nov. 4, 2000. Version: 1.0.0

Visualize waveforms

Latest Resources


Database Restricted Access

KINECAL

Sean Maudsley-Barton, Moi Hoon Yap

A dataset for balance falls-risk assessment and balance impairment analysis

posturography postural sway balance age-related changes clinical tests falls-risk

Published: May 16, 2022. Version: 1.0.0


Database Credentialed Access

MS-CXR: Making the Most of Text Semantics to Improve Biomedical Vision-Language Processing

Benedikt Boecking, Naoto Usuyama, Shruthi Bannur, Daniel Coelho de Castro, Anton Schwaighofer, Stephanie Hyland, Maria Teodora Wetscherek, Tristan Naumann, Aditya Nori, Javier Alvarez Valle, Hoifung Poon, Ozan Oktay

MS-CXR is a new dataset containing 1162 Chest X-ray bounding box labels paired with radiology text descriptions, annotated and verified by two board-certified radiologists.

chest x-ray vision-language processing

Published: May 16, 2022. Version: 0.1


Database Credentialed Access

BRATECA (Brazilian Tertiary Care Dataset): a Clinical Information Dataset for the Portuguese Language

Henrique Dias, Ana Helena Dias Pereira dos Ulbrich

Brazilian clinical dataset containing over 70,000 admissions from 10 hospitals in two Brazilian states.

exams natural language processing tertiary care prescriptions clinical notes

Published: May 13, 2022. Version: 1.0


Database Open Access

The CirCor DigiScope Phonocardiogram Dataset

Jorge Oliveira, Francesco Renna, Paulo Costa, Marcelo Nogueira, Ana Cristina Oliveira, Andoni Elola, Carlos Ferreira, Alipio Jorge, Ali Bahrami Rad, Matthew Reyna, Reza Sameni, Gari Clifford, Miguel Coimbra

A large collection of multi-location heart sound signals, with 5272 records collected from 1568 subjects. Heart murmurs have been annotated by a human annotator based on their time, shape, pitch, grading, quality, location and location intensity.

signal processing murmur pitch george b moody physionet challenge 2022 murmur grading murmur location murmur timing phonocardiogram pregnant murmur shape pediatric murmur detection murmur intensity murmur quality

Published: May 10, 2022. Version: 1.0.3

Visualize waveforms

Software Open Access

Model for Simulating ECG and PPG Signals with Arrhythmia Episodes

Andrius Sološenko, Andrius Petrėnas, Birutė Paliakaitė, Vaidotas Marozas, Leif Sörnmo

A model is capable of simulating sinus rhythm, atrial fibrillation and ectopic beats in ECGs and PPGs as well as extreme bradycardia and ventricular tachycardia in PPGs. Different types of noises and artifacts can also be added to the waveforms.

arrhythmia atrial fibrillation noise tachycardia detection motion artifacts ppg simulation bradycardia ecg

Published: May 2, 2022. Version: 1.3.1


Database Credentialed Access

DrugEHRQA: A Question Answering Dataset on Structured and Unstructured Electronic Health Records For Medicine Related Queries

Jayetri Bardhan, Anthony Colas, Kirk Roberts, Daisy Zhe Wang

DrugEHRQA is a QA dataset containing question-answers from MIMIC-III tables and discharge summaries.

question-answer qa

Published: April 12, 2022. Version: 1.0.0