Resources


Database Credentialed Access

AMR-UTI: Antimicrobial Resistance in Urinary Tract Infections

Michael Oberst, Soorajnath Boominathan, Helen Zhou, Sanjat Kanjilal, David Sontag

AMR-UTI is a freely accessible dataset, derived from electronic health record (EHR) information on over 100,000 urinary tract infections (UTI) treated at Massachusetts General Hospital and Brigham & Women's Hospital in Boston, MA, USA.

antibiotic resistance causal inference policy learning antimicrobial resistance urinary tract infection clinical decision support machine learning

Published: Nov. 4, 2020. Version: 1.0.0


Database Open Access

Surface electromyographic signals collected during long-lasting ground walking of young able-bodied subjects

Francesco Di Nardo, Christian Morbidoni, Sandro Fioretti

The dataset is composed of long-lasting surface electromyographic (sEMG) signals recorded from ten muscles during ground walking of 31 young able-bodied subjects in Movement Analysis Lab, UniversitĂ  Politecnica delle Marche, Ancona, Italy.

surface emg signal walking biomedical signals gait analysis muscle recruitment

Published: March 31, 2022. Version: 1.0.0

Visualize waveforms

Database Credentialed Access

RadGraph: Extracting Clinical Entities and Relations from Radiology Reports

Saahil Jain, Ashwin Agrawal, Adriel Saporta, Steven QH Truong, Du Nguyen Duong, Tan Bui, Pierre Chambon, Matthew Lungren, Andrew Ng, Curtis Langlotz, Pranav Rajpurkar

RadGraph is a dataset of entities and relations in full-text chest X-ray radiology reports, which are obtained using a novel information extraction (IE) schema to capture clinically relevant information in a radiology report.

radiology entity and relation extraction graph multi-modal natural language processing

Published: June 3, 2021. Version: 1.0.0


Database Open Access

Apnea-ECG Database

Seventy ECG signals with expert-labelled apnea annotations and machine-generated QRS annotations.

apnea sleep multiparameter challenge ecg

Published: Feb. 10, 2000. Version: 1.0.0

Visualize waveforms

Database Credentialed Access

MIMIC-III and eICU-CRD: Feature Representation by FIDDLE Preprocessing

Shengpu Tang, Parmida Davarmanesh, Yanmeng Song, Danai Koutra, Michael Sjoding, Jenna Wiens

Features and labels from MIMIC-III and eICU-CRD produced by FIDDLE, an EHR preprocessing pipeline.

preprocessing machine learning electronic health record

Published: April 28, 2021. Version: 1.0.0


Database Open Access

PTB-XL, a large publicly available electrocardiography dataset

Patrick Wagner, Nils Strodthoff, Ralf-Dieter Bousseljot, Wojciech Samek, Tobias Schaeffter

The PTB-XL ECG dataset is a large dataset of 21801 clinical 12-lead ECGs from 18869 patients of 10 second length. The raw signal data has been annotated by up to two cardiologists with 71 different ECG statements and is supplemented by rich metadata.

ptb-xl ptb electrocardiography ecg

Published: Nov. 9, 2022. Version: 1.0.3

Visualize waveforms

Database Credentialed Access

BRAX, a Brazilian labeled chest X-ray dataset

Eduardo Pontes Reis, Joselisa Paiva, Maria Carolina Bueno da Silva, Guilherme Alberto Sousa Ribeiro, Victor Fornasiero Paiva, Lucas Bulgarelli, Henrique Lee, Paulo Victor dos Santos, vanessa brito, Lucas Amaral, Gabriel Beraldo, Jorge Nebhan Haidar Filho, Gustavo Teles, Gilberto Szarf, Tom Pollard, Alistair Johnson, Leo Anthony Celi, Edson Amaro

BRAX contains 24,959 chest radiography exams and 40,967 images acquired in a large general Brazilian hospital. All images have been read by trained radiologists and 14 labels were derived from Brazilian Portuguese reports using NLP.

chest x-ray artificial intelligence dataset

Published: June 17, 2022. Version: 1.1.0


Database Open Access

Continuous Cuffless Monitoring of Arterial Blood Pressure via Graphene Bioimpedance Tattoos

Bassem Ibrahim, Dmitry Kireev, Kaan Sel, Neelotpala Kumar, Ali Akbari, Roozbeh Jafari, deji akinwande

Cuffless blood pressure data repository that includes raw time data for 4-channel Bioimpedance signals using Graphene Tattoos from the wrist with synchronized continuous blood pressure and PPG signals from 7 subjects

blood pressure ppg graphene bioimpedance

Published: June 4, 2022. Version: 1.0.0


Database Credentialed Access

FFA-IR: Towards an Explainable and Reliable Medical Report Generation Benchmark

Mingjie Li, Wenjia Cai, Rui Liu, Yuetian Weng, Xiaoyun Zhao, Cong Wang, Xin Chen, Zhong Liu, Caineng Pan, Mengke Li, Yingfeng Zheng, Yizhi Liu, Flora Salim, Karin Verspoor, Xiaodan Liang, Xiaojun Chang

Benchmark dataset for report generation based on fundus fluorescein angiography images and reports.

fundus fluorescein angiography explainable and reliable evaluation vision and language medical report generation

Published: Sept. 21, 2021. Version: 1.0.0


Database Credentialed Access

Paediatric Intensive Care database

Haomin Li, Xian Zeng, Gang Yu

PIC (Paediatric Intensive Care) is a large paediatric-specific, single-centre, bilingual database comprising information relating to children admitted to critical care units at a large children’s hospital in China.

intensive care pediatrics critical care natural language processing

Published: Nov. 12, 2020. Version: 1.1.0