Resources


Database Credentialed Access

RadGraph: Extracting Clinical Entities and Relations from Radiology Reports

Saahil Jain, Ashwin Agrawal, Adriel Saporta, Steven QH Truong, Du Nguyen Duong, Tan Bui, Pierre Chambon, Matthew Lungren, Andrew Ng, Curtis Langlotz, Pranav Rajpurkar

RadGraph is a dataset of entities and relations in full-text chest X-ray radiology reports, which are obtained using a novel information extraction (IE) schema to capture clinically relevant information in a radiology report.

radiology entity and relation extraction graph multi-modal natural language processing

Published: June 3, 2021. Version: 1.0.0


Database Open Access

Wilson Central Terminal ECG Database

Hossein Moeinzadeh, Gaetano Gargiulo

Wilson Central Terminal ECG signals recorded from 92 patients.

wilson central terminal limb potential unipolar lead electrocardiography ecg

Published: Nov. 13, 2019. Version: 1.0.1

Visualize waveforms

Database Open Access

Wrist PPG During Exercise

Photoplethysmogram recorded from 8 volunteers during walking, running and bike riding.

multiparameter photoplethysmogram accelerometer movement ecg

Published: Oct. 20, 2017. Version: 1.0.0

Visualize waveforms

Database Open Access

Apnea-ECG Database

Seventy ECG signals with expert-labelled apnea annotations and machine-generated QRS annotations.

apnea sleep multiparameter challenge ecg

Published: Feb. 10, 2000. Version: 1.0.0

Visualize waveforms

Database Credentialed Access

FFA-IR: Towards an Explainable and Reliable Medical Report Generation Benchmark

Mingjie Li, Wenjia Cai, Rui Liu, Yuetian Weng, Xiaoyun Zhao, Cong Wang, Xin Chen, Zhong Liu, Caineng Pan, Mengke Li, Yingfeng Zheng, Yizhi Liu, Flora Salim, Karin Verspoor, Xiaodan Liang, Xiaojun Chang

Benchmark dataset for report generation based on fundus fluorescein angiography images and reports.

fundus fluorescein angiography explainable and reliable evaluation vision and language medical report generation

Published: Sept. 21, 2021. Version: 1.0.0


Database Credentialed Access

MS-CXR: Making the Most of Text Semantics to Improve Biomedical Vision-Language Processing

Benedikt Boecking, Naoto Usuyama, Shruthi Bannur, Daniel Coelho de Castro, Anton Schwaighofer, Stephanie Hyland, Maria Teodora Wetscherek, Tristan Naumann, Aditya Nori, Javier Alvarez Valle, Hoifung Poon, Ozan Oktay

MS-CXR is a new dataset containing 1162 Chest X-ray bounding box labels paired with radiology text descriptions, annotated and verified by two board-certified radiologists.

chest x-ray vision-language processing

Published: May 16, 2022. Version: 0.1


Software Credentialed Access

Code for generating the HAIM multimodal dataset of MIMIC-IV clinical data and x-rays

Luis R Soenksen, Yu Ma, Cynthia Zeng, Leonard David Jean Boussioux, Kimberly Villalobos Carballo, Liangyuan Na, Holly Wiberg, Michael Li, Ignacio Fuentes, Dimitris Bertsimas

Code for generating the HAIM multimodal dataset of MIMIC-IV clinical data and x-rays

database code multimodality

Published: Aug. 23, 2022. Version: 1.0.1


Database Open Access

Icentia11k Single Lead Continuous Raw Electrocardiogram Dataset

Shawn Tan, Satya Ortiz-Gagné, Nicolas Beaudoin-Gagnon, Pierre Fecteau, Aaron Courville, Yoshua Bengio, Joseph Paul Cohen

This is a dataset of continuous raw electrocardiogram (ECG) signals for representation learning containing 11 thousand patients and 2 billion labelled beats.

representation learning ecg

Published: April 12, 2022. Version: 1.0

Visualize waveforms

Database Open Access

Santa Fe Time Series Competition Data Set B

This is a multivariate data set recorded from a patient in the sleep laboratory of the Beth Israel Hospital (now the Beth Israel Deaconess Medical Center) in Boston, Massachusetts. This data set was extracted from record slp60 of the MIT-BIH Polysom…

sleep multiparameter

Published: Jan. 6, 2000. Version: 1.0.0


Database Restricted Access

VinDr-PCXR: An open, large-scale pediatric chest X-ray dataset for interpretation of common thoracic diseases

Hieu Huy Pham, Tien Thanh Tran, Ha Quy Nguyen

An open, large-scale pediatric chest X-ray dataset that contains both lesion-level labels and image-level labels for multiple findings and diseases for interpretation of common thoracic diseases.

Published: March 21, 2022. Version: 1.0.0