Resources


Database Credentialed Access

Chest X-ray Dataset with Lung Segmentation

Wimukthi Indeewara, Mahela Hennayake, Kasun Rathnayake, et al.

CXLSeg dataset: Chest X-ray with Lung Segmentation, a comparatively large dataset of segmented Chest X-ray radiographs based on the MIMIC-CXR dataset. This contains segmentation results of 243,324 frontal view images and corresponding masks.

segmentation medical reports u-net chest radiographs mimic-cxr chest x-ray

Published: Feb. 8, 2023. Version: 1.0.0


Database Credentialed Access

Chest X-ray segmentation images based on MIMIC-CXR

Li-Ching Chen, Po-Chih Kuo, Ryan Wang, et al.

A chest x-rays segmentation dataset derived from MIMIC-CXR based on deep learning algorithm and human examination.

segmentation chest x-rays cxr

Published: Aug. 18, 2022. Version: 1.0.0


Database Restricted Access

Smartphone-Captured Chest X-Ray Photographs

Po-Chih Kuo, ChengChe Tsai, Diego M Lopez, et al.

Smartphone-captured CXR images including photographs taken from MIMIC-CXR and CheXpert, photographs taken by resident doctors, and photographs taken with different devices.

smartphone photograph cxr

Published: Sept. 27, 2020. Version: 1.0.0


Database Contributor Review

Medical Information Mart for Intensive Care Brazil (MIMIC-BR): a Brazilian Dataset of Anonymized Hospital and ICU Clinical Data

Gabriela Steil, Adhara Brandão Lima Vanhoz, Mateus de Lima Freitas, et al.

Medical Information Mart for Intensive Care Brazil (MIMIC-BR) is a Brazilian dataset of hospital and ICU adult patients anonymized data. It includes 31,789 admissions to the Einstein Hospital Israelita during a period of 3 years in the last 10 years

critical care dataset artificial intelligence intensive care unit machine learning tertiary heatlhcare data anonymization inpatients

Published: May 21, 2026. Version: 1.0.0


Database Restricted Access

EchoNext: A Dataset for Detecting Echocardiogram-Confirmed Structural Heart Disease from ECGs

Pierre Elias, Joshua Finer

EchoNext is a curated dataset of electrocardiograms (ECGs) paired with echocardiogram-confirmed structural heart disease labels, designed to support the development and validation of machine learning models.

clinical decision support artificial intelligence digital health structural heart disease electrocardiogram health equity ecg heart failure transthoracic echocardiogram ai model deployment valvular heart disease cardiovascular screening ai in healthcare left ventricular dysfunction deep learning population health aortic stenosis machine learning

Published: April 30, 2026. Version: 1.1.1


Challenge Credentialed Access

CXR-LT: Multi-Label Long-Tailed Classification on Chest X-Rays

Gregory Holste, Mingquan Lin, Song Wang, et al.

CXR-LT 2024 was a challenge for long-tailed, multi-label, and zero-shot thorax disease classification on chest X-rays, held at MICCAI 2024. This page contains long-tailed labels for 45 diseases from the CXR-LT 2024 and 2023 challenges.

disease classification artificial intelligence chest x-ray computer-aided diagnosis long-tailed learning cardiopulmonary disease zero-shot learning deep learning

Published: March 19, 2025. Version: 2.0.0


Database Open Access

Leipzig Heart Center ECG-Database: Arrhythmias in Children and Patients with Congenital Heart Disease

Sophia Klehs, Daniel Franke, Bayhas Alhamad, et al.

This annotated ECG database for paediatric and CHD patients features 12-lead and intracardiac recordings, supporting advanced diagnostic algorithms.

12-lead artificial intelligence arrhythmias chd intracardiac recordings annotated congenital heart disease ecg

Published: March 19, 2025. Version: 1.0.0

Visualize waveforms

Database Open Access

Brno University of Technology Smartphone PPG Database (BUT PPG)

Andrea Nemcova, Radovan Smisek, Eniko Vargova, et al.

BUT PPG is a database created for the purpose of evaluating PPG signal quality and estimation of heart rate. The data comprises 3,888 10s recordings of PPGs recorded by smartphone and associated ECG and ACC signals and annotations.

heart rate ppg artificial intelligence acc signal quality assessment annotations accelerometric data photoplethysmography electrocardiogram ecg

Published: Aug. 23, 2024. Version: 2.0.0


Database Credentialed Access

CORAL: expert-Curated medical Oncology Reports to Advance Language model inference

Madhumita Sushil, Vanessa Kennedy, Divneet Mandair, et al.

Medical oncology progress notes annotated with advanced, comprehensive oncology-relevant concepts and relationships.

information extraction artificial intelligence oncology natural language processing electronic health records large language models

Published: Feb. 7, 2024. Version: 1.0


Database Credentialed Access

BRAX, a Brazilian labeled chest X-ray dataset

Eduardo Pontes Reis, Joselisa Paiva, Maria Carolina Bueno da Silva, et al.

BRAX contains 24,959 chest radiography exams and 40,967 images acquired in a large general Brazilian hospital. All images have been read by trained radiologists and 14 labels were derived from Brazilian Portuguese reports using NLP.

chest x-ray dataset artificial intelligence

Published: June 17, 2022. Version: 1.1.0