Resources


Database Open Access

Heart and lung segmentations for MIMIC-CXR/MIMIC-CXR-JPG and Montgomery County TB databases

Benjamin Duvieusart, Felix Krones, Guy Parsons, Lionel Tarassenko, Bartlomiej W Papiez, Adam Mahdi

Heart and lung segmentations for 200 MIMIC-CXR/MIMIC-CXR-JPG chest x-rays and heart segmentations for 138 Montgomery County tuberculosis chest X-rays.

segmentation heart and lungs montgomery country tb mimic-cxr

Published: Aug. 14, 2023. Version: 1.0.0


Database Restricted Access

MIMIC-Eye: Integrating MIMIC Datasets with REFLACX and Eye Gaze for Multimodal Deep Learning Applications

Chihcheng Hsieh, Chun Ouyang, Jacinto C Nascimento, Joao Pereira, Joaquim Jorge, Catarina Moreira

MIMIC-Eye: Integrating MIMIC Datasets with REFLACX and Eye Gaze for Multimodal Deep Learning Applications

Published: March 23, 2023. Version: 1.0.0


Database Credentialed Access

MS-CXR-T: Learning to Exploit Temporal Structure for Biomedical Vision-Language Processing

Shruthi Bannur, Stephanie Hyland, Qianchu Liu, Fernando Pérez-García, Max Ilse, Daniel Coelho de Castro, Benedikt Boecking, Harshita Sharma, Kenza Bouzid, Anton Schwaighofer, Maria Teodora Wetscherek, Hannah Richardson, Tristan Naumann, Javier Alvarez Valle, Ozan Oktay

The MS-CXR-T is a multimodal benchmark that enhances the MIMIC-CXR v2 dataset by including expert-verified annotations. Its goal is to evaluate biomedical visual-language processing models in terms of temporal semantics extracted from image and text.

multimodal chest x-ray radiology cxr disease progression vision-language processing

Published: March 17, 2023. Version: 1.0.0


Software Open Access

Waveform Database Software Package (WFDB) for Python

Chen Xie, Lucas McCullum, Alistair Johnson, Tom Pollard, Brian Gow, Benjamin Moody

Tools for working with waveforms in Python.

waveform wfdb python

Published: Jan. 24, 2023. Version: 4.1.0


Database Open Access

Icentia11k Single Lead Continuous Raw Electrocardiogram Dataset

Shawn Tan, Satya Ortiz-Gagné, Nicolas Beaudoin-Gagnon, Pierre Fecteau, Aaron Courville, Yoshua Bengio, Joseph Paul Cohen

This is a dataset of continuous raw electrocardiogram (ECG) signals for representation learning containing 11 thousand patients and 2 billion labelled beats.

representation learning ecg

Published: April 12, 2022. Version: 1.0

Visualize waveforms

Database Credentialed Access

RuMedNLI: A Russian Natural Language Inference Dataset For The Clinical Domain

Pavel Blinov, Aleksandr Nesterov, Galina Zubkova, Arina Reshetnikova, Vladimir Kokh, Chaitanya Shivade

RuMedNLI is the full counterpart dataset of MedNLI in Russian language.

natural language inference recognizing textual entailment russian language

Published: April 1, 2022. Version: 1.0.0


Database Open Access

Pulse Transit Time PPG Dataset

Philip Mehrgardt, Matloob Khushi, Simon Poon, Anusha Withana

Time synchronised multi-site PPG dataset for PTT including sensors’ attachment pressures, temperatures, inertial data from accelerometer and gyroscope, annotated ECG data, blood pressures, as well as blood oxygenation saturation levels (SpO2)

blood pressure finger attachment pressure ptt imu spo2 gyroscope attachment force pulse transit time ppg accelerometer ecg

Published: March 18, 2022. Version: 1.1.0

Visualize waveforms

Database Open Access

Haaglanden Medisch Centrum sleep staging database

Diego Alvarez-Estevez, Roselyne Rijsman

A collection of 151 whole-night PolySomnoGraphic (PSG) sleep recordings from the Haaglanden Medisch Centrum (HMC, The Netherlands) sleep center containing different traces of ExG activity and expert's scorings of sleep stages

edf inter-database generalization sleep staging psg deep learning

Published: March 18, 2022. Version: 1.1

Visualize waveforms

Database Open Access

Wide-field calcium imaging sleep state database

Eric Landsness, Xiaohui Zhang, Wei Chen, Hanyang Miao, Michelle Tang, Lindsey Brier, Mark Anastasio, Jin-Moo Lee, Joseph Culver

Wide-field calcium imaging database that consists of annotated sleep recording collected from transgenic mice at Washington University of St Louis School of Medicine.

sleep machine learning wide-field calcium imaging sleep state classification sleep staging

Published: March 17, 2022. Version: 1.0.1


Model Credentialed Access

EntityBERT: BERT-based Models Pretrained on MIMIC-III with or without Entity-centric Masking Strategy for the Clinical Domain

Chen Lin, Steven Bethard, Guergana Savova, Timothy Miller, Dmitriy Dligach

Pretraining of models with a broad representation of biomedical terminology (PubMedBERT) on MIMIC-III corpus along with or without a novel entity-centric masking strategy.

Published: March 17, 2022. Version: 1.0.1