Resources


Database Credentialed Access

Chest ImaGenome Dataset

Joy Wu, Nkechinyere Agu, Ismini Lourentzou, Arjun Sharma, Joseph Paguio, Jasper Seth Yao, Edward Christopher Dee, William Mitchell, Satyananda Kashyap, Andrea Giovannini, Leo Anthony Celi, Tanveer Syeda-Mahmood, Mehdi Moradi

The Chest ImaGenome dataset is a scene graph dataset with additional chronological comparison relations for chest X-rays. It is automatically derived from the MIMIC-CXR dataset. A manually annotated gold standard is also available for 500 patients.

machine learning multimodal radiology chest x-ray scene graph visual question answering visual dialogue object detection semantic reasoning bounding box relation extraction knowledge graph explainability reasoning chest cxr deep learning disease progression

Published: July 13, 2021. Version: 1.0.0


Database Credentialed Access

Eye Gaze Data for Chest X-rays

Alexandros Karargyris, Satyananda Kashyap, Ismini Lourentzou, Joy Wu, Matthew Tong, Arjun Sharma, Shafiq Abedin, David Beymer, Vandana Mukherjee, Elizabeth Krupinski, Mehdi Moradi

This dataset was a collected using an eye tracking system while a radiologist interpreted and read 1,083 public CXR images. The dataset contains the following aligned modalities: image, transcribed report text, dictation audio and eye gaze data.

audio convolutional network heatmap eye tracking machine learning multimodal radiology chest x-ray explainability chest cxr deep learning

Published: Sept. 12, 2020. Version: 1.0.0


Database Credentialed Access

MS-CXR-T: Learning to Exploit Temporal Structure for Biomedical Vision-Language Processing

Shruthi Bannur, Stephanie Hyland, Qianchu Liu, Fernando Pérez-García, Max Ilse, Daniel Coelho de Castro, Benedikt Boecking, Harshita Sharma, Kenza Bouzid, Anton Schwaighofer, Maria Teodora Wetscherek, Hannah Richardson, Tristan Naumann, Javier Alvarez Valle, Ozan Oktay

The MS-CXR-T is a multimodal benchmark that enhances the MIMIC-CXR v2 dataset by including expert-verified annotations. Its goal is to evaluate biomedical visual-language processing models in terms of temporal semantics extracted from image and text.

multimodal radiology chest x-ray cxr disease progression vision-language processing

Published: March 17, 2023. Version: 1.0.0


Database Credentialed Access

Synthetic Acute Hypotension and Sepsis Datasets Based on MIMIC-III and Published as Part of the Health Gym Project

Nicholas Kuo, Simon Finfer, Louisa Jorm, Sebastiano Barbieri

This repository hosts the original Health Gym datasets of Acute Hypotension and Sepsis

sepsis machine learning acute hypotension reinforcement learning synthetic dataset generative modelling wasserstein generative adversarial network

Published: Feb. 23, 2022. Version: 1.0.0


Database Open Access

A multi-camera and multimodal dataset for posture and gait analysis

Manuel Palermo, João Mendes Lopes, João André, Joao Cerqueira, Cristina Santos

Multimodal dataset with 166k samples for vision-based applications with a smart walker used in gait and posture rehabilitation. It is equipped with a pair of Depth cameras with data synchronized with an inertial MoCap system worn by the participant.

computer vision inertial motion capture smart walker human pose estimation gait and posture analysis depth rehabilitation deep learning

Published: Nov. 1, 2021. Version: 1.0.0


Database Open Access

In-Gauge and En-Gage: Understanding Occupants' Behaviour, Engagement, Emotion, and Comfort Indoors with Heterogeneous Sensors and Wearables

Nan Gao, Max Marschall, Jane Burry, Simon Watkins, Flora Salim

The project aims to understand occupants’ behaviour, engagement, emotion, and comfort indoors with heterogeneous sensors and wearables.

heart rate electrodermal activity environmental sensing thermal comfort modelling physiological signals occupant behaviour sensing emotion sensing human behavoural modelling smart building

Published: Feb. 13, 2023. Version: 1.0.0


Database Credentialed Access

MIMIC-III - SequenceExamples for TensorFlow modeling

Jonas Kemp, Kun Zhang, Andrew Dai

MIMIC-III data converted into TensorFlow SequenceExample format, for use in modeling pipelines.

tensorflow sequence modeling machine learning deep learning

Published: Sept. 29, 2020. Version: 1.0.0


Database Open Access

Wide-field calcium imaging sleep state database

Eric Landsness, Xiaohui Zhang, Wei Chen, Hanyang Miao, Michelle Tang, Lindsey Brier, Mark Anastasio, Jin-Moo Lee, Joseph Culver

Wide-field calcium imaging database that consists of annotated sleep recording collected from transgenic mice at Washington University of St Louis School of Medicine.

sleep machine learning wide-field calcium imaging sleep state classification sleep staging

Published: March 17, 2022. Version: 1.0.1


Database Credentialed Access

Learning to Ask Like a Physician: a Discharge Summary Clinical Questions (DiSCQ) Dataset

Eric Lehman

Dataset of questions asked by medical experts about patients. Medical experts will read a discharge summary line-by-line and (1) ask any question that they may have and (2) record what in the text "triggered" them to ask their question.

machine learning question generation question answering

Published: July 28, 2022. Version: 1.0


Database Credentialed Access

Chest X-ray Dataset with Lung Segmentation

Wimukthi Indeewara, Mahela Hennayake, Kasun Rathnayake, Thanuja Ambegoda, Dulani Meedeniya

CXLSeg dataset: Chest X-ray with Lung Segmentation, a comparatively large dataset of segmented Chest X-ray radiographs based on the MIMIC-CXR dataset. This contains segmentation results of 243,324 frontal view images and corresponding masks.

segmentation chest x-ray medical reports mimic-cxr u-net chest radiographs

Published: Feb. 8, 2023. Version: 1.0.0