Resources


Database Contributor Review

Salzburg Intensive Care database (SICdb), a freely accessible intensive care database

Niklas Rodemund, Andreas Kokoefer, Bernhard Wernly, Crispiana Cozowicz

The SICdb dataset, version 1.0.8 contains 27350 admissions to an ICU in an Austrian tertiary care institution.

clinical intensive care critical care open data machine learning

Published: Sept. 10, 2024. Version: 1.0.8


Database Open Access

A multi-camera and multimodal dataset for posture and gait analysis

Manuel Palermo, João Mendes Lopes, João André, Joao Cerqueira, Cristina Santos

Multimodal dataset with 166k samples for vision-based applications with a smart walker used in gait and posture rehabilitation. It is equipped with a pair of Depth cameras with data synchronized with an inertial MoCap system worn by the participant.

computer vision inertial motion capture smart walker human pose estimation gait and posture analysis depth rehabilitation deep learning

Published: Nov. 1, 2021. Version: 1.0.0


Database Open Access

Wide-field calcium imaging sleep state database

Eric Landsness, Xiaohui Zhang, Wei Chen, Hanyang Miao, Michelle Tang, Lindsey Brier, Mark Anastasio, Jin-Moo Lee, Joseph Culver

Wide-field calcium imaging database that consists of annotated sleep recording collected from transgenic mice at Washington University of St Louis School of Medicine.

sleep wide-field calcium imaging sleep state classification sleep staging machine learning

Published: March 17, 2022. Version: 1.0.1


Database Contributor Review

COVID Data for Shared Learning (CDSL): A comprehensive, multimodal COVID-19 dataset from HM Hospitales

Álvaro Ritoré, Andreea M Oprescu, Alberto Estirado Bronchalo, Miguel Ángel Armengol de la Hoz

COVID Data for Shared Learning (CDSL) is a multimodal database comprising de-identified structured health data and radiological images from 4,479 patients with COVID-19, as a comprehensive toolkit for developing predictive models.

covid-19 multimodal database radiological images open data healthcare data machine learning and ai

Published: Oct. 25, 2024. Version: 1.0.0


Database Credentialed Access

MIMIC-IV

Alistair Johnson, Lucas Bulgarelli, Tom Pollard, Brian Gow, Benjamin Moody, Steven Horng, Leo Anthony Celi, Roger Mark

Large database of de-identified health information from patients admitted to Beth Israel Deaconess Medical Center

critical care intensive care unit machine learning mimic

Published: Oct. 11, 2024. Version: 3.1


Database Open Access

VTaC: A Benchmark Dataset of Ventricular Tachycardia Alarms from ICU Monitors

Li-wei Lehman, Benjamin Moody, Lucas McCullum, Hasan Saeed, Harsh Deep, Diane Perry, Tristan Struja, Qiao Li, Gari Clifford, Roger Mark

VTaC is an annotated ventricular tachycardia (VT) arrhythmia alarm database containing over 5,000 waveform recordings with VT alarms from ICU monitors, with each alarm labeled as either true or false by at least two human expert annotators.

arrhythmia icu false alarms benchmark dataset ventricular tachycardia machine learning

Published: Oct. 1, 2024. Version: 1.0

Visualize waveforms

Database Credentialed Access

Comprehensive Polysomnography (CPS) Dataset: A Resource for Sleep-Related Arousal Research

Stefan Kraft, Andreas Theissler, Vera Wienhausen-Wilke, Philipp Walter, Gjergji Kasneci

This dataset includes polysomnographic sleep recordings from a study on sleep-related arousal diagnostics, featuring raw and derived data channels, annotated event types, and questionnaire data.

polysomnography sleep disorders machine learning in healthcare sleep arousal diagnostics pulse wave analysis

Published: Sept. 18, 2024. Version: 1.0.0


Database Credentialed Access

MIMIC-IV-ECG-Ext-ICD: Diagnostic labels for MIMIC-IV-ECG

Nils Strodthoff, Juan Miguel Lopez Alcaraz, Wilhelm Haverkamp

Dataset that links ECG records from MIMIC-IV-ECG to ED discharge and hospital discharge diagnoses, which enables to train general ECG prediction models based on clinical labels and facilitates the retrieval of further clinical metadata from MIMIC-IV.

electrocardiography machine learning mimic

Published: Aug. 30, 2024. Version: 1.0.1


Database Credentialed Access

EHRXQA: A Multi-Modal Question Answering Dataset for Electronic Health Records with Chest X-ray Images

Seongsu Bae, Daeun Kyung, Jaehee Ryu, Eunbyeol Cho, Gyubok Lee, Sunjun Kweon, Jungwoo Oh, Lei JI, Eric Chang, Tackeun Kim, Edward Choi

We present EHRXQA, the first multi-modal EHR QA dataset combining structured patient records with aligned chest X-ray images. EHRXQA contains a comprehensive set of QA pairs covering image-related, table-related, and image+table-related questions.

question answering benchmark evaluation visual question answering electronic health records multi-modal question answering deep learning ehr question answering semantic parsing machine learning chest x-ray

Published: July 23, 2024. Version: 1.0.0


Database Credentialed Access

MIMIC-CXR Database

Alistair Johnson, Tom Pollard, Roger Mark, Seth Berkowitz, Steven Horng

Chest radiographs in DICOM format with associated free-text reports.

computer vision chest x-rays natural language processing radiology machine learning mimic

Published: July 23, 2024. Version: 2.1.0