Resources


Database Credentialed Access

CAD-Chest: Comprehensive Annotation of Diseases based on MIMIC-CXR Radiology Report

Mengliang Zhang, Xinyue Hu, Lin Gu, Tatsuya Harada, Kazuma Kobayashi, Ronald Summers, Yingying Zhu

The CAD-Chest dataset provides comprehensive annotations of disease, including disease severity, uncertainty, and location based on the MIMIC-CXR radiologist reports.

chesr x-ray disease label

Published: Dec. 8, 2023. Version: 1.0


Database Credentialed Access

Annotation dataset of social determinants of health from MIMIC-III Clinical Care Database

Marco Guevara, Shan Chen, Spencer Thomas, Danielle Bitterman

Annotation dataset of social determinants of health from MIMC-III Clinical Care Database notes.

natural language processing social determinants of health

Published: Nov. 24, 2023. Version: 1.0.0


Database Open Access

Simulated Obstructive Disease Respiratory Pressure and Flow

Jaimey Anne Clifton, Ella Frances Sophia Guy, Trudy Caljé-van der Klei, Jennifer Knopp, James Geoffrey Chase

Outlined is a pressure, flow, and volume dataset using a using a modular device to simulate the effects of obstructive pulmonary disease in healthy people. 20 healthy subjects were included in this dataset.

Published: Nov. 13, 2023. Version: 1.0.0


Database Open Access

ScientISST MOVE: Annotated Wearable Multimodal Biosignals recorded during Everyday Life Activities in Naturalistic Environments

João Areias Saraiva, Mariana Abreu, Ana Sofia Carmo, Hugo Plácido da Silva, Ana Fred

Multimodal (ECG, EMG, EDA, PPG, TEMP, ACC) biosignal dataset of everyday activities. Created with 3 wearable devices based on ScientISST Sense and Empatica E4.

multimodal wearable run uncontrolled environments jump greet lift walk gesticulate

Published: Nov. 13, 2023. Version: 1.0.0

Visualize waveforms

Database Open Access

Respiratory dataset from PEEP study with expiratory occlusion

Ella Frances Sophia Guy, Jaimey Anne Clifton, Trudy Caljé-van der Klei, Rongqing Chen, Jennifer Knopp, Knut Moeller, James Geoffrey Chase

Outlined is a pressure, flow, volume, dynamic circumference, and EIT assessed aeration dataset from resting breathing with REO at increasing CPAP PEEP settings. Vapers, asthmatics, smokers, and otherwise healthy people were included in the trial.

Published: Nov. 10, 2023. Version: 1.0.0


Database Credentialed Access

BOLD, a blood-gas and oximetry linked dataset

João Matos, Tristan Struja, Jack Gallifant, Luis Filipe Nakayama, Marie Charpignon, Xiaoli Liu, Jaime dos Santos Cardoso, Leo Anthony Celi, An Kwok Wong

An open-source pulse oximetry and arterial blood gas dataset, derived from MIMIC-III, MIMIC-IV, and eICU-CRD

electronic health records health equity pulse oximetry intensive care unit

Published: Nov. 8, 2023. Version: 1.0


Database Credentialed Access

INSPIRE, a publicly available research dataset for perioperative medicine

Hyung-Chul Lee, Leerang Lim

A public dataset that contains information related to surgery, anesthesia, laboratory results, medications, diagnosis, and outcomes from 50% of the patients who received surgery at Seoul National University Hospital between 2011 and 2020.

perioperative medicine surgery multi-center open dataset

Published: Nov. 3, 2023. Version: 1.1


Database Credentialed Access

CHIFIR: Cytology and Histopathology Invasive Fungal Infection Reports

Vlada Rozova, Anna Khanina, Jasmine Teng, Joanne Teh, Leon Worth, Monica Slavin, karin thursky, Karin Verspoor

A corpus of cytology and histopathology reports annotated for terminology relevant to fungal infections. Ideal for validation of named entity recognition and relation extraction methods.

nlp clinical documentation information extraction invasive fungal infections

Published: Nov. 2, 2023. Version: 1.0.1


Database Contributor Review

CARMEN-I: A resource of anonymized electronic health records in Spanish and Catalan for training and testing NLP tools

Eulalia Farre Maduell, Salvador Lima-Lopez, Santiago Andres Frid, Artur Conesa, Elisa Asensio, Antonio Lopez-Rueda, Helena Arino, Elena Calvo, Maria Jesús Bertran, Maria Angeles Marcos, Montserrat Nofre Maiz, Laura Tañá Velasco, Antonia Marti, Ricardo Farreres, Xavier Pastor, Xavier Borrat Frigola, Martin Krallinger

CARMEN-I is a Spanish corpus of 2,000 clinical records from Hospital Clínic, Barcelona. It covers COVID-19 patients and comorbidities, serving as a resource for training clinical NLP models and researchers in NLP applied to clinical documents.

de-identification anonymization clinical ner

Published: Nov. 2, 2023. Version: 1.0


Database Open Access

Induced Cesarean EHG DataSet (ICEHG DS): An open dataset with electrohysterogram records of pregnancies ending in induced and cesarean section delivery

Franc Jager

The design and development of ICEHG DS was funded by the Slovenian Research Agency (ARRS) under the research project Metabolic and inborn factors of reproductive health, birth III.

neuroelectric pregnancy electrohysterogram cesarean-section delivery induced delivery

Published: Oct. 8, 2023. Version: 1.0.1

Visualize waveforms