Resources


Database Open Access

A Multi-Modal Satellite Imagery Dataset for Public Health Analysis in Colombia

Sebastian A Cajas, David Restrepo, Dana Moukheiber, Kuan Ting Kuo, Chenwei Wu, David Santiago Garcia Chicangana, Atika Rahman Paddo, Mira Moukheiber, Lama Moukheiber, Sulaiman Moukheiber, Saptarshi Purkayastha, Diego M Lopez, Po-Chih Kuo, Leo Anthony Celi

Multi-Modal Satellite imagery Dataset in Colombia: A public health analysis with spatiotemporally aligned satellite images and its corresponding metadata across 81 municipalities (2016-2018), facilitating multimodal AI applications.

multimodality satellite imagery

Published: Jan. 30, 2024. Version: 1.0.0


Database Restricted Access

mcPHASES: A Dataset of Physiological, Hormonal, and Self-reported Events and Symptoms for Menstrual Health Tracking with Wearables

Blue Lin, Jin Yi Li, Kaavya Kalani, Khai Truong, Alex Mariakakis

This initial version of the PHASES dataset includes multimodal menstrual health data—hormone levels, wearable sensor metrics, and self-reported symptoms—collected across two study intervals from 42 young adults.

wearables hormones menstrual health multimodal health health sensor data womens health

Published: Sept. 9, 2025. Version: 1.0.0


Database Open Access

Image-derived cardiomegaly biomarker values for 96K chest X-rays in MIMIC-CXR/MIMIC-CXR-JPG

Benjamin Duvieusart, Felix Krones, Guy Parsons, Lionel Tarassenko, Bartlomiej W Papiez, Adam Mahdi

Automatically extracted cardiomegaly biomarkers - cardiothoracic ratio (CTR) and cardiopulmonary area ratio (CPAR) - for all posterior-anterior chest x-ray scans in MIMIC-CXR/MIMIC-CXR-JPG.

biomarkers mimic-cxr cpar ctr cardiomegaly

Published: Aug. 23, 2024. Version: 1.0.0


Database Open Access

A Multi-Modal Satellite Imagery Dataset for Public Health Analysis in Colombia

Sebastian A Cajas, David Restrepo, Dana Moukheiber, Kuan Ting Kuo, Chenwei Wu, David Santiago Garcia Chicangana, Atika Rahman Paddo, Mira Moukheiber, Lama Moukheiber, Sulaiman Moukheiber, Saptarshi Purkayastha, Diego M Lopez, Po-Chih Kuo, Leo Anthony Celi

Multi-Modal Satellite imagery Dataset in Colombia: A public health analysis with spatiotemporally aligned satellite images and its corresponding metadata across 81 municipalities (2016-2018), facilitating multimodal AI applications.

multimodality satellite imagery

Published: Jan. 30, 2024. Version: 1.0.0


Database Credentialed Access

Chest ImaGenome Dataset

Joy Wu, Nkechinyere Agu, Ismini Lourentzou, Arjun Sharma, Joseph Paguio, Jasper Seth Yao, Edward Christopher Dee, William Mitchell, Satyananda Kashyap, Andrea Giovannini, Leo Anthony Celi, Tanveer Syeda-Mahmood, Mehdi Moradi

The Chest ImaGenome dataset is a scene graph dataset with additional chronological comparison relations for chest X-rays. It is automatically derived from the MIMIC-CXR dataset. A manually annotated gold standard is also available for 500 patients.

scene graph visual dialogue object detection semantic reasoning bounding box knowledge graph explainability reasoning relation extraction chest disease progression cxr chest x-ray radiology multimodal deep learning visual question answering machine learning

Published: July 13, 2021. Version: 1.0.0


Database Credentialed Access

Eye Gaze Data for Chest X-rays

Alexandros Karargyris, Satyananda Kashyap, Ismini Lourentzou, Joy Wu, Matthew Tong, Arjun Sharma, Shafiq Abedin, David Beymer, Vandana Mukherjee, Elizabeth Krupinski, Mehdi Moradi

This dataset was a collected using an eye tracking system while a radiologist interpreted and read 1,083 public CXR images. The dataset contains the following aligned modalities: image, transcribed report text, dictation audio and eye gaze data.

convolutional network heatmap eye tracking explainability audio chest cxr chest x-ray radiology multimodal deep learning machine learning

Published: Sept. 12, 2020. Version: 1.0.0


Database Open Access

CGMacros: a scientific dataset for personalized nutrition and diet monitoring

Ricardo Gutierrez-Osuna, David Kerr, Bobak Mortazavi, Anurag Das

CGMacros contains information from two continuous glucose monitors (CGM), food macronutrients, food photographs, physical activity, and anonymized participant demographics, anthropometric measurements and health parameters.

diabetes continuous glucose monitors obesity postprandial glucose response food macronutrients metabolic models food photographs personalized nutrition machine learning

Published: Jan. 28, 2025. Version: 1.0.0


Database Credentialed Access

LLaVA-Rad MIMIC-CXR Annotations

Juan Manuel Zambrano Chaves, Shih-Cheng Huang, Yanbo Xu, Hanwen Xu, Naoto Usuyama, Sheng Zhang, Fei Wang, Yujia Xie, Mahmoud Khademi, Ziyi Yang, Hany Awadalla, Julia Gong, Houdong Hu, Jianwei Yang, Chunyuan Li, Jianfeng Gao, Yu Gu, Cliff Wong, Mu-Hsin Wei, Tristan Naumann, Muhao Chen, Matthew Lungren, Akshay Chaudhari, Serena Yeung, Curtis Langlotz, Sheng Wang, Hoifung Poon

This dataset provides GPT-4 extracted sections of radiology reports from MIMIC-CXR, complementing rule-based section extractions with additional reports with findings, and removing references to priors from findings.

Published: Jan. 24, 2025. Version: 1.0.0


Database Credentialed Access

DrugEHRQA: A Question Answering Dataset on Structured and Unstructured Electronic Health Records For Medicine Related Queries

Jayetri Bardhan, Anthony Colas, Kirk Roberts, Daisy Zhe Wang

DrugEHRQA is a QA dataset containing question-answers from MIMIC-III tables and discharge summaries.

question-answer qa

Published: April 12, 2022. Version: 1.0.0


Database Open Access

Behavioral and autonomic dynamics during propofol-induced unconsciousness

Sandya Subramanian, Patrick Purdon, Riccardo Barbieri, Emery Brown

Multimodal point process indices for heart rate variability and electrodermal activity for 9 subjects who are undergoing a controlled propofol sedation experiment where the concentration was increased and then decreased in stages.

autonomic nervous system anesthesia propofol heart rate variability electrodermal activity

Published: July 30, 2021. Version: 1.0