Resources


Database Restricted Access

Electrocardiogram-Capable Smartwatches: Assessing Their Clinical Accuracy and Application

Joaquin Recas, Mauro Buelga Suárez, Sergio González-Cabeza, Mario Sanz-Guerrero, Marian Diaz-Vicente, Alfonso Rebolleda, Luis Piñuel Moreno, Gonzalo Luis Alonso Salinas

This database allows the study of the feasibility of using ECG-capable smartwatches as diagnostic tools, focusing on their compliance with clinical standards and their ability to measure critical ECG parameters beyond AF detection.

ischemia fitbit sense ambulatory samsung galaxy watch st-segment apple watch withings scanwatch smartwatch

Published: April 9, 2025. Version: 1.0.0


Database Credentialed Access

MIMIC-III-Ext-tPatchGNN

Chenlong Yin, Weijia Zhang

The processed MIMIC-III dataset for the benchmark of Irregular Multivariate Time Series Forecasting: A Transformable Patching Graph Neural Networks Approach.

Published: April 9, 2025. Version: 1.0.0


Database Credentialed Access

MIMIC-IV-Ext-CEKG: A Process-Oriented Dataset Derived from MIMIC-IV for Enhanced Clinical Insights

Milad Naeimaei Aali, Felix Mannhardt, Pieter Jelle Toussaint

The MIMIC-IV-Ext-CEKG dataset is crafted for object-centric process mining in healthcare, specifically to create clinical event knowledge graphs for patients with multimorbidity, as well as for data mining and machine learning tasks.

mimic process mining multi entity process mining object centric event log clinical event knowledge graph

Published: April 8, 2025. Version: 1.0.0


Challenge Credentialed Access

CXR-LT: Multi-Label Long-Tailed Classification on Chest X-Rays

Gregory Holste, Mingquan Lin, Song Wang, Yiliang Zhou, Yishu Wei, Hao Chen, Atlas Wang, Yifan Peng

CXR-LT 2024 was a challenge for long-tailed, multi-label, and zero-shot thorax disease classification on chest X-rays, held at MICCAI 2024. This page contains long-tailed labels for 45 diseases from the CXR-LT 2024 and 2023 challenges.

disease classification artificial intelligence chest x-ray deep learning computer-aided diagnosis long-tailed learning cardiopulmonary disease zero-shot learning

Published: March 19, 2025. Version: 2.0.0


Database Credentialed Access

MIMIC-IV-Ext Triage Instruction Corpus

Qingyang Shen, Quan Guo

MIMIC-IV-Ext Triage Instruction Corpus includes 9,629 ED triage cases organized by the five-level ESI, enabling LLMs to improve triage accuracy. It provides CSV data, generation prompts, expert validation samples, and SQL QC scripts.

nlp clinical decision support large language models machine learning emergency severity index emergency triage

Published: March 4, 2025. Version: 1.0.0


Database Restricted Access

OpenOximetry Repository

Nicholas Fong, Michael Lipnick, Philip Bickler, John Feiner, Tyler Law

A repository of matched arterial oxygen and pulse oximeter readings obtained under controlled conditions, with high-frequency physiologic waveforms and skin color measurements.

Published: Feb. 28, 2025. Version: 1.1.1


Database Credentialed Access

MIMIC-IV-Ext-BHC: Labeled Clinical Notes Dataset for Hospital Course Summarization

Asad Aali, Dave Van Veen, Yamin Arefeen, Jason Hom, Christian Bluethgen, Eduardo Pontes Reis, Sergios Gatidis, Namuun Clifford, Joseph Daws, Arash Tehrani, Jangwon Kim, Akshay Chaudhari

This dataset presents a collection of preprocessed and labeled clinical notes derived from "MIMIC-IV-Note", and aims to facilitate the development of ML models focused on summarizing brief hospital courses (BHC) from clinical notes.

natural language processing clinical notes brief hospital course text summarization machine learning

Published: Feb. 3, 2025. Version: 1.2.0


Database Open Access

Synthetic Mention Corpora for Disease Entity Recognition and Normalization

Kuleen Sasse, John David Osborne

We present the Synthetic Mention Corpora for Disease Entity Recognition and Normalization, containing 128000 disease mentions from the UMLS disorder group, generated by an LLM. This corpus aims to improve these tasks in biomedical and clinical texts.

nlp named entity recognition machine learning data augmentation entity normalization

Published: Feb. 3, 2025. Version: 1.0.0


Database Credentialed Access

MIMIC-IV on FHIR

Alex Bennett, Joshua Wiedekopf, Hannes Ulrich, Philip van Damme, Piotr Szul, John Grimes, Alistair Johnson

MIMIC-IV and MIMIC-IV-ED data mapped into FHIR resources.

mimic-iv fhir electronic health record us core fast healthcare interoperability resources mimic

Published: Nov. 12, 2024. Version: 2.1


Database Credentialed Access

C-REACT: Contextualized Race and Ethnicity Annotations for Clinical Text

Oliver Bear Don't Walk IV, Adrienne Pichon, Harry Reyes Nieva, Tony Sun, Jaan Lı, Joshua Winston Joseph, Sivan Kinberg, Lauren R Richter, Salvatore Crusco, Kyle Kulas, Shaan Ahmed, Daniel Snyder, Ashkon Rahbari, Benjamin Ranard, Pallavi Juneja, Dina Demner-Fushman, Noemie Elhadad

Two sets of gold-standard annotations for race and ethnicity information from clinical notes in MIMIC-III. Contains race and ethnicity label assignments and related information such as country of origin and spoken language.

clinical notes patient country information race and ethnicity patient language information

Published: Oct. 21, 2024. Version: 1.0.0