Resources


Database Credentialed Access

Medical-Diff-VQA: A Large-Scale Medical Dataset for Difference Visual Question Answering on Chest X-Ray Images

Xinyue Hu, Lin Gu, Qiyuan An, Mengliang Zhang, liangchen liu, Kazuma Kobayashi, Tatsuya Harada, Ronald Summers, Yingying Zhu

MIMIC-Diff-VQA provides a large-scale dataset for Difference visual question answering in medical chest x-ray images.

chest x-ray visual question answering difference vqa vqa difference visual question answering

Published: Sept. 15, 2023. Version: 1.0.0


Database Credentialed Access

Radiology Report Expert Evaluation (ReXVal) Dataset

Feiyang Yu, Mark Endo, Rayan Krishnan, Ian Pan, Andy Tsai, Eduardo Pontes Reis, Eduardo Kaiser Ururahy Nunes Fonseca, Henrique Lee, Zahra Shakeri, Andrew Ng, Curtis Langlotz, Vasantha Kumar Venugopal, Pranav Rajpurkar

The Radiology Report Expert Evaluation (ReXVal) Dataset is a publicly available dataset of radiologist evaluations of errors in automatically generated radiology reports.

Published: June 20, 2023. Version: 1.0.0


Database Credentialed Access

Tasks 1 and 3 from Progress Note Understanding Suite of Tasks: SOAP Note Tagging and Problem List Summarization

Yanjun Gao, John Caskey, Timothy Miller, Brihat Sharma, Matthew Churpek, Dmitriy Dligach, Majid Afshar

We introduce a hierarchical annotation suite of tasks addressing clinical text understanding, reasoning and abstraction over evidence, and diagnosis summarization. One task is section tagging major section and the other task is diagnosis generation.

Published: Sept. 30, 2022. Version: 1.0.0


Database Credentialed Access

Learning to Ask Like a Physician: a Discharge Summary Clinical Questions (DiSCQ) Dataset

Eric Lehman

Dataset of questions asked by medical experts about patients. Medical experts will read a discharge summary line-by-line and (1) ask any question that they may have and (2) record what in the text "triggered" them to ask their question.

machine learning question generation question answering

Published: July 28, 2022. Version: 1.0


Database Credentialed Access

MS-CXR: Making the Most of Text Semantics to Improve Biomedical Vision-Language Processing

Benedikt Boecking, Naoto Usuyama, Shruthi Bannur, Daniel Coelho de Castro, Anton Schwaighofer, Stephanie Hyland, Maria Teodora Wetscherek, Tristan Naumann, Aditya Nori, Javier Alvarez Valle, Hoifung Poon, Ozan Oktay

MS-CXR is a new dataset containing 1162 Chest X-ray bounding box labels paired with radiology text descriptions, annotated and verified by two board-certified radiologists.

chest x-ray vision-language processing

Published: May 16, 2022. Version: 0.1


Software Open Access

Model for Simulating ECG and PPG Signals with Arrhythmia Episodes

Andrius Sološenko, Andrius Petrėnas, Birutė Paliakaitė, Vaidotas Marozas, Leif Sörnmo

A model is capable of simulating sinus rhythm, atrial fibrillation and ectopic beats in ECGs and PPGs as well as extreme bradycardia and ventricular tachycardia in PPGs. Different types of noises and artifacts can also be added to the waveforms.

arrhythmia atrial fibrillation noise ppg tachycardia detection motion artifacts simulation bradycardia ecg

Published: May 2, 2022. Version: 1.3.1


Database Credentialed Access

RuMedNLI: A Russian Natural Language Inference Dataset For The Clinical Domain

Pavel Blinov, Aleksandr Nesterov, Galina Zubkova, Arina Reshetnikova, Vladimir Kokh, Chaitanya Shivade

RuMedNLI is the full counterpart dataset of MedNLI in Russian language.

natural language inference recognizing textual entailment russian language

Published: April 1, 2022. Version: 1.0.0


Database Restricted Access

REFLACX: Reports and eye-tracking data for localization of abnormalities in chest x-rays

Ricardo Bigolin Lanfredi, Mingyuan Zhang, William Auffermann, Jessica Chan, Phuong-Anh Duong, Vivek Srikumar, Trafton Drew, Joyce Schroeder, Tolga Tasdizen

This dataset contains 3032 cases of eye-tracking data collected while five radiologists dictated reports for frontal chest x-rays, synchronized timestamped dictation transcription, and manual labels for validation of localization of abnormalities.

eye tracking radiology report computer vision machine learning chest x-rays radiology reflacx fixations gaze deep learning

Published: Sept. 27, 2021. Version: 1.0.0


Database Credentialed Access

Chest ImaGenome Dataset

Joy Wu, Nkechinyere Agu, Ismini Lourentzou, Arjun Sharma, Joseph Paguio, Jasper Seth Yao, Edward Christopher Dee, William Mitchell, Satyananda Kashyap, Andrea Giovannini, Leo Anthony Celi, Tanveer Syeda-Mahmood, Mehdi Moradi

The Chest ImaGenome dataset is a scene graph dataset with additional chronological comparison relations for chest X-rays. It is automatically derived from the MIMIC-CXR dataset. A manually annotated gold standard is also available for 500 patients.

multimodal machine learning chest x-ray radiology scene graph visual dialogue object detection semantic reasoning bounding box relation extraction knowledge graph explainability reasoning chest cxr visual question answering deep learning disease progression

Published: July 13, 2021. Version: 1.0.0


Database Credentialed Access

RadNLI: A natural language inference dataset for the radiology domain

Yasuhide Miura, Yuhao Zhang, Emily Tsai, Curtis Langlotz, Dan Jurafsky

A radiology NLI dataset introduced in the paper: Improving Factual Completeness and Consistency of Image-to-text Radiology Report Generation

Published: June 29, 2021. Version: 1.0.0