Resources
Database Restricted Access
Swiss-Mammo: A physician-written, synthetic dataset of German mammography reports
Daniel Reichenpfader, Sandro von Däniken, Harald Marcel Bonel
radiology mammography structured reporting bi-rads
Published: June 24, 2025. Version: 1.0.1
Database Open Access
Hillel Yaffe Glaucoma Dataset (HYGD): A Gold-Standard Annotated Fundus Dataset for Glaucoma Detection
Or Abramovich, Hadas Pizem, Jonathan Fhima, Eran Berkowitz, Ben Gofrit, Jan Van Eijgen, Eytan Blumenthal, Joachim Behar
ophthalmology retina dfi gold-standard gon fundus glaucoma
Published: June 3, 2025. Version: 1.0.0
Database Credentialed Access
Medical-CXR-VQA dataset: A Large-Scale LLM-Enhanced Medical Dataset for Visual Question Answering on Chest X-Ray Images
Xinyue Hu, Lin Gu, Kazuma Kobayashi, liangchen liu, Mengliang Zhang, Tatsuya Harada, Ronald Summers, Yingying Zhu
Published: Jan. 21, 2025. Version: 1.0.0
Database Credentialed Access
ReXPref-Prior: A MIMIC-CXR Preference Dataset for Reducing Hallucinated Prior Exams in Radiology Report Generation
Oishi Banerjee, Hong-Yu Zhou, Subathra Adithan, Stephen Kwak, Kay Wu, Pranav Rajpurkar
chest x-rays reinforcement learning hallucination
Published: Aug. 14, 2024. Version: 1.0.0
Database Credentialed Access
RadGraph2: Tracking Findings Over Time in Radiology Reports
Adam Dejl, Sameer Khanna, Patricia Therese Pile, Kibo Yoon, Steven QH Truong, Hanh Duong, Agustina Saenz, Pranav Rajpurkar
chest x-rays relation extraction disease progression information extraction radiology reports named entity recognition
Published: Aug. 8, 2024. Version: 1.0.0
Database Credentialed Access
EHRXQA: A Multi-Modal Question Answering Dataset for Electronic Health Records with Chest X-ray Images
Seongsu Bae, Daeun Kyung, Jaehee Ryu, Eunbyeol Cho, Gyubok Lee, Sunjun Kweon, Jungwoo Oh, Lei JI, Eric Chang, Tackeun Kim, Edward Choi
question answering chest x-ray benchmark evaluation multi-modal question answering ehr question answering semantic parsing machine learning electronic health records deep learning visual question answering
Published: July 23, 2024. Version: 1.0.0
Model Credentialed Access
Me-LLaMA: Foundation Large Language Models for Medical Applications
Qianqian Xie, Qingyu Chen, Aokun Chen, Cheng Peng, Yan Hu, Fongci Lin, Xueqing Peng, Jimin Huang, Jeffrey Zhang, Vipina Keloth, Xinyu Zhou, Huan He, Lucila Ohno-Machado, Yonghui Wu, Hua Xu, Jiang Bian
Published: June 5, 2024. Version: 1.0.0
Model Credentialed Access
Asclepius-R : Clinical Large Language Model Built On MIMIC-III Discharge Summaries
Sunjun Kweon, Junu Kim, Jiyoun Kim, Sujeong Im, Eunbyeol Cho, Seongsu Bae, Jungwoo Oh, Gyubok Lee, Jong Hak Moon, Seng Chan You, Seungjin Baek, Chang Hoon Han, Yoon Bin Jung, Yohan Jo, Edward Choi
clinical notes synthetic clinical notes synthetic notes asclepius open-source llm clinical llm large language model
Published: March 25, 2024. Version: 1.1.0
Software Open Access
Transformer-DeID: Deidentification of free-text clinical notes with transformers
Callandra Moore, Lucas Bulgarelli, Tom Pollard, Alistair Johnson
deidentification neural networks transformers
Published: Nov. 2, 2023. Version: 1.0.0
Database Credentialed Access
ReFiSco: Report Fix and Score Dataset for Radiology Report Generation
Katherine Tian, Sina J Hartung, Andrew A Li, Jaehwan Jeong, Fardad Behzadi, Juan Calle-Toro, Subathra Adithan, Michael Pohlen, David Osayande, Pranav Rajpurkar
Published: Aug. 23, 2023. Version: 0.0