Resources


Database Credentialed Access

MS-CXR-T: Learning to Exploit Temporal Structure for Biomedical Vision-Language Processing

Shruthi Bannur, Stephanie Hyland, Qianchu Liu, Fernando Pérez-García, Max Ilse, Daniel Coelho de Castro, Benedikt Boecking, Harshita Sharma, Kenza Bouzid, Anton Schwaighofer, Maria Teodora Wetscherek, Hannah Richardson, Tristan Naumann, Javier Alvarez Valle, Ozan Oktay

The MS-CXR-T is a multimodal benchmark that enhances the MIMIC-CXR v2 dataset by including expert-verified annotations. Its goal is to evaluate biomedical visual-language processing models in terms of temporal semantics extracted from image and text.

multimodal chest x-ray radiology cxr disease progression vision-language processing

Published: March 17, 2023. Version: 1.0.0


Database Credentialed Access

MS-CXR: Making the Most of Text Semantics to Improve Biomedical Vision-Language Processing

Benedikt Boecking, Naoto Usuyama, Shruthi Bannur, Daniel Coelho de Castro, Anton Schwaighofer, Stephanie Hyland, Maria Teodora Wetscherek, Tristan Naumann, Aditya Nori, Javier Alvarez Valle, Hoifung Poon, Ozan Oktay

MS-CXR is a new dataset containing 1162 Chest X-ray bounding box labels paired with radiology text descriptions, annotated and verified by two board-certified radiologists.

chest x-ray vision-language processing

Published: May 16, 2022. Version: 0.1


Software Open Access

FECGSYN Toolbox

The FECGSYN toolbox is a reference open-source platform for NI-FECG research, product of a collaboration between the Department of Engineering Science, University of Oxford (DES-OX), the Institute of Biomedical Engineering, TU Dresden (IBMT-TUD) and…

fetal ecg

Published: Nov. 4, 2014. Version: 1.0.0


Model Credentialed Access

Asclepius-R : Clinical Large Language Model Built On MIMIC-III Discharge Summaries

Sunjun Kweon, Junu Kim, Jiyoun Kim, Sujeong Im, Eunbyeol Cho, Seongsu Bae, Jungwoo Oh, Gyubok Lee, Jong Hak Moon, Seng Chan You, Seungjin Baek, Chang Hoon Han, Yoon Bin Jung, Yohan Jo, Edward Choi

Asclepius: Publicly Available Clinical Large Language Models with Synthetic Clinical Notes Asclepius-R: A instruction-finetuned large language model with MIMIC-III clinical notes

clinical notes large language model synthetic clinical notes synthetic notes asclepius open-source llm clinical llm

Published: March 25, 2024. Version: 1.1.0


Database Credentialed Access

ODD: A Benchmark Dataset for the NLP-based Opioid Related Aberrant Behavior Detection

Sunjae Kwon, Xun Wang, Weisong Liu, Emily Druhl, Minhee Sung, Joel Reisman, Wenjun Li, Robert Kerns, William Becker, Hong Yu

Opioid-related aberrant behaviors (ORABs) detection Dataset (ODD) which is a large-size, expert-annotated, and multi-label classification benchmark dataset corresponding to the task

natural language processing substance use opioid related aberrant behavior

Published: Jan. 11, 2024. Version: 1.0.0


Software Open Access

Transformer-DeID: Deidentification of free-text clinical notes with transformers

Callandra Moore, Lucas Bulgarelli, Tom Pollard, Alistair Johnson

Fine tune transformer-based neural networks to deidentify clinical text data.

deidentification neural networks transformers

Published: Nov. 2, 2023. Version: 1.0.0


Database Open Access

Integration of Electroencephalogram and Eye-Gaze Datasets for Performance Evaluation in Fundamentals of Laparoscopic Surgery (FLS) Tasks

Somayeh B Shafiei, Saeed Shadpour

Brain activity and eye gaze data were collected from a group of 25 participants who completed the FLS tasks using a trainer box (Pyxus®). Each participant performed the tasks five times, and their performance was evaluated by an expert rater.

Published: Aug. 23, 2023. Version: 1.0.0

Visualize waveforms

Database Open Access

Electroencephalogram and eye-gaze datasets for robot-assisted surgery performance evaluation

Somayeh B Shafiei, Saeed Shadpour, James Mohler, Mehdi Seilanian Toussi, Philippa Doherty, Zhe Jing

The brain activity and eye gaze data were recorded from 25 participants performing surgical tasks using a robot simulator. The performance score was created by the simulator. Data can be used to evaluate surgical performance.

Published: July 14, 2023. Version: 1.0.0

Visualize waveforms

Model Credentialed Access

EntityBERT: BERT-based Models Pretrained on MIMIC-III with or without Entity-centric Masking Strategy for the Clinical Domain

Chen Lin, Steven Bethard, Guergana Savova, Timothy Miller, Dmitriy Dligach

Pretraining of models with a broad representation of biomedical terminology (PubMedBERT) on MIMIC-III corpus along with or without a novel entity-centric masking strategy.

Published: March 17, 2022. Version: 1.0.1


Database Open Access

Brno University of Technology ECG Signal Database with Annotations of P Wave (BUT PDB)

Lucie Maršánová, Andrea Nemcova, Radovan Smisek, Lukas Smital, Martin Vitek

BUT PDB is an ECG signal database with marked peaks of P waves created for the development, and objective comparison of P wave detection algorithms. The database consists of 50 2-minute 2-lead ECG signal records with various types of pathology.

p wave ecg

Published: Jan. 19, 2021. Version: 1.0.0

Visualize waveforms