Resources


Database Open Access

PhysioZoo - mammalian NSR databases

Ori Shemla, Joachim Behar

PhysioZoo is a collaborative platform dedicated to the study of the heart rate variability in electrophysiological recordings from mammals

heart rate variabillity electrophysiology mammals ecg

Published: Aug. 27, 2019. Version: 1.0.0

Visualize waveforms

Database Credentialed Access

MS-CXR: Making the Most of Text Semantics to Improve Biomedical Vision-Language Processing

Benedikt Boecking, Naoto Usuyama, Shruthi Bannur, Daniel Coelho de Castro, Anton Schwaighofer, Stephanie Hyland, Maria Teodora Wetscherek, Tristan Naumann, Aditya Nori, Javier Alvarez Valle, Hoifung Poon, Ozan Oktay

MS-CXR is a new dataset containing 1162 Chest X-ray bounding box labels paired with radiology text descriptions, annotated and verified by two board-certified radiologists.

chest x-ray vision-language processing

Published: May 16, 2022. Version: 0.1


Challenge Credentialed Access

Analysis of Clinical Text: Task 14 of SemEval 2015

Guergana Savova

This is the dataset for SemEval 2014 and 2015, Analysis of Clinical Text

nlp semeval

Published: Dec. 28, 2014. Version: 2.0


Database Open Access

MIMIC-IV Clinical Database Demo

Alistair Johnson, Lucas Bulgarelli, Tom Pollard, Steven Horng, Leo Anthony Celi, Roger Mark

An openly available subset of patients in the MIMIC-IV database.

mimic critical care electronic health record

Published: June 22, 2022. Version: 1.0


Database Credentialed Access

RuMedNLI: A Russian Natural Language Inference Dataset For The Clinical Domain

Pavel Blinov, Aleksandr Nesterov, Galina Zubkova, Arina Reshetnikova, Vladimir Kokh, Chaitanya Shivade

RuMedNLI is the full counterpart dataset of MedNLI in Russian language.

natural language inference recognizing textual entailment russian language

Published: April 1, 2022. Version: 1.0.0


Database Restricted Access

Gout Emergency Department Chief Complaint Corpora

John David Osborne, Tobias O'Leary, Amy Mudano, James Booth, Giovanna Rosas, Gurusai Sujitha Peramsetty, Anthony Knighton, Jeff Foster, Ken Saag, Maria Ioana Danila

A corpus of chief complaints tagged with predicted gout flare status and chart reviewed gout flare status. Ideal for input to masked language model training to supplement lengthy clinical text notes.

emergency department gout nlp

Published: Oct. 19, 2020. Version: 1.0


Database Open Access

MIMIC-III Clinical Database Demo

Alistair Johnson, Tom Pollard, Roger Mark

An open source demo of the MIMIC-III Clinical Database

mimic critical care electronic health records

Published: April 24, 2019. Version: 1.4


Software Open Access

De-Identification Software Package

The deid software package includes code and dictionaries for automated location and removal of protected health information (PHI) in free text from medical records.

anonymization deidentification phi

Published: Dec. 18, 2007. Version: 1.1


Software Open Access

plt - Software for 2D Plots

plt is a non-interactive plotting utility originally written for Unix by Paul Albrecht. plt can produce publication-quality 2D plots in PostScript from easily-produced text or binary data files, and can also create screen plots under the X Window Sy…

multiparameter visualization

Published: Nov. 7, 2002. Version: 2.5


Database Open Access

Santa Fe Time Series Competition Data Set B

This is a multivariate data set recorded from a patient in the sleep laboratory of the Beth Israel Hospital (now the Beth Israel Deaconess Medical Center) in Boston, Massachusetts. This data set was extracted from record slp60 of the MIT-BIH Polysom…

multiparameter sleep

Published: Jan. 6, 2000. Version: 1.0.0