Resources


Database Open Access

A Wearable Exam Stress Dataset for Predicting Cognitive Performance in Real-World Settings

Computational Medicine, Md Rafiul Amin, Dilranjan Wickramasuriya, Rose T Faghih

The data contains electrodermal activity, heart rate, blood volume pulse, skin surface temperature, inter beat interval and accelerometer data recorded during three exam sessions (midterm 1, midterm 2 and finals) as well as their corresponding grades

Published: May 26, 2022. Version: 1.0.0


Database Restricted Access

Hospitalized patients with heart failure: integrating electronic healthcare records and external outcome data

Zhongheng Zhang, Linghong Cao, Yan Zhao, Ziyin Xu, Rangui Chen, Lukai Lv, Ping Xu

The new version added beta blockers in the dat_md.csv file. Dataset comprising hospital-level data on patients who were admitted with heart failure to Zigong Fourth People’s Hospital, Sichuan, China between 2016 and 2019.

heart failure china electronic health record

Published: May 22, 2022. Version: 1.3


Database Restricted Access

KINECAL

Sean Maudsley-Barton, Moi Hoon Yap

A dataset for balance falls-risk assessment and balance impairment analysis

posturography postural sway balance age-related changes clinical tests falls-risk

Published: May 16, 2022. Version: 1.0.0


Database Credentialed Access

MS-CXR: Making the Most of Text Semantics to Improve Biomedical Vision-Language Processing

Benedikt Boecking, Naoto Usuyama, Shruthi Bannur, Daniel Coelho de Castro, Anton Schwaighofer, Stephanie Hyland, Maria Teodora Wetscherek, Tristan Naumann, Aditya Nori, Javier Alvarez Valle, Hoifung Poon, Ozan Oktay

MS-CXR is a new dataset containing 1162 Chest X-ray bounding box labels paired with radiology text descriptions, annotated and verified by two board-certified radiologists.

chest x-ray vision-language processing

Published: May 16, 2022. Version: 0.1


Database Credentialed Access

BRATECA (Brazilian Tertiary Care Dataset): a Clinical Information Dataset for the Portuguese Language

Henrique Dias, Ana Helena Dias Pereira dos Ulbrich

Brazilian clinical dataset containing over 70,000 admissions from 10 hospitals in two Brazilian states.

exams natural language processing tertiary care prescriptions clinical notes

Published: May 13, 2022. Version: 1.0


Database Open Access

The CirCor DigiScope Phonocardiogram Dataset

Jorge Oliveira, Francesco Renna, Paulo Costa, Marcelo Nogueira, Ana Cristina Oliveira, Andoni Elola, Carlos Ferreira, Alipio Jorge, Ali Bahrami Rad, Matthew Reyna, Reza Sameni, Gari Clifford, Miguel Coimbra

A large collection of multi-location heart sound signals, with 5272 records collected from 1568 subjects. Heart murmurs have been annotated by a human annotator based on their time, shape, pitch, grading, quality, location and location intensity.

signal processing murmur pitch george b moody physionet challenge 2022 murmur grading murmur location murmur timing phonocardiogram pregnant murmur shape pediatric murmur detection murmur intensity murmur quality

Published: May 10, 2022. Version: 1.0.3

Visualize waveforms

Software Open Access

Model for Simulating ECG and PPG Signals with Arrhythmia Episodes

Andrius Sološenko, Andrius Petrėnas, Birutė Paliakaitė, Vaidotas Marozas, Leif Sörnmo

A model is capable of simulating sinus rhythm, atrial fibrillation and ectopic beats in ECGs and PPGs as well as extreme bradycardia and ventricular tachycardia in PPGs. Different types of noises and artifacts can also be added to the waveforms.

arrhythmia atrial fibrillation noise tachycardia detection motion artifacts ppg simulation bradycardia ecg

Published: May 2, 2022. Version: 1.3.1


Database Credentialed Access

DrugEHRQA: A Question Answering Dataset on Structured and Unstructured Electronic Health Records For Medicine Related Queries

Jayetri Bardhan, Anthony Colas, Kirk Roberts, Daisy Zhe Wang

DrugEHRQA is a QA dataset containing question-answers from MIMIC-III tables and discharge summaries.

question-answer qa

Published: April 12, 2022. Version: 1.0.0


Database Open Access

Icentia11k Single Lead Continuous Raw Electrocardiogram Dataset

Shawn Tan, Satya Ortiz-Gagné, Nicolas Beaudoin-Gagnon, Pierre Fecteau, Aaron Courville, Yoshua Bengio, Joseph Paul Cohen

This is a dataset of continuous raw electrocardiogram (ECG) signals for representation learning containing 11 thousand patients and 2 billion labelled beats.

representation learning ecg

Published: April 12, 2022. Version: 1.0

Visualize waveforms

Database Credentialed Access

RuMedNLI: A Russian Natural Language Inference Dataset For The Clinical Domain

Pavel Blinov, Aleksandr Nesterov, Galina Zubkova, Arina Reshetnikova, Vladimir Kokh, Chaitanya Shivade

RuMedNLI is the full counterpart dataset of MedNLI in Russian language.

natural language inference recognizing textual entailment russian language

Published: April 1, 2022. Version: 1.0.0