Resources


Database Credentialed Access

Phenotype Annotations for Patient Notes in the MIMIC-III Database

Edward Moseley, Leo Anthony Celi, Joy Wu, Franck Dernoncourt

Clinical notes, annotated by at least two expert annotators for over ten patient phenotypes, including advanced cancer, substance abuse, and treatment non-adherence.

patient classification natural language processing

Published: March 5, 2020. Version: 1.20.03


Software Open Access

Waveform Database Software Package (WFDB) for Python

Chen Xie, Lucas McCullum, Alistair Johnson, Tom Pollard, Brian Gow, Benjamin Moody

Tools for working with waveforms in Python.

waveform wfdb python

Published: Jan. 24, 2023. Version: 4.1.0


Database Open Access

EPHNOGRAM: A Simultaneous Electrocardiogram and Phonocardiogram Database

Arsalan Kazemnejad, Peiman Gordany, Reza Sameni

An open-access database recorded during the EPHNOGRAM project, consisting of simultaneous electrocardiogram (ECG) and phonocardiogram (PCG) recordings from young healthy adults, during stress-test experiments.

stress-test electrocardiogram phonocardiogram

Published: June 11, 2021. Version: 1.0.0

Visualize waveforms

Database Credentialed Access

Curated Data for Describing Blood Glucose Management in the Intensive Care Unit

Aldo Robles Arévalo, Roselyn Mateo-Collado, Leo Anthony Celi

The data subsets consist of time series files that includes all the curated entries of glucose readings and insulin inputs from MIMIC-III database.

insulin replacement therapy glycemic control critical care

Published: April 19, 2021. Version: 1.0.1


Database Credentialed Access

MedNLI for Shared Task at ACL BioNLP 2019

Chaitanya Shivade

Data for the MedNLI Shared Task at the 2019 ACL BioNLP 2019 Workshop on Biomedical Language Processing

mimic natural language inference recognizing textual entailment

Published: Nov. 28, 2019. Version: 1.0.1


Challenge Contributor Review

BioNLP Workshop 2023 Shared Task 1A: Problem List Summarization

Yanjun Gao, Timothy Miller, Majid Afshar, Dmitriy Dligach

This is the data storage for BioNLP Workshop Shared Task 1A: Problem List Summarization.

clinical natural language processing bionlp electronic health record summarization

Published: Jan. 19, 2023. Version: 1.0.0


Database Credentialed Access

Nosocomial Risk Datasets from MIMIC-III

Travis Goodwin

Text-based Longitudinal Data for Predicting Nosocomial Disease Risk as used by CANTRIP.

deep learning pressure injury risk prediction acute kidney injury anemia forecasting natural language processing

Published: Sept. 15, 2022. Version: 1.0


Database Contributor Review

BRATECA (Brazilian Tertiary Care Dataset): a Clinical Information Dataset for the Portuguese Language

Henrique Dias, Ana Helena Dias Pereira dos Ulbrich

Brazilian clinical dataset containing over 70,000 admissions from 10 hospitals in two Brazilian states.

prescriptions exams tertiary care natural language processing clinical notes

Published: July 14, 2022. Version: 1.1


Database Open Access

The CirCor DigiScope Phonocardiogram Dataset

Jorge Oliveira, Francesco Renna, Paulo Costa, Marcelo Nogueira, Ana Cristina Oliveira, Andoni Elola, Carlos Ferreira, Alipio Jorge, Ali Bahrami Rad, Matthew Reyna, Reza Sameni, Gari Clifford, Miguel Coimbra

A large collection of multi-location heart sound signals, with 5272 records collected from 1568 subjects. Heart murmurs have been annotated by a human annotator based on their time, shape, pitch, grading, quality, location and location intensity.

signal processing murmur pitch george b moody physionet challenge 2022 murmur grading murmur location murmur timing phonocardiogram pregnant murmur shape pediatric murmur detection murmur intensity murmur quality

Published: May 10, 2022. Version: 1.0.3

Visualize waveforms

Software Open Access

Heart Vector Origin Point Detection and Time-Coherent Median Beat Construction

Erick Andres Perez Alday, Larisa Tereshchenko

The algorithm finds the heart vector origin point and constructs the time-coherent median beat. VCG origin point is defined as the electrically quiet or isoelectric state of the heart when the heart vector does not move in 3D space.

baseline vectorcardiogram origin point heart vector signal processing electrocardiogram

Published: May 25, 2021. Version: 1.0.0