Resources


Database Open Access

Synthetic Mention Corpora for Disease Entity Recognition and Normalization

Kuleen Sasse, John David Osborne

We present the Synthetic Mention Corpora for Disease Entity Recognition and Normalization, containing 128000 disease mentions from the UMLS disorder group, generated by an LLM. This corpus aims to improve these tasks in biomedical and clinical texts.

nlp machine learning named entity recognition data augmentation entity normalization

Published: Feb. 3, 2025. Version: 1.0.0


Database Open Access

Synthetic Mention Corpora for Disease Entity Recognition and Normalization

Kuleen Sasse, John David Osborne

We present the Synthetic Mention Corpora for Disease Entity Recognition and Normalization, containing 128000 disease mentions from the UMLS disorder group, generated by an LLM. This corpus aims to improve these tasks in biomedical and clinical texts.

nlp machine learning named entity recognition data augmentation entity normalization

Published: Feb. 3, 2025. Version: 1.0.0


Database Open Access

Normal Sinus Rhythm RR Interval Database

Beat annotation files for 54 long-term ECG recordings of subjects in normal sinus rhythm.

sinus normal interbeat rr interval

Published: March 3, 2003. Version: 1.0.0

Visualize waveforms

Challenge Open Access

Is the normal heart rate chaotic?

George Moody

In its June 2008 issue, the editors of Chaos announced a new feature, "Controversial Topics in Nonlinear Dynamics". The first controversial topic to be aired is "Is the Normal Heart Rate Chaotic?". This project provides data to explore this question.

Published: Oct. 30, 2008. Version: 1.0.0


Database Open Access

Normal Sinus Rhythm RR Interval Database

Beat annotation files for 54 long-term ECG recordings of subjects in normal sinus rhythm.

sinus normal interbeat rr interval

Published: March 3, 2003. Version: 1.0.0

Visualize waveforms

Database Open Access

MIT-BIH Normal Sinus Rhythm Database

Long-term ECG recordings of 18 subjects referred to the Arrhythmia Laboratory at Boston's Beth Israel Hospital.

sinus normal ecg

Published: Aug. 3, 1999. Version: 1.0.0

Visualize waveforms

Database Restricted Access

CXRGraph: Using Information Extraction to Normalize the Training Data for Automatic Radiology Report Generation

Yuxiang Liao, Hoisang Heung, Hantao Liu, et al.

CXRGraph is a structured radiology report dataset built upon RadGraph and tailored for the Automatic Radiology Report Generation task. It can identify more task-relevant information such as abnormalities and hallucinated prior references.

relation extraction information extraction natural language processing named entity recognition structured radiology report

Published: Feb. 3, 2025. Version: 1.0.0


Database Open Access

Evoked Auditory Responses in Normals

Auditory Brainstem Response and Otoacoustic Emission recordings generated as part of a study examining evoked potentials and loudness growth.

loudness auditory neuroelectric

Published: Feb. 3, 2011. Version: 1.0.0

Visualize waveforms

Database Open Access

MIT-BIH Normal Sinus Rhythm Database

Long-term ECG recordings of 18 subjects referred to the Arrhythmia Laboratory at Boston's Beth Israel Hospital.

sinus normal ecg

Published: Aug. 3, 1999. Version: 1.0.0

Visualize waveforms

Database Open Access

Respiratory and Pulse Oximetry Waveforms from Healthy Adults During Simulated Apnoea Events

Jordan Hill, Ella Frances Sophia Guy, Jaimey Anne Clifton, et al.

This dataset contains airway pressure, flow and pulse oximetry waveforms from 20 healthy adults during simulated apnoea events, including arterial and venous PPG signals for developing and validating OSA detection and oxygenation models.

pulse oximetry respiratory obstructive sleep apnea

Published: March 4, 2026. Version: 1.0.0