Resources


Database Open Access

CHB-MIT Scalp EEG Database

John Guttag

EEG recordings from pediatric subjects with intractable seizures, collected at the Children’s Hospital Boston.

medication seizure eeg neuroelectric

Published: June 9, 2010. Version: 1.0.0

Visualize waveforms

Software Open Access

De-Identification Software Package

The deid software package includes code and dictionaries for automated location and removal of protected health information (PHI) in free text from medical records.

phi deidentification anonymization

Published: Dec. 18, 2007. Version: 1.1


Software Open Access

CVSim

CVSim is a lumped-parameter model of the human cardiovascular system that has been developed and used for research and for teaching quantitative physiology courses at MIT and Harvard Medical School since 1984. The versions presented here have a grap…

cardiovascular simulation

Published: March 16, 2007. Version: 1.0.0


Database Open Access

Long-term Recordings of Gait Dynamics

Stride interval fluctuations were studied in ten young, healthy men. Participants had no history of any neuromuscular, respiratory or cardiovascular disorders, and were taking no medications. Mean age was 21.7 years (range: 18-29 years). Height was …

gait

Published: Aug. 16, 2001. Version: 1.0.0


Database Open Access

BIDMC Congestive Heart Failure Database

Long-term ECG recordings from 15 subjects with severe congestive heart failure.

congestive heart failure chf ecg

Published: Oct. 14, 2000. Version: 1.0.0

Visualize waveforms

Database Open Access

MIMIC Database

The MIMIC Database includes data recorded from over 90 ICU patients. The data in each case include signals and periodic measurements obtained from a bedside monitor as well as clinical data obtained from the patient's medical record. The recordi…

health record icu ehr critical care mimic

Published: March 15, 2000. Version: 1.0.0

Visualize waveforms

Database Open Access

Santa Fe Time Series Competition Data Set B

This is a multivariate data set recorded from a patient in the sleep laboratory of the Beth Israel Hospital (now the Beth Israel Deaconess Medical Center) in Boston, Massachusetts. This data set was extracted from record slp60 of the MIT-BIH Polysom…

sleep multiparameter

Published: Jan. 6, 2000. Version: 1.0.0


Database Open Access

MIT-BIH Normal Sinus Rhythm Database

Long-term ECG recordings of 18 subjects referred to the Arrhythmia Laboratory at Boston's Beth Israel Hospital.

sinus normal ecg

Published: Aug. 3, 1999. Version: 1.0.0

Visualize waveforms

Database Credentialed Access

RadVLM Instruction Dataset

Nicolas Deperrois, Hidetoshi Matsuo, Samuel Ruiperez-Campillo, Moritz Vandenhirtz, Sonia Laguna, Alain Ryser, Koji Fujimoto, Mizuho Nishio, Thomas Sutter, Julia Vogt, Jonas Kluckert, Thomas Frauenfelder, Christian Bluethgen, Farhad Nooralahzadeh, Michael Krauthammer

This dataset is designed to construct RadVLM, a vision–language model for chest X-ray interpretation. It includes instruction data for tasks such as report generation, abnormality detection, and region grounding, and multitask conversation.

chest x-rays vision-language models medical ai

Published: Sept. 25, 2025. Version: 1.0.0


Database Credentialed Access

MIMIC-IV-Ext-Instr: A Dataset of 450K+ EHR-Grounded Instruction-Following Examples

Zhenbang Wu, Anant Dadu, Mike Nalls, Faraz Faghri, Jimeng Sun

This dataset contains 450K open-ended instruction-following examples generated using GPT-3.5 based on the MIMIC-IV EHR database.

large language models medical question answering instruction tuning

Published: Sept. 9, 2025. Version: 1.0.0