Resources


Software Open Access

Transformer-DeID: Deidentification of free-text clinical notes with transformers

Callandra Moore, Lucas Bulgarelli, Tom Pollard, Alistair Johnson

Fine tune transformer-based neural networks to deidentify clinical text data.

deidentification neural networks transformers

Published: Nov. 2, 2023. Version: 1.0.0


Software Open Access

Transformer-DeID: Deidentification of free-text clinical notes with transformers

Callandra Moore, Lucas Bulgarelli, Tom Pollard, Alistair Johnson

Fine tune transformer-based neural networks to deidentify clinical text data.

deidentification neural networks transformers

Published: Nov. 2, 2023. Version: 1.0.0


Model Credentialed Access

Transformer models trained on MIMIC-III to generate synthetic patient notes

Ali Amin-Nejad, Julia Ive, Sumithra Velupillai

Machine learning models that have been trained using MIMIC-III to enable the creation of synthetic discharge summaries.

Published: May 27, 2020. Version: 1.0.0


Database Open Access

MIMIC-IV demo data in the OMOP Common Data Model

Michael Kallfelz, Anna Tsvetkova, Tom Pollard, Manlik Kwong, Gigi Lipori, Vojtech Huser, Jeffrey Osborn, Sicheng Hao, Andrew Williams

Preliminary work to transform a MIMIC-IV demo dataset to the OMOP Common Data Model

omop common data model

Published: June 21, 2021. Version: 0.9


Database Credentialed Access

RadQA: A Question Answering Dataset to Improve Comprehension of Radiology Reports

Sarvesh Soni, Kirk Roberts

RadQA is an electronic health record question answering dataset containing clinical questions that can be answered using the Findings and Impressions sections of radiology reports

electronic health records clinical notes question answering radiology reports machine reading comprehension

Published: Dec. 9, 2022. Version: 1.0.0


Model Credentialed Access

EntityBERT: BERT-based Models Pretrained on MIMIC-III with or without Entity-centric Masking Strategy for the Clinical Domain

Chen Lin, Steven Bethard, Guergana Savova, Timothy Miller, Dmitriy Dligach

Pretraining of models with a broad representation of biomedical terminology (PubMedBERT) on MIMIC-III corpus along with or without a novel entity-centric masking strategy.

Published: March 17, 2022. Version: 1.0.1


Database Restricted Access

KURIAS-ECG: a 12-lead electrocardiogram database with standardized diagnosis ontology

Hakje Yoo, Yunjin Yum, Soowan Park, Jeong Moon Lee, Moonjoung Jang, Yoojoong Kim, Jong-Ho Kim, Hyun-Joon Park, Kap Su Han, Jae Hyoung Park, Hyung Joon Joo

The KURIAS-ECG database is a high-quality 12-lead ECG DB including standard vocabulary (SNOMED CT, OMOP-CDM), and ECG diagnoses of our DB are grouped into 10 diagnoses by applying the minnesota code.

snomed 12-lead minnesota ecg

Published: Nov. 8, 2021. Version: 1.0


Software Open Access

Apnea Detection from the ECG

Hilbert Transform based Sleep Apnea Detection using a Single Lead Electrocardiogram.

apnea sleep ecg

Published: Feb. 4, 2002. Version: 1.0.0