Database Restricted Access

Dataset for Segmentation and Classification of Cardiac Implantable Electronic Devices in Chest X-Rays

Keno Bressem Felix Busch Andrei Zhukov Lisa Adams

Published: March 4, 2025. Version: 1.0.0


When using this resource, please cite: (show more options)
Bressem, K., Busch, F., Zhukov, A., & Adams, L. (2025). Dataset for Segmentation and Classification of Cardiac Implantable Electronic Devices in Chest X-Rays (version 1.0.0). PhysioNet. https://doi.org/10.13026/swv0-cd11.

Please include the standard citation for PhysioNet: (show more options)
Goldberger, A., Amaral, L., Glass, L., Hausdorff, J., Ivanov, P. C., Mark, R., ... & Stanley, H. E. (2000). PhysioBank, PhysioToolkit, and PhysioNet: Components of a new research resource for complex physiologic signals. Circulation [Online]. 101 (23), pp. e215–e220.

Abstract

This dataset encompasses a comprehensive collection of 2,321 chest radiographs (originally DICOM, converted to PNG) and 11,072 smartphone images documenting various cardiac implantable electronic devices (CIEDs) such as implantable pacemakers, cardioverter defibrillators, cardiac resynchronization therapy devices, and cardiac monitors, collected from 896 patients at the Charité - Universitätsmedizin Berlin over a decade (January 2012 to January 2022). The dataset's primary objective is to advance the development and validation of automated techniques for precise segmentation and classification of CIEDs across both traditional and smartphone-generated imagery. It includes a diverse array of 27 CIED models from four manufacturers (Medtronic, Biotronic, Boston Scientific, Abbott Laboratories including St. Jude Medical), captured in anterior-posterior or posterior-anterior chest radiographs using five different smartphone brands. This dataset aims to underpin the creation of deep learning models for accurately identifying CIED types, manufacturers, and models. Designed to be a robust foundation for research and clinical applications, it facilitates the exploration of innovative machine learning solutions for CIED identification leveraging both original radiogrpahs and smartphone images, thereby addressing a critical need in cardiac care technology.


Background

Cardiac implantable electronic devices (CIEDs), which include implantable cardiac monitors (ICMs), pacemakers (PMs), and implantable cardioverter defibrillators (ICDs), are critical diagnostic and therapeutic tools used in the management of a variety of cardiac conditions. These devices are essential for electrical modulation, prevention of sudden cardiac death, and continuous cardiac monitoring [1]. As cardiovascular disease continues to impact global health, the need for CIEDs is increasing [2]. This increase underscores the importance of accurately identifying CIEDs in clinical scenarios, particularly in emergency settings where patient-provided information might be lacking.

While traditional identification algorithms have been somewhat effective, they often require labor-intensive and time-consuming manual interpretation. Artificial intelligence (AI) methods, particularly convolutional neural networks (CNNs), have shown promise in improving the speed and accuracy of this identification process [3, 4]. However, a major limitation has been the lack of publicly available, high-quality datasets and algorithms, which limits the external validation, transparency, and reproducibility of these methods [5, 6]. This gap is particularly glaring when it comes to identifying not only the manufacturer, but also the specific CIED model.

In response to these challenges, this dataset was curated with the goal of advancing the field by providing the first publicly available resource for developing deep learning models for accurate CIED segmentation and classification. The dataset includes both original images converted to PNG and smartphone images, serving the dual purpose of validating the algorithms in traditional settings and extending their applicability to point-of-care scenarios via smartphone applications.


Methods

Inclusion Criteria:

The study ran from January 2012 to January 2022, and included adult patients (≥18 years) with documented CIEDs on chest radiographs. Informed consent was not required. Eligible images included those from bedside anterior-posterior or standing posterior-anterior chest radiography.

Data Collection:

A comprehensive database search was performed to identify patients who met the eligibility criteria. Images from the same patient taken at different time points were included. Exclusion criteria were images with sagittal beam path, insufficient quality, incomplete device acquisition, or multiple devices of the same type on an image. Two reviewers independently extracted CIED manufacturer and model classifications for each DICOM image from the associated medical reports, with discrepancies resolved by consensus-based discussion. Rare types of CIEDs were classified as other.

Data Pre-processing:

The original DICOM images were anonymized and converted to PNG format. This removed all sensitive header information. Anonymization was then performed in the following steps.

First, rare CEID types were not explicitly classified, but grouped into a different category. This avoids identification of patients by their rare CEID type. Second, center cropping was used to remove potentially burned-in patient information, which is most often found in the corners of the image. Third, an object character recognition (OCR) algorithm was used to black out any image information remaining after center cropping. Finally, each image was again manually inspected for any remaining information, which was then manually removed.

Each of the converted images was then photographed up to five times with one of five different smartphones at different angles from a monitor display.

After re-matching the original and smartphone images using the patient ID, this ID was replaced with a new randomly generated ID. Finally, the patient ID, examination ID, and all other identifiers were deleted to prevent re-identification of the patients, even by the authors.

For segmentation model training, ground truth annotations were manually generated by two board-certified radiologists with 7-9 years of experience in image processing using 3D Slicer software.


Data Description

The dataset consists of a total of 13,393 images, radiographs converted to PNG and smartphone photographs. The images capture a range of CIEDs, including pacemakers, implantable cardioverter defibrillators, cardiac resynchronization therapy devices, and implantable cardiac monitors.

Image Types:

  • Chest radiographs: These are high-resolution medical images obtained from the Imaging Department of Charité - University Medicine Berlin. These images serve as ground truth for CIED manufacturer and model identification..

  • Smartphone photos: A variety of smartphones were used to capture these images from different angles. The breakdown of smartphone types is as follows:

    • iPhone 13 mini: N=695 (5.2%)
    • Samsung Galaxy S4: N=1,675 (12.5%)
    • iPhone 8: N=2,890 (21.5%)
    • iPhone 13: N=2,902 (21.6%)
    • Samsung Galaxy S5: N=2,910 (21.7%)

Annotations:

Ground-truth annotations for CIED segmentation were manually created on each image by board-certified radiologists. These annotations serve as a reference standard for the evaluation of the segmentation algorithms. They are available as PNG in the `seg_masks` folder. In these images a pixel value of 0 represents the background, 1 represents a pacemaker or defibrillator and 2 represents an event monitor.

Variables and Labels:

  • Manufacturer: The manufacturer of the CIED is labeled, providing the ability to train and validate manufacturer-specific algorithms.

  • Device Model: Each CIED is labeled with its specific model, facilitating more granular analysis and classification. For details on the distribution of device types and models, please refer to the accompanying table.

File Format:

  • Anonymized original images are provided in PNG format.
  • Smartphone photos are also available in PNG format.

The table below presents an overview of the various devices included in the study. Bolded rows provide a summary of the total number of devices from each vendor.

Manufacturer/ device

Total (N)

Training dataset (N)

Validation dataset (N)

Test dataset (N)

Device category

Medtronic

1218

849

240

129

Attesta DR MRI SureScan

263

186

51

26

Dual chamber PM

Mirro MRI VR SureScan

237

168

45

24

Single chamber ICD

Reveal LINQ

94

69

16

9

ICM

Serena Quad CRT-P MRI SureScan

77

52

17

8

CRT-P

Compia MRI Quad CRT-D SureScan

51

36

9

6

CRT-D

Ensura DR MRI SureScan

39

22

10

7

Dual chamber PM

Reveal XT

31

19

7

5

ICM

Q20 SR MRI SureScan

342

238

71

33

Single chamber PM

Other

84

59

14

11

Biotronik

517

373

77

67

Enticos 4 DR

111

76

26

9

Dual chamber PM

Enticos 4 SR

48

38

6

4

Single chamber PM

Enitra 8 HF-T QP

79

60

10

9

CRT-P

Ilivia 7 VR-T

34

24

5

5

Single chamber ICD

Ilivia 7 HF-T QP

22

17

3

2

CRT-D

Ecuro SR-T

29

20

4

5

Single chamber ICD

Rivacor 7 HF-T QP

25

17

3

5

CRT-D

Rivacor 5 VR-T DX

34

21

5

8

Single chamber ICD

Etrinsa 8 HF-T

22

15

2

5

CRT-P

Enitra 6 DR-T

20

15

3

2

Dual chamber PM

Other

93

70

10

13

Abbott Laboratories (including St. Jude Medical)

401

279

78

44

Confirm Rx DM3500

137

90

32

15

ICM

Ellipse VR

97

64

23

10

Single chamber ICD

Quadra Allure MP

84

63

12

9

CRT-P

Quadra Assura MP

39

28

5

6

CRT-D

SJM Confirm

26

20

5

1

ICM

Other

18

14

1

3

Boston Scientific

185

124

35

26

Charisma X4 CRT-D

64

44

12

8

CRT-D

Valitude X4 CRT-P

32

19

7

6

CRP-P

Inogen EL ICD VR

27

19

3

5

Single chamber ICD

Origen Mini ICD

23

14

4

5

Single chamber ICD

Other

39

28

9

2

Other devices total

234

171

34

29

Data Structure:

The data is arranged as follows (please also refer to the README.md).

Directory Structure
.
├── README.md  # Data description
├── galaxys4   # Folder with smartphone photos
│   ├── fe40d960c739bbbd3002d1759155bad92d3818b5ec88e459117e64c6df2a0fb2.png
│   ├── fe5b9ad8b3a0046eacc334578a49cabe0837d813a9b8f588538132045c38a2be.png
│   ├── ff76fce90f157c2cdf328e5e012698c26c563b1443a37156e24ac1f7be4c7c76.png
│   ├── ff88824115f5e1f3d237fc6769bca905e8fcf1a89765e1fffdbf0b6aeb1dd89a.png
|   .
│   └── ffeb08d7f946d32c9c7ff6a6235c012b38887b634e246fddfd2e19f9b92d6613.png
├── galaxys5
├── iphone13
├── iphone13mini
├── iphone8
├── originals  # original images
├── seg_masks  # segmentation masks
└── data.json  # meta data, containing labels and IDs.  

Metadata

data.json Contains metadata for the images, such as device type and manufacturer.
You can read them with pandas, which should give you the following dataframe:

df = pd.read_json("data.json")
df.head()
pat_id exam_id fname dataset manufacturer model seg_mask
19920cfafb68... a733519257f3... 8911e52a3fc7... galaxys4 Biotronik Enticos 4 DR 8911e52a3fc9...
19920cfafb68... a733519257f3... 1ca664a0ac53... iphone8 Biotronik Enticos 4 DR
19920cfafb68... a733519257f3.. 51b2c95e84b5... iphone8 Biotronik Enticos 4 DR
19920cfafb68... a733519257f3... baa1e73a3dc0... iphone8 Biotronik Enticos 4 DR

pat_id: Anonymized patient ID. Use this to split your data, as a patient can have multiple images
fname: The filename of the image
dataset: Either the smartphone type, the photo was taken from or originals. The levels are 'galaxys4', 'iphone8', 'originals', 'galaxys5', 'iphone13' or 'iphone13mini'.
manufacturer: The CIED manufacturer with levels 'Biotronik', 'Medtronic', 'St. Jude Medical' or 'Boston Scientific'
model: The exact CIED model (28 classes, including the 'other' category). The classes are: Attesta DR MRI SureScan, Charisma X4 CRT-D, Compia MRI Quad CRT-D SureScan, Confirm Rx DM3500, Ecuro SR, Ellipse VR, Enitra 6 DR-T, Enitra 8 HF-T QP, Ensura DR MRI SureScan, Enticos 4 DR, Enticos 4 SR, Etrinsa 8 HF-T, Ilivia 7 HF-T QP, Ilivia 7 VR-T, Inogen EL ICD VR, Mirro MRI VR SureScan, Origen Mini ICD, Other, Q20 SR MRI SureScan, Quadra Allure MP, Quadra Assura, Reveal LINQ, Reveal XT, Rivacor 5 VR-T DX, Rivacor 7 HF-T QP, SJM Confirm, Serena Quad CRT-P MRI ScureScan and Valitude X4 CRT-P .
seg_mask: For certain images, segmentation masks are provided, stored within the seg_masks directory. It's important to note that when viewed in an image viewer, the masks might seem entirely black. This appearance is attributed to the pixel values assigned: 0 represents the background, 1 indicates a CIED (pacemaker or defibrillator), and 2 signifies a heart monitor. Given that PNG images are 8-bit, the contrast between pixel values of 0 and 2 is minimal, rendering the masks seemingly black. However, upon accessing these masks using numpy, the distinct labels assigned to each pixel become apparent.


Usage Notes

This dataset is designed to support multiple research applications in medical image analysis and artificial intelligence. Researchers can utilize it to develop and validate deep learning models for automated CIED identification in both clinical and point-of-care settings. The dataset enables three primary applications: device segmentation in chest radiographs, manufacturer classification, and specific model identification. The inclusion of smartphone images alongside traditional radiographs allows for the development of transfer learning approaches and mobile applications for bedside device identification.

Dataset limitations include the single-center origin of the data, which may affect generalizability across different healthcare settings. The dataset covers devices from four major manufacturers but excludes some smaller vendors. Additionally, the smartphone images were captured from monitor displays rather than printed films, which may influence image quality characteristics. To preserve patient anonymity, some rare device types are grouped under the "other" category, which may limit training for these specific models.

For practical implementation, we provide example code and model architectures at our GitHub repository [7]. This includes complete training pipelines for both segmentation and classification tasks, preprocessing scripts for both radiographs and smartphone images, and evaluation metrics for model performance assessment.


Ethics

The institutional review board-approved this retrospective single-center study (EA4/042/20). The need for informed consent was waived as the data was collected retrospectively and was anonymized.


Conflicts of Interest

None


References

  1. Goldenberg I, Huang DT, Nielsen JC. The role of implantable cardioverter-defibrillators and sudden cardiac death prevention: indications, device selection, and outcome. Eur Heart J. 2020;41(21):2003-11.
  2. Vaduganathan M, Mensah GA, Turco JV, Fuster V, Roth GA. The global burden of cardiovascular diseases and risk: a compass for future health. J Am Coll Cardiol. 2022;80(25):2361-71.
  3. Howard JP, Fisher L, Shun-Shin MJ, Keene D, Arnold AD, Ahmad Y, et al. Cardiac rhythm device identification using neural networks. JACC Clin Electrophysiol. 2019;5(5):576-86.
  4. Kim UH, Kim MY, Park EA, Lee W, Lim WH, Kim HL, et al. Deep learning-based algorithm for the detection and characterization of MRI safety of cardiac implantable electronic devices on chest radiographs. Korean J Radiol. 2021;22(11):1918.
  5. Lauzier PT, Gomes DG, Weng W, Sadek MM, Zakutney T, Bernier ML, Birnie D. Detection and identification of cardiac implanted electronic devices in a large data set of chest radiographs using semi-supervised artificial intelligence methods. Heart Rhythm. 2023;20(4):642-3.
  6. Weinreich M, Weinreich B, Chudow JJ, Krumerman T, Rahgozar K, Nag T, et al. Computer-aided detection and identification of implanted cardiac devices on chest radiography utilizing deep convolutional neural networks, a form of machine learning. J Am Coll Cardiol. 2019;73(9S1):307.
  7. Cardiac Device Identification [Internet]. GitHub; 2024. Available from: github.com/AndreasZhukov/cied [Accessed 2024 Feb 3].

Share
Access

Access Policy:
Only registered users who sign the specified data use agreement can access the files.

License (for files):
PhysioNet Restricted Health Data License 1.5.0

Data Use Agreement:
PhysioNet Restricted Health Data Use Agreement 1.5.0

Corresponding Author
You must be logged in to view the contact information.

Files