Database Open Access

MIMIC-III Waveform Database Matched Subset

Benjamin Moody George Moody Mauricio Villarroel Gari D. Clifford Ikaro Silva

Published: April 7, 2020. Version: 1.0


When using this resource, please cite: (show more options)
Moody, B., Moody, G., Villarroel, M., Clifford, G. D., & Silva, I. (2020). MIMIC-III Waveform Database Matched Subset (version 1.0). PhysioNet. https://doi.org/10.13026/c2294b.

Additionally, please cite the original publication:

Johnson, A. E. W., Pollard, T. J., Shen, L., Lehman, L. H., Feng, M., Ghassemi, M., Moody, B., Szolovits, P., Celi, L. A., & Mark, R. G. (2016). MIMIC-III, a freely accessible critical care database. Scientific Data, 3, 160035.

Please include the standard citation for PhysioNet: (show more options)
Goldberger, A., Amaral, L., Glass, L., Hausdorff, J., Ivanov, P. C., Mark, R., ... & Stanley, H. E. (2000). PhysioBank, PhysioToolkit, and PhysioNet: Components of a new research resource for complex physiologic signals. Circulation [Online]. 101 (23), pp. e215–e220.

Abstract

The MIMIC-III Waveform Database Matched Subset contains 22,317 waveform records, and 22,247 numerics records, for 10,282 distinct ICU patients. These recordings typically include digitized signals such as ECG, ABP, respiration, and PPG, as well as periodic measurements such as heart rate, oxygen saturation, and systolic, mean, and diastolic blood pressure.

This database is a subset of the MIMIC-III Waveform Database, representing those records for which the patient has been identified, and their corresponding clinical records are available in the MIMIC-III Clinical Database.


Background

The MIMIC-III Waveform Database contains thousands of recordings of multiple physiologic signals (“waveforms”) and time series of vital signs (“numerics”) collected from bedside patient monitors in adult and neonatal intensive care units (ICUs).

An ICU bedside monitor collects a great deal of data, from which it is possible to infer something about a patient’s physiological state. However, in order to understand how these waveforms are influenced by disease state and treatment, and the extent to which phenomena observed in the waveform can serve as indicators of disease, it is necessary to look at the broader context: patient demographics, diagnoses, medications, lab tests, and other information that is recorded by caregivers in the electronic medical record.

Collecting this broad clinical context is the task of the MIMIC-III Clinical Database, which was created in parallel with the Waveform Database and contains information about many of the same patients. The Matched Subset consists of all of the waveform and numerics recordings for which the corresponding clinical record is also available.


Methods

The bedside monitors used for collecting this database were not directly linked to the hospital medical record system. The monitor could be configured to display the patient’s name and medical record number, for ease of identifying patients at the central station, but this was not automatically updated when a patient was admitted or transferred to the ICU. This information was only available when the ICU staff entered it manually into the monitoring system, and since entering this information was not critical to patient care, it was frequently omitted or incomplete. Furthermore, limitations of the data archiving software made it possible to identify the care unit from which a recording originated, but not the precise room or bed number.

As a result, only a subset of the waveform recordings actually contained enough information to reliably identify the patient, and of those, not all overlapped with the time period represented by the MIMIC-III Clinical Database [1]. Using all of the available information, through a process of mostly automated matching with some manual corrections, a total of 22,317 waveform records (34%) and 22,247 numerics records (35%) were found that could be linked to a corresponding patient in the Clinical Database.

For each of those records, a new WFDB header file was created, incorporating the subject ID as well as the surrogate date and time of the recording. Note that the raw signal files (such as 3314767_0004.dat and 3314767n.dat) and segment header files (such as 3314767_0004.hea) are identical to those in the original numbered records.

The project was approved by the Institutional Review Boards of Beth Israel Deaconess Medical Center (Boston, MA) and the Massachusetts Institute of Technology (Cambridge, MA). Requirement for individual patient consent was waived because the project did not impact clinical care and all protected health information was deidentified.


Data Description

All data associated with a particular patient have been placed into a single subdirectory, named according to the patient's MIMIC-III subject_ID. These subdirectories are further divided into ten intermediate-level directories (matched/p00 to matched/p09).

The name of each matched waveform record is of the form matched/pXX/pXXNNNN/pXXNNNN-YYYY-MM-DD-hh-mm, where XXNNNN is the matching MIMIC-III Clinical Database Subject_ID, and YYYY, MM, DD, hh, and mm are the surrogate year, month (01-12), and day (01-31), and the real hour (00-23) and minute (00-59), derived from the starting date and time of day of the record. The surrogate dates match those of the corresponding MIMIC-III Clinical Database records.

In most cases, the waveform record is paired with a numerics record, which has the same name as the associated waveform record, with an n added to the end.

Frequently there are multiple waveform and numerics record pairs associated with a given clinical record; all of them will appear in the same subdirectory in such a case, and their names will indicate their chronologic sequence. For example, MIMIC-III Clinical Database record p000079 has been matched with two waveform and numerics record pairs, named:

  • p000079-2175-09-26-01-25 and p000079-2175-09-26-01-25n
  • p000079-2175-09-26-12-28 and p000079-2175-09-26-12-28n

Each mimic3wdb/matched record is also an undated mimic3wdb record (i.e., it also belongs to the full MIMIC-III Waveform Database). Only the surrogate-dated mimic3wdb/matched header (.hea) files are unique to the Matched Subset; the others, with names of the form 3*.hea and 3*.dat, are copies of the like-named files in the full database.


Usage Notes

The following example illustrates the organization of the database:

  • Intermediate directory p04 contains all records with names that begin with p04 (patients with a subject_id between 40000 and 49999.)
  • All files associated with patient 44083 are contained within the directory p04/p044083. This directory contains two waveform records (p044083-2112-05-04-19-50 and p044083-2112-05-23-12-22) and two corresponding numerics records (p044083-2112-05-04-19-50n and p044083-2112-05-23-12-22n), recorded from two separate ICU stays.
  • The master waveform header file for the first stay (p044083-2112-05-04-19-50.hea) indicates that the record is 20342033 sample intervals (about 45 hours) in length, and begins at 19:50 on May 4, 2112. This date, as with all dates in MIMIC-III, has been anonymized by shifting it by a random number of days into the future. See header(5) in the WFDB Applications Guide for more information about the format of this file.
  • This waveform record consists of 41 segments (3314767_0001 through to 3314767_0041), as indicated by the master header file. The layout header file (3314767_layout.hea) indicates that four ECG signals (II, AVR, V, and MCL) were recorded, along with a respiration signal, photoplethysmogram, and arterial blood pressure. Not all of these signals are available simultaneously.
  • The header file for segment number 4 (3314767_0004.hea) shows us that during this segment, five signals are available: three ECG leads (II, V, and AVR), a respiration signal (RESP), and a PPG signal (PLETH).
  • The numerics header file (p044083-2112-05-04-19-50n.hea) shows us that a variety of measurements were recorded, including heart rate, invasive and non-invasive blood pressure, respiratory rate, ST segment elevation, oxygen saturation, and cardiac rhythm statistics. Just as with waveforms, not all of these measurements are available at all times.

Referring to the MIMIC-III Clinical Database Demo, we can see from the PATIENTS table that this patient was male, and his anonymized date of birth was November 15, 2057 (making him 54 years old at the time of this ICU stay):

subject_id gender dob dod
44083 M 2057-11-15 00:00:00 2114-02-20 00:00:00

The ICUSTAYS table shows us that he was admitted once to the SICU and twice to the CCU:

subject_id hadm_id icustay_id first_careunit intime outtime
44083 125157 265615 SICU 2112-05-04 19:03:39 2112-05-06 17:21:01
44083 131048 282640 CCU 2112-05-23 12:32:06 2112-05-25 14:59:50
44083 198330 286428 CCU 2112-05-29 02:01:33 2112-06-01 16:50:40

The first of these admissions corresponds to the waveform record above, as indicated by the date (2112-05-04). Note that the starting and ending date and time of the waveform record will not always match the precise admission or discharge time.

The hadm_id (125157) and icustay_id (265615) are linked to other tables in MIMIC-III that provide further information about this particular ICU stay, such as vital signs, laboratory tests, medications, and diagnoses.


Release Notes

This database is a subset of version 1.0 of the MIMIC-III Waveform Database. It also represents a superset of the records in the previously-released MIMIC-II Waveform Database Matched Subset. However, it uses a different directory structure (see Data Description above), as well as different subject IDs and surrogate dates. This version corresponds to version 1.4 of the MIMIC-III Clinical Database.


Acknowledgements

We wish to thank Philips Healthcare, as well as the Beth Israel Deaconess Medical Center, for their invaluable support in making this project possible.

Many people have contributed to this project over the past 18 years, and it would be impossible to list them all. In particular, we would like to acknowledge Michael Craig, Tin Kyaw, and Mohammed Saeed, for their efforts in collecting and organizing the original MIMIC-II Waveform Database, upon which this database is based.


Conflicts of Interest

The authors have no conflicts of interests to declare.


References

  1. Johnson, A. E. W., Pollard, T. J., Shen, L., Lehman, L. H., Feng, M., Ghassemi, M., Moody, B., Szolovits, P., Celi, L. A., & Mark, R. G. (2016). MIMIC-III, a freely accessible critical care database. Scientific Data, 3, 160035. https://dx.doi.org/10.1038/sdata.2016.35

Parent Projects
MIMIC-III Waveform Database Matched Subset was derived from: Please cite them when using this project.
Share
Access

Access Policy:
Anyone can access the files, as long as they conform to the terms of the specified license.

License (for files):
Open Data Commons Open Database License v1.0

Discovery

DOI (version 1.0):
https://doi.org/10.13026/c2294b

DOI (latest version):
https://doi.org/10.13026/v217-zr73

Corresponding Author
You must be logged in to view the contact information.

Files

Total uncompressed size: 2.4 TB.

Access the files

Visualize waveforms

Folder Navigation: <base>/p02
Name Size Modified
Parent Directory
p020013
p020018
p020060
p020062
p020066
p020071
p020101
p020115
p020124
p020128
p020129
p020132
p020172
p020181
p020190
p020199
p020238
p020242
p020246
p020263
p020265
p020268
p020303
p020312
p020316
p020324
p020326
p020327
p020345
p020354
p020372
p020375
p020389
p020403
p020407
p020410
p020448
p020450
p020459
p020460
p020471
p020474
p020476
p020479
p020486
p020545
p020546
p020564
p020575
p020577
p020582
p020589
p020598
p020612
p020620
p020624
p020643
p020658
p020677
p020678
p020679
p020689
p020704
p020705
p020726
p020742
p020748
p020763
p020766
p020789
p020794
p020795
p020801
p020836
p020839
p020840
p020846
p020848
p020856
p020858
p020860
p020865
p020900
p020908
p020919
p020922
p020923
p020929
p020931
p020936
p020940
p020966
p020968
p020984
p020986
p020990
p021002
p021011
p021013
p021015
p021025
p021030
p021048
p021050
p021071
p021072
p021081
p021088
p021090
p021093
p021108
p021115
p021123
p021138
p021139
p021148
p021150
p021151
p021152
p021155
p021156
p021161
p021162
p021179
p021187
p021192
p021195
p021202
p021219
p021242
p021244
p021247
p021258
p021265
p021270
p021271
p021275
p021284
p021305
p021306
p021308
p021317
p021318
p021321
p021323
p021328
p021334
p021349
p021373
p021397
p021416
p021419
p021431
p021438
p021443
p021444
p021447
p021448
p021449
p021460
p021481
p021483
p021484
p021496
p021504
p021507
p021514
p021517
p021521
p021538
p021543
p021548
p021559
p021561
p021570
p021575
p021580
p021584
p021630
p021642
p021663
p021666
p021667
p021673
p021683
p021688
p021706
p021709
p021710
p021712
p021730
p021734
p021737
p021739
p021747
p021766
p021769
p021771
p021773
p021775
p021786
p021792
p021797
p021805
p021809
p021811
p021817
p021819
p021845
p021857
p021860
p021873
p021876
p021900
p021901
p021910
p021920
p021939
p021954
p021965
p021968
p021974
p021975
p021986
p022017
p022018
p022034
p022039
p022049
p022068
p022071
p022077
p022104
p022114
p022118
p022120
p022122
p022130
p022134
p022138
p022140
p022152
p022156
p022159
p022165
p022180
p022181
p022200
p022207
p022218
p022221
p022225
p022231
p022234
p022241
p022242
p022264
p022266
p022281
p022285
p022289
p022298
p022303
p022304
p022306
p022310
p022316
p022322
p022326
p022335
p022336
p022337
p022339
p022348
p022354
p022364
p022365
p022373
p022383
p022384
p022389
p022393
p022401
p022414
p022418
p022423
p022429
p022432
p022438
p022442
p022450
p022461
p022462
p022464
p022466
p022491
p022495
p022496
p022499
p022505
p022508
p022537
p022550
p022557
p022565
p022577
p022584
p022585
p022588
p022600
p022603
p022606
p022616
p022642
p022648
p022657
p022664
p022669
p022673
p022687
p022714
p022718
p022722
p022731
p022735
p022752
p022766
p022769
p022771
p022774
p022782
p022788
p022791
p022795
p022801
p022804
p022809
p022817
p022836
p022859
p022862
p022879
p022880
p022888
p022904
p022908
p022918
p022921
p022930
p022932
p022933
p022936
p022937
p022942
p022954
p022956
p022960
p022961
p022962
p022980
p022983
p023001
p023015
p023020
p023028
p023030
p023034
p023038
p023042
p023047
p023048
p023060
p023061
p023065
p023085
p023091
p023097
p023100
p023105
p023120
p023126
p023130
p023132
p023150
p023154
p023162
p023178
p023180
p023193
p023197
p023200
p023201
p023238
p023264
p023270
p023291
p023292
p023298
p023299
p023318
p023321
p023324
p023325
p023336
p023339
p023344
p023351
p023363
p023364
p023368
p023371
p023380
p023384
p023390
p023401
p023413
p023440
p023448
p023450
p023451
p023452
p023456
p023459
p023468
p023469
p023470
p023474
p023475
p023503
p023510
p023529
p023539
p023550
p023552
p023568
p023575
p023577
p023578
p023580
p023582
p023584
p023590
p023591
p023594
p023599
p023603
p023613
p023617
p023619
p023620
p023626
p023627
p023637
p023642
p023652
p023657
p023666
p023673
p023674
p023675
p023677
p023678
p023687
p023693
p023696
p023707
p023749
p023761
p023762
p023771
p023778
p023780
p023782
p023787
p023790
p023811
p023824
p023826
p023847
p023869
p023874
p023875
p023876
p023885
p023888
p023890
p023893
p023895
p023907
p023913
p023922
p023929
p023933
p023934
p023944
p023959
p024004
p024007
p024018
p024029
p024030
p024042
p024063
p024064
p024076
p024084
p024099
p024123
p024129
p024133
p024142
p024152
p024157
p024177
p024185
p024218
p024227
p024228
p024232
p024238
p024242
p024244
p024271
p024276
p024281
p024282
p024283
p024289
p024308
p024320
p024327
p024355
p024357
p024387
p024411
p024417
p024431
p024438
p024443
p024446
p024447
p024455
p024457
p024460
p024461
p024475
p024477
p024508
p024514
p024532
p024547
p024548
p024552
p024556
p024559
p024560
p024562
p024567
p024569
p024573
p024591
p024597
p024605
p024609
p024622
p024626
p024646
p024656
p024666
p024690
p024693
p024711
p024730
p024743
p024746
p024748
p024792
p024793
p024799
p024804
p024807
p024810
p024822
p024825
p024828
p024856
p024860
p024865
p024876
p024897
p024899
p024902
p024922
p024923
p024924
p024925
p024927
p024938
p024942
p024949
p024958
p024967
p024975
p024979
p024984
p024986
p025006
p025016
p025017
p025024
p025030
p025039
p025049
p025058
p025073
p025081
p025104
p025107
p025111
p025115
p025116
p025117
p025131
p025140
p025141
p025167
p025168
p025171
p025174
p025178
p025189
p025197
p025203
p025206
p025207
p025222
p025225
p025228
p025229
p025234
p025255
p025271
p025284
p025297
p025299
p025304
p025313
p025317
p025318
p025326
p025328
p025329
p025332
p025354
p025356
p025367
p025372
p025373
p025400
p025404
p025428
p025429
p025446
p025452
p025466
p025471
p025505
p025506
p025522
p025528
p025553
p025557
p025574
p025575
p025581
p025585
p025602
p025603
p025610
p025621
p025627
p025630
p025635
p025658
p025659
p025662
p025664
p025668
p025679
p025699
p025708
p025724
p025725
p025729
p025741
p025757
p025759
p025770
p025772
p025800
p025835
p025851
p025857
p025858
p025860
p025862
p025882
p025886
p025915
p025916
p025939
p025949
p025954
p025987
p025988
p025989
p026018
p026024
p026027
p026031
p026037
p026039
p026043
p026054
p026055
p026063
p026069
p026079
p026085
p026087
p026094
p026097
p026105
p026109
p026133
p026134
p026136
p026151
p026156
p026161
p026192
p026211
p026212
p026219
p026221
p026228
p026233
p026256
p026267
p026270
p026271
p026274
p026277
p026282
p026285
p026288
p026296
p026300
p026303
p026306
p026318
p026324
p026325
p026351
p026356
p026377
p026380
p026381
p026382
p026391
p026395
p026398
p026399
p026406
p026421
p026435
p026446
p026459
p026467
p026469
p026472
p026480
p026494
p026502
p026504
p026506
p026511
p026519
p026523
p026560
p026568
p026575
p026576
p026579
p026594
p026628
p026632
p026637
p026639
p026661
p026673
p026688
p026693
p026695
p026698
p026705
p026709
p026710
p026711
p026712
p026714
p026715
p026732
p026734
p026737
p026747
p026759
p026761
p026769
p026771
p026781
p026827
p026837
p026845
p026863
p026868
p026872
p026879
p026884
p026893
p026897
p026901
p026905
p026925
p026926
p026930
p026964
p026978
p026990
p026996
p027002
p027022
p027026
p027060
p027077
p027083
p027084
p027102
p027106
p027119
p027132
p027147
p027148
p027155
p027162
p027172
p027177
p027185
p027192
p027193
p027194
p027195
p027197
p027200
p027202
p027210
p027212
p027213
p027215
p027221
p027223
p027232
p027235
p027237
p027241
p027245
p027266
p027282
p027321
p027326
p027329
p027337
p027338
p027343
p027351
p027355
p027367
p027372
p027374
p027379
p027398
p027423
p027425
p027428
p027429
p027434
p027436
p027439
p027441
p027446
p027456
p027463
p027464
p027486
p027504
p027530
p027539
p027540
p027542
p027551
p027554
p027555
p027577
p027584
p027585
p027599
p027616
p027636
p027638
p027639
p027643
p027648
p027661
p027677
p027687
p027689
p027691
p027695
p027696
p027697
p027708
p027710
p027778
p027791
p027793
p027796
p027799
p027800
p027801
p027823
p027829
p027833
p027842
p027845
p027850
p027860
p027861
p027884
p027887
p027890
p027891
p027905
p027910
p027925
p027927
p027953
p027961
p027962
p027969
p027971
p027981
p028019
p028037
p028039
p028044
p028048
p028052
p028061
p028065
p028073
p028075
p028077
p028079
p028083
p028085
p028089
p028093
p028094
p028095
p028110
p028149
p028166
p028170
p028172
p028180
p028187
p028189
p028221
p028260
p028270
p028281
p028291
p028294
p028331
p028338
p028339
p028340
p028354
p028364
p028365
p028386
p028416
p028419
p028423
p028443
p028460
p028461
p028496
p028499
p028505
p028507
p028508
p028510
p028511
p028514
p028525
p028530
p028531
p028536
p028541
p028587
p028594
p028611
p028616
p028625
p028627
p028628
p028629
p028644
p028654
p028660
p028671
p028676
p028684
p028698
p028702
p028706
p028707
p028721
p028727
p028729
p028753
p028758
p028762
p028765
p028772
p028774
p028775
p028777
p028785
p028789
p028806
p028808
p028813
p028827
p028865
p028868
p028869
p028875
p028880
p028882
p028883
p028887
p028897
p028900
p028901
p028902
p028903
p028905
p028909
p028910
p028911
p028920
p028927
p028930
p028941
p028955
p028961
p029005
p029007
p029027
p029035
p029043
p029049
p029057
p029066
p029073
p029093
p029100
p029102
p029106
p029116
p029120
p029125
p029127
p029131
p029133
p029137
p029148
p029164
p029167
p029191
p029199
p029215
p029216
p029251
p029262
p029270
p029299
p029300
p029336
p029337
p029343
p029358
p029377
p029378
p029388
p029411
p029426
p029463
p029466
p029468
p029470
p029477
p029478
p029493
p029503
p029507
p029509
p029511
p029512
p029527
p029529
p029530
p029541
p029544
p029553
p029556
p029569
p029570
p029573
p029576
p029581
p029619
p029620
p029622
p029629
p029638
p029660
p029664
p029678
p029697
p029712
p029730
p029767
p029769
p029770
p029799
p029826
p029829
p029840
p029861
p029862
p029866
p029869
p029871
p029872
p029875
p029878
p029884
p029937
p029946
p029949
p029961
p029967
p029968
p029969
p029972
p029999