Database Open Access

MIMIC-III Waveform Database Matched Subset

Benjamin Moody George Moody Mauricio Villarroel Gari Clifford Ikaro Silva

Published: April 7, 2020. Version: 1.0


When using this resource, please cite: (show more options)
Moody, B., Moody, G., Villarroel, M., Clifford, G., & Silva, I. (2020). MIMIC-III Waveform Database Matched Subset (version 1.0). PhysioNet. https://doi.org/10.13026/c2294b.

Additionally, please cite the original publication:

Johnson, A. E. W., Pollard, T. J., Shen, L., Lehman, L. H., Feng, M., Ghassemi, M., Moody, B., Szolovits, P., Celi, L. A., & Mark, R. G. (2016). MIMIC-III, a freely accessible critical care database. Scientific Data, 3, 160035.

Please include the standard citation for PhysioNet: (show more options)
Goldberger, A., Amaral, L., Glass, L., Hausdorff, J., Ivanov, P. C., Mark, R., ... & Stanley, H. E. (2000). PhysioBank, PhysioToolkit, and PhysioNet: Components of a new research resource for complex physiologic signals. Circulation [Online]. 101 (23), pp. e215–e220.

Abstract

The MIMIC-III Waveform Database Matched Subset contains 22,317 waveform records, and 22,247 numerics records, for 10,282 distinct ICU patients. These recordings typically include digitized signals such as ECG, ABP, respiration, and PPG, as well as periodic measurements such as heart rate, oxygen saturation, and systolic, mean, and diastolic blood pressure.

This database is a subset of the MIMIC-III Waveform Database, representing those records for which the patient has been identified, and their corresponding clinical records are available in the MIMIC-III Clinical Database.


Background

The MIMIC-III Waveform Database contains thousands of recordings of multiple physiologic signals (“waveforms”) and time series of vital signs (“numerics”) collected from bedside patient monitors in adult and neonatal intensive care units (ICUs).

An ICU bedside monitor collects a great deal of data, from which it is possible to infer something about a patient’s physiological state. However, in order to understand how these waveforms are influenced by disease state and treatment, and the extent to which phenomena observed in the waveform can serve as indicators of disease, it is necessary to look at the broader context: patient demographics, diagnoses, medications, lab tests, and other information that is recorded by caregivers in the electronic medical record.

Collecting this broad clinical context is the task of the MIMIC-III Clinical Database, which was created in parallel with the Waveform Database and contains information about many of the same patients. The Matched Subset consists of all of the waveform and numerics recordings for which the corresponding clinical record is also available.


Methods

The bedside monitors used for collecting this database were not directly linked to the hospital medical record system. The monitor could be configured to display the patient’s name and medical record number, for ease of identifying patients at the central station, but this was not automatically updated when a patient was admitted or transferred to the ICU. This information was only available when the ICU staff entered it manually into the monitoring system, and since entering this information was not critical to patient care, it was frequently omitted or incomplete. Furthermore, limitations of the data archiving software made it possible to identify the care unit from which a recording originated, but not the precise room or bed number.

As a result, only a subset of the waveform recordings actually contained enough information to reliably identify the patient, and of those, not all overlapped with the time period represented by the MIMIC-III Clinical Database [1]. Using all of the available information, through a process of mostly automated matching with some manual corrections, a total of 22,317 waveform records (34%) and 22,247 numerics records (35%) were found that could be linked to a corresponding patient in the Clinical Database.

For each of those records, a new WFDB header file was created, incorporating the subject ID as well as the surrogate date and time of the recording. Note that the raw signal files (such as 3314767_0004.dat and 3314767n.dat) and segment header files (such as 3314767_0004.hea) are identical to those in the original numbered records.

The project was approved by the Institutional Review Boards of Beth Israel Deaconess Medical Center (Boston, MA) and the Massachusetts Institute of Technology (Cambridge, MA). Requirement for individual patient consent was waived because the project did not impact clinical care and all protected health information was deidentified.


Data Description

All data associated with a particular patient have been placed into a single subdirectory, named according to the patient's MIMIC-III subject_ID. These subdirectories are further divided into ten intermediate-level directories (matched/p00 to matched/p09).

The name of each matched waveform record is of the form matched/pXX/pXXNNNN/pXXNNNN-YYYY-MM-DD-hh-mm, where XXNNNN is the matching MIMIC-III Clinical Database Subject_ID, and YYYY, MM, DD, hh, and mm are the surrogate year, month (01-12), and day (01-31), and the real hour (00-23) and minute (00-59), derived from the starting date and time of day of the record. The surrogate dates match those of the corresponding MIMIC-III Clinical Database records.

In most cases, the waveform record is paired with a numerics record, which has the same name as the associated waveform record, with an n added to the end.

Frequently there are multiple waveform and numerics record pairs associated with a given clinical record; all of them will appear in the same subdirectory in such a case, and their names will indicate their chronologic sequence. For example, MIMIC-III Clinical Database record p000079 has been matched with two waveform and numerics record pairs, named:

  • p000079-2175-09-26-01-25 and p000079-2175-09-26-01-25n
  • p000079-2175-09-26-12-28 and p000079-2175-09-26-12-28n

Each mimic3wdb/matched record is also an undated mimic3wdb record (i.e., it also belongs to the full MIMIC-III Waveform Database). Only the surrogate-dated mimic3wdb/matched header (.hea) files are unique to the Matched Subset; the others, with names of the form 3*.hea and 3*.dat, are copies of the like-named files in the full database.


Usage Notes

The following example illustrates the organization of the database:

  • Intermediate directory p04 contains all records with names that begin with p04 (patients with a subject_id between 40000 and 49999.)
  • All files associated with patient 44083 are contained within the directory p04/p044083. This directory contains two waveform records (p044083-2112-05-04-19-50 and p044083-2112-05-23-12-22) and two corresponding numerics records (p044083-2112-05-04-19-50n and p044083-2112-05-23-12-22n), recorded from two separate ICU stays.
  • The master waveform header file for the first stay (p044083-2112-05-04-19-50.hea) indicates that the record is 20342033 sample intervals (about 45 hours) in length, and begins at 19:50 on May 4, 2112. This date, as with all dates in MIMIC-III, has been anonymized by shifting it by a random number of days into the future. See header(5) in the WFDB Applications Guide for more information about the format of this file.
  • This waveform record consists of 41 segments (3314767_0001 through to 3314767_0041), as indicated by the master header file. The layout header file (3314767_layout.hea) indicates that four ECG signals (II, AVR, V, and MCL) were recorded, along with a respiration signal, photoplethysmogram, and arterial blood pressure. Not all of these signals are available simultaneously.
  • The header file for segment number 4 (3314767_0004.hea) shows us that during this segment, five signals are available: three ECG leads (II, V, and AVR), a respiration signal (RESP), and a PPG signal (PLETH).
  • The numerics header file (p044083-2112-05-04-19-50n.hea) shows us that a variety of measurements were recorded, including heart rate, invasive and non-invasive blood pressure, respiratory rate, ST segment elevation, oxygen saturation, and cardiac rhythm statistics. Just as with waveforms, not all of these measurements are available at all times.

Referring to the MIMIC-III Clinical Database Demo, we can see from the PATIENTS table that this patient was male, and his anonymized date of birth was November 15, 2057 (making him 54 years old at the time of this ICU stay):

subject_id gender dob dod
44083 M 2057-11-15 00:00:00 2114-02-20 00:00:00

The ICUSTAYS table shows us that he was admitted once to the SICU and twice to the CCU:

subject_id hadm_id icustay_id first_careunit intime outtime
44083 125157 265615 SICU 2112-05-04 19:03:39 2112-05-06 17:21:01
44083 131048 282640 CCU 2112-05-23 12:32:06 2112-05-25 14:59:50
44083 198330 286428 CCU 2112-05-29 02:01:33 2112-06-01 16:50:40

The first of these admissions corresponds to the waveform record above, as indicated by the date (2112-05-04). Note that the starting and ending date and time of the waveform record will not always match the precise admission or discharge time.

The hadm_id (125157) and icustay_id (265615) are linked to other tables in MIMIC-III that provide further information about this particular ICU stay, such as vital signs, laboratory tests, medications, and diagnoses.


Release Notes

This database is a subset of version 1.0 of the MIMIC-III Waveform Database. It also represents a superset of the records in the previously-released MIMIC-II Waveform Database Matched Subset. However, it uses a different directory structure (see Data Description above), as well as different subject IDs and surrogate dates. This version corresponds to version 1.4 of the MIMIC-III Clinical Database.


Acknowledgements

We wish to thank Philips Healthcare, as well as the Beth Israel Deaconess Medical Center, for their invaluable support in making this project possible.

Many people have contributed to this project over the past 18 years, and it would be impossible to list them all. In particular, we would like to acknowledge Michael Craig, Tin Kyaw, and Mohammed Saeed, for their efforts in collecting and organizing the original MIMIC-II Waveform Database, upon which this database is based.


Conflicts of Interest

The authors have no conflicts of interests to declare.


References

  1. Johnson, A. E. W., Pollard, T. J., Shen, L., Lehman, L. H., Feng, M., Ghassemi, M., Moody, B., Szolovits, P., Celi, L. A., & Mark, R. G. (2016). MIMIC-III, a freely accessible critical care database. Scientific Data, 3, 160035. https://dx.doi.org/10.1038/sdata.2016.35

Parent Projects
MIMIC-III Waveform Database Matched Subset was derived from: Please cite them when using this project.
Share
Access

Access Policy:
Anyone can access the files, as long as they conform to the terms of the specified license.

License (for files):
Open Data Commons Open Database License v1.0

Corresponding Author
You must be logged in to view the contact information.

Files

Total uncompressed size: 2.4 TB.

Access the files

Visualize waveforms

Folder Navigation: <base>/p01
Name Size Modified
Parent Directory
p010013
p010023
p010030
p010042
p010045
p010049
p010061
p010069
p010075
p010077
p010083
p010086
p010096
p010124
p010134
p010152
p010186
p010188
p010205
p010209
p010241
p010246
p010247
p010257
p010277
p010289
p010305
p010315
p010317
p010320
p010337
p010342
p010384
p010391
p010419
p010423
p010425
p010428
p010432
p010433
p010434
p010455
p010464
p010475
p010485
p010487
p010510
p010513
p010525
p010532
p010534
p010552
p010564
p010581
p010595
p010600
p010604
p010611
p010618
p010629
p010635
p010638
p010651
p010652
p010653
p010655
p010656
p010667
p010674
p010679
p010686
p010689
p010694
p010710
p010721
p010725
p010738
p010748
p010766
p010769
p010774
p010782
p010785
p010799
p010814
p010842
p010847
p010852
p010856
p010859
p010872
p010906
p010917
p010924
p010925
p010926
p010928
p010939
p010957
p010973
p010985
p010995
p011003
p011018
p011032
p011043
p011055
p011061
p011066
p011086
p011096
p011099
p011137
p011138
p011143
p011147
p011154
p011161
p011162
p011171
p011172
p011187
p011191
p011200
p011205
p011232
p011235
p011236
p011242
p011244
p011247
p011261
p011268
p011279
p011280
p011291
p011292
p011318
p011320
p011323
p011328
p011330
p011341
p011342
p011347
p011370
p011372
p011380
p011388
p011403
p011421
p011431
p011442
p011446
p011464
p011467
p011473
p011486
p011509
p011512
p011529
p011546
p011558
p011561
p011563
p011590
p011591
p011596
p011597
p011600
p011604
p011609
p011622
p011638
p011641
p011658
p011667
p011679
p011681
p011684
p011688
p011694
p011698
p011700
p011703
p011710
p011723
p011727
p011728
p011744
p011752
p011756
p011757
p011762
p011763
p011764
p011785
p011787
p011801
p011815
p011827
p011829
p011840
p011850
p011852
p011855
p011861
p011870
p011876
p011877
p011901
p011907
p011912
p011931
p011945
p011948
p011949
p011951
p011957
p011978
p011988
p011992
p011995
p011998
p012000
p012008
p012013
p012014
p012070
p012077
p012078
p012090
p012094
p012104
p012110
p012112
p012113
p012115
p012116
p012122
p012124
p012140
p012141
p012167
p012169
p012171
p012174
p012175
p012181
p012182
p012184
p012187
p012203
p012212
p012215
p012217
p012251
p012267
p012274
p012277
p012284
p012294
p012306
p012319
p012331
p012344
p012351
p012365
p012367
p012371
p012372
p012375
p012383
p012388
p012400
p012403
p012410
p012411
p012435
p012445
p012461
p012473
p012481
p012482
p012508
p012531
p012536
p012564
p012565
p012567
p012573
p012575
p012577
p012581
p012586
p012589
p012599
p012619
p012631
p012632
p012663
p012673
p012679
p012693
p012704
p012708
p012709
p012712
p012724
p012727
p012733
p012739
p012748
p012752
p012753
p012772
p012788
p012795
p012797
p012799
p012806
p012807
p012812
p012821
p012823
p012829
p012831
p012834
p012849
p012856
p012869
p012878
p012883
p012903
p012914
p012915
p012920
p012941
p012942
p012947
p012968
p012974
p012982
p012987
p013002
p013013
p013033
p013049
p013052
p013071
p013072
p013096
p013099
p013101
p013110
p013121
p013123
p013136
p013144
p013146
p013149
p013150
p013171
p013181
p013183
p013191
p013195
p013212
p013214
p013218
p013226
p013253
p013259
p013265
p013266
p013273
p013274
p013286
p013295
p013308
p013314
p013316
p013329
p013333
p013353
p013354
p013355
p013361
p013364
p013373
p013378
p013422
p013432
p013435
p013436
p013437
p013438
p013439
p013463
p013476
p013480
p013485
p013489
p013494
p013500
p013501
p013508
p013532
p013536
p013561
p013564
p013569
p013570
p013582
p013593
p013599
p013600
p013615
p013618
p013628
p013629
p013640
p013646
p013649
p013667
p013668
p013689
p013694
p013705
p013710
p013715
p013716
p013719
p013720
p013728
p013731
p013739
p013752
p013754
p013759
p013772
p013782
p013793
p013818
p013830
p013837
p013839
p013840
p013844
p013850
p013852
p013854
p013868
p013877
p013902
p013927
p013948
p013960
p013970
p013973
p013993
p014005
p014014
p014019
p014034
p014036
p014037
p014048
p014050
p014054
p014057
p014058
p014059
p014070
p014079
p014094
p014096
p014098
p014104
p014106
p014123
p014131
p014161
p014167
p014168
p014174
p014179
p014186
p014189
p014190
p014197
p014204
p014205
p014233
p014240
p014245
p014251
p014256
p014263
p014266
p014279
p014286
p014297
p014298
p014299
p014321
p014322
p014325
p014328
p014330
p014334
p014346
p014370
p014386
p014391
p014448
p014458
p014469
p014478
p014486
p014495
p014497
p014506
p014520
p014524
p014529
p014532
p014533
p014539
p014542
p014544
p014551
p014561
p014573
p014579
p014584
p014592
p014603
p014611
p014622
p014626
p014651
p014664
p014669
p014679
p014692
p014702
p014703
p014714
p014724
p014728
p014749
p014755
p014761
p014763
p014766
p014772
p014777
p014780
p014784
p014793
p014822
p014824
p014828
p014836
p014855
p014857
p014862
p014863
p014865
p014873
p014884
p014897
p014898
p014899
p014900
p014909
p014914
p014918
p014919
p014922
p014928
p014929
p014932
p014935
p014936
p014938
p014946
p014947
p014953
p014975
p014982
p014995
p015013
p015021
p015026
p015046
p015052
p015055
p015079
p015082
p015093
p015110
p015119
p015124
p015128
p015141
p015144
p015150
p015168
p015181
p015185
p015198
p015208
p015218
p015226
p015230
p015243
p015254
p015266
p015268
p015270
p015279
p015295
p015298
p015301
p015302
p015303
p015308
p015315
p015329
p015333
p015361
p015369
p015382
p015385
p015426
p015427
p015435
p015458
p015461
p015464
p015465
p015466
p015470
p015474
p015480
p015485
p015488
p015490
p015509
p015514
p015524
p015531
p015538
p015545
p015557
p015558
p015563
p015567
p015569
p015583
p015595
p015610
p015619
p015624
p015625
p015631
p015632
p015637
p015640
p015644
p015645
p015652
p015654
p015669
p015679
p015683
p015684
p015687
p015701
p015703
p015716
p015725
p015727
p015733
p015749
p015769
p015770
p015775
p015809
p015810
p015817
p015821
p015831
p015838
p015841
p015852
p015877
p015883
p015885
p015900
p015902
p015903
p015904
p015911
p015917
p015922
p015924
p015929
p015963
p015964
p015965
p015974
p015977
p015982
p015997
p016002
p016013
p016019
p016024
p016032
p016046
p016063
p016071
p016076
p016088
p016105
p016112
p016115
p016117
p016121
p016122
p016127
p016129
p016139
p016156
p016161
p016164
p016172
p016192
p016196
p016199
p016210
p016216
p016236
p016256
p016258
p016261
p016265
p016275
p016279
p016280
p016286
p016296
p016322
p016336
p016337
p016343
p016352
p016353
p016360
p016365
p016373
p016391
p016392
p016399
p016423
p016436
p016447
p016455
p016463
p016490
p016492
p016499
p016504
p016511
p016516
p016533
p016550
p016552
p016554
p016560
p016561
p016565
p016568
p016581
p016590
p016592
p016607
p016608
p016619
p016637
p016639
p016640
p016642
p016666
p016677
p016684
p016685
p016691
p016709
p016715
p016723
p016727
p016740
p016748
p016775
p016776
p016798
p016804
p016806
p016810
p016821
p016827
p016839
p016849
p016853
p016856
p016860
p016864
p016873
p016874
p016876
p016881
p016888
p016909
p016916
p016924
p016927
p016932
p016949
p016961
p016967
p016992
p017002
p017018
p017021
p017026
p017028
p017038
p017041
p017054
p017058
p017069
p017072
p017075
p017083
p017092
p017097
p017105
p017112
p017122
p017128
p017133
p017145
p017149
p017152
p017182
p017196
p017216
p017218
p017231
p017236
p017246
p017260
p017262
p017280
p017285
p017286
p017290
p017293
p017299
p017300
p017312
p017313
p017318
p017337
p017344
p017366
p017372
p017394
p017401
p017412
p017419
p017421
p017423
p017440
p017443
p017451
p017456
p017457
p017472
p017483
p017488
p017497
p017516
p017522
p017539
p017557
p017582
p017589
p017616
p017617
p017629
p017646
p017663
p017666
p017667
p017671
p017674
p017690
p017691
p017692
p017696
p017697
p017702
p017712
p017717
p017727
p017735
p017736
p017743
p017748
p017753
p017757
p017761
p017764
p017765
p017774
p017775
p017785
p017791
p017795
p017798
p017803
p017807
p017808
p017810
p017812
p017814
p017822
p017826
p017828
p017847
p017848
p017865
p017875
p017882
p017886
p017901
p017902
p017913
p017920
p017929
p017944
p017948
p017954
p017959
p017976
p017997
p018000
p018035
p018088
p018108
p018123
p018126
p018139
p018166
p018167
p018169
p018200
p018205
p018219
p018225
p018229
p018233
p018239
p018248
p018254
p018258
p018264
p018269
p018280
p018300
p018322
p018333
p018356
p018357
p018358
p018365
p018376
p018377
p018393
p018401
p018402
p018403
p018413
p018418
p018429
p018469
p018487
p018489
p018498
p018516
p018524
p018535
p018546
p018568
p018572
p018584
p018595
p018597
p018614
p018615
p018621
p018624
p018626
p018633
p018642
p018643
p018676
p018681
p018685
p018687
p018688
p018689
p018695
p018696
p018711
p018727
p018733
p018737
p018738
p018739
p018740
p018753
p018786
p018812
p018815
p018818
p018837
p018846
p018852
p018875
p018892
p018896
p018897
p018910
p018921
p018928
p018942
p018952
p018959
p018962
p018970
p018971
p018975
p018982
p018988
p018995
p018996
p018998
p019005
p019012
p019016
p019018
p019029
p019031
p019038
p019040
p019053
p019055
p019059
p019087
p019093
p019099
p019102
p019125
p019145
p019155
p019169
p019208
p019213
p019218
p019220
p019233
p019246
p019248
p019265
p019296
p019297
p019308
p019309
p019311
p019330
p019333
p019338
p019342
p019344
p019346
p019361
p019364
p019371
p019372
p019375
p019405
p019411
p019412
p019418
p019430
p019442
p019445
p019453
p019465
p019470
p019484
p019493
p019513
p019523
p019538
p019553
p019560
p019563
p019578
p019598
p019603
p019604
p019608
p019618
p019620
p019624
p019627
p019634
p019640
p019644
p019649
p019655
p019666
p019675
p019685
p019700
p019718
p019726
p019733
p019734
p019757
p019765
p019769
p019771
p019811
p019815
p019817
p019827
p019834
p019848
p019866
p019872
p019891
p019898
p019918
p019931
p019936
p019938
p019947
p019965
p019975
p019977
p019980
p019981
p019999