Database Open Access

MIMIC-III Waveform Database Matched Subset

Benjamin Moody George Moody Mauricio Villarroel Gari Clifford Ikaro Silva

Published: April 7, 2020. Version: 1.0


When using this resource, please cite: (show more options)
Moody, B., Moody, G., Villarroel, M., Clifford, G., & Silva, I. (2020). MIMIC-III Waveform Database Matched Subset (version 1.0). PhysioNet. https://doi.org/10.13026/c2294b.

Additionally, please cite the original publication:

Johnson, A. E. W., Pollard, T. J., Shen, L., Lehman, L. H., Feng, M., Ghassemi, M., Moody, B., Szolovits, P., Celi, L. A., & Mark, R. G. (2016). MIMIC-III, a freely accessible critical care database. Scientific Data, 3, 160035.

Please include the standard citation for PhysioNet: (show more options)
Goldberger, A., Amaral, L., Glass, L., Hausdorff, J., Ivanov, P. C., Mark, R., ... & Stanley, H. E. (2000). PhysioBank, PhysioToolkit, and PhysioNet: Components of a new research resource for complex physiologic signals. Circulation [Online]. 101 (23), pp. e215–e220.

Abstract

The MIMIC-III Waveform Database Matched Subset contains 22,317 waveform records, and 22,247 numerics records, for 10,282 distinct ICU patients. These recordings typically include digitized signals such as ECG, ABP, respiration, and PPG, as well as periodic measurements such as heart rate, oxygen saturation, and systolic, mean, and diastolic blood pressure.

This database is a subset of the MIMIC-III Waveform Database, representing those records for which the patient has been identified, and their corresponding clinical records are available in the MIMIC-III Clinical Database.


Background

The MIMIC-III Waveform Database contains thousands of recordings of multiple physiologic signals (“waveforms”) and time series of vital signs (“numerics”) collected from bedside patient monitors in adult and neonatal intensive care units (ICUs).

An ICU bedside monitor collects a great deal of data, from which it is possible to infer something about a patient’s physiological state. However, in order to understand how these waveforms are influenced by disease state and treatment, and the extent to which phenomena observed in the waveform can serve as indicators of disease, it is necessary to look at the broader context: patient demographics, diagnoses, medications, lab tests, and other information that is recorded by caregivers in the electronic medical record.

Collecting this broad clinical context is the task of the MIMIC-III Clinical Database, which was created in parallel with the Waveform Database and contains information about many of the same patients. The Matched Subset consists of all of the waveform and numerics recordings for which the corresponding clinical record is also available.


Methods

The bedside monitors used for collecting this database were not directly linked to the hospital medical record system. The monitor could be configured to display the patient’s name and medical record number, for ease of identifying patients at the central station, but this was not automatically updated when a patient was admitted or transferred to the ICU. This information was only available when the ICU staff entered it manually into the monitoring system, and since entering this information was not critical to patient care, it was frequently omitted or incomplete. Furthermore, limitations of the data archiving software made it possible to identify the care unit from which a recording originated, but not the precise room or bed number.

As a result, only a subset of the waveform recordings actually contained enough information to reliably identify the patient, and of those, not all overlapped with the time period represented by the MIMIC-III Clinical Database [1]. Using all of the available information, through a process of mostly automated matching with some manual corrections, a total of 22,317 waveform records (34%) and 22,247 numerics records (35%) were found that could be linked to a corresponding patient in the Clinical Database.

For each of those records, a new WFDB header file was created, incorporating the subject ID as well as the surrogate date and time of the recording. Note that the raw signal files (such as 3314767_0004.dat and 3314767n.dat) and segment header files (such as 3314767_0004.hea) are identical to those in the original numbered records.

The project was approved by the Institutional Review Boards of Beth Israel Deaconess Medical Center (Boston, MA) and the Massachusetts Institute of Technology (Cambridge, MA). Requirement for individual patient consent was waived because the project did not impact clinical care and all protected health information was deidentified.


Data Description

All data associated with a particular patient have been placed into a single subdirectory, named according to the patient's MIMIC-III subject_ID. These subdirectories are further divided into ten intermediate-level directories (matched/p00 to matched/p09).

The name of each matched waveform record is of the form matched/pXX/pXXNNNN/pXXNNNN-YYYY-MM-DD-hh-mm, where XXNNNN is the matching MIMIC-III Clinical Database Subject_ID, and YYYY, MM, DD, hh, and mm are the surrogate year, month (01-12), and day (01-31), and the real hour (00-23) and minute (00-59), derived from the starting date and time of day of the record. The surrogate dates match those of the corresponding MIMIC-III Clinical Database records.

In most cases, the waveform record is paired with a numerics record, which has the same name as the associated waveform record, with an n added to the end.

Frequently there are multiple waveform and numerics record pairs associated with a given clinical record; all of them will appear in the same subdirectory in such a case, and their names will indicate their chronologic sequence. For example, MIMIC-III Clinical Database record p000079 has been matched with two waveform and numerics record pairs, named:

  • p000079-2175-09-26-01-25 and p000079-2175-09-26-01-25n
  • p000079-2175-09-26-12-28 and p000079-2175-09-26-12-28n

Each mimic3wdb/matched record is also an undated mimic3wdb record (i.e., it also belongs to the full MIMIC-III Waveform Database). Only the surrogate-dated mimic3wdb/matched header (.hea) files are unique to the Matched Subset; the others, with names of the form 3*.hea and 3*.dat, are copies of the like-named files in the full database.


Usage Notes

The following example illustrates the organization of the database:

  • Intermediate directory p04 contains all records with names that begin with p04 (patients with a subject_id between 40000 and 49999.)
  • All files associated with patient 44083 are contained within the directory p04/p044083. This directory contains two waveform records (p044083-2112-05-04-19-50 and p044083-2112-05-23-12-22) and two corresponding numerics records (p044083-2112-05-04-19-50n and p044083-2112-05-23-12-22n), recorded from two separate ICU stays.
  • The master waveform header file for the first stay (p044083-2112-05-04-19-50.hea) indicates that the record is 20342033 sample intervals (about 45 hours) in length, and begins at 19:50 on May 4, 2112. This date, as with all dates in MIMIC-III, has been anonymized by shifting it by a random number of days into the future. See header(5) in the WFDB Applications Guide for more information about the format of this file.
  • This waveform record consists of 41 segments (3314767_0001 through to 3314767_0041), as indicated by the master header file. The layout header file (3314767_layout.hea) indicates that four ECG signals (II, AVR, V, and MCL) were recorded, along with a respiration signal, photoplethysmogram, and arterial blood pressure. Not all of these signals are available simultaneously.
  • The header file for segment number 4 (3314767_0004.hea) shows us that during this segment, five signals are available: three ECG leads (II, V, and AVR), a respiration signal (RESP), and a PPG signal (PLETH).
  • The numerics header file (p044083-2112-05-04-19-50n.hea) shows us that a variety of measurements were recorded, including heart rate, invasive and non-invasive blood pressure, respiratory rate, ST segment elevation, oxygen saturation, and cardiac rhythm statistics. Just as with waveforms, not all of these measurements are available at all times.

Referring to the MIMIC-III Clinical Database Demo, we can see from the PATIENTS table that this patient was male, and his anonymized date of birth was November 15, 2057 (making him 54 years old at the time of this ICU stay):

subject_id gender dob dod
44083 M 2057-11-15 00:00:00 2114-02-20 00:00:00

The ICUSTAYS table shows us that he was admitted once to the SICU and twice to the CCU:

subject_id hadm_id icustay_id first_careunit intime outtime
44083 125157 265615 SICU 2112-05-04 19:03:39 2112-05-06 17:21:01
44083 131048 282640 CCU 2112-05-23 12:32:06 2112-05-25 14:59:50
44083 198330 286428 CCU 2112-05-29 02:01:33 2112-06-01 16:50:40

The first of these admissions corresponds to the waveform record above, as indicated by the date (2112-05-04). Note that the starting and ending date and time of the waveform record will not always match the precise admission or discharge time.

The hadm_id (125157) and icustay_id (265615) are linked to other tables in MIMIC-III that provide further information about this particular ICU stay, such as vital signs, laboratory tests, medications, and diagnoses.


Release Notes

This database is a subset of version 1.0 of the MIMIC-III Waveform Database. It also represents a superset of the records in the previously-released MIMIC-II Waveform Database Matched Subset. However, it uses a different directory structure (see Data Description above), as well as different subject IDs and surrogate dates. This version corresponds to version 1.4 of the MIMIC-III Clinical Database.


Acknowledgements

We wish to thank Philips Healthcare, as well as the Beth Israel Deaconess Medical Center, for their invaluable support in making this project possible.

Many people have contributed to this project over the past 18 years, and it would be impossible to list them all. In particular, we would like to acknowledge Michael Craig, Tin Kyaw, and Mohammed Saeed, for their efforts in collecting and organizing the original MIMIC-II Waveform Database, upon which this database is based.


Conflicts of Interest

The authors have no conflicts of interests to declare.


References

  1. Johnson, A. E. W., Pollard, T. J., Shen, L., Lehman, L. H., Feng, M., Ghassemi, M., Moody, B., Szolovits, P., Celi, L. A., & Mark, R. G. (2016). MIMIC-III, a freely accessible critical care database. Scientific Data, 3, 160035. https://dx.doi.org/10.1038/sdata.2016.35

Parent Projects
MIMIC-III Waveform Database Matched Subset was derived from: Please cite them when using this project.
Share
Access

Access Policy:
Anyone can access the files, as long as they conform to the terms of the specified license.

License (for files):
Open Data Commons Open Database License v1.0

Corresponding Author
You must be logged in to view the contact information.

Files

Total uncompressed size: 2.4 TB.

Access the files

Visualize waveforms

Folder Navigation: <base>/p04
Name Size Modified
Parent Directory
p040000
p040013
p040019
p040033
p040042
p040056
p040057
p040059
p040063
p040068
p040083
p040084
p040094
p040102
p040132
p040133
p040161
p040179
p040183
p040187
p040189
p040200
p040206
p040213
p040216
p040227
p040236
p040239
p040241
p040246
p040253
p040269
p040288
p040299
p040305
p040317
p040321
p040334
p040337
p040347
p040352
p040370
p040371
p040387
p040412
p040425
p040435
p040460
p040463
p040472
p040474
p040477
p040483
p040485
p040548
p040566
p040567
p040569
p040577
p040580
p040599
p040601
p040624
p040673
p040689
p040694
p040703
p040706
p040715
p040723
p040724
p040736
p040744
p040745
p040767
p040797
p040798
p040811
p040822
p040826
p040828
p040831
p040833
p040850
p040854
p040866
p040867
p040878
p040882
p040897
p040900
p040904
p040911
p040912
p040929
p040934
p040940
p040950
p040967
p040972
p040973
p040988
p040999
p041002
p041013
p041022
p041024
p041031
p041034
p041035
p041050
p041055
p041061
p041067
p041074
p041078
p041107
p041115
p041121
p041154
p041163
p041192
p041194
p041199
p041204
p041217
p041224
p041254
p041257
p041266
p041279
p041284
p041287
p041302
p041311
p041322
p041332
p041343
p041350
p041359
p041361
p041371
p041373
p041383
p041389
p041405
p041408
p041430
p041441
p041442
p041446
p041447
p041469
p041487
p041493
p041517
p041525
p041546
p041573
p041588
p041589
p041592
p041596
p041603
p041619
p041625
p041631
p041639
p041653
p041661
p041682
p041702
p041705
p041710
p041724
p041733
p041738
p041758
p041768
p041782
p041783
p041795
p041803
p041816
p041823
p041830
p041844
p041863
p041874
p041881
p041882
p041890
p041897
p041902
p041929
p041937
p041943
p041945
p041956
p041958
p041962
p041976
p041981
p041982
p042021
p042033
p042035
p042038
p042049
p042054
p042060
p042071
p042073
p042075
p042093
p042124
p042130
p042131
p042141
p042143
p042155
p042184
p042185
p042188
p042196
p042197
p042199
p042203
p042210
p042211
p042232
p042236
p042243
p042251
p042255
p042261
p042274
p042285
p042302
p042310
p042311
p042327
p042360
p042364
p042367
p042385
p042388
p042396
p042397
p042400
p042402
p042404
p042405
p042410
p042434
p042438
p042444
p042460
p042468
p042477
p042486
p042492
p042496
p042501
p042509
p042510
p042519
p042525
p042530
p042545
p042572
p042574
p042590
p042591
p042604
p042608
p042609
p042621
p042649
p042652
p042663
p042685
p042694
p042696
p042702
p042709
p042721
p042725
p042728
p042733
p042747
p042763
p042781
p042782
p042792
p042795
p042800
p042809
p042815
p042819
p042820
p042829
p042851
p042854
p042858
p042860
p042866
p042870
p042875
p042892
p042898
p042904
p042905
p042919
p042926
p042930
p042937
p042950
p042961
p042965
p042969
p042970
p042995
p043006
p043017
p043033
p043037
p043060
p043061
p043084
p043086
p043089
p043093
p043098
p043115
p043116
p043121
p043143
p043150
p043155
p043160
p043165
p043206
p043209
p043220
p043233
p043243
p043261
p043274
p043296
p043323
p043359
p043383
p043392
p043400
p043402
p043412
p043422
p043426
p043430
p043439
p043446
p043447
p043450
p043459
p043461
p043472
p043482
p043484
p043501
p043520
p043529
p043551
p043559
p043561
p043563
p043571
p043585
p043589
p043601
p043613
p043615
p043624
p043632
p043634
p043649
p043664
p043671
p043673
p043676
p043691
p043700
p043705
p043729
p043731
p043736
p043737
p043738
p043741
p043759
p043770
p043774
p043776
p043786
p043792
p043798
p043803
p043812
p043814
p043817
p043827
p043837
p043866
p043870
p043874
p043911
p043917
p043926
p043937
p043943
p043946
p043948
p043961
p043975
p043982
p043983
p043991
p043995
p044002
p044018
p044023
p044036
p044044
p044052
p044058
p044059
p044061
p044083
p044084
p044115
p044123
p044126
p044128
p044135
p044139
p044141
p044153
p044164
p044166
p044188
p044203
p044206
p044207
p044220
p044232
p044234
p044248
p044255
p044270
p044277
p044298
p044319
p044326
p044340
p044369
p044373
p044375
p044377
p044383
p044408
p044427
p044437
p044454
p044468
p044486
p044500
p044514
p044521
p044532
p044534
p044539
p044553
p044570
p044586
p044597
p044600
p044605
p044622
p044624
p044625
p044630
p044633
p044644
p044653
p044666
p044685
p044706
p044715
p044721
p044723
p044732
p044735
p044741
p044742
p044748
p044751
p044763
p044773
p044781
p044784
p044787
p044788
p044789
p044793
p044797
p044799
p044806
p044807
p044808
p044820
p044827
p044829
p044837
p044856
p044870
p044874
p044908
p044917
p044920
p044922
p044929
p044941
p044955
p044969
p044976
p044979
p045012
p045032
p045040
p045064
p045072
p045088
p045104
p045124
p045127
p045129
p045132
p045138
p045141
p045152
p045170
p045176
p045180
p045186
p045199
p045213
p045226
p045227
p045232
p045249
p045269
p045276
p045292
p045293
p045300
p045309
p045310
p045315
p045317
p045320
p045321
p045329
p045344
p045346
p045355
p045359
p045409
p045431
p045434
p045477
p045492
p045495
p045524
p045531
p045542
p045580
p045583
p045601
p045604
p045608
p045619
p045622
p045631
p045632
p045635
p045650
p045655
p045657
p045671
p045684
p045703
p045709
p045719
p045724
p045736
p045745
p045765
p045768
p045770
p045772
p045774
p045788
p045791
p045797
p045801
p045805
p045806
p045816
p045838
p045842
p045843
p045851
p045866
p045910
p045914
p045918
p045936
p045942
p045949
p045962
p045974
p045979
p046000
p046028
p046034
p046041
p046054
p046057
p046063
p046067
p046077
p046080
p046081
p046092
p046093
p046109
p046116
p046119
p046123
p046125
p046132
p046144
p046148
p046154
p046156
p046163
p046189
p046192
p046195
p046197
p046201
p046205
p046208
p046214
p046217
p046223
p046228
p046230
p046237
p046242
p046243
p046252
p046254
p046260
p046262
p046264
p046268
p046287
p046297
p046305
p046315
p046320
p046321
p046339
p046373
p046380
p046389
p046399
p046415
p046427
p046429
p046446
p046449
p046467
p046471
p046473
p046480
p046489
p046497
p046498
p046502
p046510
p046527
p046528
p046534
p046545
p046550
p046551
p046560
p046566
p046608
p046611
p046641
p046642
p046651
p046667
p046672
p046695
p046723
p046728
p046734
p046740
p046744
p046775
p046776
p046781
p046792
p046793
p046796
p046797
p046802
p046809
p046816
p046817
p046837
p046851
p046857
p046858
p046878
p046884
p046904
p046910
p046915
p046923
p046926
p046927
p046934
p046936
p046938
p046950
p046968
p046983
p046984
p046996
p047013
p047035
p047045
p047046
p047058
p047084
p047087
p047093
p047118
p047127
p047132
p047136
p047137
p047146
p047157
p047183
p047203
p047216
p047232
p047233
p047234
p047247
p047255
p047263
p047266
p047270
p047272
p047275
p047287
p047288
p047289
p047306
p047309
p047311
p047319
p047326
p047335
p047342
p047385
p047398
p047406
p047409
p047410
p047419
p047420
p047424
p047430
p047444
p047453
p047460
p047473
p047477
p047478
p047492
p047511
p047543
p047546
p047547
p047563
p047569
p047582
p047613
p047634
p047637
p047654
p047660
p047667
p047673
p047677
p047698
p047709
p047715
p047718
p047724
p047731
p047733
p047747
p047749
p047757
p047758
p047785
p047790
p047795
p047808
p047814
p047816
p047827
p047835
p047858
p047874
p047884
p047887
p047892
p047914
p047918
p047937
p047940
p047949
p047956
p047963
p047967
p047978
p047980
p047983
p047989
p047995
p048006
p048011
p048032
p048037
p048038
p048051
p048056
p048058
p048076
p048078
p048087
p048095
p048118
p048121
p048123
p048124
p048145
p048149
p048159
p048189
p048196
p048204
p048212
p048217
p048238
p048239
p048253
p048267
p048274
p048281
p048297
p048314
p048327
p048340
p048342
p048351
p048380
p048388
p048390
p048391
p048397
p048398
p048414
p048417
p048425
p048479
p048480
p048498
p048504
p048514
p048520
p048523
p048536
p048542
p048546
p048555
p048556
p048580
p048612
p048637
p048640
p048647
p048656
p048666
p048667
p048674
p048677
p048688
p048690
p048693
p048701
p048705
p048707
p048730
p048732
p048734
p048736
p048755
p048756
p048770
p048774
p048777
p048779
p048780
p048794
p048804
p048812
p048821
p048826
p048827
p048830
p048843
p048872
p048882
p048895
p048910
p048915
p048935
p048936
p048939
p048942
p048946
p048958
p048968
p048982
p048996
p048999
p049015
p049022
p049023
p049024
p049037
p049038
p049053
p049058
p049067
p049068
p049080
p049098
p049106
p049118
p049138
p049140
p049144
p049168
p049190
p049191
p049197
p049224
p049245
p049255
p049261
p049268
p049292
p049295
p049304
p049311
p049315
p049322
p049328
p049340
p049367
p049375
p049377
p049380
p049382
p049392
p049407
p049431
p049447
p049453
p049456
p049471
p049480
p049482
p049499
p049500
p049513
p049520
p049534
p049544
p049545
p049554
p049555
p049556
p049567
p049575
p049578
p049582
p049583
p049586
p049604
p049611
p049613
p049619
p049622
p049623
p049632
p049635
p049649
p049650
p049654
p049658
p049683
p049685
p049692
p049723
p049739
p049747
p049750
p049780
p049788
p049836
p049839
p049840
p049844
p049858
p049868
p049872
p049879
p049881
p049925
p049955
p049963
p049970
p049971
p049976
p049984
p049995
p049999