Database Open Access

MIMIC-III Waveform Database Matched Subset

Benjamin Moody George Moody Mauricio Villarroel Gari Clifford Ikaro Silva

Published: April 7, 2020. Version: 1.0


When using this resource, please cite: (show more options)
Moody, B., Moody, G., Villarroel, M., Clifford, G., & Silva, I. (2020). MIMIC-III Waveform Database Matched Subset (version 1.0). PhysioNet. https://doi.org/10.13026/c2294b.

Additionally, please cite the original publication:

Johnson, A. E. W., Pollard, T. J., Shen, L., Lehman, L. H., Feng, M., Ghassemi, M., Moody, B., Szolovits, P., Celi, L. A., & Mark, R. G. (2016). MIMIC-III, a freely accessible critical care database. Scientific Data, 3, 160035.

Please include the standard citation for PhysioNet: (show more options)
Goldberger, A., Amaral, L., Glass, L., Hausdorff, J., Ivanov, P. C., Mark, R., ... & Stanley, H. E. (2000). PhysioBank, PhysioToolkit, and PhysioNet: Components of a new research resource for complex physiologic signals. Circulation [Online]. 101 (23), pp. e215–e220.

Abstract

The MIMIC-III Waveform Database Matched Subset contains 22,317 waveform records, and 22,247 numerics records, for 10,282 distinct ICU patients. These recordings typically include digitized signals such as ECG, ABP, respiration, and PPG, as well as periodic measurements such as heart rate, oxygen saturation, and systolic, mean, and diastolic blood pressure.

This database is a subset of the MIMIC-III Waveform Database, representing those records for which the patient has been identified, and their corresponding clinical records are available in the MIMIC-III Clinical Database.


Background

The MIMIC-III Waveform Database contains thousands of recordings of multiple physiologic signals (“waveforms”) and time series of vital signs (“numerics”) collected from bedside patient monitors in adult and neonatal intensive care units (ICUs).

An ICU bedside monitor collects a great deal of data, from which it is possible to infer something about a patient’s physiological state. However, in order to understand how these waveforms are influenced by disease state and treatment, and the extent to which phenomena observed in the waveform can serve as indicators of disease, it is necessary to look at the broader context: patient demographics, diagnoses, medications, lab tests, and other information that is recorded by caregivers in the electronic medical record.

Collecting this broad clinical context is the task of the MIMIC-III Clinical Database, which was created in parallel with the Waveform Database and contains information about many of the same patients. The Matched Subset consists of all of the waveform and numerics recordings for which the corresponding clinical record is also available.


Methods

The bedside monitors used for collecting this database were not directly linked to the hospital medical record system. The monitor could be configured to display the patient’s name and medical record number, for ease of identifying patients at the central station, but this was not automatically updated when a patient was admitted or transferred to the ICU. This information was only available when the ICU staff entered it manually into the monitoring system, and since entering this information was not critical to patient care, it was frequently omitted or incomplete. Furthermore, limitations of the data archiving software made it possible to identify the care unit from which a recording originated, but not the precise room or bed number.

As a result, only a subset of the waveform recordings actually contained enough information to reliably identify the patient, and of those, not all overlapped with the time period represented by the MIMIC-III Clinical Database [1]. Using all of the available information, through a process of mostly automated matching with some manual corrections, a total of 22,317 waveform records (34%) and 22,247 numerics records (35%) were found that could be linked to a corresponding patient in the Clinical Database.

For each of those records, a new WFDB header file was created, incorporating the subject ID as well as the surrogate date and time of the recording. Note that the raw signal files (such as 3314767_0004.dat and 3314767n.dat) and segment header files (such as 3314767_0004.hea) are identical to those in the original numbered records.

The project was approved by the Institutional Review Boards of Beth Israel Deaconess Medical Center (Boston, MA) and the Massachusetts Institute of Technology (Cambridge, MA). Requirement for individual patient consent was waived because the project did not impact clinical care and all protected health information was deidentified.


Data Description

All data associated with a particular patient have been placed into a single subdirectory, named according to the patient's MIMIC-III subject_ID. These subdirectories are further divided into ten intermediate-level directories (matched/p00 to matched/p09).

The name of each matched waveform record is of the form matched/pXX/pXXNNNN/pXXNNNN-YYYY-MM-DD-hh-mm, where XXNNNN is the matching MIMIC-III Clinical Database Subject_ID, and YYYY, MM, DD, hh, and mm are the surrogate year, month (01-12), and day (01-31), and the real hour (00-23) and minute (00-59), derived from the starting date and time of day of the record. The surrogate dates match those of the corresponding MIMIC-III Clinical Database records.

In most cases, the waveform record is paired with a numerics record, which has the same name as the associated waveform record, with an n added to the end.

Frequently there are multiple waveform and numerics record pairs associated with a given clinical record; all of them will appear in the same subdirectory in such a case, and their names will indicate their chronologic sequence. For example, MIMIC-III Clinical Database record p000079 has been matched with two waveform and numerics record pairs, named:

  • p000079-2175-09-26-01-25 and p000079-2175-09-26-01-25n
  • p000079-2175-09-26-12-28 and p000079-2175-09-26-12-28n

Each mimic3wdb/matched record is also an undated mimic3wdb record (i.e., it also belongs to the full MIMIC-III Waveform Database). Only the surrogate-dated mimic3wdb/matched header (.hea) files are unique to the Matched Subset; the others, with names of the form 3*.hea and 3*.dat, are copies of the like-named files in the full database.


Usage Notes

The following example illustrates the organization of the database:

  • Intermediate directory p04 contains all records with names that begin with p04 (patients with a subject_id between 40000 and 49999.)
  • All files associated with patient 44083 are contained within the directory p04/p044083. This directory contains two waveform records (p044083-2112-05-04-19-50 and p044083-2112-05-23-12-22) and two corresponding numerics records (p044083-2112-05-04-19-50n and p044083-2112-05-23-12-22n), recorded from two separate ICU stays.
  • The master waveform header file for the first stay (p044083-2112-05-04-19-50.hea) indicates that the record is 20342033 sample intervals (about 45 hours) in length, and begins at 19:50 on May 4, 2112. This date, as with all dates in MIMIC-III, has been anonymized by shifting it by a random number of days into the future. See header(5) in the WFDB Applications Guide for more information about the format of this file.
  • This waveform record consists of 41 segments (3314767_0001 through to 3314767_0041), as indicated by the master header file. The layout header file (3314767_layout.hea) indicates that four ECG signals (II, AVR, V, and MCL) were recorded, along with a respiration signal, photoplethysmogram, and arterial blood pressure. Not all of these signals are available simultaneously.
  • The header file for segment number 4 (3314767_0004.hea) shows us that during this segment, five signals are available: three ECG leads (II, V, and AVR), a respiration signal (RESP), and a PPG signal (PLETH).
  • The numerics header file (p044083-2112-05-04-19-50n.hea) shows us that a variety of measurements were recorded, including heart rate, invasive and non-invasive blood pressure, respiratory rate, ST segment elevation, oxygen saturation, and cardiac rhythm statistics. Just as with waveforms, not all of these measurements are available at all times.

Referring to the MIMIC-III Clinical Database Demo, we can see from the PATIENTS table that this patient was male, and his anonymized date of birth was November 15, 2057 (making him 54 years old at the time of this ICU stay):

subject_id gender dob dod
44083 M 2057-11-15 00:00:00 2114-02-20 00:00:00

The ICUSTAYS table shows us that he was admitted once to the SICU and twice to the CCU:

subject_id hadm_id icustay_id first_careunit intime outtime
44083 125157 265615 SICU 2112-05-04 19:03:39 2112-05-06 17:21:01
44083 131048 282640 CCU 2112-05-23 12:32:06 2112-05-25 14:59:50
44083 198330 286428 CCU 2112-05-29 02:01:33 2112-06-01 16:50:40

The first of these admissions corresponds to the waveform record above, as indicated by the date (2112-05-04). Note that the starting and ending date and time of the waveform record will not always match the precise admission or discharge time.

The hadm_id (125157) and icustay_id (265615) are linked to other tables in MIMIC-III that provide further information about this particular ICU stay, such as vital signs, laboratory tests, medications, and diagnoses.


Release Notes

This database is a subset of version 1.0 of the MIMIC-III Waveform Database. It also represents a superset of the records in the previously-released MIMIC-II Waveform Database Matched Subset. However, it uses a different directory structure (see Data Description above), as well as different subject IDs and surrogate dates. This version corresponds to version 1.4 of the MIMIC-III Clinical Database.


Acknowledgements

We wish to thank Philips Healthcare, as well as the Beth Israel Deaconess Medical Center, for their invaluable support in making this project possible.

Many people have contributed to this project over the past 18 years, and it would be impossible to list them all. In particular, we would like to acknowledge Michael Craig, Tin Kyaw, and Mohammed Saeed, for their efforts in collecting and organizing the original MIMIC-II Waveform Database, upon which this database is based.


Conflicts of Interest

The authors have no conflicts of interests to declare.


References

  1. Johnson, A. E. W., Pollard, T. J., Shen, L., Lehman, L. H., Feng, M., Ghassemi, M., Moody, B., Szolovits, P., Celi, L. A., & Mark, R. G. (2016). MIMIC-III, a freely accessible critical care database. Scientific Data, 3, 160035. https://dx.doi.org/10.1038/sdata.2016.35

Parent Projects
MIMIC-III Waveform Database Matched Subset was derived from: Please cite them when using this project.
Share
Access

Access Policy:
Anyone can access the files, as long as they conform to the terms of the specified license.

License (for files):
Open Data Commons Open Database License v1.0

Corresponding Author
You must be logged in to view the contact information.

Files

Total uncompressed size: 2.4 TB.

Access the files

Visualize waveforms

Folder Navigation: <base>/p06
Name Size Modified
Parent Directory
p060034
p060038
p060048
p060049
p060053
p060057
p060072
p060074
p060079
p060083
p060104
p060115
p060118
p060122
p060130
p060146
p060164
p060169
p060170
p060180
p060181
p060198
p060207
p060219
p060226
p060228
p060235
p060254
p060262
p060272
p060274
p060294
p060295
p060303
p060309
p060325
p060341
p060343
p060353
p060355
p060380
p060389
p060393
p060403
p060409
p060419
p060422
p060424
p060432
p060436
p060441
p060518
p060523
p060531
p060534
p060550
p060593
p060598
p060603
p060614
p060641
p060659
p060674
p060675
p060680
p060685
p060716
p060719
p060726
p060727
p060735
p060737
p060739
p060747
p060749
p060753
p060767
p060773
p060775
p060778
p060782
p060783
p060798
p060809
p060825
p060829
p060838
p060850
p060852
p060868
p060892
p060894
p060919
p060929
p060949
p060952
p060958
p060961
p060968
p060969
p060975
p060977
p060983
p060985
p060987
p060994
p060998
p061003
p061012
p061024
p061030
p061041
p061053
p061073
p061095
p061106
p061118
p061119
p061121
p061132
p061144
p061149
p061150
p061156
p061157
p061170
p061182
p061187
p061195
p061196
p061201
p061207
p061213
p061223
p061248
p061282
p061295
p061296
p061301
p061316
p061378
p061390
p061406
p061434
p061441
p061458
p061472
p061492
p061496
p061516
p061519
p061522
p061538
p061551
p061568
p061569
p061574
p061588
p061590
p061591
p061597
p061619
p061620
p061622
p061624
p061630
p061642
p061648
p061656
p061658
p061663
p061667
p061685
p061729
p061738
p061742
p061748
p061751
p061764
p061771
p061787
p061802
p061809
p061823
p061828
p061833
p061835
p061846
p061856
p061876
p061877
p061898
p061913
p061924
p061928
p061932
p061943
p061949
p061956
p061959
p061980
p061984
p061991
p062004
p062032
p062033
p062035
p062061
p062087
p062114
p062126
p062160
p062183
p062186
p062194
p062212
p062215
p062227
p062232
p062239
p062248
p062254
p062259
p062261
p062268
p062284
p062295
p062298
p062305
p062323
p062346
p062380
p062389
p062393
p062395
p062415
p062416
p062430
p062438
p062447
p062457
p062466
p062478
p062497
p062512
p062514
p062515
p062522
p062528
p062530
p062538
p062539
p062543
p062561
p062562
p062564
p062571
p062586
p062603
p062606
p062608
p062613
p062622
p062637
p062641
p062646
p062647
p062650
p062664
p062674
p062681
p062684
p062689
p062693
p062699
p062715
p062717
p062721
p062722
p062731
p062734
p062735
p062750
p062751
p062765
p062782
p062791
p062795
p062798
p062824
p062829
p062833
p062835
p062841
p062854
p062884
p062913
p062917
p062919
p062926
p062937
p062950
p062965
p062974
p062980
p063003
p063017
p063024
p063028
p063039
p063053
p063059
p063063
p063073
p063074
p063076
p063107
p063113
p063116
p063130
p063131
p063133
p063135
p063138
p063145
p063173
p063187
p063192
p063201
p063219
p063220
p063226
p063236
p063237
p063238
p063245
p063249
p063272
p063290
p063320
p063327
p063359
p063364
p063368
p063372
p063383
p063402
p063403
p063404
p063431
p063437
p063444
p063447
p063453
p063456
p063461
p063467
p063471
p063482
p063486
p063489
p063492
p063496
p063499
p063507
p063512
p063525
p063531
p063541
p063544
p063552
p063559
p063563
p063571
p063579
p063582
p063598
p063599
p063601
p063621
p063628
p063639
p063646
p063659
p063660
p063669
p063683
p063700
p063701
p063706
p063710
p063721
p063733
p063741
p063750
p063755
p063756
p063757
p063762
p063764
p063771
p063773
p063785
p063792
p063834
p063865
p063878
p063890
p063898
p063899
p063912
p063922
p063925
p063934
p063936
p063938
p063944
p063952
p063958
p063961
p063987
p063992
p063999
p064004
p064008
p064024
p064025
p064026
p064027
p064047
p064067
p064082
p064083
p064089
p064099
p064100
p064101
p064112
p064120
p064136
p064137
p064145
p064150
p064160
p064188
p064191
p064192
p064194
p064195
p064206
p064216
p064230
p064238
p064253
p064260
p064277
p064280
p064282
p064287
p064296
p064297
p064298
p064332
p064334
p064336
p064361
p064368
p064374
p064377
p064384
p064411
p064421
p064450
p064459
p064465
p064472
p064485
p064491
p064502
p064519
p064523
p064524
p064538
p064550
p064557
p064558
p064601
p064621
p064652
p064663
p064666
p064673
p064687
p064700
p064701
p064717
p064719
p064721
p064741
p064752
p064771
p064772
p064785
p064798
p064830
p064833
p064839
p064845
p064846
p064873
p064897
p064906
p064916
p064925
p064927
p064935
p064944
p064952
p064965
p064967
p064969
p064970
p064988
p064992
p064994
p064996
p065003
p065006
p065013
p065033
p065049
p065050
p065052
p065055
p065057
p065072
p065097
p065107
p065112
p065144
p065147
p065161
p065175
p065176
p065180
p065187
p065192
p065217
p065240
p065247
p065256
p065263
p065267
p065268
p065310
p065342
p065358
p065370
p065393
p065401
p065404
p065411
p065421
p065425
p065431
p065442
p065444
p065448
p065476
p065481
p065484
p065490
p065513
p065515
p065516
p065527
p065535
p065537
p065560
p065565
p065570
p065575
p065594
p065599
p065604
p065611
p065634
p065636
p065645
p065652
p065656
p065659
p065665
p065675
p065689
p065703
p065710
p065719
p065728
p065732
p065743
p065753
p065759
p065760
p065779
p065787
p065793
p065810
p065824
p065833
p065835
p065837
p065843
p065849
p065854
p065861
p065871
p065879
p065890
p065893
p065895
p065900
p065915
p065925
p065946
p065951
p065956
p065970
p065979
p065999
p066031
p066033
p066037
p066043
p066046
p066055
p066058
p066061
p066063
p066067
p066068
p066072
p066083
p066093
p066105
p066110
p066152
p066157
p066170
p066172
p066184
p066189
p066200
p066206
p066208
p066213
p066217
p066228
p066232
p066244
p066264
p066280
p066288
p066298
p066310
p066311
p066320
p066322
p066326
p066338
p066346
p066365
p066383
p066384
p066386
p066405
p066412
p066419
p066424
p066437
p066463
p066473
p066474
p066479
p066481
p066483
p066496
p066504
p066505
p066507
p066530
p066534
p066542
p066556
p066559
p066560
p066566
p066571
p066572
p066580
p066595
p066598
p066603
p066614
p066643
p066654
p066677
p066682
p066686
p066692
p066696
p066706
p066710
p066717
p066720
p066727
p066753
p066756
p066765
p066770
p066772
p066786
p066796
p066804
p066807
p066811
p066818
p066822
p066823
p066825
p066831
p066851
p066859
p066871
p066878
p066880
p066892
p066894
p066898
p066899
p066903
p066910
p066919
p066936
p066955
p066957
p066965
p066974
p066986
p067005
p067017
p067042
p067050
p067051
p067054
p067058
p067072
p067087
p067101
p067111
p067112
p067140
p067149
p067154
p067158
p067172
p067213
p067222
p067223
p067227
p067239
p067245
p067248
p067256
p067283
p067284
p067301
p067316
p067321
p067323
p067341
p067343
p067344
p067347
p067348
p067358
p067367
p067377
p067380
p067384
p067386
p067388
p067413
p067415
p067418
p067429
p067442
p067446
p067449
p067451
p067461
p067468
p067475
p067477
p067511
p067533
p067546
p067583
p067589
p067617
p067619
p067620
p067625
p067633
p067636
p067639
p067651
p067653
p067659
p067675
p067680
p067684
p067687
p067689
p067696
p067710
p067711
p067715
p067717
p067718
p067722
p067735
p067740
p067744
p067747
p067758
p067761
p067763
p067771
p067772
p067774
p067788
p067796
p067798
p067803
p067813
p067821
p067831
p067853
p067856
p067858
p067877
p067903
p067912
p067924
p067931
p067939
p067956
p067976
p067996
p068001
p068003
p068007
p068024
p068036
p068059
p068065
p068068
p068075
p068094
p068095
p068099
p068116
p068123
p068126
p068127
p068128
p068172
p068174
p068228
p068231
p068242
p068244
p068251
p068268
p068285
p068296
p068297
p068299
p068300
p068307
p068344
p068356
p068375
p068385
p068391
p068396
p068401
p068422
p068425
p068426
p068435
p068439
p068450
p068453
p068457
p068464
p068475
p068505
p068515
p068526
p068527
p068531
p068533
p068534
p068542
p068543
p068564
p068589
p068605
p068607
p068611
p068621
p068623
p068624
p068629
p068645
p068651
p068655
p068663
p068673
p068676
p068685
p068690
p068703
p068710
p068720
p068724
p068728
p068736
p068738
p068744
p068780
p068785
p068789
p068797
p068806
p068807
p068808
p068812
p068824
p068836
p068857
p068860
p068870
p068875
p068890
p068892
p068902
p068905
p068907
p068909
p068911
p068915
p068916
p068919
p068922
p068946
p068947
p068949
p068956
p068962
p068964
p068965
p068980
p068989
p068991
p069000
p069006
p069011
p069020
p069027
p069039
p069047
p069052
p069074
p069081
p069082
p069088
p069093
p069108
p069118
p069138
p069144
p069146
p069157
p069162
p069169
p069170
p069182
p069194
p069201
p069215
p069219
p069221
p069233
p069250
p069251
p069265
p069268
p069270
p069272
p069274
p069293
p069296
p069339
p069343
p069344
p069351
p069352
p069354
p069367
p069370
p069371
p069388
p069395
p069396
p069407
p069411
p069434
p069435
p069442
p069447
p069450
p069464
p069465
p069483
p069484
p069498
p069501
p069512
p069518
p069522
p069531
p069537
p069559
p069563
p069577
p069578
p069579
p069586
p069591
p069596
p069598
p069615
p069620
p069624
p069626
p069639
p069650
p069651
p069654
p069675
p069676
p069679
p069681
p069684
p069693
p069694
p069696
p069709
p069746
p069763
p069764
p069776
p069778
p069786
p069799
p069857
p069871
p069890
p069891
p069903
p069905
p069925
p069943
p069956