Database Open Access

MIMIC-III Waveform Database

Benjamin Moody George Moody Mauricio Villarroel Gari D. Clifford Ikaro Silva

Published: April 7, 2020. Version: 1.0


When using this resource, please cite: (show more options)
Moody, B., Moody, G., Villarroel, M., Clifford, G. D., & Silva, I. (2020). MIMIC-III Waveform Database (version 1.0). PhysioNet. https://doi.org/10.13026/c2607m.

Additionally, please cite the original publication:

Johnson, A. E. W., Pollard, T. J., Shen, L., Lehman, L. H., Feng, M., Ghassemi, M., Moody, B., Szolovits, P., Celi, L. A., & Mark, R. G. (2016). MIMIC-III, a freely accessible critical care database. Scientific Data, 3, 160035.

Please include the standard citation for PhysioNet: (show more options)
Goldberger, A., Amaral, L., Glass, L., Hausdorff, J., Ivanov, P. C., Mark, R., ... & Stanley, H. E. (2000). PhysioBank, PhysioToolkit, and PhysioNet: Components of a new research resource for complex physiologic signals. Circulation [Online]. 101 (23), pp. e215–e220.

Abstract

The MIMIC-III Waveform Database contains 67,830 record sets for approximately 30,000 ICU patients. Almost all record sets include a waveform record containing digitized signals (typically including ECG, ABP, respiration, and PPG, and frequently other signals) and a “numerics” record containing time series of periodic measurements, each presenting a quasi-continuous recording of vital signs of a single patient throughout an ICU stay (typically a few days, but many are several weeks in duration). A subset of this database contains waveform and numerics records that have been matched and time-aligned with MIMIC-III Clinical Database records.


Background

The MIMIC-III Waveform Database contains thousands of recordings of multiple physiologic signals (“waveforms”) and time series of vital signs (“numerics”) collected from bedside patient monitors in adult and neonatal intensive care units (ICUs).

The MIMIC-III Waveform Database is a companion to the MIMIC-III Clinical Database, which contains detailed clinical information about most of the patients represented in the Waveform Database [1]. Since the contents of each database were collected independently, in partially deidentified form, matching the clinical data with the waveform data is a non-trivial task, and only a subset of Waveform Database records has been matched with Clinical Database records. See the MIMIC-III Waveform Database Matched Subset for more information.


Methods

Unlike the original MIMIC Database, waveforms were collected in a largely automated fashion, from all of the bedside monitors in certain adult and neonatal ICUs. Not all of the ICUs in the hospital were included, and the data archiving process did not run continuously, but while it was running, all waveforms from those ICUs were captured and archived. As a result, these records represent a random sample of patients in those specific ICUs.

Recorded waveforms and numerics vary depending on choices made by the ICU staff. Waveforms almost always include one or more ECG signals, and often include continuous arterial blood pressure (ABP) waveforms, fingertip photoplethysmogram (PPG) signals, and respiration, with additional waveforms (up to 8 simultaneously) as available. Numerics typically include heart and respiration rates, SpO2, and systolic, mean, and diastolic blood pressure, together with others as available. Recording lengths also vary; most are a few days in duration, but some are shorter and others are several weeks long.

The project was approved by the Institutional Review Boards of Beth Israel Deaconess Medical Center (Boston, MA) and the Massachusetts Institute of Technology (Cambridge, MA). Requirement for individual patient consent was waived because the project did not impact clinical care and all protected health information was deidentified.


Data Description

Each recording comprises two records (a waveform record and a matching numerics record) in a single record directory (“folder”) with the name of the record. To reduce access time, the record directories have been distributed among ten intermediate-level directories (listed below). The names of these intermediate directories (30, 31, ..., 39) match the first two digits of the record directories they contain.

In almost all cases, the waveform records comprise multiple segments, each of which can be read as a separate record. Each segment contains an uninterrupted recording of a set of simultaneously observed signals, and the signal gains do not change at any time during the segment. Whenever the ICU staff changed the signals being monitored or adjusted the amplitude of a signal being monitored, this event was recorded in the raw data dump, and a new segment begins at that time.

Each composite waveform record includes a list of the segments that comprise it in its master header file. The list begins on the second line of the master header with a layout header file that specifies all of the signals that are observed in any segment belonging to the record. Each segment has its own header file and (except for the layout header) a matching (binary) signal (.dat) file. Occasionally, the monitor may be disconnected entirely for a short time; these intervals are recorded as gaps in the master header file, but there are no header or signal files corresponding to gaps.

The numerics records (designated by the letter n appended to the record name) are not divided into segments, since the storage savings that would be achieved by doing so would be relatively little.

Physiologic waveform records in this database contain up to eight simultaneously recorded signals digitized at 125 Hz with 8-, 10-, or (occasionally) 12-bit resolution. Numerics records typically contain 10 or more time series of vital signs sampled once per second or once per minute.

Technical Limitations

Waveforms or numerics missing:
Occasionally, technical limitations of the data acquisition system make it possible to create a physiologic waveform record but not a numerics record, or vice versa.
A given signal may not be available throughout an entire record:
Records in the MIMIC-III Waveform Database vary in length; some are several weeks in duration. It is common for the physiologic signals to be interrupted or changed occasionally during recordings of such long duration. When using a viewer such as LightWAVE, all signals available at any time during a record are listed, although in most cases only a subset is visible at any given time.
Gaps and patient identification:
The waveform and numerics records have been extracted from raw data dumps collected from the bedside monitors using a facility provided by the monitor manufacturer. The raw data dumps contain files of data collected from a single patient monitor during a single monitoring session (which may last days or weeks). Usually the monitoring session ends when the patient is discharged, so that the data in a single file come from a single patient. Occasionally, however, the monitor is not reset when the patient is discharged, and the session continues after a new patient has been admitted; in this case the raw data file contains data from two (or more) patients, with a gap (an interval during which no waveforms or numerics are recorded) that is typically an hour or more in duration. Such gaps may also appear if the monitor is temporarily disconnected (for example, for a laboratory test) and then reconnected to the same patient. Since the raw data files do not usually contain patient identifiers, it is not trivial to determine with certainty if the data before and after a gap were collected from the same patient.
Ideally, each MIMIC-III Waveform Database record should contain data from only one patient. All raw data files containing gaps of an hour or more have been split into separate records in order to decrease the likelihood that any record contains data from multiple patients. An ongoing project is to examine the sets of records created this way, matching them with MIMIC-III Clinical Database records when possible, to determine if and how they should be reassembled.
Inter-waveform alignment problems:
The method used for MIMIC waveform data extraction was not designed for inter-waveform analysis. The waveform data contain unspecified/unknown filtering delays and/or unknown inter-channel delays, which may not be constant in a given record. Therefore, although the ECGs are time-aligned with each other, there may be a (changing) delay of up to 500ms between any of the other waveforms in the data. For example, the pulse transit time measured between different waveforms may be unreliable (either in absolute or relative terms).
ECG limitations:
The ECG signals in the waveform records were originally sampled with 12-bit precision at a high sampling rate, and were then scaled and decimated to 500 samples per second (per signal). The scaling reduced the effective amplitude resolution from 12 bits to 9 or 10 bits in typical cases, and as little as 7 bits in some cases. From each set of 4 consecutive decimated samples of the same ECG signal, one was recorded (chosen using a turning-point compressor, a technique sometimes called “peak-picking”). The result is an ECG signal sampled 125 times per second, but at intervals that vary between 2 and 14 ms (averaging 8 ms). Since the interval between any given pair of samples was not available to us, the reconstructions of the ECG signals assume uniform 8 ms intervals. These signals with reduced time and amplitude resolution, and sampling jitter introduced by the “peak-picking”, were the only ECG signals that were possible to capture from the ICU monitors. Although ECGs reconstructed in this way can be readily interpreted visually, they are unsuitable as input for certain algorithms for ECG analysis, particularly those that are sensitive to frequency-domain features of the signal. Note that these limitations apply only to the ECG signals, not to the other signals, which were originally sampled at uniform 8 ms intervals (125 samples per second) and were not scaled prior to capture.

Usage Notes

The following example illustrates the organization of the database:

  • Intermediate directory 31 contains all records with names that begin with 31.
  • Record directory 3141595 is contained within intermediate directory 31.
  • All files associated with physiologic waveform record 3141595 and its companion numerics record 3141595n are contained within record directory 31/3141595.
    • The first line of the master header file for waveform record 314595 (31/3141595/3141595.hea) indicates that the record is 242353557 sample intervals (about 22 days at 125 samples per second) in duration, and that it contains 427 segments and gaps. (See header(5) in the WFDB Applications Guide for details on the format of this text file.) The first segment is named 3141595_0001, and it is 2888500 sample intervals (6 hours, 15 minutes, and 8 seconds, at 125 samples per second) in duration. At the end of the master header file, a comment (# Location: nicu) specifies the ICU in which the recording was made (the neonatal ICU, in this case).
    • The layout header file for this record (31/3141595/3141595_layout.hea) indicates that five ECG signals (I, II, III, AVR, and “V”), a respiration signal, and a PPG signal are available during portions of the record. (The five ECG signals are not all available simultaneously.)
    • The header file for the first segment of this record (31/3141595/3141595_0001.hea) shows that a PPG signal (“PLETH”), a respiration signal, and ECG leads II and AVR are available throughout this initial segment.
  • The matching numerics record is named 3141595n, and its header file (31/3141595/3141595n.hea) shows that it is 1938730 sample intervals (about 22 days at 1 sample per second) in duration, and that it contains heart rate (“HR”, which is measured from the ECG, as well as “PULSE”, measured from one or more pulsatile signals), noninvasive blood pressure (raw as well as systolic, diastolic, and mean), respiration rate, and SpO2.

Any WFDB application can read any waveform record from this database directly from the PhysioNet web server (i.e., without downloading the record first) using a record name of the form mimic3wdb/3x/3xyyyyy/. Numerics records can be read using the longer form mimic3wdb/3x/3xyyyyy/3xyyyyyn (note that the final 3xyyyyy must be repeated and followed by n to specify the numerics record).

For example, if you have installed the WFDB Software Package, you can read the first 10 seconds of waveform record 3141595 using this rdsamp command:

rdsamp -r mimic3wdb/31/3141595/ -p -v -t 10

To read the first 10 seconds of the matching numerics record 3141595n, use this command instead:

rdsamp -r mimic3wdb/31/3141595/3141595n -p -v -t 10

Notice that the first command produces 1250 samples of each waveform (125 samples per second, for 10 seconds), but the second command produces only 10 samples of each vital sign (1 sample per second, for 10 seconds).


Release Notes

Version 1.0 of the MIMIC-III Waveform Database supersedes previously-released versions of the MIMIC-II Waveform Database. The numbered records (3000003 to 3999988) are identical to those in version 3.2 of the MIMIC-II Waveform Database. The Matched Subset, however, uses different subject IDs and surrogate dates, corresponding to version 1.4 of the MIMIC-III Clinical Database.


Acknowledgements

We wish to thank Philips Healthcare, as well as the Beth Israel Deaconess Medical Center, for their invaluable support in making this project possible.

Many people have contributed to this project over the past 18 years, and it would be impossible to list them all. In particular, we would like to acknowledge Michael Craig, Tin Kyaw, and Mohammed Saeed, for their efforts in collecting and organizing the original MIMIC-II Waveform Database, upon which this database is based.


Conflicts of Interest

The authors have no conflicts of interests to declare.


References

  1. Johnson, A. E. W., Pollard, T. J., Shen, L., Lehman, L. H., Feng, M., Ghassemi, M., Moody, B., Szolovits, P., Celi, L. A., & Mark, R. G. (2016). MIMIC-III, a freely accessible critical care database. Scientific Data, 3, 160035. https://dx.doi.org/10.1038/sdata.2016.35

Parent Projects
MIMIC-III Waveform Database was derived from: Please cite them when using this project.
Share
Access

Access Policy:
Anyone can access the files, as long as they conform to the terms of the specified license.

License (for files):
Open Data Commons Open Database License v1.0

Discovery

DOI (version 1.0):
https://doi.org/10.13026/c2607m

DOI (latest version):
https://doi.org/10.13026/gs83-bd50

Corresponding Author
You must be logged in to view the contact information.

Files

Total uncompressed size: 6.7 TB.

Access the files

Visualize waveforms

Folder Navigation: <base>/matched/p04
Name Size Modified
Parent Directory
p040000
p040013
p040019
p040033
p040042
p040056
p040057
p040059
p040063
p040068
p040083
p040084
p040094
p040102
p040132
p040133
p040161
p040179
p040183
p040187
p040189
p040200
p040206
p040213
p040216
p040227
p040236
p040239
p040241
p040246
p040253
p040269
p040288
p040299
p040305
p040317
p040321
p040334
p040337
p040347
p040352
p040370
p040371
p040387
p040412
p040425
p040435
p040460
p040463
p040472
p040474
p040477
p040483
p040485
p040548
p040566
p040567
p040569
p040577
p040580
p040599
p040601
p040624
p040673
p040689
p040694
p040703
p040706
p040715
p040723
p040724
p040736
p040744
p040745
p040767
p040797
p040798
p040811
p040822
p040826
p040828
p040831
p040833
p040850
p040854
p040866
p040867
p040878
p040882
p040897
p040900
p040904
p040911
p040912
p040929
p040934
p040940
p040950
p040967
p040972
p040973
p040988
p040999
p041002
p041013
p041022
p041024
p041031
p041034
p041035
p041050
p041055
p041061
p041067
p041074
p041078
p041107
p041115
p041121
p041154
p041163
p041192
p041194
p041199
p041204
p041217
p041224
p041254
p041257
p041266
p041279
p041284
p041287
p041302
p041311
p041322
p041332
p041343
p041350
p041359
p041361
p041371
p041373
p041383
p041389
p041405
p041408
p041430
p041441
p041442
p041446
p041447
p041469
p041487
p041493
p041517
p041525
p041546
p041573
p041588
p041589
p041592
p041596
p041603
p041619
p041625
p041631
p041639
p041653
p041661
p041682
p041702
p041705
p041710
p041724
p041733
p041738
p041758
p041768
p041782
p041783
p041795
p041803
p041816
p041823
p041830
p041844
p041863
p041874
p041881
p041882
p041890
p041897
p041902
p041929
p041937
p041943
p041945
p041956
p041958
p041962
p041976
p041981
p041982
p042021
p042033
p042035
p042038
p042049
p042054
p042060
p042071
p042073
p042075
p042093
p042124
p042130
p042131
p042141
p042143
p042155
p042184
p042185
p042188
p042196
p042197
p042199
p042203
p042210
p042211
p042232
p042236
p042243
p042251
p042255
p042261
p042274
p042285
p042302
p042310
p042311
p042327
p042360
p042364
p042367
p042385
p042388
p042396
p042397
p042400
p042402
p042404
p042405
p042410
p042434
p042438
p042444
p042460
p042468
p042477
p042486
p042492
p042496
p042501
p042509
p042510
p042519
p042525
p042530
p042545
p042572
p042574
p042590
p042591
p042604
p042608
p042609
p042621
p042649
p042652
p042663
p042685
p042694
p042696
p042702
p042709
p042721
p042725
p042728
p042733
p042747
p042763
p042781
p042782
p042792
p042795
p042800
p042809
p042815
p042819
p042820
p042829
p042851
p042854
p042858
p042860
p042866
p042870
p042875
p042892
p042898
p042904
p042905
p042919
p042926
p042930
p042937
p042950
p042961
p042965
p042969
p042970
p042995
p043006
p043017
p043033
p043037
p043060
p043061
p043084
p043086
p043089
p043093
p043098
p043115
p043116
p043121
p043143
p043150
p043155
p043160
p043165
p043206
p043209
p043220
p043233
p043243
p043261
p043274
p043296
p043323
p043359
p043383
p043392
p043400
p043402
p043412
p043422
p043426
p043430
p043439
p043446
p043447
p043450
p043459
p043461
p043472
p043482
p043484
p043501
p043520
p043529
p043551
p043559
p043561
p043563
p043571
p043585
p043589
p043601
p043613
p043615
p043624
p043632
p043634
p043649
p043664
p043671
p043673
p043676
p043691
p043700
p043705
p043729
p043731
p043736
p043737
p043738
p043741
p043759
p043770
p043774
p043776
p043786
p043792
p043798
p043803
p043812
p043814
p043817
p043827
p043837
p043866
p043870
p043874
p043911
p043917
p043926
p043937
p043943
p043946
p043948
p043961
p043975
p043982
p043983
p043991
p043995
p044002
p044018
p044023
p044036
p044044
p044052
p044058
p044059
p044061
p044083
p044084
p044115
p044123
p044126
p044128
p044135
p044139
p044141
p044153
p044164
p044166
p044188
p044203
p044206
p044207
p044220
p044232
p044234
p044248
p044255
p044270
p044277
p044298
p044319
p044326
p044340
p044369
p044373
p044375
p044377
p044383
p044408
p044427
p044437
p044454
p044468
p044486
p044500
p044514
p044521
p044532
p044534
p044539
p044553
p044570
p044586
p044597
p044600
p044605
p044622
p044624
p044625
p044630
p044633
p044644
p044653
p044666
p044685
p044706
p044715
p044721
p044723
p044732
p044735
p044741
p044742
p044748
p044751
p044763
p044773
p044781
p044784
p044787
p044788
p044789
p044793
p044797
p044799
p044806
p044807
p044808
p044820
p044827
p044829
p044837
p044856
p044870
p044874
p044908
p044917
p044920
p044922
p044929
p044941
p044955
p044969
p044976
p044979
p045012
p045032
p045040
p045064
p045072
p045088
p045104
p045124
p045127
p045129
p045132
p045138
p045141
p045152
p045170
p045176
p045180
p045186
p045199
p045213
p045226
p045227
p045232
p045249
p045269
p045276
p045292
p045293
p045300
p045309
p045310
p045315
p045317
p045320
p045321
p045329
p045344
p045346
p045355
p045359
p045409
p045431
p045434
p045477
p045492
p045495
p045524
p045531
p045542
p045580
p045583
p045601
p045604
p045608
p045619
p045622
p045631
p045632
p045635
p045650
p045655
p045657
p045671
p045684
p045703
p045709
p045719
p045724
p045736
p045745
p045765
p045768
p045770
p045772
p045774
p045788
p045791
p045797
p045801
p045805
p045806
p045816
p045838
p045842
p045843
p045851
p045866
p045910
p045914
p045918
p045936
p045942
p045949
p045962
p045974
p045979
p046000
p046028
p046034
p046041
p046054
p046057
p046063
p046067
p046077
p046080
p046081
p046092
p046093
p046109
p046116
p046119
p046123
p046125
p046132
p046144
p046148
p046154
p046156
p046163
p046189
p046192
p046195
p046197
p046201
p046205
p046208
p046214
p046217
p046223
p046228
p046230
p046237
p046242
p046243
p046252
p046254
p046260
p046262
p046264
p046268
p046287
p046297
p046305
p046315
p046320
p046321
p046339
p046373
p046380
p046389
p046399
p046415
p046427
p046429
p046446
p046449
p046467
p046471
p046473
p046480
p046489
p046497
p046498
p046502
p046510
p046527
p046528
p046534
p046545
p046550
p046551
p046560
p046566
p046608
p046611
p046641
p046642
p046651
p046667
p046672
p046695
p046723
p046728
p046734
p046740
p046744
p046775
p046776
p046781
p046792
p046793
p046796
p046797
p046802
p046809
p046816
p046817
p046837
p046851
p046857
p046858
p046878
p046884
p046904
p046910
p046915
p046923
p046926
p046927
p046934
p046936
p046938
p046950
p046968
p046983
p046984
p046996
p047013
p047035
p047045
p047046
p047058
p047084
p047087
p047093
p047118
p047127
p047132
p047136
p047137
p047146
p047157
p047183
p047203
p047216
p047232
p047233
p047234
p047247
p047255
p047263
p047266
p047270
p047272
p047275
p047287
p047288
p047289
p047306
p047309
p047311
p047319
p047326
p047335
p047342
p047385
p047398
p047406
p047409
p047410
p047419
p047420
p047424
p047430
p047444
p047453
p047460
p047473
p047477
p047478
p047492
p047511
p047543
p047546
p047547
p047563
p047569
p047582
p047613
p047634
p047637
p047654
p047660
p047667
p047673
p047677
p047698
p047709
p047715
p047718
p047724
p047731
p047733
p047747
p047749
p047757
p047758
p047785
p047790
p047795
p047808
p047814
p047816
p047827
p047835
p047858
p047874
p047884
p047887
p047892
p047914
p047918
p047937
p047940
p047949
p047956
p047963
p047967
p047978
p047980
p047983
p047989
p047995
p048006
p048011
p048032
p048037
p048038
p048051
p048056
p048058
p048076
p048078
p048087
p048095
p048118
p048121
p048123
p048124
p048145
p048149
p048159
p048189
p048196
p048204
p048212
p048217
p048238
p048239
p048253
p048267
p048274
p048281
p048297
p048314
p048327
p048340
p048342
p048351
p048380
p048388
p048390
p048391
p048397
p048398
p048414
p048417
p048425
p048479
p048480
p048498
p048504
p048514
p048520
p048523
p048536
p048542
p048546
p048555
p048556
p048580
p048612
p048637
p048640
p048647
p048656
p048666
p048667
p048674
p048677
p048688
p048690
p048693
p048701
p048705
p048707
p048730
p048732
p048734
p048736
p048755
p048756
p048770
p048774
p048777
p048779
p048780
p048794
p048804
p048812
p048821
p048826
p048827
p048830
p048843
p048872
p048882
p048895
p048910
p048915
p048935
p048936
p048939
p048942
p048946
p048958
p048968
p048982
p048996
p048999
p049015
p049022
p049023
p049024
p049037
p049038
p049053
p049058
p049067
p049068
p049080
p049098
p049106
p049118
p049138
p049140
p049144
p049168
p049190
p049191
p049197
p049224
p049245
p049255
p049261
p049268
p049292
p049295
p049304
p049311
p049315
p049322
p049328
p049340
p049367
p049375
p049377
p049380
p049382
p049392
p049407
p049431
p049447
p049453
p049456
p049471
p049480
p049482
p049499
p049500
p049513
p049520
p049534
p049544
p049545
p049554
p049555
p049556
p049567
p049575
p049578
p049582
p049583
p049586
p049604
p049611
p049613
p049619
p049622
p049623
p049632
p049635
p049649
p049650
p049654
p049658
p049683
p049685
p049692
p049723
p049739
p049747
p049750
p049780
p049788
p049836
p049839
p049840
p049844
p049858
p049868
p049872
p049879
p049881
p049925
p049955
p049963
p049970
p049971
p049976
p049984
p049995
p049999