Database Open Access
Term-Preterm EHG DataSet with Tocogram
Published: Aug. 29, 2018. Version: 1.0.0
Goldberger AL, Amaral LAN, Glass L, Hausdorff JM, Ivanov PCh, Mark RG, Mietus JE, Moody GB, Peng C-K, Stanley HE. PhysioBank, PhysioToolkit, and PhysioNet: Components of a New Research Resource for Complex Physiologic Signals (2003). Circulation. 101(23):e215-e220.
The Term-Preterm ElectroHysteroGram DataSet with Tocogram (TPEHGT DS) contains 26 four-signal 30-min uterine EHG records, i.e., three EHG signals accompanied by a simultaneously recorded external tocogram measuring mechanical uterine activity (TOCO signal) of pregnant women, and another five 30-min uterine records (EHG signals and TOCO signal) of non-pregnant women.
The records of the pregnant women belong to pregnancies that resulted in spontaneous preterm delivery (13 preterm records from eight pregnancies), and to pregnancies that resulted in spontaneous term delivery (13 term records from ten pregnancies). The main objective of the dataset is to provide a set of annotated contraction intervals (annotated intervals related to uterine contractions), and another set of annotated non-contraction intervals (dummy intervals, i.e., intervals out of uterine contractions). The annotated contraction and dummy intervals of the records allow:
- characterization of uterine records, contraction intervals, and dummy intervals during pregnancy;
- development of algorithms to classify between preterm and term uterine records, and/or, between labor and nonlabor uterine records.
The dataset was developed at the Faculty of Computer and Information Science (Laboratory for Biomedical Computer Systems and Imaging), University of Ljubljana, Ljubljana, while the records were collected at the University Medical Centre Ljubljana, Department of Obstetrics and Gynecology. (Obtaining of the uterine records and the research were approved by the National Medical Ethics Committee of the Republic Slovenia: No. 32/01/97 and No. 108/09/09. The recording protocol (including position of the electrodes) was those which was also used during collecting the records of the Term-Preterm EHG Database (TPEHG DB).
The women participating in the study represented a sample of the general population. The TPEHGT DS contains 31 uterine records of which:
- 13 preterm records are with spontaneous delivery (eight pregnancies), the mean delivery time was 33.7 (± 1.97) weeks, and contain 47 annotated contraction intervals and 47 annotated dummy intervals;
- 13 term records are with spontaneous delivery (ten pregnancies), the mean delivery time was 38.1 (± 1.04) weeks, and contain 53 annotated contraction intervals and 53 annotated dummy intervals;
- five records are from non-pregnant women, and contain 53 annotated dummy intervals.
The records of pregnant women were obtained during regular check-ups in pregnancy around 31st week of pregnancy. The mean recording time and standard deviation of the records of pregnant women was 30.2 (± 2.76) weeks of pregnancy. For the manual annotating procedure, we used a graphic user interface and annotation editor. Besides visualizing original signals and annotation editing, the graphic user interface also allows calculating, and visualizing, spectra and spectrograms of the signals. Consensus about the annotated intervals was reached by two annotators.
The EHG signals of the records were collected from the abdominal surface. The electrodes (AgCl2) to measure EHG signals of the records were placed in two horizontal rows, symmetrically above and under the navel, spaced 7 cm apart:
- the first electrode (E1) was placed 3.5 cm to the left and 3.5 cm above the navel;
- the second electrode (E2) was placed 3.5 cm to the right and 3.5 cm above the navel;
- the third electrode (E3) was placed 3.5 cm to the right and 3.5 cm below the navel;
- the fourth electrode (E4) was placed 3.5 cm to the left and 3.5 cm below the navel;
- the reference electrode was attached to the woman’s thigh.
The differences in the electrical potentials of the electrodes were recorded, producing three signals:
- S1 = E2 - E1 (first EHG signal);
- S2 = E2 - E3 (second EHG signal);
- S3 = E4 - E3 (third EHG signal).
Prior to sampling, the EHG signals were filtered using an analog three-pole Butterworth filter with the bandwidth from 0.0 Hz to 5.0 Hz. The fourth simultaneous signal was the analog signal corresponding to an external tocogram (TOCO signal) measuring mechanical uterine pressure acquired using a cardiotocograph (model HP8030) attached at the top of the fundus. The analog TOCO signal was lead to one of the amplifiers of the A/D converter (the value of 150 μV corresponds to a pressure of 1 Pa). The sampling frequency for the EHG and TOCO signals was 20 samples per second per signal, with 16-bit resolution.
The original EHG and TOCO signals of the records were further filtered using the four-pole band-pass digital Butterworth filter with cut-off frequencies at 0.08 Hz and 5.0 Hz applied bi-directionally. The records of the TPEHGT DS contain both, the original and filtered signals.
Record names of the TPEHGT DS are of the following format: tpehgt_TXXX, where capital T refers to the type of record: p - Preterm, t - Term, and n - Non-pregnant, and XXX is the record number of that type.
The records of the dataset are in the WFDB format. Each records consists of three files, a header file (.hea) containing information regarding the record, the data file (.dat) containing original and filtered signal data, and the annotation file (.atr) containing annotations for the record.
The comment section in the header files (.hea) includes clinical information, such as:
- record ID [record name];
- record type [Preterm/Term/Non-pregnant];
- pregnancy duration (gestation) [weeks];
- gestation duration at the time of recording [weeks];
- maternal age [years];
- number of previous deliveries (parity) [number];
- number of previous abortions [number];
- weight at the time of recording [kg];
- placental position [front/end];
- smoker [yes/no];
If data was not available, the value is marked as None, if data was not applicable, the value is marked as N/A.
The signal data in the data files (.dat) are in the following order:
- first EHG signal S1, unfiltered;
- first EHG signal S1, filtered using a 4-pole band-pass Butterworth filter from 0.08 Hz to 5.0 Hz, bi-directionally;
- second EHG signal S2, unfiltered;
- second EHG signal S2, filtered using a 4-pole band-pass Butterworth filter from 0.08 Hz to 5.0 Hz, bi-directionally;
- third EHG signal S3, unfiltered;
- third EHG signal S3, filtered using a 4-pole band-pass Butterworth filter from 0.08 Hz to 5.0 Hz, bi-directionally;
- TOCO signal, unfiltered;
- TOCO signal, filtered using a 4-pole band-pass Butterworth filter from 0.08 Hz to 5.0 Hz, bi-directionally;
The annotations files (.atr) cantain manual annotations of contraction and dummy intervals:
- BC - the beginning of contraction interval;
- EC - the end of contraction interval;
- BD - the beginning of dummy interval;
- ED - the end of dummy interval;
For further information, please contact:
Laboratory of Biomedical Computer Systems and Imaging
Faculty of Computer and Information Science
University of Ljubljana
Večna pot 113
1000 Ljubljana, Slovenia
- Jager F, Libenšek S, Geršak K (2018). Characterization and automatic classification of preterm and term uterine records. PLoS ONE 13(8): e0202125. https://doi.org/10.1371/journal.pone.0202125.
Anyone can access the files, as long as they conform to the terms of the specified license.
License (for files):
Open Data Commons Attribution License v1.0
Total uncompressed size: 39.8 MB.
Access the files
- Download the ZIP file (39.8 MB)
- Access the files using the Google Cloud Storage Browser here. Login with a Google account is required.
- Access the data using Google Cloud "gsutil":
gsutil -m cp -r gs://tpehgt-1.0.0.physionet.org DESTINATION
- Download the files using your terminal:
wget -r -N -c -np https://physionet.org/files/tpehgt/1.0.0/