24 October 2009: Read the preliminary announcement of Challenge 2010 (below). Challenge data and further details will be posted beginning on 1 December 2009, and participants may begin submitting preliminary results on 20 March 2010.
1 December 2009: Set A, a set of training data for the Challenge, is now available.
20 March 2010: The Challenge is now underway! Registration is open, and participants can submit reconstructions of target signals to the on-line scoring system.
23 March 2010: The scoring method used in event 1 has been revised slightly (see below).
28 December 2010: Read the papers presented at CinC 2010 by participants in this Challenge.
Sources for the open-source entries are also available.
In settings ranging from sleep studies to surgery to sports medicine to intensive care, real-time monitoring of a variety of physiologic signals has become an essential tool for clinicians and researchers. Transient corruption or loss of one or more signals, common in all of these settings, can be disruptive, especially when continuous observations are required in order to rule out rare events or as a basis for forecasting. Signal corruption can be particularly challenging when it mimics features that are associated with pathologic states.
Humans can be remarkably adept at dealing with transient noise and signal loss in these settings. Filling in gaps, and making use of context to recognize and ignore noise, are processes that our sensory and cognitive abilities leave us well-equipped to do. Can algorithmic solutions that take account of the same data, in broader contexts and without fatigue, do as well?
The aim of this year's challenge is to develop robust methods for filling in gaps in multiparameter physiologic data (including ECG signals, continuous blood pressure waveforms, and respiration). In a real-world monitoring application, these methods can be applied for many purposes, including:
In this Challenge, participants are asked to reconstruct, using any combination of available prior and concurrent information, segments of signals that have been removed from multiparameter recordings of patients in intensive care units (ICUs).
Participants are provided with three data sets of 100 records each. Each ten-minute record contains 6, 7, or 8 signals acquired from bedside ICU patient monitors. The recorded signals vary across records, and they include ECG, continuous invasive blood pressure, respiration, fingertip plethysmograms, and occasional other signals. In one of these signals, the final 30-second segment (the target signal) has been replaced by a gap (a flat line signal), and the goal is to reconstruct this missing 30-second target signal in each record.
Records in all three Challenge data sets are provided in standard PhysioBank (compact binary) formats; read them using the WFDB software package, use the PhysioBank ATM to convert them to text, CSV, Matlab, or EDF formats, or see How to obtain PhysioBank data in text form for notes on other methods for converting them to text. The Set A files named with the suffix .missing are the target signals; these files are in text format.
The data sets, together with the target signals for Set A, are available here. To download them efficiently, use rsync as described here (retrieve the challenge-2010 "module"). For convenience, a tarball containing the full contents of all three data sets is available here (196,792,152 bytes); see this note for information about unpacking tarballs.
A detailed reconstruction of a signal, reproducing all characteristics of the original signal accurately, is more than is necessary in order to be useful. For example, a reconstruction of an invasive blood pressure signal may be useful if it allows a good estimation of mean pressure, or pulse pressure, even if none of the other details of the signal can be reproduced accurately; as another example, a reconstruction of an ECG signal that allows accurate estimation of RR intervals may be useful, even in the absence of other details.
For this reason, this challenge makes use of two scoring algorithms, detailed below. The first, used in event 1, measures the overall accuracy of the reconstruction, and as such will tend to favor those reconstruction methods that succeed in recovering properties of the signal that are derivable from the signal amplitude, such as MAP and pulse pressure. The second, used in event 2, is based on the correlation between the original signal and its reconstruction, and tends to favor those methods that accurately recover the timing of the major fluctuations in the original signal, such as QRS complexes in the ECG needed to derive RR interval series. Each reconstruction is scored using both algorithms, so a single submission is entered in both events of the Challenge.
Each reconstruction, Vrec, is compared with the corresponding target (reference) signal, Vref. Target signals are known in all cases, but those for the Set B and Set C records are not provided to participants.
Each submitted reconstruction is scored using two different methods, one for each of the two events. Since the scoring methods for the two events have different biases, a change in your reconstruction of a given record may improve your event 1 score and decrease your event 2 score, or vice versa.
The target signal is subtracted (sample by sample) from the reconstruction to obtain the residual signal, Vres. The sum of the squares of the residuals, ssvres, is normalized by the energy (sample variance) of the target signal, Eref.
In event 1, the quality, Q1, of a reconstruction is defined as 1 - ssvres/Eref, or 0, whichever is larger. If ssvres is 0, Q1 is 1, even if Eref is also 0.
Use of a figure of merit based on the residual signal reflects the importance in many cases of obtaining a good estimate of target signal levels (such as systolic, mean, and diastolic pressures in a continuous blood pressure signal).
[Note that Q1 was previously defined as 1 - Eres/Eref, or 0, whichever is larger (where Eres is the energy of the residual signal). This definition was changed because it was independent of the magnitude of any constant (DC) component of the residual signal, unlike the revised definition.]
In event 2, the quality Q2 of a reconstruction is defined as the correlation coefficient of Vref and Vrec, or 0, whichever is larger. (Correlation coefficients can of course be negative; for the purposes of this challenge, an anticorrelated reconstruction is equivalent to an uncorrelated one, however.)
Use of the correlation coefficient as a figure of merit is motivated by the observation that reconstruction of a filtered signal may be useful in many cases. Such a reconstruction might, for example, provide a basis for reliable estimation of the timing of major fluctuations in a signal (such as QRS complexes in an ECG signal), even if absolute signal levels are not recovered. Unlike Q1, Q2 is relatively insensitive to misestimation of the amplitudes of fluctuations.
The final ranking of participants is based on summing the Q scores obtained for records in Set C. Participants are encouraged, but not required, to provide reconstructions of all records in Set C.
Both Q1 and Q2 are defined so they can vary between 0 and 1, and higher values are better. If you have submitted more than one reconstruction of the same record, only the last one submitted determines your Q1 and Q2 for that record. If you have not submitted a reconstruction for a given record, your Q1 and Q2 scores for that record are zero.
For event 1, your Set C summed score, C1, is the sum of the your final Q1 scores for each record in Set C. Similarly, for event 2, your Set C summed score, C2, is the sum of the your final Q2 scores for each record in Set C. Since Set C contains 100 records, C1 and C2 can vary between 0 and (in theory!) 100.
The summed scores are not normalized by the number of target signals reconstructed, to provide a strong incentive to submit reconstructions of as many of the Set C records as possible.
To register as a participant in the Challenge:
To be eligible for an award (see below), also:
If your abstract is accepted, you will be expected to prepare a four-page paper for publication in Computing in Cardiology, and to present a talk or poster about your work at CinC.
Reconstructions should be in the same format as the .missing files containing the Set A target signals (text format, with one sample of the reconstructed signal per line). Each file should contain 3750 samples (125 samples/second, for 30 seconds). Although the target signals consist entirely of samples with integer values, reconstructions may contain integer or floating point values. Use a decimal point (.) rather than a comma (,) to separate the integral and fractional components of floating point values.
Follow the links from your PhysioNetWorks home page (see above) to submit reconstructions and to view your Set A and B individual record scores, Q1 and Q2, as well as your Set A and B summed scores, A1, A2, B1, and B2.
You may replace any previously submitted reconstruction simply by submitting a new one. The corresponding scores are updated immediately.
Scores are calculated using the code in c2010-score.c, a short C program that you may download and use to compare any two time series.
Participants may submit reconstructions of as many Set A, B, and C target signals as they wish. Each reconstruction is scored separately. For each event, the final ranking of participants is based on the sum of the scores obtained for reconstructions of Set C target signals.
Keep in mind that Sets A and B are intended for experimentation with your methods for reconstruction. You can obtain scores for your Set A and B reconstructions at any time during the Challenge. Set C is the "final examination", and although you have until September to study it, you will not be able to obtain intermediate scores. "Tuning" your methods to Set C will not be possible, so learn what you can from Sets A and B and then apply it to Set C.
Aggregate and individual scores for Set C records will be calculated based on reconstructions submitted no later than noon GMT on 1 September 2010. These scores, which will appear on your page when they have been calculated, determine the final ranking of participants.
During a plenary session of Computing in Cardiology in September, four awards will be presented to the eligible participants in attendance with the best final scores as follows:
As in previous years, the Challenge includes an open source division. You may enter the open source division by sending the source code for your challenge entry by email, before noon GMT on Wednesday, 1 September 2010, to PhysioNet. Use the subject line "Challenge 2010 entry source", and be sure to include:
Each source file submitted should begin with a comment block containing the names of its authors and a reference to the open source license you have chosen for it, if any; for example:
/* fill.c - fill signal gaps using large wads of duct tape Copyright (C) 2010 Herman Foobar <email@example.com> This software is released under the terms of the GNU General Public License (http://www.gnu.org/copyleft/gpl.html). */
Source files in C, C++, Fortran, or Matlab m-code are preferred; other languages may be acceptable, but please ask first. Do not submit any code that cannot be freely redistributed. Following the conclusion of the Challenge, selected entries will be posted, with full credit to their authors, on PhysioNet.
Must there be an automatic process to choose the set of signals used in the reconstruction of the target signal or is it acceptable that the choice of signals is not done by a program but by the participant?
Either method is acceptable. Keep in mind that the final rankings will be determined by your scores on set C, and that you will receive only one score for each of the set C records (in other words, you will not be able to try several different signal subsets for each set C record and choose the subset that yields the best score in each case).
There are some files in the challenge dataset, where there are several signals with zeros in the last 30 seconds. The determination of the target signal is ambiguous at the following signals: c21, c54, c67. What should be the way of handling this problem?
In each of these cases, there is only one signal that becomes flat (all zero values) after 9 minutes and 30 seconds (i.e., beginning at sample 71250). Reconstruct that signal, not the one that has been flat for a longer period.