Abstract

To obtain reliable transient auditory evoked potentials (AEPs) from EEGs recorded using high stimulus rate (HSR) paradigm, it is critical to design the stimulus sequences of appropriate frequency properties. Traditionally, the individual stimulus events in a stimulus sequence occur only at discrete time points dependent on the sampling frequency of the recording system and the duration of stimulus sequence. This dependency likely causes the implementation of suboptimal stimulus sequences, sacrificing the reliability of resulting AEPs. In this paper, we explicate the use of continuous-time stimulus sequence for HSR paradigm, which is independent of the discrete electroencephalogram (EEG) recording system. We employ simulation studies to examine the applicability of the continuous-time stimulus sequences and the impacts of sampling frequency on AEPs in traditional studies using discrete-time design. Results from these studies show that the continuous-time sequences can offer better frequency properties and improve the reliability of recovered AEPs. Furthermore, we find that the errors in the recovered AEPs depend critically on the sampling frequencies of experimental systems, and their relationship can be fitted using a reciprocal function. As such, our study contributes to the literature by demonstrating the applicability and advantages of continuous-time stimulus sequences for HSR paradigm and by revealing the relationship between the reliability of AEPs and sampling frequencies of the experimental systems when discrete-time stimulus sequences are used in traditional manner for the HSR paradigm.

1. Introduction

In studying the auditory evoked potentials (AEPs), high stimulus-rate (HSR) paradigm featuring shorter and irregular interstimulus intervals (ISIs) has been proposed by Delgado and Özdamar [1] and applied to various investigations [24]. The specific technique proposed in [1] is generally named as continuous loop average deconvolution (CLAD). For CLAD, the presence of stimulus and silence is, respectively, represented by “1” and “0.” As such a stimulus sweep containing multiple stimulus events is described by a binary sequence. A typical sweep contains a number of “1s” and a large number “0s.” Due to the fact that the ISIs have to be so short for the HSR paradigm that the brain responses to consecutive stimulus events overlap, this binary sequence that constitutes a sweep of stimulus and the sweep response that contains a number of overlapped transient-responses are transformed into the frequency domain to solve the overlapping problem in order to recover the transient responses (see Section 2.1 for more details). This process is usually termed as deconvolution.

As shown by Jewett et al. [5], any chosen stimulus sweep needs to satisfy the noise attenuation property to avoid distorting transient responses in the deconvolution process. As far as this peroperty is concerned, the ISIs must be irregularly distributed in a sweep rather than a fixed ISI as in the conventional recording paradigm. On the other hand, most practical applications require that such ISI-jitters should be as small as possible so that the linear convolution model is valid [5, 6]. The noise attenuation property, which is a major criterion for judging its appropriateness, can be straightforwardly understood in Fourier domain (for details, please see (1d) and (5) below). The generation of stimulus sequences with the desired property is in essence an optimization problem, which is important and dependent on various factors in practice. Typically, a minimal temporal resolution (i.e., the analogy-to-digital (AD) conversion rate for EEG recording) and the number of stimulus events are first chosen for the stimulus sweep for a given experiment. Since the stimulus sequence is optimized at given temporal resolution, we term it as discrete-time sequence. The temporal resolution is crucial for finding a good discrete-time stimulus sequence in that the AD rate is supposed to be as high as possible in order to increase the searching space for a sequence optimization method. However, the use of high temporal resolution imposes challenges for the search of optimal sequence. It either makes the optimization prone to local minima issue or increases computational expense when exhaustive searching strategy is used. Another issue is that when it is chosen, the optimal stimulus sequence should be used at the specific AD rate according to the chosen temporal resolution at which the optimal stimulus sequence is established.

In reality, different recording systems may not always be operated at the frequency exactly identical to that chosen for stimulus sweep design. When a recording system works at a different frequency, the timing of the onsets of stimulus events in the discrete-time stimulus sweep is to be resampled. It is unclear about the impacts of the resampling and actual AD rate on the deconvolution performance. Moreover, many optimization methods can only deal with continuous variables for the convenience of being exposed to certain mathematic operations [7, 8]. In this case, the optimal timing of stimulus events is a continuous variable, and we term the stimulus sweep as the continuous-time stimulus sequence, which need to be discretized in time domain to be used in actual experiments. The temporal resolution of the discretization is also determined by the AD rate for the actual application. As such, it is important to understand how the AD rate influences the performance of HSR paradigm no matter a resampling or discretization of optimal stimulus sequence is necessary.

To address these critical questions, in this paper, we derived the frequency representation of a continuous-time impulse sequence to solve the deconvolution problem for the HSR paradigm. Using simulated EEGs (based on real AEP data and simulated noises) and four optimized continuous-time stimulus sequences, we demonstrate the applicability and the advantages of continuous-time stimulus sequences for HSR paradigm. We also illustrate the relationship between the AD rate for discretizing a continuous-time sequence and the errors in terms of temporal locations of stimulus impulse, frequency properties of discretized stimulus sequence, and the deconvolved AEP as compared to the ground truth AEP.

2. The Convolution Model for HSR Paradigm

2.1. Discrete-Time Convolution

Under discrete HSR condition, the observed sweep-response can be modeled as a circulant discrete convolution between the binary stimulus sequence and a transient response , that is, where denotes circulant convolution; represents an additive noise term for any undesired contribution to ; is the interval between two discrete samples, or in other word, the reciprocal of the analogue-to-digital (AD) rate (i.e., ); is the number of discrete samples for the duration of a stimulus sweep.

Note that the circulant convolution is adopted in (1a). This is because the stimulus sequence is delivered to the subject repetitively in the HSR paradigm. As such, , , and are of the same length , or the same time duration . In practice, the recorded raw electroencephalograms (EEGs) are epoched according to the stimulus sweep rather than to individual stimulus impulse (each stimulus sweep contains a number of stimulus impulses depending on the particular experiment design), and averaged over a number of sweeps to attenuate the noise level. Equation (1a) is usually referred to the case after averaging.

Equation (1a) can be represented in Fourier domain as where and the capital letters correspond to the discrete Fourier transforms of their counterparts (e.g., denotes the discrete Fourier transform of ),

From (1b) one can solve the transient response in the frequency domain (i.e., ) in a straightforward way using an inverse filter as done by the CLAD [1]. Consider Based on (1d), it is obvious that distortion of the solution can happen at some frequency bins where approaches to zeros, since small can amplify the noise components .

To address this issue, Jewett et al. [5] propose a scheme to examine the values of within the frequency band of interest and make sure that these values are larger than a preset threshold. In a following study, Jewett et al. [7] offered a number of binary stimulus sequences according to this criterion.

2.2. Continuous-Time Convolution

Knowing the frequency property of a stimulus sequence can assist the assessment of the noise attenuation performance of the inverse filter for the deconvolution problem [8]. Early studies such as those by Jewett et al. [5, 7] guiding the selection of appropriate stimulus sequences have made the HSR scheme practical for actual applications and thus significantly contributed to the development of HSR paradigms. However, the optimal stimulus sequences used in the literature [7] so far are AD rate dependent and thus not generalizable due to the need of resampling when the actual AD rate differs from that used for stimulus sequence optimization. As such, there are two major drawbacks in optimizing the stimulus sequences in discrete form. First, the optimized sequence may not be optimal if the sequence is resampled with a different rate. This means that it is impossible to generate a sequence for general use; instead, optimization algorithm is to be employed to generate a good sequence according to the AD rate for each given experiment. Second, optimization in discrete form makes it hard to know how the AD rates influence the performance of an optimized sequence.

In this section, we derive the continuous-time convolution relationship between the stimulus sequence and transient response, and the estimation of transient response in a general form.

Similar to the in (1a), the observable EEG sweep can be represented as the circulant convolution between transient response and stimulus , with error term :

Since all the variables (except the error term ) in (2) can be considered as periodical functions (with a period of ) due to the repetitive stimulation manner, the key difference here from (1a) is that the stimulus events in can happen at any time point within the period , rather than the discrete time points determined by the AD rate. The Fourier transform of such signals as is in discrete form: where denotes Fourier transform operator, and represents the repetition rate of the stimulus sweep and thus the discrete frequency resolution for signals (e.g., in (3)) to be presented in Fourier domain. We can rewrite (2) in Fourier domain:

The transient response can be estimated in Fourier domain as

Distortion of the estimated transient response is introduced in the second term of the right hand side of (5). The noise term is to be amplified by the inverse filter at some frequency bins where is small. As such, it is critical to study in detail the properties of the inverse filter and we will do so in Section 2.3.

Equation (5) shows that as far as these continuous periodical signals are concerned, the error term’s contribution to the estimation of depends only on but not the AD rate as in (1d). Note that the frequency range of the solution can be infinite in theory, but in practice, the energy of transient signal is bounded within a relatively narrow frequency band of interest and only the continuous-time stimulus sequences can be of unlimited frequency range, which is to be detailed below.

2.3. The Properties of Continuous-Time Stimulus Sequence

A stimulus sequence for one experiment sweep can be described as a impulses train. We use delta functions of delay to represent the occurrence of stimulus impulses and their summation to represent the stimulus sweep:

In this continuous-time form of stimulus sequence, the is a continuous variable, that is, . Figure 1(a) shows an example sweep with stimulus impulses occurring at 17 time points. The Fourier transform of (6) is

This is a continuous function in the frequency domain [9]. In a given experiment, the stimulus sweep (Figure 1(a)) is repetitively delivered to the subject to stimulate the brain responses for EEG recording. As such, the whole stimulus sequence (Figure 1(b)) is modeled as a convolution between the impulses train and a summation of delta functions : where is a periodic function with period defined as

The Fourier transform of (9) is

Equation (10) shows that the Fourier transform of a periodical delta sequence is also a delta train with the interval of .

Based on (8) and (10), the Fourier transform of the periodical stimulus is

Equation (11) indicates that the spectrum is a discrete sampling of the continuous function in (7) by the delta function train at the interval of . Plugging in (7) into (11), we get which can be rewritten as follow given the sampling property of the delta function :

Equation (13) is the Fourier spectrum of the stimulus impulse-sequence, which includes infinite number of frequencies defined by since can be any integer. In real experiment, however, signals and noises are limited within a frequency band, say . While and can be of arbitrary real values in theory, we should round off them to the multiples of in practice, say , in the discrete frequency domain: where and are the frequency domain indexes for the frequency band of interest.

As shown in (5), the inverse filter critically determines the error term’s contribution to the distortion of deconvolved transient response . To limit the distortion, it is necessary to constrain the Fourier energy distribution of the stimulus sequence as follows so that the noise term in (5) is at least not amplified. Assume where is a threshold usually set at 1 to make sure that the noise term within the frequency bins of interest is at most maintaining its original energy if not attenuated by the inverse filter.

Figure 1(c) illustrates that the example stimulus loop in Figure 1(a) meets this criterion in that the inverse filter satisfies in the chosen frequency band .

To further evaluate the overall quality of the stimulus sequence in (6), a measure called noise gain factor (NGF) can be defined accordingly [10] as which represents the average of noise gain factor at each frequency .

3. Experiments and Results

3.1. Stimulus Impulse Sequences

In this section, we generate continuous-time stimulus sequences to be used for examining the impact of AD rates on the performance of inverse filtering in solving the transient AEPs. Using various optimization methods [11], these stimulus sequences can be found to satisfy the constraint in (15). Here we employed a modified optimization method called differential evolution algorithm [12] to obtain the optimal continuous-time stimulus sequences. The value of threshold in (15) was set to 1. Since the details of the optimization are beyond the scope of this paper, we directly provide the four impulse sequences (Table 1) generated for our study.

These four sequences are given in the form of ISI-series which can be expressed as , where if . Note that the temporal resolution of the stimulus events in the sequence is infinity in theory. In Table 1, we show in the second decimal place only. These sequences are used to study the characteristics of 40 Hz steady-state responses which is a main application of HSR paradigms [2, 4]. Here, the jitter ratio (JR) in Table 1 is defined as in percentage to measure the inhomogeneity of the stimulus interval. The applications of HSR paradigms usually requires a low jitter (small JR) stimulation to approach the case of steady state recordings while satisfying the constraint of (15) within the frequency band of interest (8–122 Hz, see Figures 4 and 5) in which the majority of energy of the simulated EEG signal falls.

3.2. EEG Data Simulation

In actual experiments, the recorded EEGs (including transient AEPs and noises) are band-limited signals. They are digitized in time and amplitude according to Nyquist-Shannon sampling theorem. In this study, we use a real AEP signal previously measured using CLAD method [8] as the AEP component () and the additive noise (i.e., ) generated from a process [13] to simulate background EEGs mixed with inherent artifacts and noise. In our practice, the is filtered by a band-pass filter to eliminate frequency outside Hz as a recording system usually does in experiments for the recording of middle latency response and 40 Hz steady state response. Note that and are both band-limited signals which in this case fall in frequency range  Hz. In theory, there is no error when resampling them at a different AD rate as long as the sampling theorem is satisfied. In this study the original signals and were obtained at AD rate of 20 kHz and then resampled into other rates as needed. We chose a frequency band  Hz for the simulated EEG signal to emulate the actual signal from recorded EEGs, which usually have a cut off frequency at 500 Hz in real experiment. However, this frequency band is broader than that (Hz) we used in optimizing our stimulus sequence in Section 3.1. Theoretically, we should avoid this inconsistency by either generating stimulus sequence satisfactory within  Hz or filtering the EEG signal to Hz. In practice, we found it was much more difficult, if not impossible, to optimize the stimulus sequence for broad frequency range. Although the stimulus sequences in this paper are optimized for a frequency band narrower than that for the EEG signal, there was no data point of measure within the range of Hz to be extremely large. Moreover, the energy of EEG signals beyond range  Hz is very low since the AEP is a very narrow band signal and the noise is simulated using a process. As such, noise will not be over amplified by the inverse filtering in this study and their contribution to the distortion of deconvolved AEP is limited.

3.3. Errors Introduced by the Temporal Discretization

The interstimulus interval of continuous-time stimulus sequence can be discretized at required temporal resolution. The discretization will introduce round off errors no matter how high is sampling frequency . Accordingly, the Fourier of the discretized counterpart will differ from that of the original continuous-time stimulus sequence.

The discretized temporal location of a stimulus impulse at with respect to the onset of a stimulus-sweep is

The normalized root-mean-square error is defined as an overall measure of the errors caused by the temporal discretization of the continuous-time stimulus impulse sequence

To examine the error introduced by the temporal discretization of four stimulus sequences in Table 1, we calculated at different temporal resolutions determined by the AD rates (from 1 kHz to 25 kHz). The results are presented in Figure 2. Consistent with results for the four sets of stimulus sequences, the decreases as sampling frequency increases. The results can be fit by a reciprocal function: , () and the location of maximal curvature  kHz.

As seen above, the temporal discretization introduces errors of the temporal location of the stimulus impulses, causing the difference between and . We need to examine whether an optimized continuous-time stimulus sequence still satisfies the constraint in (15) after discretization. We define relative root-mean-square error in frequency domain as

Using the same four dataset in Table 1, we calculate with respect to each AD rate from 1 kHz to 25 kHz. The resulting is shown in Figure 3. Again, the data points can be well-fit with a reciprocal function: , (), and the location of its maximal curvature is at  kHz.

Figure 3 shows that the error of stimulus sequence in Fourier domain caused by the temporal discretization decreases with the increase of sampling rate. But the error will not completely disappear. Based on the fit function , the maximal curvature occurs at  kHz, indicating that, in the range [0, 3.77] kHz, decreases rapidly with the increase of . However, decreases at a much low rate when is greater than 3.77 kHz. These results suggest that the is very sensitive to low AD rate, and that it is critical to increase the AD rate to a frequency higher than 3.77 kHz.

Here we use one stimulus sequence to exemplify the differences caused by discretization at two different AD rates:  kHz, which is below the critical (3.77 kHz) and  kHz, which is above the critical (3.77 kHz). Figure 4(a) illustrates the measure of the continuous-time stimulus sequence (data points labeled with cross “”) and that of the corresponding discrete-time stimulus sequence (data points labeled with open circle “”) at an AD rate of  kHz. Within the frequency range of interest [ Hz,  Hz], the inverse filter based on the discretized stimulus sequence satisfies (15) at most frequency bins. However, at some frequency bins, its measure is greater than 1. The difference between the continuous-time and discrete-time stimulus sequence at each frequency bin is plotted in Figure 4(b). Clearly, the discretization causes errors at most frequency bins at amplitudes within the range from −0.25 to 0.25.

Figure 5 shows the similar contents as Figure 4, except that the continuous-time stimulus sequence is discretized at a higher AD rate of  kHz. Figure 5(a) depicts the measures for continuous-time (data points labeled with “”) and discrete-time (data points in open circles “”) stimulus sequences. Again, the measure within most frequency bins for the discrete-time stimulus sequence satisfies (15). However, we can also see some data points stick out of the threshold of 1. The difference between the measures for the continuous-time and discrete-time stimulus sequences is shown in Figure 5(b). We can see that at  kHz, the difference at most frequency bins is close to zero. When compared to Figure 4(b), Figure 5(b) clearly show that  kHz has substantial advantage over  kHz in maintaining the discrete stimulus sequence satisfactory to the condition described in (15). Importantly, if the AD rate is low, the discrete stimulus sequence may violate the condition and cause overamplification of noise components in the recorded EEG and distort the deconvolved transient AEPs. The results also suggest that using AD rate much higher than the Nyquist sampling rate is necessary to minimize the error introduced from temporal discretization.

3.4. Noise Attenuation Effects of Discretization Frequency

Although the continuous-time stimulus sequence is frequency unlimited ( in Fourier domain, where can be any integer), the frequency range of interest is determined by the nonzero values of the product of and as indicated in (4). In this study, the recorded EEG is band-pass filtered within Hz. The transient response is obtained by applying inverse Fourier transform on in (5):

Likewise, the inverse Fourier transform of in (1d) will give the time domain signal for the discrete binary sequence:

To quantify the difference between the estimated and the true solutions for both continuous and discrete stimulus sequence, we can define root mean square error for

By replacing with in (22), we can define the root mean square error for . As stated previously, an original copy of additive random noise is then rescaled and/or resampled to accommodate conditions with different AD rates and signal-to-noise ratios (SNRs), where the latter is defined as

To examine the relationship between AD rate and , we simulated the discretization at = 1–25 kHz and three SNR conditions (9.5 dB, 0 dB, and −6.0 dB). The resulting is shown in Table 2, where in the first column (AD rate = Inf) means the errors for the continuous-time stimulus sequence condition and results for this condition serve the references for all other AD rates. From Table 2, we can see that the errors for high SNR EEGs are much smaller than those for low SNR EEGs, and the errors decrease as AD rate increases. These phenomena are consistent across four stimulus sequences. Figure 6 is a graphic representation of the results. It shows that low SNR EEGs are more vulnerable to low AD rates, and the increase AD rate is more beneficial for low SNR signal on the other hand.

To gain more straightforward understanding of above results, we choose to present the resulting AEPs from different SNR conditions and at two representative AD rates. Figure 7(a) shows the results at AD rate of 1 kHz. It is clear that the AEPs recovered from high SNR EEGs (the top panel) using continuous-time and discrete-time stimulus sequence are both very close to the original AEP used for the simulation study. However, the difference between the recovered AEPs becomes significant when the SNR is low (the third panel in Figure 7(a)). At same time, we can see that the AEPs recovered using continuous-time stimulus sequence are closer than the ones from discrete-time stimulus sequence to the original AEP. Figure 7(b) shows the results at AD rate of 20 kHz. Similar to the phenomena in Figure 7(a), the AEPs (the top panel in Figure 7(b)) recovered from high SNR EEGs are very close to the original AEP used in this study. When the SNR is getting worse, for example −6 dB as in the third panel in Figure 7(b), the recovered AEP is less close to the original AEP. Consistent to that exemplified in Figure 7(a), the AEPs recovered based on continuous-time stimulus sequence better resemble the original AEP than those based on discrete-time stimulus sequence. Moreover, the AEPs from high AD rate ( kHz) of discretization resemble more closely to the original AEP than those from low AD rate of discretization. As such, it becomes obvious that increasing AD rate will reduce the distortions in the recovered transient AEP. More importantly, this positive impact of high AD rate is more significant for low SNR recordings.

4. Discussion and Conclusion

Solving the transient AEPs in frequency domain under HSR paradigm has been investigated and implemented recently using discrete-time stimulus sequences (e.g., in [15, 7, 8, 10]). Since the frequency characteristics of a stimulus sequence critically influence the performance of deconvolution algorithms in the presence of noises in EEG recordings, one needs to select or generate appropriate sequences to satisfy certain criteria (e.g., (15)). However, it is challenging to generate an optimal discrete-time stimulus sequence when the occurring time points of stimulus impulses are subject to the constraint of the AD rate of the experiment systems. This is because theoretically it only has a limited number of possible solutions. Although continuous sequences generated by an optimum method might be ready for use, one needs to answer two more questions: (1) how to present such sequences into the practical recordings in digital form; (2) how to quantify the errors and impacts due to the discretization of the continuous-time stimulus sequences.

This paper answers the first question by explicating the frequency presentation of continuous-time stimulus sequences and its use in solving the transient AEP using deconvolution algorithm. As detailed before, the repetitive continuous-time stimulus sequences have discrete frequencies in the Fourier domain and as such the convolution model and deconvolution algorithm can be similarly presented as for the discrete-time time system. In practice, the continuous-time stimulus sequences can be approximated using very high temporal resolution discrete-time stimulus sequences. For example, the temporal resolution for the stimulus sequences in Table 1 can be rounded to the 4th decimal place, equivalent to a sampling frequency at MHz level, which is well above the characteristic frequency (~4 kHz) in Figures 2 and 3 and makes the discretization errors negligible in applications. Furthermore, simulation studies are used to examine the applicability of the theory and quantify the errors and impacts when a continuous-time stimulus sequence is temporally discretized. The results show that using continuous-time stimulus sequences in the HSR paradigm has significant advantages over using discrete-time stimulus sequences in recovering the transient AEPs. This study also reveals a reciprocal relationship between errors introduced by the discretization of continuous-time stimulus sequence into discrete-time counterpart and the discretization frequencies. Note that we here used the absolute errors to quantify the impact of sampling rate of discrete-time stimulus sequences. But absolute errors should not be considered the only choice. In fact, other metrics such as correlation coefficient can be a good measure when the shape of waveform is of the researchers’ only concern. In typical AEP studies, researchers are usually interested in both the shape and amplitude of the signals. In any case, the results in this paper suggest that when discrete-time stimulus sequence is used, a high frequency system is more likely to offer reliable recovered AEPs under HSR paradigm.

To conclude, this study demonstrates the applicability and advantages of continuous-time stimulus sequences for AEP studies using HSR paradigm and reveals the reciprocal relationship between the errors in recovered AEPs and the sampling frequencies of the experimental systems when discrete-time stimulus sequences are used in traditional manner for the HSR paradigm.

Acknowledgments

The authors wish to thank Professor Sung-Phil Kim as well as the anonymous reviewers for useful comments on an earlier version of this paper. This work was supported by the National Natural Science Foundation of China (no. 61172033 and no. 61271154) and by Talent Introduction Foundation (2009) for the Institution of Higher Education in Guangdong.