Abstract

Sparse representation is a powerful tool in signal denoising, and visual evoked potentials (VEPs) have been proven to have strong sparsity over an appropriate dictionary. Inspired by this idea, we present in this paper a novel sparse representation-based approach to solving the VEP extraction problem. The extraction process is performed in three stages. First, instead of using the mixed signals containing the electroencephalogram (EEG) and VEPs, we utilise an EEG from a previous trial, which did not contain VEPs, to identify the parameters of the EEG autoregressive (AR) model. Second, instead of the moving average (MA) model, sparse representation is used to model the VEPs in the autoregressive-moving average (ARMA) model. Finally, we calculate the sparse coefficients and derive VEPs by using the AR model. Next, we tested the performance of the proposed algorithm with synthetic and real data, after which we compared the results with that of an AR model with exogenous input modelling and a mixed overcomplete dictionary-based sparse component decomposition method. Utilising the synthetic data, the algorithms are then employed to estimate the latencies of P100 of the VEPs corrupted by added simulated EEG at different signal-to-noise ratio (SNR) values. The validations demonstrate that our method can well preserve the details of the VEPs for latency estimation, even in low SNR environments.

1. Introduction

Evoked potentials (EPs) are bioelectrical signals that are generated by the central nervous system when the latter is stimulated by well-defined external stimuli. Depending on the modality of stimulation, EPs are categorised into auditory evoked potential (AEP), visual evoked potential (VEP), and somatosensory evoked potential (SEP). In clinical environments, these signals are used to reflect the various functions of auditory, optic, and sensory nerve sense-conducting pathways. In this paper, we concentrate on the second type, namely, the VEPs. Generally speaking, there exist three prominent components (N75, P100, and N145) in the VEP signal, whereas the preceding and following segments are almost flat. Of the three components, the P100 wave is the most significant and stable; hence, it is the most important component in clinical applications [1].

VEP signals have time-locked (quasiperiodic) characteristics and are always accompanied by ongoing electroencephalogram (EEG) signals. Moreover, the signal-to-noise ratio (SNR) of VEP records is usually low (−5 to −10 dB). Ensemble averaging (EA) is the most widely used method for estimating VEP against a noisy background. However, EA cannot be used to detect latency and amplitude variations from one trial to another; thus, single-trial analysis is better suited for investigations into the dynamics of brain activation. The single-trial VEP estimation is very meaningful in cognitive science research and clinical applications, such as brain-computer interfacing and intraoperative monitoring [2].

Many single-trial EP estimation methods have been proposed over the past two decades. These methods can be divided into two categories, namely, denoising methods and separation methods. The denoising methods assume that the measurement of the VEP is corrupted by noise and that the main source of noise is the EEG. Many conventional denoising methods have been applied, such as the Wiener filter [3], Kalman filter [4], and ARX [5]. Among these methods, ARX is widely recognised and has previously been applied to monitor the depth of anaesthesia during surgery. In ARX, the EEG can be viewed as an autoregressive (AR) model driven by white noise, and the EP can be modelled by an ARMA filter with a known signal accurately. The known signal is typically the average of the reference EPs (AREP). The orders and parameters of the AR and ARMA models can be estimated by utilising various optimisation techniques, such as the final prediction error (FPE) [6] and the least-squares (LS) method [7]. The EPs can then be reconstructed by ARMA filtering with the AREP. Recently, Cerutti et al. [6] found that EP extraction using ARX modelling is only capable of extracting latency EP variations in relatively high SNRs and that it is completely invalid because the latency varies greatly compared with the AREP from systemic experiments. The separation methods separate the VEP and EEG signals by modelling them based on their characteristics, such as wavelet transformation and sparse representation.

Meanwhile, Causevic et al. [8] and Martazi et al. [9] used wavelet transformation to separate the EP and EEG signals. Sparse coding is a powerful tool in analysing nonstationary signals, and it has shown significant success in signal denoising and separation. Xu and Yao [10] proposed the mixed overcomplete dictionary-based sparse component decomposition method (MOSCA), which decomposes the EP and EEG signals in the wavelet dictionary and discrete cosine transform (DCT) dictionary, respectively. However, given that EEG is not considered white noise and that many components of EP and EEG look alike in a single trial, their components are represented by the wrong dictionaries and their corresponding coefficients. Therefore, MOSCA cannot separate the EP and EEG signals sufficiently [11, 12].

In this paper, we present a novel sparse representation-based approach to solving the VEP extraction problem. Instead, of the mixed signals from the EEG and EP, we utilised an EEG in a previous trial that did not contain VEP to identify the parameters of the EEG AR model. Then, we used sparse representation in the ARMA model, instead of MA, to simulate the VEP. The sparse coefficients can be calculated by an optimisation method. Finally, the VEP can be derived from the AR model. Experiments carried out on synthetic and real data confirm the superior performance of our method. The rest of the paper is organised as follows. Section 2 provides the details of our single-trial estimation algorithm. Section 3 contains experimental results obtained from the proposed method and a comparison with ARX and MOSCA. Section 4 provides the conclusions.

2. Method

Let the VEP signal to be estimated be corrupted by noise from ongoing background activities. The main source of noise is the spontaneous EEG . The measurement is given byWe need to design a method that can remove the noise from , getting as close as possible to the original EP signal [13].

2.1. The VEP Signal

In ARX, VEP is derived by filtering the reference , which is chosen to be the average of a sufficient number of trials and can represent the general form of the evoked response under analysis, by the ARMA model parameters; that is, where and are the -transform of and and , . Sparse coding is a powerful tool for the analysis of nonstationary signals; it has achieved significant success in signal denoising and separation. Compared with ARMA, sparse coding is more flexible and uses the dictionary and the corresponding coefficient to represent signals. VEP has been proven to have strong sparsity over an appropriate dictionary in our previous paper [12]. Thus, in the current paper, we use sparse coding to represent the single-trial VEP instead of the MA model in ARMA. Therefore, formula (2) can be rewritten as where and are the dictionary and sparse coefficient of , respectively. The transfer function merely represents a mechanism to incorporate deterministic VEP variations into the reference signal , rather than a physiologically meaningful process.

2.2. Dictionary Construction

Inspired by the modelling method in [14], we proposed a dictionary construction method for the EP signal, as reported in our previous paper. This method assumes that the atoms in the dictionary can be extracted from a reference signal and that the single-trial EP can be decomposed sparsely by the dictionary. Many previous experiments have demonstrated this result.

The reference signal consists of a superposition of components expressed as can be acquired by AREP. can be extracted from using a certain filtering window function, such as Hamming window and Blackman window. The central location and width of the window are determined by the location of point of peak (and valley) amplitude and peak (and valley) width of the th component. The dictionary can be represented by where, and Then, can be represented by the dictionary and the coefficient , as In this paper, we aim to construct the dictionary . The transfer function represents a mechanism, which incorporates deterministic single-trial EP variations into the reference signal, rather than a physiologically meaningful process. From this, it follows that whereandare usually small positive integers. Given that is sparse on dictionary , is also sparse on dictionary . Thus, in this paper, .

2.3. EEG Signal

Similar to ARX, in this paper, the EEG is viewed as an AR model driven by white noise ; that is,

The parameters can be estimated using the least-squares method. We assume that the statistical characteristics of the EEG in the successive trials are similar, as has been reported in many papers [7, 8]. Thus, in the current paper, instead of the mixed signal of EEG and EP, we utilise the EEG from a previous trial, to estimate the parameters of AR model. This EEG does not contain EP.

2.4. Single-Trial Extraction

Substituting formulas (3) and (5) into formula (1), we get Then,Let ; then, formula (11) can be simplified as Hence,where is determined by the variance of the EEG. Formula (10) can be solved by using optimisation methods, such as basis pursuit (BP) [15], orthonormal matching pursuit (OMP) [16], and Lasso [17].

The single-trial VEP can then be reconstructed by using

3. Experimental Results

3.1. Analysis of the Simulations

Computer simulation is conducted to verify the performance of our proposed VEP signal extraction method. Depending on the characteristics of the VEP, the simulated VEP is constructed with three components and is expressed asThe three Gaussian functions represent a prominent VEP with similar morphological characteristics to those of the negative (N75), positive (P100), and negative (N145) peaks of a real VEP, respectively. The simulated VEP is shown in Figure 1.

The background EEG that is superimposed on the EP signal is simulated by an AR process [18], which is given bywhere is the Gaussian white noise. The simulated VEP is shown in Figure 1.

In this paper, we assume that the AR parameters of spontaneous EEG in two consecutive trials are extremely similar. In order to validate this assumption, three consecutive trials of spontaneous EEG signals are chosen randomly for the experiment. We compute their least-squares AR model with an approach. We set

Then, each is used to transform the EEG signal in the other trial, so these parameters are, respectively, changed asThe three EEG signals are transformed by ; that is, , where and are the -transform of and , respectively. In Figures 2 and 3, we, respectively, provide the frequency content and independence of . As can be seen in the figures, compared with , the energy of each frequency band of is more uniform, and the autocorrelation coefficients of are lower.

Let represent the parameter vectors of and represent the difference between and . From formula (13), we obtain and . In order to test the robustness of the proposed method in case inaccurate estimations of the AR coefficients are obtained, we use to estimate the AR parameter and and to generate the measurements and , respectively. Then, the extracted VEP2 and VEP3 from and with our proposed method are shown in Figure 4. With our method, the VEP2 and VEP3 are extracted from and . As shown in the figure, when , both VEP2 and VEP3 show results that approach the simulated VEP.

Table 1 shows the mean and standard deviation of SNR obtained from 100 extracted VEP2 or VEP3. As shown in the table, with the same SNR, the SNR values of VEP2 and VEP3 are similar, although is two times larger than .

During estimation, the observed SNR values may change over time due to the nonstationary characteristics of the EEG. Therefore, in this experiment, the performance of our method is examined under various SNR conditions. The EEGs are generated with formula (16).

As shown in Figure 5, although the estimation performance degrades with decreasing SNR, the prominent morphological characteristics (N75, P100, and N145) are preserved in all the SNR values.

The average values of SNR obtained with our method, MOSCA [10], and ARX [6] are shown in Figure 6. The results are acquired by identifying the average of 100 trials for each piece of data. In our method, the dictionary of VEP is constructed by using formula (15) where . Similarly, in ARX, the reference VEP is generated by formula (15) where . In this experiment, we change the latency of P100 by setting and . We can see from this figure that our method consistently demonstrates the greatest improvement in all methods. Compared with sparse coding, ARX shows superior performance at low initial SNRs. However, when the latencies change greatly , ARX method degrades seriously. We can also see that the latency change has hardly any impact on the estimation performance of our method.

To increase the objectivity of the evaluation, for each SNR and , we generate data from 50 trials and then estimate the latencies of P100. As shown in Table 2, we change from −10 to 10 and the SNR from −10 dB to 0 dB and estimate the single-trial VEP signal. Results show that, with the decrease of SNR, all standard deviations also increase. The RMSE value depends primarily on the SNR, rather than on the variations of latency , thereby indicating that our method is appropriate for tracking the latency variations when SNR ≥ −10 dB.

3.2. Analysis of the Real VEP

To further evaluate the performance of our method, we collected VEPs from three pairs of eyes of three human subjects during pattern reversal VEP experiments. The basic data obtained from the three subjects are shown in Table 3.

An example from the 50 trials is selected randomly from the original recorded VEPs of subject 2’s right eye. Figure 7 shows the corresponding average VEP.

We extract the VEPs with our method, MOSCA, and ARX, and the results are shown in Figure 8. Clearly, the three components N75, P100, and N145 of VEPs extracted with our method are prominent, and the signals estimated using our method are more similar to the ensemble average signal.

Then, our method is used to estimate the amplitudes and latencies of P100 in 200 trials. As shown in Figure 9, the variations in amplitudes and latencies are significant, whereas most amplitudes are between −7 and −4 and most latencies are between 100 and 115. These results have good agreement with those observed in practice.

4. Conclusions

Single-trial EP estimation is a very useful tool in cognitive science-related studies and clinical applications. Many investigations have been carried out and some amount of success has been achieved. However, only a few practical methods have been proposed. ARX modelling is a classical method that has been applied in clinical practice for several years. However, this method has limitations regarding the tracking of latency variations and is only capable of extracting latency variations of an EP under relatively high SNR values. Meanwhile, sparse coding is a powerful tool in signal denoising, and EPs have been proven to have strong sparsity over an appropriate dictionary. Inspired by this idea, in this paper, we introduce sparse coding into the ARX model and propose a novel single-trial VEP extraction method based on ARX and sparse coding. Compared with ARMA, sparse coding is more flexible. It uses the best matching atoms from the dictionary to represent the EP signal without needing to estimate the number of atoms beforehand. By transforming the electroencephalography signal into white noise, the single-trial EP estimation is transformed into a signal denoising problem for white noise. With the dictionary constructed specially for EPs, the EP signal can be extracted easily with sparse coding. Moreover, since the location of the atom in the dictionary has no influence on the effectiveness of sparse decomposition, variations of the amplitude and latency of EPs have only a minor impact on the performance of the proposed method. The proposed method can thus track EP signal variations. We conducted a series of experiments on synthetic and real data, and the results have been evaluated using waveform observation and several metrics. The validations demonstrate that our method can well preserve the EP details of latency and amplitude estimation simultaneously, even under low SNR conditions.

Competing Interests

The authors declare that there is no conflict of interests regarding the publication of this paper.

Acknowledgments

This work was supported by the Natural Science Foundation of the Jiangsu Higher Education Institutions of China (Grant no. 13KJB51-0010), the Natural Science Foundation of Jiangsu province, China (Grant no. BK20130230), and the Natural Science Foundation of China (Grants nos. 61401181 and 61403174).