Abstract

An efficient frequency recognition method is very important for SSVEP-based BCI systems to improve the information transfer rate (ITR). To address this aspect, for the first time, likelihood ratio test (LRT) was utilized to propose a novel multichannel frequency recognition method for SSVEP data. The essence of this new method is to calculate the association between multichannel EEG signals and the reference signals which were constructed according to the stimulus frequency with LRT. For the simulation and real SSVEP data, the proposed method yielded higher recognition accuracy with shorter time window length and was more robust against noise in comparison with the popular canonical correlation analysis- (CCA-) based method and the least absolute shrinkage and selection operator- (LASSO-) based method. The recognition accuracy and information transfer rate (ITR) obtained by the proposed method was higher than those of the CCA-based method and LASSO-based method. The superior results indicate that the LRT method is a promising candidate for reliable frequency recognition in future SSVEP-BCI.

1. Introduction

Brain-computer interface (BCI) can provide online communication between a human or animal brain and external devices without depending on the normal output pathways of peripheral nerves and muscles [1]. Research interest has increased because of its potential application value in neural engineering and neuroscience [2, 3]. Many EEG signals could serve as control signals in BCI systems [1, 3]. In recent years, steady-state visual evoked potentials (SSVEPs) have been widely used in SSVEP-based BCI [2, 46]. SSVEP has the same fundamental frequency as well as harmonics of the flickering visual stimulus, and it has high signal-to-noise ratio (SNR) and stable spectrum [7]. Accordingly, SSVEP-based BCI usually requires little training effort and achieving high information transfer rate (ITR) and becomes an important branch for designing BCI applications [8, 9].

In the SSVEP-based BCI system, the targets are encoded by a single frequency or various combinations of frequencies. A command can be transmitted by shifting the subject’s attention to the corresponding target. Although SSVEP has the aforementioned characteristics, it is likely to be contaminated by spontaneous EEG activities and other noises [10, 11]. How to efficiently recognize the target with short time window and low error rate is one of the key topics to boost the IRT of the SSVEP-based BCI systems. Therefore, it is necessary to develop an efficient frequency recognition method for SSVEP-based BCI. The existing traditional recognition methods are power spectral density analysis (PSDA) [12] and stability coefficients (SC) [10], which are mainly based on the single EEG channel. These methods are sensitive to noise and need long time window to perform the recognition, which may limit the real-time performance of SSVEP-based BCIs. In addition, because users usually have shown large intervariation in the SSVEP amplitude and distribution, additional calibration is required for parameter optimization (e.g., channel selection and appropriate data length) with these traditional methods [8, 11].

To overcome the drawbacks in the single channel based recognition, the multichannel recognition methods have aroused wide interests. Lin et al. proposed a method based on the canonical correlation analysis (CCA) [13]. Another multichannel recognition method is minimum energy combination (MEC) method proposed by Friman and his colleagues [11]. These two methods showed superior performance as compared to the traditional recognition methods [8, 9]. In addition, Nan and his colleagues had shown that CCA-based method could achieve better performance than MEC [14], and CCA has been widely adopted for frequency recognition in SSVEP-based BCI systems. Zhang et al. using LASSO-based frequency recognition method showed that the sparse regression model greatly improved the classification performance over CCA [15]. Therefore, in current work, we will use the CCA-based method and the LASSO-based method as the baselines to evaluate the performance of the proposed method by us.

LRT is a tool to test the independence of two sets of multivariate variables [16]. In this paper, we proposed a novel frequency recognition method based on likelihood ratio test (LRT) to further improve the frequency recognition accuracy for SSVEP-BCIs. For the first time, the LRT was utilized to calculate the correlation between the multichannel EEG signals and the reference signals. Experimental results based on the simulation and the real EEG data from eleven subjects demonstrate that the proposed method showed better performance as compared to the CCA-based method and the LASSO-based method.

2. Materials and Methods

This study was approved by the Institution Research Ethics Board at the University of Electronic Science and Technology of China. All participants were asked to read and sign an informed consent form before participating in the study. All the participants received monetary compensation for their time and effort following completion of the experiment.

2.1. LRT-Based Frequency Recognition

Suppose that is a -dimensional normal distribution vector. and are - and -dimensional vector, respectively, and . The vector mean and covariance matrix of are given by

The null hypothesis of two independent sets of variables and is represented as

Suppose that are the samples drawn from . According to likelihood ratio test [16], the likelihood ratio statistic for formula (2) is as follows: where denotes the complex conjugate transpose of vectors or matrices. Then we have the statistical measurement , where

In fact, is also a metric to measure the coefficient of alienation between and [16]. Therefore, a measure of association is 1 minus . Furthermore, the association can be modified to take account of the number of dimensions [16]. Accordingly, we can get the measure of association as follows: ranges from 0 to 1. If and are identical, then . If and are independent, .

Suppose that is the multichannel SSVEP data and are the reference signal matrices which are formed by the following formula at the stimulus frequencies : where is the number of harmonics and is the sampling rate.

For frequency recognition, we can calculate the correlation coefficients () between and with the formulas (4)–(6). Then the frequency of is recognized as the stimulus frequency corresponding to the maximal correlation coefficient.

2.2. CCA-Based Frequency Recognition

CCA is a multivariable statistical method to explore the underlying correlation between two sets of variables [17]. When using CCA for frequency recognition, we also require the reference signals described in formula (7) [13]. With CCA, we can find the weight vectors and to obtain the maximum canonical correlation between and () by solving the following optimization problem:

For and each reference signal (), we can obtain a maximum canonical correlation by formula (8) and use these coefficients to recognize the frequency of . Similar to LRT-based method, the frequency of is the stimulus frequency corresponding to the maximal correlation coefficient [13].

2.3. LASSO-Based Frequency Recognition

LASSO-based stimulus frequency recognition model showed that the sparse regression model greatly improved the classification performance over CCA [15]. For a SSVEP response , and the design matrix , we can model a linear regression model by adding the noise as follows: where are the reference signal matrices corresponding to stimulus frequencies, respectively, represents the regression coefficients, and represents a noise vector with zero mean and unit variance. For the LASSO estimate, the is given by the following formula: where and denote the -norm and -norm, respectively. is a penalty parameter which controls the sparsity of solution . The function “lasso.m” in Matlab (MathWorks) was used as implement to calculate the .

Here, the number of harmonics was two, and the number of the stimulus frequency was 4. Therefore, we can denote as . For each channel, we could get a (). After that, the contribution degree (CD) of th stimulus frequency and its harmonic to the EEG signal can be calculated as follows: where is the number of the used channels and is the number of harmonics.

Similar to CCA-based method, the frequency of is the stimulus frequency corresponding to the maximal contribution degree [15]. For the details about the LASSO-based method, please refer to the reference [15].

2.4. Simulation

The main purpose of this simulation was to study the antinoise capability of CCA-based method, LASSO-based method, and LRT-based method. We chose four frequencies, that is, 7.5 Hz, 8.6 Hz, 10 Hz, and 12 Hz, to simulate the SSVEP signals. For each frequency, we generated 8 sinusoidal signals to simulate 8 channels of SSVEPs. The sampling rate was 250 Hz, and the signals lasted for 10 seconds. Then, Gaussian white noise was added to the sinusoidal signals to simulate the noise-contaminated signals. Finally, the two methods were used for frequency recognition, and the time window length was 1 s as that used in the simulation by Lin et al. [13]. The accuracy was used to evaluate the recognition result, which was the ratio of the number of correct recognition operations to the 40 recognition operations. At each SNR level, the procedure was repeated 50 times, and the mean accuracies across 50 runs are reported. To show the influence of the SNR on the accuracy of these methods, SNRs ranging from −7 db to −20 db were considered to add the noises. The SNR is defined as follows: where and are the power of the signal and the power of the noise, respectively. is the amplitude of the sinusoidal signals, and is the variance of the noise [13, 14].

2.5. Offline Experiment

To further evaluate the performance of the three methods, the real SSVEP data was also used. The SSVEP data was from an offline SSVEP-based BCI experiment in our lab with 4 frequencies, that is, 7.5 Hz, 8.6 Hz, 10 Hz, and 12 Hz. The flickering stimulus was presented by a computer through a control program realized by C++Builder and Windows DirectX API. A laptop with a 13′′ screen and a 60 Hz refresh rate was used to present the stimuli.

The experiment was performed in a normal room. EEG signals were recorded from the scalp via 8 Ag/AgCl recording electrodes with a Symtop Amplifier (NIL System, Chengdu, China). The electrodes were placed at P3, P4, O1, O2, Pz, Oz, PO7, and PO8. Fcz and Afz were adopted as the reference and ground, respectively. Data were sampled at 1000 Hz and filtered with a band-pass filter from 0.5 to 30 Hz and a 50 Hz notch filter. Impedances were kept below 5 kΩ. Eleven healthy right-handed subjects (two female and nine male, age ranging from 21 to 25 years) participated in this study. All subjects had normal or corrected-to-normal vision. These subjects did not have any history of epileptic seizure or mental disease. Six were naive to the SSVEP-based BCI equipment and paradigm. During the experiment, the subjects were seated in a comfortable armchair, 60 cm away from the center of the laptop monitor. The subjects were instructed to gaze binocularly at each frequency flickering stimulus for 30 s, followed by a rest period of approximately 1-2 min.

Based on the EEG data, we evaluated the performances of the three methods using different time window, that is, 0.5 s, 0.75 s, 1 s, 1.25 s, 1.5 s, 1.75 s, and 2 s. For each time window, we extracted nonoverlapping segments from the 30 s data of each frequency and pooled all the segments for the four frequencies together. Afterward, we used the three methods to conduct the frequency recognition. The accuracy, which was the ratio of the number of segments correctly classified to the number of total segments, was used to evaluate the performances of the three methods.

2.6. Information Transfer Rate (ITR)

In this study, information transfer rate (ITR) was adopted as the criteria to evaluate the BCI system [1]. If possible selections exist in one trial, if each selection is of the identical probability to be selected by the user, if the probability () that the desired selection will actually be selected is always the same, and each of the other (i.e., undesired) selections has the same probability of being selected (i.e., ), then the bit rate (in bits min−1) can be computed as follows:

Then, the ITR (bits/min) is equal to Bt multiplied by the selection speed (i.e., trials per minute).

For our offline analysis, we used a simulation method to conduct a simulated online ITR test [4]. A 0.5 s was set to simulate the interval which was given to the subjects to shift gaze as the online situation. Therefore, a trial period was 0.5 plus the window length which was used to obtain a frequency recognition result.

3. Results

For the simulation, Figure 1 shows the average recognition accuracies of the 2 methods at various SNR levels. The LRT-based method significantly differed from the CCA-based method when the SNR was lower than −13 db and from the LASSO-based methods when the SNR was lower than −15 db, which demonstrates that the LRT-based method showed higher accuracy and better robustness to decreased SNRs.

For the offline EEG data, Table 1 summarizes the recognition accuracies and Table 2 shows the ITR for the eleven subjects with different time window lengths by the three methods. At each time window length, most of the subjects showed better performance by LRT-based method than the other two methods. Figures 2 and 3 present the paired -test significance test results for the recognition accuracies and ITR of the three methods, respectively. The results show that the proposed method is significantly better than the CCA-based method at most time window lengths, especially for the shorter time window lengths. It also suggests that our method is more efficient and robust than CCA-based method. From Figures 2 and 3, the results of the LASSO-based method were worse than those of the CCA-based method, which was different from the simulation result as in Figure 1 (not consistent with the results in [15]). The use of signals within a broader area can introduce more noise and negatively impact recognition accuracy. Multichannel detection methods, that is, the LRT-based and the CCA-based methods, benefit from an optimized combination of multiple signals and have greater robustness against noise, thus improving the results. For the LASSO-based method, it calculates the classification features from each channel independently, such that those features from the low SNR channels can deteriorate its recognition performance.

4. Discussion and Conclusion

Efficient frequency recognition is critical for a high performance SSVEP-based BCI system. The popular multichannel frequency recognition methods benefit from an optimized combination of multiple signals and have better robustness against noise. These methods always achieve higher recognition accuracy and increase the convenience of the BCI system for users due to the nonrequirement for specific channel selection and data calibration [8, 9, 11, 13].

In this study, a multichannel frequency recognition method based on LRT is proposed, which adopts LRT to calculate the correlation coefficient between EEG data and reference signals for frequency recognition. From the simulation and offline experiment, we could see that LRT-based method can achieve higher recognition accuracy in shorter time window and is of better robustness against noise than the CCA-based method and the LASSO-based method. The accurate detection of the intention of the user with short data lengths is crucial for developing a high-performance SSVEP-based BCI system [10]. Furthermore, short data acquisition can prevent fatigue to some extent because of shorter gazing time. In current study, we just demonstrated the superiority of LRT based on offline analysis. In the future, we will realize LRT-based method in the online BCI system to further test its online performance.

For the LASSO-based method, it can yield better performance in extracting robust and detectable features of SSVEP, and the ITR obtained by the LASSO model is significantly higher than that of the CCA-based method when only three channels O1, O2, and Oz are used [15]. However, in our offline data analysis, the results of the LASSO-based method were worse than those of the CCA-based method when using eight channels (Figures 2 and 3). We further used the data from O1, O2, and Oz to run the frequency recognition as in the reference [15]. The result was shown in Figure 4. It seems that the LASSO-based method was better than the CCA-based method (consistent with the results in [15]). The proposed LRT-based method was better than the CCA-based method and showed similar performance to the LASSO-based method under most time windows. At this point, we may infer that the LRT-based method can effectively extract robust and detectable features of SSVEP that are interfered by other noises. Although the results were similar for the LRT-based method and LASSO-based method when only using channels O1, O2, and Oz, the former did not require a penalty parameter that is necessary for the latter to generate the desired performance. Accordingly, the LRT-based method is indicated to be a promising candidate for the frequency recognition.

The linear correlation may not extract the nonlinear structure in multichannel EEG signals. To further improve the performance of LRT-based method, we will take into account the nonlinearity between two EEG signals in our future study. In addition, the correlation computed with LRT-based may be used for brain activity analysis in fMRI data [18], EEG data [19], multimodal data [20], and so forth. It may be another important future direction.

The reference signals used in this study were the preconstructed sine-cosine waves according to the stimulus frequencies. These reference signals may fail to provide the subject-specific and intertrial feature information. In order to further improve the accuracy of the frequency recognition methods, two methods, that is, L1-regularized multiway canonical correlation analysis and multiset canonical correlation analysis, were presented to refine the reference signals [21, 22]. For each subject, these two methods generate the optimized reference signals which extract the SSVEP features from the training data. Hence, it is worthy of further study to adopt the refined reference signals in our proposed frequency method to further improve its performance and fuse the existed frequency recognition method to generate more robust and efficient BCI system [13, 15, 23].

The stimulus frequency set may be an important parameter for high performance BCI system. The frequencies may influence the recognition performance to some degree because different subjects may have their favored frequencies [4, 11]. In this study, the 10 Hz was chosen because it is an integer divider of the screen refresh rate (60 Hz) and produces strong SSVEP. In the experiment, we did not observe significant negative effects from the alpha rhythm on SSVEP. One future improvement of the system could be to add a frequency selection procedure to choose the optimal frequency combination for each user. With the frequency selection procedure, we may avoid the possible negative role of the alpha and provide the more efficient BCI system for the users.

In summary, a novel frequency recognition method was proposed based on the LRT, and its efficiency was validated with both simulation data and offline real EEG data. The results indicated that the new method outperformed the popular CCA-based method and the LASSO-based method in some concerned aspects like short time window and robustness to noise. It may be a new promising candidate for frequency recognition to develop SSVEP-based BCI systems with high performance.

Conflict of Interests

The authors declare that there is no conflict of interests regarding the publication of this paper.

Acknowledgments

The authors would like to thank all the reviewers for their constructive comments. This work was supported by the Doctoral Research Fund of Southwest University of Science and Technology, the NSFC (nos. 81401484, 31070881, 91232725, 61175117, and 61305028), the 973 Project (2011CB707803), the 863 Project (2012AA011601), the Program for New Century Excellent Talents in University (no. NCET-12-0089), and the National Science & Technology Pillar Program 2012BAI16B02.