Abstract

Feature extraction of motor imagery electroencephalogram (MI-EEG) has shown good application prospects in the field of medical health. Also, multivariate entropy-based feature extraction methods have been gradually applied to analyze complex multichannel biomedical signals, such as EEG and electromyography. Compared with traditional multivariate entropies, refined composite multivariate multiscale fuzzy entropy (RCmvMFE) overcomes the defect of unstable entropy values caused by the scale factor increase and is beneficial towards obtaining richer feature information. However, the coarse-grained process of RCmvMFE is mean filtered, which weakens Gaussian noise and is powerless against random impulse noise interference. This yields poor quality feature information and low accuracy classification. In this paper, RCmvMFE is improved (IRCmvMFE) by using composite filters in the coarse-grained procedure to enhance filter performance. Median filters are employed to remove the impulse noise interference from multichannel MI-EEG signals, and these filtered MI-EEGs are further smoothed by the mean filters. The multiscale IRCmvMFEs are calculated for all channels of composite filtered MI-EEGs, forming a feature vector, and a support vector machine is used for pattern classification. Based on two public datasets with different motor imagery tasks, the recognition results of 10 × 10-fold cross-validation achieved 99.43% and 99.86%, respectively, and the statistical analysis of experimental results was completed, showing the effectiveness of IRCmvMFE, as well. The proposed IRCmvMFE-based feature extraction method is superior compared to entropy-based and traditional methods.

1. Introduction

Brain-computer interface (BCI) is a new type of human-computer interaction technology that enables the brain to control external devices [1, 2]. Motor imagery electroencephalogram- (MI-EEG-) based BCI has great prospects in the field of rehabilitation medical engineering. One of the key technologies of BCI is the ability to effectively extract features from complex multichannel MI-EEG signals.

Previous studies focus on time-frequency methods in MI-EEG feature extraction, including wavelet transform (WT) [3], discrete WT (DWT) [4], Hilbert–Huang transform (HHT) [3], dual-tree complex WT (DTCWT) [5], empirical mode decomposition (EMD) [6], and common spatial pattern- (CSP-) based methods, including CSP, filter bank CSP (FBCSP), discriminant FBCSP (DFBCSP), sparse FBCSP (SFBCSP), and spectrally weighted CSP (SWCSP) [79]. With the development of nonlinear dynamics, entropy-based methods have been widely utilized in the analysis of biomedical signals. Specifically, the most prevalent methods are approximate entropy (ApEn) and sample entropy (SampEn) because of the power these methods have to quantify the complexity of a time series [1012]. Nevertheless, sensitivity to selected parameters will lead to entropy mutation. To solve this weakness, fuzzy entropy (FE) was proposed for EEG analysis, where fuzzy membership functions replace Heaviside functions [1315]. Research shows that FE alleviates the problem of entropy mutation; however, these methods analyze at a single scale, which loses useful information. Therefore, multiscale SampEn (MSE) [1619], multiscale FE (MFE) [20, 21], and improved MFE (IMFE) [22] were put forward to explore deeper information. Unfortunately, using larger scale factors for short time series may cause inaccurate MSEs and MFEs. To achieve more reliable results, composite MFE (CMFE) [23, 24] was developed as a rolling bearing fault detection method.

Although the above univariate methods have shown good performance, they are only suitable for single-channel recording analyses. They fail to measure multichannel data synchronously and ignore the dynamic characteristics across channels [25]. So, SampEn was extended to produce multivariate SampEn (mvSE) [26] and multivariate MSE (mvMSE) [2629] to analyze multichannel signals more effectively. Considering the disadvantages of SampEn in mvSE and mvMSE, multivariate FE (mvFE) and multivariate MFE (mvMFE) [30, 31] were yielded by replacing SampEn with FE. Recently, as an improvement of mvMFE, a refined composite mvMFE (RCmvMFE) was proposed to analyze fault signals and biomedical signals [32, 33]. In RCmvMFE, the entropy stability is improved and the signals’ length sensitivity is reduced. However, the coarse-grained process of RCmvMFE is a mean filter that smoothens signals but does not eliminate random impulse noise interference. It is inevitable to produce high-amplitude electrooculogram and electromyography interference during the acquisition of MI-EEG. This is not conducive to extracting valid feature information from multichannel MI-EEG signals. In this paper, improved RCmvMFE (IRCmvMFE) is developed by combining median [34] and mean filters in the coarse-grained process to further improve filter effect, i.e., first the median filter is applied to each channel to remove pulse interference, and then the mean filter is used for further smoothing. Subsequently, IRCmvMFE is proposed to extract features from multichannel MI-EEG signals. The experimental research shows the effectiveness of IRCmvMFE.

The rest of the paper is described as follows: Section 2 introduces the process of extracting MI-EEG features using IRCmvMFE, Section 3 describes the experiments performed, Section 4 discusses the results, and Section 5 provides the conclusions.

2. Feature Extraction with IRCmvMFE

By combining median filters and mean filters in coarse-grained processes, RCmvMFE is improved to produce IRCmvMFE, which is applied to extract features of MI-EEG. The main steps are as follows: preprocessing, optimal channel selecting, performing multivariate coarse-grained analysis of preprocessed MI-EEG data, calculating IRCmvMFE, and constructing a feature vector. The support vector machine (SVM) was used to classify the feature vector. The block diagram of the proposed method is displayed in Figure 1.

2.1. Preprocessing MI-EEG Signals

For two-class motor imagery tasks, assume that represents the channel MI-EEG sequence of the task, where , ; and represent the sample points and the number of total channels, respectively. is bandpass filtered to the frequency band associated with the tasks and is expressed as . The motor imagery time period is taken as the optimal sampling interval, and MI-EEG signals in the segment are summarized as , where represents the sampled MI-EEG points within the optimal sampling interval.

2.2. Channel Selection

When the brain is engaged in motor imagery, only parts of channels are activated in the form of the power spectrum. Extracting the features of all channels not only increases the computational complexity but also increases the feature information redundancy and reduces the classification accuracy [35]. Therefore, the choice of optimal channels is important. In this paper, the Fisher score of the average power spectrum of is calculated to select channels according to the following equation:where and represent the average power spectrum on the channel of class 1 and class 2 motor imagery tasks, respectively. is the variance and represents the channel Fisher score. The larger the , the greater contribution of the channel. The signals of channels with the top are selected for subsequent research. is rewritten as , where , in which stands for the number of selected channels.

2.3. Coarse-Graining of IRCmvMFE

Step 1. In the coarse-grained process of IRCmvMFE, the median filter is first performed on . Supposing the filter size is or , the data in the window would be sorted in ascending order with the filter output beingwhere and means the maximum value in the window.

Step 2. For the scale factor , the coarse-grained sequence on channel of class task iswhere represents the sample points of the coarse-grained sequence. Therefore, multivariate coarse-grained sequences are obtained and described as , , …, .

2.4. IRCmvMFE Calculation

Step 1. The multivariate coarse-grained sequence is executed for multivariate embedded reconstruction, with the multivariate composite delay vectors calculated aswhere and and are the embedding dimension vector and time delay vector, respectively. Additionally, and .

Step 2. The distance of any two multivariate composite delay vectors and is computed in the following equation:where , , and .

Step 3. Given a threshold , suppose the fuzzy membership function is , the similarity between and is

Step 4. The average membership grade can be obtained using the following equation:

Step 5. Repeat the above steps, extend the dimension of the multivariate composite delay vector from to and derive . For each , we get and . The average and are calculated. The definition of IRCmvMFE is as follows:The procedure for calculating IRCmvMFE is summarized in Algorithm 1.

Input: Channels selected data
(1)Coarse-graining of IRCmvMFE
Step 1. Calculate the output of median filter on :
Step 2. Calculate the output of mean filter on :
(2)IRCmvMFE calculation
Step 1. Set embedding dimension m and scale factor , calculate the multivariate reconstruction vectors:
Step 2. For k = 1 to
  Calculate the distance and the similarity of and using equations (5) to (6)
  Calculate by equation (7)
  End
Step 3. Repeat the above steps, extend the dimension from to , and calculate
Step 4. Set and , and calculate IRCmvMFE
Output: IRCmvMFE at scale
2.5. Determination of a Maximum Scale Factor

As the number of scale factors increases, multivariate coarse-grained sequences become smoother. Scale factors that are too large omit useful information and reduce classification accuracy. Therefore, the impact on sequence smoothness and classification accuracy should be considered comprehensively to determine its maximum scale factor .

2.6. Construction of a Feature Vector

For , IRCmvMFE at scale in the class task, i.e., , is estimated and combined to form the feature vector :

The feature vectors of the two tasks are fused in parallel to obtain the feature vector of MI-EEG:

3. Experimental Research

3.1. Data Description and Preprocessing

MI-EEG data were obtained from dataset III in the BCI Competition II [36] and dataset IVa in the BCI Competition III [37]. MI-EEG signals on channels C3, Cz, and C4 were recorded in dataset III of BCI Competition II, where the data were from a healthy subject who imagined left-right hand movement. Left- and right-motor imagery tasks were each performed 140 times for a total of 280 experimental trials. The signals were sampled at 128 Hz and filtered to 0.5–30 Hz. The MI-EEG collection timing scheme is shown in Figure 2(a). The subject was at rest for the first 2 s, and the corresponding motor imagery task was completed according to the screen prompts from 3 s to 9 s. To better distinguish the two-class tasks, this paper used the sampling interval [451, 900].

The dataset IVa of BCI Competition III recorded the MI-EEG signals of five healthy subjects using 118 channels during right-hand (RH) and right-foot (RF) motor imagery tasks. The original sampling rate was 1000 Hz, but we downsampled these data to 100 Hz. The subjects performed the corresponding imaginary movement according to the prompts in the first 3.5 s and then rested for a random epoch between 1.75 s and 2.25 s. The timing scheme of MI-EEG collection during the right-hand-foot motor imagery task is shown in Figure 2(b). Each subject performed 280 trials, with 140 each of the RH and RF motor imagery tasks. In this paper, MI-EEG related to mu rhythm (8–13 Hz) and beta rhythm (14–32 Hz) related to motor imagery tasks were selected, i.e., the original MI-EEG signals were preprocessed by a bandpass filter of 8–32 Hz. The data between 0.5 s and 3.5 s were used for subsequent experimental research.

3.2. Channel Selection

Channel selection directly affects the quality of feature information and classification accuracy. It is essential to select the optimal channels before extracting MI-EEG features. There was a close relationship between the signals on channels C3, Cz, and C4 in the left-right-hand motor imagery task, so the data of these three channels were used for feature extraction. When RH and RF motor imagery tasks were conducted in dataset IVa from BCI Competition III, the Fisher Score of each channel was calculated by equation (1). The scores of different subjects are shown in Figure 3.

For each subject, the score of each channel is different and for different subjects, scores from the same channel are different. Thus, the optimal channels for each subject are different due to individual differences. The channels with the top three Fisher scores can be used as the optimal channels. The detailed information is shown in Table 1.

3.3. Comparison of Coarse-Grained Sequences between IRCmvMFE and RCmvMFE Methods

To confirm the effectiveness of IRCmvMFE in extracting MI-EEG features, the coarse-grained processes of RCmvMFE and IRCmvMFE were compared. The relevant parameters were selected as follows: , where SD represents the standard deviation of . According to Table 1, the channel k with the highest Fisher score of each subject was selected. The experimental process was as follows: when a motor imagery task was performed, at scale, the first points of were removed in turn. The RCmvMFEs of the remaining points were calculated separately, and they were composed of time series recorded as , where , , and represents the number of experiments. The of experiments were superimposed and averaged to obtain the average time series . The average time series of IRCmvMFE was obtained the same way as . When imaging left-right-hand motor imagery, training set data were used for analysis, i.e., . Similarly, was selected as 140 when the RH and RF motor imagery were performed. The amplitude of the original MI-EEG signals and coarse-grained sequences of RCmvMFE and IRCmvMFE during left-right-hand motor imagery are displayed in Figure 4. Similarly, the experimental results from imaging right-hand-foot movement are shown in Figure 5.

It can be seen from Figure 4 that the original MI-EEG signals had larger fluctuations, which was obviously improved after the coarse-grained process of both RCmvMFE and IRCmvMFE; and the smoothness of IRCmvMFE was better than RCmvMFE. In Figure 5, there are different intensity impulse noise interferences for different subjects. The coarse-grained sequences of both RH and RF motor imagery tasks using the RCmvMFE and IRCmvMFE of each subject changed with the fluctuations of the original MI-EEG but oscillated more smoothly. For subject “aw,” the impulse noise is not obvious, and the coarse-grained sequence of IRCmvMFE had larger fluctuations than that of RCmvMFE. But the intensity of impulse noise interference is higher for other subjects. Both and showed better smoothness, and was superior to for rapid MI-EEG changes. The reason is that the coarse-grained process of RCmvMFE is equivalent to a mean filter, which has the effect of low-pass filtering and smoothing and can remove some random interference. However, it is helpless against impulse noise caused by sudden factors such as eye-movements, blinks, and motion. In the coarse-grained IRCmvMFE, the median filter is assigned to remove the impulse noise interference, and then the filtered signals are smoothed by a mean filter.

3.4. Selection of Parameters in IRCmvMFE

The parameter selection will affect the estimate of IRCmvMFE. According to equation (8), the estimation of IRCmvMFE is not only related to the preprocessed MI-EEG but also involves selecting an embedding dimension vector , time delay vector , threshold r, and scale factor . The selection of parameter M was similar to reference [32], i.e., . Parameter does not have any proven standards, so for simplicity, was selected as 1. The threshold r was determined as .

In addition, the selection of influenced the filter effect in the coarse-grained process of MI-EEG and affected the extracted features and the classification results in turn. The larger the , the larger the calculation and the better the recognition. In contrast, a smaller resulted in poor filter performance [34]. When , the IRCmvMFEs with imaging left-right-hand movements were estimated and then classified by SVM. Gaussian kernel function was employed in this paper, and SVM optimized by grid search. When , the same experiment was performed with right-hand-foot motor imagery tasks. The 10 × 10-fold cross-validation (CV) was used to eliminate the contingency in the feature extraction process of MI-EEG. The average classification accuracy of the 10 × 10-fold CV is shown in Figure 6.

In Figure 6(a), the classification results gradually increased as the scale factor increased. When was from 55 to 75, the classification accuracy tended to be stable and close to 100%, and the highest recognition was obtained at 65 scale. Therefore, the maximum about left-right-hand motor imagery was selected as 65. In Figure 6(b), with the increased , the average recognition rate of each subject first increased and then later decreased. In this paper, the values of subjects “aa”, “al”, “av”, “aw,” and “ay” during right hand-foot-motor imagery were chosen as 41, 37, 33, 38, and 39, respectively. And, is related to the mathematical model of the coarse-grained process of IRCmvMFE. There is a significant difference in during different types of two-class motor imagery tasks, while the difference between multiple subjects during the same type of tasks is not obvious.

3.5. Comparison of Multiple Entropy-Based Feature Extraction Methods

In this section, the comparative study of IRCmvMFE and various entropy-based feature extraction methods was conducted. To make the comparison process more objective, the same dataset was selected as reference [13, 22], i.e., dataset III from BCI Competition II, and SVM was used for classification. The classification result of IMFE was derived from [22], and the related parameters of other entropy-based methods were selected as references [13]. The average recognition results of 10 × 10-fold CV and standard deviations are displayed in Figure 7.

In Figure 7, the classification result of MFE was higher than SampEn, FE, and MSE. Because the fuzzy membership function was used to enhance the stability of MFE, richer feature information from the multiscale was collected. At the same scale, the information of multiple coarse-grained sequences was integrated by CMFE, yielding a slightly better result. Based on the parameters’ independent optimization strategy, the preferred parameters were used by IMFE to extract features from the MI-EEG, and the recognition accuracy was further improved. Despite the results of mvSE, mvFE, and mvMSE being poor, mvMFE, RCmvMFE, and IRCmvMFE showed the advantages of multivariate entropy methods over traditional univariate entropies, both in terms of classification accuracy and standard deviation. This was mainly because these feature extraction methods evaluated the multivariate complexity of multichannel data and expressed the dynamic relationships and synchronizations across channels.

IRCmvMFE, RCmvMFE, and mvMFE methods displayed superiority on dataset III from BCI Competition II. To further illustrate the improvement of IRCmvMFE, a comparative study of these three methods was performed based on dataset IVa and using SVM for classification. The classification results with 10×10-fold CV are shown in Table 2. For each subject, the recognition rates obtained by using RCmvMFE to extract features of MI-EEG were higher than those of mvMFE because the multivariate feature of RCmvMFE was considered at the same scale, and the defect of unstable entropy values, i.e., coarse-grained time series shortening with scale factor increases, was overcome. Moreover, a composite filter technique was applied in the coarse-grained process of IRCmvMFE to eliminate burst-like impulse noise and the Gaussian noise of the MI-EEG, which produced better quality information. For different subjects, IRCmvMFE achieved better recognition accuracy and a smaller standard deviation than RCmvMFE, illustrating the stability and superiority of IRCmvMFE. Further, according to Figure 5, the impulse noise interference was not obvious for subject “aw,” and the recognition result by IRCmvMFE was slightly better than RCmvMFE. However, there was greater impulse noise interference for most subjects (“aa,” “al,” “av,” “ay”), after using IRCmvMFE to enhance the filter effect, the recognition results were obviously improved.

3.6. Statistical Analysis

In this section, statistical analysis was performed to further describe the development of IRCmvMFE. The kappa coefficient, which was designed to measure the classification precision and the comparison of performance in multiclass tasks, was made fairer. This method is a common indicator for evaluating the performance of BCI systems [38, 39]. The calculation of κ coefficient was expressed aswhere represents the classification accuracy and means the probability of opportunity consistency. For a two-class task, if the number of samples across classes was equal, then the value of was 0.5. Using equation (11), the mean kappa coefficients of IRCmvMFE, RCmvMFE, and mvMFE with 10 × 10-fold CV were calculated. The results are shown in Table 3.

Comparing the mean kappa values, the results of MI-EEG feature extraction from each subject was highest when using IRCmvMFE; this result revealed that IRCmvMFE had better consistency than those of RCmvMFE and mvMFE.

3.7. Comparison of Multiple Traditional Feature Extraction Methods

A variety of traditional feature extraction methods [39] were compared with the method presented in this paper, using SVM as a classifier. In Table 4, the top classification results and average classification of 10 × 10-fold CV of referenced feature extraction methods [37] on BCI competition II are displayed. IRCmvMFE achieved the highest classification accuracy over the referenced methods, and its 10 × 10-fold CV results were also better; it also showed the ability of IRCmvMFE to quantify the complexity of multichannel signals and implied its superiority in extracting features from MI-EEG signals.

The CSP-based feature extraction methods have been extensively studied on BCI competition III. The experimental results of 10 × 10-fold CV with CSP, filter bank CSP (FBCSP), discriminant FBCSP (DFBCSP), sparse FBCSP (SFBCSP), and spectrally weighted CSP (SWCSP) methods were from references [8, 9]. The method presented in this paper was compared with these methods, and the recognition rates are shown in Table 5. The results of CSP-based feature extraction were lower than those of IRCmvMFE. CSP-based methods only considered the spatial characteristics of MI-EEG signals, ignoring the features in other domains. IRCmvMFE effectively extracted nonlinear dynamic features of MI-EEG, correctly analyzed multichannel signals, and had good applicability in multiple subjects.

4. Discussion

In this paper, IRCmvMFE was proposed as a feature extraction method for MI-EEG signals. In IRCmvMFE, a composite filter technique was applied to improve the coarse-grained process of RCmvMFE, which eliminated impulse noise interference due to random factors, produced smoother MI-EEG time series, and enhanced the filter results. The optimal channels and the optimal parameters were selected to calculate IRCmvMFE for each subject when imaging left-right-hand or right-hand-foot movement. Multiscale IRCmvMFEs were constructed as a feature vector. Entropy-based and traditionally referenced feature extraction methods were compared on two public datasets. The kappa coefficients of IRCmvMFE, RCmvMFE, and mvMFE were calculated for statistical analysis. The results implied the superiority and applicability of IRCmvMFE for the analysis of two-class motor imagery tasks. In the future, we will continue to focus on the research of multiclass motor imagery tasks.

5. Conclusions

A novel nonlinear dynamics method based on RCmvMFE, called IRCmvMFE, was introduced in this study. This method provides a potential tool for the nonlinear dynamic analysis of multichannel MI-EEG signals. RCmvMFE was developed using a composite filter technique in the coarse-grained process, which effectively removes impulse noise interference, better reflects the dynamic correlations both within and across channels, and is more closely matched the nonlinear and time-varying characteristics of MI-EEG and produced better features and classification accuracy. IRCmvMFE was applied to the analysis of multichannel MI-EEG signals and was compared to other commonly used feature extraction methods. IRCmvMFE yielded the highest classification results and improved stability; it also displayed the applicability of IRCmvMFE for MI-EEG feature extraction and provided a useful tool for the analysis of other complex, two-class biological signals.

Data Availability

Two previously reported datasets were used to support this study and are available at http://bbci.de/competition/ii/ and http://www.bbci.de/competition/iii. These datasets are cited at relevant places within the text as references [36, 37].

Conflicts of Interest

The authors declare that there are no conflicts of interest regarding the publication of this article.

Acknowledgments

This research was financially supported by the National Natural Science Foundation of China (Nos. 81471770, 11882003, and 61672070) and the Natural Science Foundation of Beijing (No. 7132021).