Abstract

This paper presents a heuristic method for electroencephalography (EEG) grouping and feature classification using harmony search (HS) for improving the accuracy of the brain-computer interface (BCI) system. EEG, a noninvasive BCI method, uses many electrodes on the scalp, and a large number of electrodes make the resulting analysis difficult. In addition, traditional EEG analysis cannot handle multiple stimuli. On the other hand, the classification method using the EEG signal has a low accuracy. To solve these problems, we use a heuristic approach to reduce the complexities in multichannel problems and classification. In this study, we build a group of stimuli using the HS algorithm. Then, the features from common spatial patterns are classified by the HS classifier. To confirm the proposed method, we perform experiments using 64-channel EEG equipment. The subjects are subjected to three kinds of stimuli: audio, visual, and motion. Each stimulus is applied alone or in combination with the others. The acquired signals are processed by the proposed method. The classification results in an accuracy of approximately 63%. We conclude that the heuristic approach using the HS algorithm on the BCI is beneficial for EEG signal analysis.

1. Introduction

The brain-computer interface (BCI) is a system that connects a human brain and a machine. Without using an external control device, humans would be able to control devices such as brain-controlled mobile robots, wheelchairs, and robot manipulators [13] with their thoughts. Therefore, we greatly anticipate that the BCI system will become a technology not only for the average person but also for the disabled including people suffering from a neurodegenerative disorder such as amyotrophic lateral sclerosis and brainstem stroke [4]. To implement the BCI, various signal acquisition methods have been proposed. Among the various BCI systems, the electroencephalography (EEG)-type BCI is promising technology. The BCI system based on EEG does not require surgical operation and is considered as the ultimate goal of the systems for practical use because of its relative convenience [5]. Furthermore, the EEG signal has a higher temporal resolution than functional magnetic resonance imaging (fMRI) or functional near-infrared spectroscopy (fNIRS) signals. This characteristic provides the detailed states of brain activity in a very short time. However, several restrictions exist in the use of the EEG signals in the BCI system. Although the EEG signal has a high temporal resolution, it has very poor spatial resolution. If we want to observe the details of the brain state in a small area, we have to use many electrodes on the scalp, which will result in channel increment; that is, channel increment is an ineluctable requirement and induces a “curse of dimensionality.”

Similar to other systems, the BCI system consists of three stages: sensing, signal processing, and applications. The brain-wave signal from a sensor is used for the application according to the class of the signal. The curse of dimension causes a problem in signal processing, which adds more complexity, increases the computational time, and complicates the classification. To avoid a high-dimensional computational problem resulting from the use of many channels, existing research suggests two primary methods: the multimodal integration method [6, 7] and the channel-selection method. The first method, which combines several signals, requires the solution to several limitations. fMRI or fNIRS requires a larger space for the equipment, and building the entire system is expensive [8]. Further, the multimodal experimental method can be potentially contaminated [9]. On the other hand, the channel-selection method is a preferable and convenient method although some data are dropped. This method restricts the channels using the brain area or mathematical models [1012]. However, this method focuses only on one stimulus at a time. Therefore, it cannot handle simultaneous stimuli in the human brain. The BCI system that employs the EEG signal for actual use overcomes the disadvantage of the curse of dimension.

In the BCI system, the performance is dependent on feature extraction and classification. From the raw-data signal, we can extract the EEG features. In this process, the common spatial pattern (CSP) algorithm is commonly used as a feature extraction algorithm [13, 14]. In the classification, even though many approaches have been attempted and the support vector machine (SVM) is preferred [1517], a dominant algorithm is absent. The linear discriminant analysis and the original SVM algorithm are linear classifications, and their performance in nonlinear EEG signal analysis is poor. The SVM algorithm with a kernel function is not only redundant when the dimension becomes higher but also difficult in unsupervised cases. The artificial neural network is powerful and can classify nonlinear data; however, it suffers from a weakness because the length of time needed for learning cannot be approximated. Thus, a new approach to BCI feature classification is needed.

The present paper presents a solution to the two problems of conventional channel selection and feature classification using the harmony search (HS) algorithm, a music-inspired metaheuristic algorithm. The metaheuristic algorithm finds the solution using rules and randomness; thus, it does not require too many computational resources [18]. In non-deterministic polynomial-time hard (NP-hard) problems, we can solve the problem using the heuristic algorithm faster than that using the exact computational method. The HS algorithm, one of the nature-inspired heuristic algorithms, possesses advantages that are not available in the other heuristic algorithms. HS can maintain diversity; thus, the search solution effectively and easily escapes from the local optimum [19]. In addition, the HS algorithm is easy to use, has a simple structure, and requires only two parameters. By using the HS algorithm, we can determine the group of EEG signal channels to eliminate the restriction of the curse of dimension in signal processing and develop a classification for BCI system applications. In Section 2, we present the theoretical background of the HS and feature extraction algorithms. In Section 3, we show the proposed methods. Sections 4 and 5 present the experiment and analysis of the channel grouping and classification. We conclude this paper in Section 6.

2. Theoretical Background

2.1. Harmony Search Algorithm

As the name suggests, the HS algorithm originates from musical harmony [20]. If we assume that the variables are the instruments, the pitches of the instruments are the values of the variables. Similar to the orchestra player who determines the harmony to create a good balance, the HS algorithm searches the best state of a given problem defined by an objective function. The HS algorithm has a harmony memory (HM) that saves the previous results, and its structure is described in a matrix form: where is the selected solution of the th variable, HMS is the HM size, and is the number of variables. The HM obtains the latest best result; thus, the HM members change when the result of the new value is better than the worst result in the HM. The new value is selected in three ways: selected by HM, randomly selected from the range of possible solutions, or selected by a local area search called the pitch adjustment. The selection by the HM is performed using the HM consideration rate (HMCR). The pitch adjustment, a mutation of the HM member, also has an occurrence rate, which is called the pitch adjustment rate (PAR). The random search and the pitch adjustment prevent premature convergence and maintain the diversity of the solution. These processes are represented as follows: where is the pitch adjustment size, and , , and are the probability of the random search, HMCR, and PAR, respectively.

2.2. Parameter-Setting-Free Harmony Search

In the basic HS algorithm principle, we use several parameters, namely, HMS, HMCR, and PAR. Generally, the HMS value ranges from 10 to 100. Although this value depends on the iteration number, the result is not sensitive to the HMS. In contrast, the HMCR and PAR control the result of the HS algorithm. To select a proper parameter value, some investigations have been conducted [21, 22]. The parameter-setting-free (PSF) HS proposed by Geem and Sim [23] is one of the parameter control methods for automatic selection of the parameters. This method uses one more memory to represent the method that generates the HM values. The memory is the same as that of the HM. Then, we obtain the new parameter values on the basis of the following equations:

In each iteration time, the PSF HS calculates the HMCR and PAR. Then, we obtain the proper adaptive parameters. Using this PSF technique, the value to be initialized is only the HMS.

2.3. Common Spatial Patterns

The CSP algorithm is one of the useful methods for feature extraction of the EEG signals. For two different classes and , we obtain multiple spatial filters that maximize the variance of one class and minimize that of the other class. The spatial filter matrix is obtained by solving the following equation [24]: where and are the covariance matrices of signal given in each class. The matrix consists of eigenvectors of . Thus, the final goal of the CSP is to obtain the linear transform , which is represented by the following equation:

3. Methodology

3.1. Mean Square Error and Euclidean Matrices

To derive the objective function, the mean square error (MSE) and the Euclidean distances were used. The MSE matrix was used for grouping the EEG channels. A low MSE means that the two signals are almost the same. Moreover, if the MSE is zero, the two signals are the same. Therefore, the group with a minimum MSE has similar characteristics and can be regarded to have the same reaction for the same stimulus. For the grouping of the signals, we first normalized the EEG data acquired from 64 channels. We divided the EEG signals with the maximum values in each channel. Then, the elements in the th row and th column of the MSE matrix were represented as follows (here, the diagonal members of the MSE matrix were zero):

In (6), and are the normalized signal vectors of theth and th channels at time , respectively.

For the feature extraction, we used the Euclidean distance matrix. Similar to grouping, we assumed that the same class members were located close to one another. From this assumption, we determined the group with the shortest overall distance. After feature extraction was completed, the features were expressed in a vector space. Then, the () elements of the Euclidean distance matrix were represented as follows (the diagonal of the MSE matrix is zero, and represents the mean feature vector in the - space):

3.2. Minimum-Seeking Harmony Search

To represent the member of a group or class, we used a binary switch vector (BSV), which indicates which channel or vector was selected. If the th member of the BSV is one, the th channel or vector is included in the group or class. Therefore, we determine the BSV values that generate the minimum result of the objective function using the HS algorithm. The objective function used in this study is the following:

In (8), is the channel number or feature selected by the BSV, and is a constant determined by . The numeral “1” in the numerator is used to prevent the value of (8) from becoming zero. is the summation of the entire MSE or the distances selected by the BSV. In (9), is the selected member of the MSE matrix or the Euclidean distance matrix selected by the BSV.

BSV contains the information in which the data, especially the channels or features, are selected. To apply the BSV to the HS algorithm, we can use the binary HS algorithm [25]. However, the binary HS algorithm has difficulty in applying pitch adjustment and requires more memory than the basic HS algorithm. This characteristic affects the speed performance and accuracy of the binary HS algorithm. Thus, in the HS, we converted each binary number in the BSV to a -radix. Then, the solution ranged from 0 to , and the pitch adjustment size was . In this study, we used . When the value was above the range of the pitch adjustment, we used the circular method.

The value of was determined by the following process: when the value of is from 0 to , the objective function is represented as follows:

We assumed that the optimal value of was , and the value of the objective function with was represented as . Then, if , the objective function satisfies the following equation:

From the definition of , the following relationship should be satisfied:

If , the objective function satisfies the following equation:

From the definition of , the following relationship should also be satisfied:

From (11) and (12), we obtain the following relationship:

From (10), . Then, (15) is transformed into the following relationship:

The same process can be applied to (13) and (14). From (13), (14), and (16), we obtain the following:

Equation (17) should be satisfied for all values. Therefore, and can be, respectively, converted to the maximum summation of , which has a value of one in the BSV , and the minimum summation of , which has a value of zero in the BSV . Equation (17) is rewritten as follows:

In this equation, and are constant, and is determined by the HS process. Therefore, the value of should satisfy (18).

4. Experiment

In the experiments, we used a commercial product, the Neuroscan EEG signal acquisition device (Neuroscan system, Compumedics, USA), to acquire the EEG signals. This device consists of the Synamps2 and Stim2 hardware, and the software for acquisition and preprocessing is Curry 7. We used an EEG cap with 64 electrodes for the EEG signals and five additional electrodes for noise reduction. The subjects were three healthy men in their mid-twenties. The subjects were selected without using any selection criteria to ensure generality of the results. All subjects had undergone one or more brain-wave experiments not related to this study. The subjects were fully aware of the entire experimental process, risk, and objective of the experiment. The entire experimental process was conducted under the experimental ethics and experimental safety regulations of the Chung-Ang University. The subjects wore EEG caps on their heads and sat in a comfortable position in front of the monitor. Electrodes were placed in accordance with the international 10–20 system [26]. The sampling rate of the EEG signals was 250 Hz, and we used a band-pass filter to obtain signals from 8 to 70 Hz. Commonly, a low-frequency signal below 8 Hz contains biological noise such as blinking eyes and heartbeat. The following Figure 1 shows the location of the electrodes. The actual electrodes were placed on the human scalp so that the real position of the electrodes represented an arc. In this study, however, we installed the electrodes in a line for ease in understanding the position of the electrodes.

The subjects received three types of stimuli without feedback. To reduce noise, the subject movements were restricted. The experiment was repeated 10 times per subject. One experiment involved six sessions, and one session consisted of seven epochs. The order of the epochs was random. The same epoch was repeated 60 times, and each stimulus was repeatedly administered 240 times. The nonstationary characteristic of the EEG signals allowed us to obtain results from a number of trials despite the small number of subjects.

We used three types of stimuli in this study: audio, visual, and finger movement. Each stimulus was given in 1 s. A monitor, an earphone, and a button pad were used to introduce the stimuli, which were given in rotation. The timing of the stimuli in one session is presented in the following timing chart, Figure 2.

S1, S2, and S3 represent the three stimuli, and their order was randomly changed. The resting time between stimuli was 2 s, and the entire session lasted for 20 s. First, one stimulus was administered three times. Subsequently, two stimuli were concurrently administered. At the final, three stimuli were given at once. At 300 ms before the motion stimulus was given, a white cross mark appeared on the black screen. The audio stimulus was a 1000-Hz stereo pure tone made by Stim2. The monitor screen was black when the audio stimulus was administered, with the earphone controlled by a specific amplifier for the least delay stimulus. The visual stimulus was administered in 1 s when the monitor screen was green, and the other stimuli were controlled so that the unrelated channels were unaffected. The motion stimulus was quite different from the other stimuli because the subject made a physical movement by pressing the button pad using their fingers. The motion image was difficult to notice in the experiments and affected the concentration of the subject. Therefore, we used real movements in this stimulus even though noise from the movements was present. To reduce the noise, the subject used only one finger and kept the other body parts still. The motion stimulus indicates a white cross symbol on the screen to prepare the subject, and a white sentence “press button” appeared on the screen to signal to the subject to push the button pad.

4.1. Channel Grouping

For the channel grouping, we used the PSF HS algorithm and the MSE method for the objective function. In the previous research, we used the binary HS and the correlated coefficient. However, the previous method exhibited poor performance in terms of computational time. In this study, we used the proposed method to improve the speed and accuracy. The number of iterations of the HS and HMS used in this study was 5000 and 20, respectively. The number of trials for one epoch of the HS algorithm was 10. We identified the groups by determining the number of times each channel was grouped with the other channels. All 30 experiments for the three subjects were summarized in three results in Section 5.1.

4.2. Feature Classification

In the classification, we used the HS algorithm with PSF where the HMS was 20 and the number of iterations was 50000. The results of channel grouping which dimension of the signal data decreased were used for classification. The CSP algorithm was used for feature extraction. To match the number of channels, we selected a minimum number of channels among the stimuli. In the CSP procedure, we used the data from the given two stimuli and determined the eigenvectors of the filtered signal. The data were classified without using any training method.

5. Results and Analysis

5.1. Channel Grouping

The results of the channel grouping are shown in the following Figures 3, 4, and 5 and Table 1. We searched the group by comparing all cases. The results for subject 1 exhibited were distinct. The audio stimulus activated the front of the brain, the visual stimulus activated the backside of the brain, and the motion stimulus activated the middle of the brain. Generally, the functions of the brain are known to be specialized by the location. The area starting with “C” includes the vicinity from the Cz channel to the C6 channel, which is also known as the motor cortex. The visual stimulus activated the visual cortex and the back of the brain. A similar tendency could be observed in the other subjects. All subjects showed the relationship between the motion stimulus and the “C” area. Although slight differences existed, the results agreed with those of the existing research, which also explained the visual and audio stimuli. Therefore, we conclude that the results of the grouping are related to the given stimuli.

Furthermore, the similar group made by a single stimulus also could be found in the groups subjected to two or three stimuli including a previous single stimulus. Therefore, we can analyze the EEG and the brain by separated stimuli. Additionally, the number of channels used in the analysis was reduced by the proposed method. These characteristics can make the EEG analysis easier.

However, in some cases, determining the relationship between the stimulus and the group was difficult. This problem implied two limitations of this work. First is the difficulty in strictly controlling the experimental conditions including the state of the subject. A human brain constantly and unconsciously receives stimuli. If an unwanted stimulus is administered, the accuracy would worsen. The second is the problem resulting from averaging the signals. EEG signals have a high temporal resolution, and the data sampling rate used in this study was 250 Hz. However, the grouping method focused on the summation of the MSEs in 1 s. Therefore, the advantage of EEG was not well presented.

5.2. Feature Classification

We used the HS algorithm for classification. The classification in the feature space was attempted 10 times for 120 features. Figures 6, 7, and 8 show the results of the classification for two stimuli.

For Subject 1, the best classification result was 74.17%, and the worst classification result was 55%. The average accuracy rate was 62.83%. For Subject 2, the best classification result was 77.50%, and the worst classification result was 55%. The average accuracy rate was 64.61%. For Subject 3, the best classification result was 72.50%, and the worst classification result was 50.83%. The average accuracy rate was 62.53%. The HS algorithm is a heuristic algorithm; thus, the results had an accuracy-rate range. However, without any learning method or prior information, the proposed method exceeded the decision boundary of the two classes by a rate of approximately 63%. Furthermore, the classified data were nonlinear high-dimensional data.

6. Conclusion

In this study, we have built a group to easily analyze EEG signals and proposed an EEG signal classification method based on HS. Signal processing and classification are important, as they determine the accuracy of the entire system. Therefore, we focused on solving the problems of a BCI system in signal processing and classification. First, we employed EEG signal grouping using the HS algorithm to reduce the channel complexity. From the EEG signal grouping, we did not only reduce the channel used in the analysis but also separately analyzed the signal of each stimulus. Therefore, the computational time and the complexity of the data were reduced. The proposed method could be the smartest solution for EEG signal analysis for problems with large data and dimension. Next, we classified the EEG signals using HS. We proposed a novel classification method with a metaheuristic algorithm for nonlinear data such as the EEG signals. The proposed method also focused on unsupervised classification; thus, we could classify the data without training or prior information. We believe that the proposed two methods could be helpful in enhancing the BCI implementation and the EEG analysis field.

Acknowledgment

This work was supported by the Mid-Career Researcher Program through an NRF Grant funded by the MEST (no. 2012-0008726).