Vowel Imagery Decoding toward Silent Speech BCI Using Extreme Learning Machine with Electroencephalogram

Min, Beomjun; Kim, Jongin; Park, Hyeong-jun; Lee, Boreom

doi:https://doi.org/10.1155/2016/2618265

BioMed Research International

On this page

Abstract Introduction Materials and Methods Results and Discussion Conclusions Authors’ Contributions Acknowledgments References Copyright Related Articles

Special Issue

Neural Engineering for Rehabilitation

View this Special Issue

Research Article | Open Access

Volume 2016 | Article ID 2618265 | https://doi.org/10.1155/2016/2618265

Vowel Imagery Decoding toward Silent Speech BCI Using Extreme Learning Machine with Electroencephalogram

Beomjun Min,¹Jongin Kim,¹Hyeong-jun Park,¹and Boreom Lee¹

Academic Editor: Han-Jeong Hwang

Received30 Sept 2016

Revised04 Nov 2016

Accepted17 Nov 2016

Published19 Dec 2016

Abstract

The purpose of this study is to classify EEG data on imagined speech in a single trial. We recorded EEG data while five subjects imagined different vowels, /a/, /e/, /i/, /o/, and /u/. We divided each single trial dataset into thirty segments and extracted features (mean, variance, standard deviation, and skewness) from all segments. To reduce the dimension of the feature vector, we applied a feature selection algorithm based on the sparse regression model. These features were classified using a support vector machine with a radial basis function kernel, an extreme learning machine, and two variants of an extreme learning machine with different kernels. Because each single trial consisted of thirty segments, our algorithm decided the label of the single trial by selecting the most frequent output among the outputs of the thirty segments. As a result, we observed that the extreme learning machine and its variants achieved better classification rates than the support vector machine with a radial basis function kernel and linear discrimination analysis. Thus, our results suggested that EEG responses to imagined speech could be successfully classified in a single trial using an extreme learning machine with a radial basis function and linear kernel. This study with classification of imagined speech might contribute to the development of silent speech BCI systems.

1. Introduction

People communicate with each other by exchanging verbal and visual expressions. However, paralyzed patients with various neurological diseases such as amyotrophic lateral sclerosis and cerebral ischemia have difficulties in daily communications because they cannot control their body voluntarily. In this context, brain-computer interface (BCI) has been studied as a tool of communication for these types of patients. BCI is a computer-aided control technology based on brain activity data such as EEG, which is appropriate for BCI systems because of its noninvasive nature and convenience of recording [1, 2].

The classification of EEG signals recorded during the motor imagery paradigm has been widely studied as a BCI controller [3–5]. According to these studies, different imagined tasks induce different EEG patterns on the contralateral hemisphere mainly in mu (7.5–12.5 Hz) and beta (13–30 Hz) frequency bands. Many researchers have successfully constructed BCI systems based on the limb movement imagination paradigm such as right hand, left hand, and foot movement [5–7]. However, EEG signals recorded during imagination of speech without any movement of either mouth or tongue are still difficult to classify; however, this topic has become an interesting issue for researchers because speech imagination has high similarity to real voice communication. For example, Deng et al. proposed a method to classify imagined syllables, /ba/ and /ku/, in three different rhythms using Hilbert spectrum methods, and the classification results were significantly greater than the chance level [8]. In addition, DaSalla et al. classified /a/ and /u/ as vowel speech imagery for EEG-based BCI [9]. Furthermore, a study to discriminate syllables embedded in spoken and imagined words using an electrocorticogram (ECoG) was conducted [10].

Obviously, for the BCI system, the use of optimized classification algorithms that categorize a set of data into different classes is essential, and these algorithms are usually divided into five groups: linear classifiers, neural networks, nonlinear Bayesian classifiers, nearest neighbor classifiers, and combinations of classifiers [11]. For instance, various algorithms for speech classification have been used, such as k-nearest neighbor classifier (KNN) [12], support vector machine (SVM) [9, 13], and linear discriminant analysis (LDA) [8].

The extreme learning machine (ELM) is a type of feedforward neural network for classification, proposed by Huang et al. [14]. ELM has high speed and good generalization performance compared to the classic gradient-based learning algorithms. There is growing interest in the application of ELM and its variants in the biomedical field, such as epileptic EEG pattern recognition [15, 16], MRI study [17], and BCI [18].

In this study, we measured the EEG activities of speech imagination and attempted to classify those signals using the ELM algorithm and its variants with kernels. In addition, we compared the results to the support vector machine with a radial basis function (SVM-R) kernel and linear discriminant analysis (LDA). As far as we know, applications of ELM as a classifier for EEG data of imagined speech have been rarely studied. In the present study, we will examine the validity of using ELM and its variants in the classification of imagined speech and the possibility of our method for applications in BCI systems based on silent speech.

2. Materials and Methods

2.1. Participants

Five healthy human participants (5 males; mean age: , range: 26–32) participated in this study. All participants were native Koreans with normal hearing and right-handedness. None of the participants had any known neurological disorders or other significant health problems. All participants gave written informed consent, and the experimental protocol was approved by the Institutional Review Board (IRB) of the Gwangju Institute of Science and Technology (GIST). The approval process of the IRB complies with the declaration of Helsinki.

2.2. Experimental Paradigm

Participants were seated in a comfortable armchair and wore earphones (er-4p, Etymotic research, Inc., IL 60007, United States of America) providing auditory stimuli. Five types of Korean syllables—/a/, /e/, /i/, /o/, and /u/, as well as a mute (zero volume) sound—were utilized in the experiment. Figure 1 describes the overall experimental paradigm. At the beginning of each trial, a beep sound was presented to prepare the participants for perception of the target syllable. These six auditory cues (including the mute sound) were recorded using Goldwave software (GoldWave, Inc., St. John’s, Newfoundland, Canada), and the source audio was from Oddcast’s online (http://www.oddcast.com/home/demos/tts/tts_example.php?sitepa). The five vowels and mute sound were randomly presented. Another 1 s after the onset of the target syllable, two beep sounds were given sequentially, with a 300 ms interval between them. After the two beep sounds, participants were instructed to imagine the same syllable heard at the beginning of the trial. The time for imagination was 3 s for each trial. Participants performed 5 sessions, with each session consisting of 10 trials for each syllable. Resting times were given between sessions for 1 min. Therefore, 50 trials were recorded for each syllable and the mute sound, and the total time for the experiment was approximately 10 min. All sessions were carried out in a day.

The experimental procedure was designed with e-Prime 2.0 software (Psychology Software Tools, Inc., Sharpsburg, PA, USA). A HydroCel Geodesic Sensor Net with 64 channels and Net Amps 300 amplifiers (Electrical Geodesics, Inc., Eugene, OR, USA) were used to record the EEG signals, using a 1000 Hz sampling rate (Net Station version 4.5.6).

2.3. Data Processing and Classification Procedure

2.3.1. Preprocessing

First, we resampled the acquired EEG data into 250 Hz for fast preprocessing procedure. The EEG data was bandpass filtered with 1–100 Hz. Sequentially, an IIR notch filter (Butterworth; order: 4; bandwidth: 59–61 Hz) was applied to remove the power line noise.

In general, EEG classification has problems in terms of poor generalization performance and the overfitting phenomenon because the number of samples is much smaller than the dimension of the features. Therefore, to obtain enough samples for learning and testing the classifier, we divided each imagination trial for 3 s into 30 time segments with a 0.2 s length and 0.1 s overlap. Therefore, we obtained a total of 9000 segments = (6 (conditions) 50 (trials per each condition) 30 segments) to learn and test the classifier. We calculated the mean, variance, standard deviation, and skewness from each segment to acquire the feature vector for the classifier. The dimension of the feature vector is 240 (4 (types of features) 60 (the number of channels)). Additionally, to reduce the dimension of the feature vector, we applied a feature selection algorithm based on the sparse regression model. The selected set of features extracted from all segments was employed to learn and test the classifier. Because a trial consists of thirty segments, a trial has thirty outputs of the classifier. Therefore, the label of the test trial was determined by selecting the most frequent output among the outputs of the thirty segments. The training and testing of the classifier model are conducted using the segments extracted only from training data and testing data, respectively. Finally, to accurately estimate the classification performance, we applied 10-fold cross-validation. The classification accuracies of ELM, extreme learning machine with linear function (ELM-L), extreme learning machine with radial basis function (ELM-R), and SVM-R for all five subjects were compared to select the optimal classifier to discriminate the vowel imagination. The overall signal processing procedures are briefly described in Figure 2.

Figure 2

Overall signal processing procedure for classification. First, each trial was divided into thirty blocks with a 0.2 s length and 0.1 s overlap. Mean, variance, standard deviation, and skewness were extracted from all blocks and channels. Sequentially, sparse-regression-model-based feature selection was employed to reduce the dimension of the features. All features were used as the input of the trained classifier. Because each trial includes thirty blocks, thirty classifier outputs were acquired; therefore, the label of each trial was determined by selecting the most frequent output of the thirty classifier outputs.

2.3.2. Sparse-Regression-Model-Based Feature Selection

Tibshirani developed a sparse regression model known as the Lasso estimate [19]. In this study, we employed the sparse regression model to select the discriminative set of features to classify the EEG responses to covert articulation. The formula for selecting discriminative features based on the sparse regression model can be described as follows:where denotes the -norm, is a sparse vector to be learned, and indicates an optimal sparse vector. is a vector about the true class label for the number of training samples, , and is a positive regularization parameter that controls the sparsity of . is the matrix that consists of the mean, variance, standard deviation, and skewness for each channelwhere is the th column vector of . The coordinate descent algorithm is adopted to solve the optimization problem in (1) [20].

The column vectors in corresponding to the zero entries in are excluded to form an optimized feature set, , that is of lower dimensionality than .

2.3.3. Extreme Learning Machine

Conventional feedforward neural networks require weights and biases for all layers to be adjusted by the gradient-based learning algorithms. However, the procedure for tuning the parameters of all layers is very slow because it is repeated many times, and its solutions easily fall into local optima. For this reason, Huang et al. proposed ELM, which randomly assigns the input weights and analytically calculates only the output weights. Therefore, the learning speed of ELM is much faster than conventional learning algorithms and has outstanding generalization performance [21–23]. If we assume the training samples , where is an -dimensional feature vector, , and is the true labels, which consists of -classes, , a standard SLFN with hidden neurons and activation function can be formulated as follows:where is the weight vector for the input layer between the th hidden neuron and the input neurons, is the weight vector for the hidden layer between the th hidden neuron and the output neurons, is the output vector of the network, and is the bias of the th hidden neuron. The operator indicates the inner product. We can now reformulate the equation into matrix form as followswherewhere matrix is the output matrix of the hidden layer and the operator indicates the transpose of the matrix. Because the ELM algorithm randomly selects the input weights and biases , we can find weights for the hidden layer, , by solving the following optimization problem: where is the matrix of true labels for training samplesThe above problem is known as a linear system optimization problem, and its unique least-squares solution with a minimum norm is as follows:where is the Moore–Penrose generalized inverse of the matrix . According to the analysis of Bartlett and Huang, the ELM algorithms achieve not only the minimum square training error but also the best generalization performance on novel test samples [14, 24].

In this paper, the activation function was determined to be a sigmoidal function, and the probability density function for assigning the input weights and biases was set to be a uniform distribution function.

3. Results and Discussion

3.1. Time-Frequency Analysis for Imagined Speech EEG Data

We computed the time-frequency representation (TFR) of imagined speech EEG data for every subject to identify speech-related brain activities. TFR of each trial was calculated using a Morlet wavelet and averaged over all trials. Among the five subjects, we plotted TFRs of subjects 2 and 5 which showed notable patterns in gamma frequency. As shown in Figure 3, much of the gamma band (30–70 Hz) powers of five vowel conditions (/a/, /e/, /i/, /o/, and /u/) in the left temporal area are totally distinct and much higher than those of the control condition (mute sound). In addition, topographical head plot of subject 5 was presented in Figure 4. Increased gamma activities were observed in both temporal regions when the subject imagined vowels.

Figure 3

Time-frequency representation (TFR) of EEG signals averaged over all trials for subjects 2 and 5. The EEG signals were obtained from eight electrodes in the left temporal areas during each of the six experimental conditions (vowels /a/, /e/, /i/, /o/, /u/, and mute). The EEG data were bandpass filtered with 1–100 Hz, and a Morlet mother wavelet transform was used to calculate the TFR. The TFRs are plotted for the first 2 s after final beep sound and for the frequency range of 10–70 Hz.

3.2. Classification Results

Figure 5 shows the classification accuracies averaged over all pairwise classifications for five subjects using ELM, ELM-L, ELM-R, SVM-R, and LDA. We also conducted SVM and SVM with a linear kernel, but the results of SVM and SVM with a linear kernel are excluded because these classifiers could not be converged during many iterations (100,000 times). All classification accuracies are estimated by 10 10-fold cross-validation. In the cases of subjects 1, 3, and 4, ELM-L shows the best classification performance compared to the other four classifiers. However, ELM-R shows the best classification accuracies in subjects 2 and 5. In the cases of all subjects, the classification accuracies of ELM, ELM-L, and ELM-R are much better than those of SVM-R, which are approximately the chance level of 50%. To identify the best classifier to discriminate the vowel imagination, we conducted paired -tests between the classification accuracies of ELM-R and those of the other three classifiers. As a result, the classification performance of ELM-R is significantly better than those of ELM (), LDA (), and SVM-R (). However, there is no significant difference between the classification accuracies of ELM-R and ELM-L ().

Table 1 describes the classification accuracies of subject 2, which shows the highest overall accuracies among all subjects, after 10 10-fold cross-validation, for all pairwise combinations. In almost all pairwise combinations, ELM-R has better classification performance than the other four classifiers for subject 2. The most discriminative pairwise combination for subject 2 is vowels /a/ and /i/, which shows 100% classification accuracy using ELM-R for subject 2.

Table 1

Classification accuracies in % employing SVM-R, ELM, ELM-L, ELM-R, and LDA for subject 2. The highest classification accuracy among the four classifiers is marked in bold for pairwise combination. Classification accuracies are expressed as mean and associated standard deviation. SVM-R, ELM, ELM-L, ELM-R, and LDA denote the support vector machine with radial basis function, extreme learning machine, extreme learning machine with a linear kernel, extreme learning machine with a radial basis function, and linear discriminant analysis, respectively.

Table 2 contains the results of ELM-R for the pairwise combinations and shows the top five classification performances for each subject. There is no pairwise combination to be selected from all subjects; however, /a/ versus mute and /i/ versus mute are selected from four subjects, and /a/ versus /i/ is selected from three subjects.

Table 3 indicates the confusion matrix for all pairwise combinations and subjects using ELM, ELM-L, ELM-R, SVM-R, and LDA. In terms of sensitivity and specificity, ELM-L is the best classifier for our EEG data. Although SVM-R shows higher specificity than those of the other three classifiers in this table, SVM-R classified almost all conditions as positive and resulted in poor sensitivity; therefore, the high specificity of the SVM-R is possibly invalid. Thus, SVM-R might be an unsuitable classifier for our study.

3.3. Discussion

Overall, ELM, ELM-L, and ELM-R showed better performance than the SVM-R and LDA algorithms in this study. In several previous studies, ELM achieved similar or better classification accuracy rates with much less training time compared to other algorithms using EEG data [16, 25–27]. However, we could not find studies on classification of imagined speech using ELM algorithms. Deng et al. reported classification rates using LDA for imagined speech with 72.67% of the highest accuracy, but the average results were not much better than the chance level [8]. DaSalla et al. using SVM showed approximately 82% of the best accuracy and 73% of the average result overall [9], whereas Huang et al. reported that ELM tends to have a much higher learning speed and comparable generalization performance in binary classification [21]. In another study, Huang argued that ELM has fewer optimization constraints owing to its special separability feature and results in simpler implementation, faster learning, and better generalization performance [23]. Thus, our results showed consistent characters with others’ previous research using ELM and even similar or better classification results for imagined speech compared to other research using different algorithms. Recently, ELM algorithms have been extensively applied in many other medical and biomedical studies [28–31]. More detailed information about ELM can be found in a recent review [32].

In this study, each trial was divided into the thirty time segments of 0.2 s in length and a 0.1 s overlap. Each time segment was considered as a sample for training the classifier, and the final label of the test sample was determined by selecting the most frequent output (see Figure 2). We also compared the classification accuracy of our method with those of a conventional method that does not divide the trials into multiple time segments. As a result, our method showed superior performance in terms of classification accuracy to the conventional method. In our opinion, by dividing the trials, some effects such as increasing number of trials for classifier training might occur, and each time segment with a 0.2 s length is likely to retain enough information for discrimination of EEG vowel imagination. Generally, EEG classification has problems in terms of poor generalization performance and the overfitting phenomenon because of the deficiency of the number of samples for the classifier. Therefore, an increased number of samples by dividing trials could mitigate the aforementioned problems. However, further analyses are required to prove our assumptions in subsequent studies.

To reduce the dimension of the feature vector, we employed a feature selection algorithm based on the sparse regression model. In the sparse-regression-model-based feature selection algorithm, the regularization parameter, , of equation (1) must be carefully selected because determines the dimension of the optimized feature parameter. For example, when the selected is too large, the algorithm excludes discriminative features from an optimal feature set, . However, when users set too small, redundant features are not excluded from an optimal feature set . Therefore, the optimal value for was selected by cross-validation on the training session in our study. For example, the change of classification accuracy caused by varying for subject 1 is illustrated in Figure 6. In the case of /a/ and /i/ using ELM-R, the best classification accuracy reached a plateau at = 0.08 and declined after 0.14. However, the optimal values of are totally different among the pairwise combinations and all subjects.

Furthermore, our optimized results were achieved in the gamma frequency band (30–70 Hz). We also tested the other frequency ranges, such as beta (13–30 Hz), alpha (8–13 Hz), and, theta (4–8 Hz); however, the classification rates of those bands were not much better than the chance level in every subject and pairwise combination of syllables. In addition, the results of our TFR and topographical analysis (Figures 3 and 4) could support some relationship between gamma activities and imagined speech processing. As far as we know, in the EEG classification of imagined speech, there have been only a few studies that examined the differences between multiple frequency bands including gamma frequency [33, 34]. Therefore, our study might be the first report that the gamma frequency band could play an important role as features for the EEG classification of imagined speech. Moreover, several studies using ECoG reported quite good results in the gamma frequency for imagined speech classification [35, 36], and these findings are consistent with our results. However, several studies have been conducted that suggested the role of gamma frequency band for speech processing in neurophysiological perspectives [37–39]. However, those studies usually used intracranial recordings and focused on the analysis for the high gamma (70–150 Hz) frequency band. Thus, suggesting a relevance between those results and our classification study is not easy. However, a certain relation between some information in low gamma frequencies as a feature for classification and its implication from a neurophysiological view will be specified in future studies.

Currently, communication systems with various BCI technologies have been developed for disabled people [40]. For instance, the P300 speller is one of the most widely researched BCI technologies to decode verbal thoughts from EEG [41]. Despite many efforts toward better and faster performance, the P300 speller is still insufficient for use in normal conversation [42, 43], whereas, independent of the P300 component, efforts toward extraction and analysis of EEG or ECoG induced by imagined speech have been conducted [44, 45]. In this context, our results of high performance from the application of ELM and its variants have potential to advance BCI research using silent speech communication. However, the pairwise combinations with the highest accuracies (see Table 2) differed in each subject. After experiment, each participant reported different patterns of vowel discrimination. For example, one subject reported that he could not discriminate /e/ from /i/, and the other subject reported the other pair was not easy to distinguish. Although those reports were not exactly matched to the results of classification, these discrepancies of subjective sensory perception might be related to process of imagining speech and classification results. Besides, we have not tried multiclass classification in this study, yet some attempts in multiclass classification of imagined speech have been performed by others [8, 46, 47]. These issues related to intersubject variability and multiclass systems should be considered for our future study to develop more practical and generalized BCI systems using silent speech.

4. Conclusions

In the present study, we used classification algorithms for EEG data of imagined speech. Particularly, we compared ELM and its variants to SVM-R and LDA algorithms and observed that ELM and its variants showed better performance than other algorithms with our data. These results might lead to the development of silent speech BCI systems.

Competing Interests

The authors declare that there is no conflict of interests regarding the publication of this paper.

Authors’ Contributions

Beomjun Min and Jongin Kim equally contributed to this work.

Acknowledgments

This research was supported by the GIST Research Institute (GRI) in 2016 and the Pioneer Research Center Program through the National Research Foundation of Korea funded by the Ministry of Science, ICT & Future Planning (Grant no. 2012-0009462).

References

H.-J. Hwang, S. Kim, S. Choi, and C.-H. Im, “EEG-based brain-computer interfaces: a thorough literature survey,” International Journal of Human-Computer Interaction, vol. 29, no. 12, pp. 814–826, 2013.
View at: Publisher Site | Google Scholar
U. Chaudhary, N. Birbaumer, and A. Ramos-Murguialday, “Brain–computer interfaces for communication and rehabilitation,” Nature Reviews Neurology, vol. 12, no. 9, pp. 513–525, 2016.
View at: Publisher Site | Google Scholar
M. Hamedi, S.-H. Salleh, and A. M. Noor, “Electroencephalographic motor imagery brain connectivity analysis for BCI: a review,” Neural Computation, vol. 28, no. 6, pp. 999–1041, 2016.
View at: Publisher Site | Google Scholar
C. Neuper, R. Scherer, S. Wriessnegger, and G. Pfurtscheller, “Motor imagery and action observation: modulation of sensorimotor brain rhythms during mental control of a brain-computer interface,” Clinical Neurophysiology, vol. 120, no. 2, pp. 239–247, 2009.
View at: Publisher Site | Google Scholar
H.-J. Hwang, K. Kwon, and C.-H. Im, “Neurofeedback-based motor imagery training for brain–computer interface (BCI),” Journal of Neuroscience Methods, vol. 179, no. 1, pp. 150–156, 2009.
View at: Publisher Site | Google Scholar
G. Pfurtscheller, C. Brunner, A. Schlögl, and F. H. Lopes da Silva, “Mu rhythm (de)synchronization and EEG single-trial classification of different motor imagery tasks,” NeuroImage, vol. 31, no. 1, pp. 153–159, 2006.
View at: Publisher Site | Google Scholar
M. Ahn and S. C. Jun, “Performance variation in motor imagery brain-computer interface: a brief review,” Journal of Neuroscience Methods, vol. 243, pp. 103–110, 2015.
View at: Publisher Site | Google Scholar
S. Deng, R. Srinivasan, T. Lappas, and M. D'Zmura, “EEG classification of imagined syllable rhythm using Hilbert spectrum methods,” Journal of Neural Engineering, vol. 7, no. 4, Article ID 046006, 2010.
View at: Publisher Site | Google Scholar
C. S. DaSalla, H. Kambara, M. Sato, and Y. Koike, “Single-trial classification of vowel speech imagery using common spatial patterns,” Neural Networks, vol. 22, no. 9, pp. 1334–1339, 2009.
View at: Publisher Site | Google Scholar
X. Pei, D. L. Barbour, E. C. Leuthardt, and G. Schalk, “Decoding vowels and consonants in spoken and imagined words using electrocorticographic signals in humans,” Journal of Neural Engineering, vol. 8, no. 4, Article ID 046028, 2011.
View at: Publisher Site | Google Scholar
F. Lotte, M. Congedo, A. Lécuyer, F. Lamarche, and B. Arnaldi, “A review of classification algorithms for EEG-based brain-computer interfaces,” Journal of Neural Engineering, vol. 4, no. 2, article R01, pp. R1–R13, 2007.
View at: Publisher Site | Google Scholar
K. Brigham and B. V. K. V. Kumar, “Imagined speech classification with EEG signals for silent communication: a preliminary investigation into synthetic telepathy,” in Proceedings of the 4th International Conference on Bioinformatics and Biomedical Engineering (iCBBE '10), Chengdu, China, June 2010.
View at: Publisher Site | Google Scholar
J. Kim, S.-K. Lee, and B. Lee, “EEG classification in a single-trial basis for vowel speech perception using multivariate empirical mode decomposition,” Journal of Neural Engineering, vol. 11, no. 3, Article ID 036010, 2014.
View at: Publisher Site | Google Scholar
G.-B. Huang, Q.-Y. Zhu, and C.-K. Siew, “Extreme learning machine: theory and applications,” Neurocomputing, vol. 70, no. 1-3, pp. 489–501, 2006.
View at: Publisher Site | Google Scholar
Y. Song and J. Zhang, “Automatic recognition of epileptic EEG patterns via Extreme Learning Machine and multiresolution feature extraction,” Expert Systems with Applications, vol. 40, no. 14, pp. 5477–5489, 2013.
View at: Publisher Site | Google Scholar
Q. Yuan, W. Zhou, S. Li, and D. Cai, “Epileptic EEG classification based on extreme learning machine and nonlinear features,” Epilepsy Research, vol. 96, no. 1-2, pp. 29–38, 2011.
View at: Publisher Site | Google Scholar
M. N. I. Qureshi, B. Min, H. J. Jo, and B. Lee, “Multiclass classification for the differential diagnosis on the ADHD subtypes using recursive feature elimination and hierarchical extreme learning machine: structural MRI study,” PLoS ONE, vol. 11, no. 8, Article ID e0160697, 2016.
View at: Publisher Site | Google Scholar
L. Gao, W. Cheng, J. Zhang, and J. Wang, “EEG classification for motor imagery and resting state in BCI applications using multi-class Adaboost extreme learning machine,” Review of Scientific Instruments, vol. 87, no. 8, Article ID 085110, 2016.
View at: Publisher Site | Google Scholar
R. Tibshirani, “Regression shrinkage and selection via the Lasso Robert Tibshirani,” Journal of the Royal Statistical Society: Series B (Statistical Methodology), vol. 58, no. 1, pp. 267–288, 2007.
View at: Google Scholar
J. Friedman, T. Hastie, and R. Tibshirani, “Regularization paths for generalized linear models via coordinate descent,” Journal of Statistical Software, vol. 33, no. 1, pp. 1–22, 2010.
View at: Google Scholar
G.-B. Huang, H. Zhou, X. Ding, and R. Zhang, “Extreme learning machine for regression and multiclass classification,” IEEE Transactions on Systems, Man, and Cybernetics Part B: Cybernetics, vol. 42, no. 2, pp. 513–529, 2012.
View at: Publisher Site | Google Scholar
J. Cao, Z. Lin, and G.-B. Huang, “Composite function wavelet neural networks with extreme learning machine,” Neurocomputing, vol. 73, no. 7-9, pp. 1405–1416, 2010.
View at: Publisher Site | Google Scholar
G.-B. Huang, X. Ding, and H. Zhou, “Optimization method based extreme learning machine for classification,” Neurocomputing, vol. 74, no. 1-3, pp. 155–163, 2010.
View at: Publisher Site | Google Scholar
P. L. Bartlett, “The sample complexity of pattern classification with neural networks: the size of the weights is more important than the size of the network,” Institute of Electrical and Electronics Engineers. Transactions on Information Theory, vol. 44, no. 2, pp. 525–536, 1998.
View at: Publisher Site | Google Scholar | MathSciNet
L.-C. Shi and B.-L. Lu, “EEG-based vigilance estimation using extreme learning machines,” Neurocomputing, vol. 102, pp. 135–143, 2013.
View at: Publisher Site | Google Scholar
Y. Peng and B.-L. Lu, “Discriminative manifold extreme learning machine and applications to image and EEG signal classification,” Neurocomputing, vol. 174, pp. 265–277, 2014.
View at: Publisher Site | Google Scholar
N.-Y. Liang, P. Saratchandran, G.-B. Huang, and N. Sundararajan, “Classification of mental tasks from EEG signals using extreme learning machine,” International Journal of Neural Systems, vol. 16, no. 1, pp. 29–38, 2006.
View at: Publisher Site | Google Scholar
J. Kim, H. S. Shin, K. Shin, and M. Lee, “Robust algorithm for arrhythmia classification in ECG using extreme learning machine,” Biomedical Engineering OnLine, vol. 8, article 31, 2009.
View at: Publisher Site | Google Scholar
S. Karpagachelvi, M. Arthanari, and M. Sivakumar, “Classification of electrocardiogram signals with support vector machines and extreme learning machine,” Neural Computing and Applications, vol. 21, no. 6, pp. 1331–1339, 2012.
View at: Publisher Site | Google Scholar
W. Huang, Z. M. Tan, Z. Lin et al., “A semi-automatic approach to the segmentation of liver parenchyma from 3D CT images with Extreme Learning Machine,” in Proceedings of the IEEE Annual International Conference of the IEEE Engineering in Medicine and Biology Society (EMBC' 12), pp. 3752–3755, Buenos Aires, Argentina, September 2012.
View at: Publisher Site | Google Scholar
R. Barea, L. Boquete, S. Ortega, E. López, and J. M. Rodríguez-Ascariz, “EOG-based eye movements codification for human computer interaction,” Expert Systems with Applications, vol. 39, no. 3, pp. 2677–2683, 2012.
View at: Publisher Site | Google Scholar
G. Huang, G.-B. Huang, S. Song, and K. You, “Trends in extreme learning machines: a review,” Neural Networks, vol. 61, pp. 32–48, 2015.
View at: Publisher Site | Google Scholar
B. M. Idrees and O. Farooq, “Vowel classification using wavelet decomposition during speech imagery,” in Proceedings of the International Conference on Signal Processing and Integrated Networks (SPIN '16), pp. 636–640, Noida, India, Feburary 2016.
View at: Publisher Site | Google Scholar
A. Riaz, S. Akhtar, S. Iftikhar, A. A. Khan, and A. Salman, “Inter comparison of classification techniques for vowel speech imagery using EEG sensors,” in Proceedings of the 2nd International Conference on Systems and Informatics (ICSAI '14), pp. 712–717, IEEE, Shanghai, China, November 2014.
View at: Publisher Site | Google Scholar
S. Martin, P. Brunner, I. Iturrate et al., “Word pair classification during imagined speech using direct brain recordings,” Scientific Reports, vol. 6, Article ID 25803, 2016.
View at: Publisher Site | Google Scholar
X. Pei, J. Hill, and G. Schalk, “Silent communication: toward using brain signals,” IEEE Pulse, vol. 3, no. 1, pp. 43–46, 2012.
View at: Publisher Site | Google Scholar
A.-L. Giraud and D. Poeppel, “Cortical oscillations and speech processing: emerging computational principles and operations,” Nature Neuroscience, vol. 15, no. 4, pp. 511–517, 2012.
View at: Publisher Site | Google Scholar
V. L. Towle, H.-A. Yoon, M. Castelle et al., “ECoG γ activity during a language task: differentiating expressive and receptive speech areas,” Brain, vol. 131, no. 8, pp. 2013–2027, 2008.
View at: Publisher Site | Google Scholar
S. Martin, P. Brunner, C. Holdgraf et al., “Decoding spectrotemporal features of overt and covert speech from the human cortex,” Frontiers in Neuroengineering, vol. 7, article 14, 2014.
View at: Publisher Site | Google Scholar
J. S. Brumberg, A. Nieto-Castanon, P. R. Kennedy, and F. H. Guenther, “Brain-computer interfaces for speech communication,” Speech Communication, vol. 52, no. 4, pp. 367–379, 2010.
View at: Publisher Site | Google Scholar
L. A. Farwell and E. Donchin, “Talking off the top of your head: toward a mental prosthesis utilizing event-related brain potentials,” Electroencephalography and Clinical Neurophysiology, vol. 70, no. 6, pp. 510–523, 1988.
View at: Publisher Site | Google Scholar
D. J. Krusienski, E. W. Sellers, F. Cabestaing et al., “A comparison of classification techniques for the P300 Speller,” Journal of Neural Engineering, vol. 3, no. 4, article 007, pp. 299–305, 2006.
View at: Publisher Site | Google Scholar
E. Yin, Z. Zhou, J. Jiang, F. Chen, Y. Liu, and D. Hu, “A novel hybrid BCI speller based on the incorporation of SSVEP into the P300 paradigm,” Journal of Neural Engineering, vol. 10, no. 2, Article ID 026012, 2013.
View at: Publisher Site | Google Scholar
C. Herff and T. Schultz, “Automatic speech recognition from neural signals: a focused review,” Frontiers in Neuroscience, vol. 10, article 429, 2016.
View at: Publisher Site | Google Scholar
S. Chakrabarti, H. M. Sandberg, J. S. Brumberg, and D. J. Krusienski, “Progress in speech decoding from the electrocorticogram,” Biomedical Engineering Letters, vol. 5, no. 1, pp. 10–21, 2015.
View at: Publisher Site | Google Scholar
K. Mohanchandra and S. Saha, “A communication paradigm using subvocalized speech: translating brain signals into speech,” Augmented Human Research, vol. 1, no. 1, article 3, 2016.
View at: Publisher Site | Google Scholar
O. Ossmy, I. Fried, and R. Mukamel, “Decoding speech perception from single cell activity in humans,” NeuroImage, vol. 117, pp. 151–159, 2015.
View at: Publisher Site | Google Scholar

Copyright

Copyright © 2016 Beomjun Min et al. This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

PDF Download Citation

Download other formats

Order printed copies

Views

3966

Downloads

1860

Citations