Spatial and Time Domain Feature of ERP Speller System Extracted via Convolutional Neural Network

Yoon, Jaehong; Lee, Jungnyun; Whang, Mincheol

doi:https://doi.org/10.1155/2018/6058065

Computational Intelligence and Neuroscience

On this page

Abstract Introduction Methods Results Discussion Conclusions Conflicts of Interest Acknowledgments References Copyright Related Articles

Special Issue

Recent Advances in Brain Signal Analysis: Methods and Applications 2018

View this Special Issue

Research Article | Open Access

Volume 2018 | Article ID 6058065 | https://doi.org/10.1155/2018/6058065

Spatial and Time Domain Feature of ERP Speller System Extracted via Convolutional Neural Network

Jaehong Yoon,¹Jungnyun Lee,²and Mincheol Whang²

Academic Editor: Victor H. C. de Albuquerque

Received06 Jan 2018

Revised05 Mar 2018

Accepted01 Apr 2018

Published15 May 2018

Abstract

Feature of event-related potential (ERP) has not been completely understood and illiteracy problem remains unsolved. To this end, P300 peak has been used as the feature of ERP in most brain–computer interface applications, but subjects who do not show such peak are common. Recent development of convolutional neural network provides a way to analyze spatial and temporal features of ERP. Here, we train the convolutional neural network with 2 convolutional layers whose feature maps represented spatial and temporal features of event-related potential. We have found that nonilliterate subjects’ ERP show high correlation between occipital lobe and parietal lobe, whereas illiterate subjects only show correlation between neural activities from frontal lobe and central lobe. The nonilliterates showed peaks in P300, P500, and P700, whereas illiterates mostly showed peaks in around P700. P700 was strong in both subjects. We found that P700 peak may be the key feature of ERP as it appears in both illiterate and nonilliterate subjects.

1. Introduction

A brain–computer interface (BCI) is a system which provides a communication method by utilizing biophysiological signals [1]. BCI system enables the users to communicate with external world through measurements of biological signals and mostly do not require voluntary muscle movement. The system has been utilized to support severe locked-in syndrome (LIS) patients who lack motor ability, such as amyotrophic lateral sclerosis (ALS) and Guillain–Barre syndrome patients, as a means of communication [2–7]. Of many biophysiological signals, electroencephalography (EEG) has been most widely used in BCI field for its easiness in and low cost of measurement [8, 9].

Among different applications of BCI, event-related potential (ERP) based speller system has been one of the most widely used paradigms. The system was pioneered by Farwell and Donchin [10] in 1988 which utilized oddball paradigm in order to induce visual evoked potential (VEP), especially the P300 response. However, there are still illiteracy problems associated with ERP speller system [11, 12]. There has been reports of ERP features other than P300 [13, 14] which may be a key feature of distinguishing identifying illiterates.

One of the most prominent classification methods for ERP system is support vector machine (SVM) [15–18]. SVM is mathematically simple and, with sufficient knowledge of feature matrix, the experimenter can modulate the kernel for the target problem. Unfortunately, the kernel of SVM is sensitive to overfitting [19]. As EEG are measured from multiple electrodes [20–23], feature matrix can have high dimension with possible duplicates, which increase possibility of overfitting. As most of ERP system paradigms are dependent on P300 peak, the information (peak magnitude and latency) from each electrode should be similar. Moreover, it is hard to extract temporal and spatial information of EEG of a single kernel. Although multiple kernel learning (MKL) problem has been suggested [24], it is hard to extract intuition of the given problem through the method.

Recent development of deep learning provides an alternative approach. The convolutional neural network (CNN) can extract the feature from a given feature vector by using convolution. When an optimal filter is applied, the convolution will magnify the feature of interest and reduce the others [25]. CNN has been used in pattern recognition, especially in image recognition and speech recognition, as it provides topological information within the extracted feature [26–30]. Therefore, data with sequence or topological information can be recognized more efficiently as CNN enables extracting both temporal and spatial information within the raw data. As the ERP shows sequence of rise and fall as a response to visual stimuli, pattern recognition technique as CNN can be applied. Moreover, the convolution kernel of CNN can be used as tool for interpreting the spatial correlation among EEG electrodes.

In this paper, we explore the performance of CNN on ERP data to identify the key features that distinguish illiterates of ERP speller system. The convolution kernels of trained model will be explored to analyze the spatial correlation between cortices and pattern within ERP of each electrode. The subjects were grouped as either strong (nonilliterate) or weak (illiterate) depending on clarity of ERP signals. Results of two groups were compared to analyze difference in features.

2. Methods

2.1. ERP Speller Design

6 icons shown in Figure 2 were used as visual stimuli for the speller system of this paper. Rapid serial visual presentation (RSVP) panel design was adopted for the speller system to avoid gaze effect. During the experiment, screen size icons appeared on the center of the monitor in a random sequence [31]. The oddball paradigm was implemented by presenting target icon with distractors in a random sequence [10]. Each icon appeared 20 times per trial. The interstimulus interval (ISI) between icon appearances was set to 300 ms.

2.2. Data Acquisition

For this paper, 33 subjects (13 female, 20 male) participated in the experiment. The subjects’ age ranged from 24 to 30 (mean = 27.25, std = ±1.92). During the experiment, subjects were asked to sit upright on a chair and instructed to keep still. No straps or ties were attached. Subjects were asked to self-report any inconvenience that might bother the concentration.

Each trial was initiated with an acoustic cue instructing the target of the given trial in subjects’ mother tongue (Korean). 10 seconds after the acoustic cue was given, the icons appeared on the monitor according to RSVP design in random sequence. The subjects were instructed to mentally count the target occurrence during each trial (Figure 2(b)). Each session consisted of 12 trials. Each icon was selected as a target during the session twice in random sequence.

All subjects were naive; 10–20-minute preexperiment session was given to get subjects used to the procedure. The subjects were asked to self-report if they felt confident of the procedure. After the preexperiment session ended, the measurements of EEG were made. During the experiment, one training session and online session were conducted as a pair. To minimize subject’s stress level and fatigue, 10-minute break was given in between training and online session. Each subject conducted minimum of 2 pairs of training and online session. No subjects had participated in more than 4 pairs of sessions.

EEG was collected by B-Alert X10 headset from Advanced Brain Monitoring (ABM) with sampling rate of 256 Hz. The EEG electrodes recorded followed international 10/20 system [32] as shown in Figure 2(a). All experiments were held in accordance with the Declaration of Helsinki, and the protocol was approved by the Ethics Committee of Sangmyung University.

2.3. Convolutional Neural Network

The architecture of CNN for this paper was as shown in Figure 2(c). The CNN consisted of 2 convolutional layers, 2 max-pooling layers, and 2 fully connected layers. Rectified linear unit (ReLU) function was applied as activation function for each convolutional layer since its performance was proven by another [33]. A softmax function was applied to output the last layer to regularize the final output to be between 0 and 1. The output of CNN was vector of 2 elements where each element represented the score of target and nontarget.

The CNN was designed to perform both spatial and temporal filtering. The feature maps of each layer were used to access correlation between adjacent electrodes and temporal feature of target ERP. In the 1st convolutional layer (L₁), a filter of size 6 × 20 was applied to extract correlation of EEG recorded in adjacent electrodes. The row number of the filter was set to 6 as 3 electrodes were placed on each lobe (except for occipital lobe where two electrodes were placed). The size of filter enables analyzing the correlation of all 6 electrodes from adjacent lobes. For analysis of temporal feature of feature map from L₁ among different lobes, a filter with size of 1 × 12 was applied for 2nd convolutional layer (L₂) whose window size was approximately 100 ms in time scale.

To reduce the receptive field size for ease of calculation and prevent overfitting, max-pooling layers (M₁ and M₂) were inserted after each convolutional layer [27, 34]. The max-pooling layers downsample the feature map by applying a sliding window without overlap. As the name implies, the maximum value within the window is extracted. As the max-pooling introduces downsampling effect, a generalization of feature map was achieved which prevented overfitting of the model. Sliding window sizes of M₁ and M₂ were 2 × 2 and 1 × 10, respectively.

To further reduce the possibility of overfitting while training the model, drop-out technique was applied on the first fully connected layer (F₁). The drop-out technique padded zeros to randomly selected rows in the given feature map. By intentionally losing the data within the feature map, generalization was achieved for the feature map which prevented the model from being overfitted by the training data [35, 36].

The size of input matrix fed into the CNN was 14 × 300 where each row corresponded to EEG collected from each electrode in Figure 2.

The CNN architecture was implemented in Python via TensorFlow on Python [37, 38]. The Adam optimizer was used to train the CNN which controls the learning rate to use larger step size. 10,000 iterations were conducted for training the model for each subject’s data.

2.4. Tie Breaking

Ideally, if the model is perfect, only one icon will be identified as the target for a given trial. However, the system identified multiple icons as the targets in several trials. On the other extreme, the system failed to identify any target icon for some trials. For each case, the tie breaking rule was applied as follows.(i)Multiple icons cases: When multiple icons were thought to be the target of a given trial by the CNN, the tie breaking rule was applied to select the target among these candidates. Since the first element of output vector represents the icons affiliation to target ERP property, the icon with the greatest value of the element was selected as the target of the trial.(ii)No target case: When the system failed to find the association of the ERP from any icons to property of target ERP, that is, no icons were identified as the target, same rules as those in multiple icons case were applied to select the target for the given trial. In this case, the first elemenet of output vector from all icons was compared. The icon whose first element of output vector was the greatest was selected as the target of the trial.

2.5. Analysis

Both qualitative and quantitative analysis were performed to analyze the characteristics of filters of each convolutional layer. The subjects were divided into two groups according to their relative strength of ERP as follows:(i)ERP detection: if the target icon was detected as positive in a given trial, the ERP is considered detected. The subjects were divided accordingly into either H or L group (H and L for high and low) ERP detection group. The threshold between H and L group was 50%.(ii)Feature map: feature maps from L₁ and L₂ were drawn in color map. As higher weights of feature map denote high discriminant power, the colormap can qualitatively give insight of how each electrode is correlated and at which time the main peak is formed.(iii)Statistical analysis: for quantitative analysis of performance, accuracy, sensitivity, precision, F1 measure, and ROC were calculated for each subject and ANOVA test was held to compare mean difference. The accuracy is defined as the ratio of number of correctly identified trial to total trial numbers. The classical statistic measurements for quantitative evaluation are as follows: (iv)Receiver operating characteristic: receiver operating characteristic (ROC), which plots the sensitivity against specificity, widely used statistical measurement for its diagnostic ability of binary classifier. As the CNN of the paper is a binary classifier, the ROC information is provided to compare the performance of CNN between H and L group.(v)Peak signal to noise ratio: peak signal to noise ratio (PSNR) is used as measurement of qualitative reconstruction method of compression codes [39]. As the performance of filter will depend on how many core features are extracted from raw ERP, the PSNR of L₁s feature map was calculated as a mean of measurement of performance. The greater PSNR shows the presence of significantly high weight inside feature map whereas lower PSNR represents only low weights that are present in the given feature map and the discriminant power of the filter is low.

3. Results

3.1. ERP Detection

Of 33 subjects, 19 were identified as H group. In Figure 3, time course of learning curve and other statistical measurements over the training iteration from H and L subject are presented. The learning curve of L subject shown in Figure 3(a) indicates that although the false negative rate (FN) drops according to the training iteration, reaching 0 eventually, the false positive rate (FP) becomes 1. Although the learning curve shows sharp increase at 1st and 13th iteration, mostly it remains around .2. This indicates that the CNN becomes overtrained to positives (target). Moreover, as the CNN identifies most of the ERP to be positive (high FP and low FN), the result indicates that discriminant feature of target ERP was not found. On the other hand, both FN and FP of H subject drop to around 0 and .2. The learning curve saturates around .85 indicating nonoverfitting of the CNN (Figure 3(b)).

The errors shown in Figures 3(c) and 3(d) are defined as follows for training and online data:Although both H and L group show drop in both training and validation error as training iteration continues, the validation error of L subject is higher than that of H subject.

The ROCs of H and L subject shown in Figure 3(e) indicate the performance of CNN of H group to be greater than that of L group subject.

3.2. Spatial and Temporal Features

The feature map of each convolutional layer did not contain negative weights associated with negative peaks, such as N1 [40] as the activation function was set to ReLU [33].

The target ERP and feature map of L₁ of sampler H and L subject are shown in Figures 4 and 6. The target ERP shown in both figures is target ERP averaged over all trials. To analyze the correlation of frontal and occipital lobe electrodes, the first 3 electrodes (first 3 rows of averaged target ERP matrix) were copied and pasted at the end of ERP matrix. As shown in Figure 4(a), the target ERP of L group subject shows broad peak around P700 range on F3 and CZ. ERP of other lobes did not show any significant positive weight indicating nonsignificant features associated with target being observed and being flat. Feature maps shown in Figures 4(b) through (i) have shown high correlation between ERP from central and parietal lobe electrodes.

On the other hand, the correlation of ERP among adjacent electrodes for H group subject shown in Figure 6 indicates the correlation is restricted to specific time range. Most of the high weights of feature maps shown in Figures 6(b), 6(d), 6(f), and 6(e) show significant positive value around P500 and P700 range for frontal and central lobe electrodes. The correlation between central and parietal lobe is shown in Figure 6(c) around P500 range. Some features around P500 region were found to show high correlation among all electrodes. Unlike that of L group subjects, feature map of L₁ for H group subject showed high correlation among all electrodes, where each case shows specific temporal characteristics.

The temporal features shown in feature map in Figure 5 indicate that temporal features associated with P700 peak are present for L group subjects as expected. In Figures 5(a), 5(b), and 5(c), high positive weights were found around P700 range (row 4 and 6). However, most of the feature maps did not show significant weights or were either flat as in Figure 5(i).

The temporal features of H group subjects showed more variety. Some feature maps showed high positive weights in their feature maps around P300 and P500 range as shown in Figures 7(a), 7(b), 7(c), and 7(d), whereas the others indicated significant positive weight around P700 range as in Figures 7(a)–7(i). However, the weight associated with P700 range is more widely defined than those associated with P300 and P500.

3.3. Statistical Analysis

Comparison of classical statistical measurements and other measurements is shown in Table 1. The accuracy, sensitivity, and precision showed significant mean difference between H and L group ( values were 0.0135, , and 0.0072, resp.). A significant mean difference in F1 measure did not exist between H and L group. The accuracy of H and L group was 0.889 and 0.687, respectively. The sensitivity of H group was higher than that of L group, but the precision of H group was significantly lower than that of L group. The area under ROC of H group was significantly higher than that of L group ( value = 0.0137).

The PSNR for L₁ of H group was significantly lower than that of L group. As all PSNR measured were negative, the absolute value of PSNR of H group was greater than that of L group. On the other hand, no mean difference of the peak time (PeT.) between H and L group was found ( value = 0.965).

4. Discussion

In this study, CNN has been used to investigate the spatial and temporal characteristics of ERP that distinguish the performance difference between illiterates and nonilliterates (L and H group). As a comparison of performance, classical statistic measurements as well as filter comparison measurement had been collected to compare the correlation of ERP taken from different EEG electrodes and identify characteristic temporal features associated with each group.

The statistical measurement shows that the mean performance of CNN with H and L group data had significant difference. The accuracy of H group data was higher than that of L group data. Interestingly, although the sensitivity of H group was higher than that of L group, the precision of H group was significantly lower than that of L group. This reflects the fact that the ERP of L group was not identified as target in most of the cases, and the CNN identified ERP from all 6 icons to be nontarget in more than half of the trials.

The learning curve and errors in Figure 3 demonstrate how the statistical measurement affects the performance of CNN. Although the false negative rate remains mostly near 0, as the false positive rate remains close to 0, the learning curve remains stable around .2 for the L group subject. This again reflects the characteristics of L group ERP who were mostly identified as nontarget. Some of the ERP that were identified as target ERP were mostly from nontarget icons, indicating lack of distinctive feature associated with target ERP. However, both false negative and false positive rate drop as training iteration continues for H group subject’s data, leading to increase of learning accordingly to the iteration. As the ERP of L group does not have sufficient distinctive features, the model becomes slightly overtrained compared to the model of H group subject as shown in validation error plot in Figures 3(c) and 3(d). The comparison of ROC validates the analysis as ROC of H group was significantly higher than that of L group ( value = 0.0137).

As shown in Figure 4, most of the ERP collected from L group were flat in most of the channels. Most of the positive weights in target ERP were observed in frontal and central lobe electrodes (1st and 5th row of Figure 4(a)) which was contrary to the expectation as previous research indicated positive peaks associated with target event were mostly observed in parietal or occipital lobe [41, 42]. The correlation of ERP collected from adjacent electrodes did not show existence of significant correlation between occipital and parietal lobe data in L group subjects. On the other hand, ERP of H group were more invigorated, showing stronger activity in P300 area as shown in Figure 6(a). The ERP correlation indicated in feature map also indicated stronger correlation of ERP data collected from occipital and parietal lobe with other lobes. The spatial correlation shown in feature map of H group also indicated that the correlation was restricted in specific time range corresponding to either P300, P500, or P700.

The feature map of 2nd convolutional layer demonstrated the difference in temporal features between H and L group subjects. In most of L group subjects, the feature map did not show strong positive weights and was flat. Some indication of positive weights was mostly restricted in P700 region. On the other hand, the positive weights of H group were distributed around P300, P500, and P700 and the positive weights found near P300 and P500 range was sharper compared to those found around P700 range. Previous researches have indicated the possibility of existence of different features other than P300 [41, 43, 44] The result of the paper also supports the idea that P300 may not be the only key feaure of ERP speller system. Rather, the P700, which were identified among both L and H group subjects, may represent more universal ERP feature. However, the ERP from central lobe area observed in L group indicates the possibility of effect of stimulus probability [32] (Figure 1(a)).

(a)

(b)

(c)

Figure 1

Experimental paradigm. (a) The position of EEG channels in 10/20 system. The EEG were collected from F3, Fz, F4, C3, Cz, C4, P3, Pz, P4, O1, and O2 positions as indicated by red circle. (b) Experimental setting schematics. Subjects were sat on a chair and were asked to mentally count the occurrence of target icon. The ERP speller system for this paper was implemented with RSVP. The icon appeared on the center of the monitor. (c) Schematics of CNN architecture. The architecture consisted of 2 convolutional layers, 2 max-pool layers, and 2 fully connected layers. The number on top of each layer indicates size of feature map.

(a)

(b)

(c)

(d)

(e)

(f)

(a)

(b)

(c)

(d)

(e)

Figure 3

Learning curve and receiver operating characteristic curve (ROC) of L and H subject. (a) False negative rate (FN) and learning curve of L subject saturates near 0 and .2, respectively, whereas false positive rate (FP) increase to 1. (b) Both FP and FN drop over the time course for H subject and learning curve saturates near .8. (c) Training and validation error of drops over the time course for both L subject and (d) H subject. Both validation and training error are lower for H subject. (e) ROC curve of H and L subjects.

(a)

(b)

(c)

(d)

(e)

(f)

(g)

(h)

(i)

(a)

(b)

(c)

(d)

(e)

(f)

(g)

(h)

(i)

The PSNR indicated that lack of activities in occipital/parietal lobe and broad peak found in P700 affect the performance of spatial filter in L1 as well. As the PSNR measures the maximum power of a signal and the power of corrupting noise [45], the result indicates that the filter was not able to extract distinctive signal of target ERP from background noise for L group subjects’ data. This may be since peaks near P700 were broad and fluctuating. On the other hand, P300 and P500 peaks found in H group subjects were sharper, which made the filter extract relevant features more precisely without being affected by background noise. Interestingly, the major peak of L₂ of H and L group subjects did not differ significantly ( value = 0.965). As the major peak was found by averaging the feature maps from L₂, the difference in each feature map may have been overshadowed. Further statistical analysis to access temporal feature within each feature map must be applied to validate the results found in this study.

5. Conclusions

This study has investigated the difference in spatial and temporal features of ERP between high performance group (H group) and low performance group (L group). The result indicated that the major difference arises from spatial correlation of ERP among other lobes rather than temporal features. Although the temporal feature difference was not found to be quantitative in this study, the qualitative analysis indicated lack of P300 in low performance group. Interestingly, both low and high performance group showed activity near P700 which may be the key activity of ERP speller system instead of traditional P300 peak. Further analysis of individual feature map will be needed to investigate the key temporal feature of ERP speller system.

Conflicts of Interest

The authors declare that they have no conflicts of interest.

Acknowledgments

This work was partly supported by Institute for Information & Communications Technology Promotion (IITP) grant funded by the Korea government (MSIT) (no. 2015-0-00312, the development of technology for social life logging based on analyzing social emotion and intelligence of convergence contents) and National Research Foundation of Korea (NRF) grant funded by the Korea government (MSIT) (no. 2011-0030079).

References

J. R. Wolpaw, N. Birbaumer, D. J. McFarland, G. Pfurtscheller, and T. M. Vaughan, “Brain-computer interfaces for communication and control,” Clinical Neurophysiology, vol. 113, no. 6, pp. 767–791, 2002.
View at: Publisher Site | Google Scholar
T. Fomina, G. Lohmann, M. Erb, T. Ethofer, B. Schölkopf, and M. Grosse-Wentrup, “Self-regulation of brain rhythms in the precuneus: A novel BCI paradigm for patients with ALS,” Journal of Neural Engineering, vol. 13, no. 6, Article ID 066021, 2016.
View at: Publisher Site | Google Scholar
W. Speier, N. Chandravadia, D. Roberts, S. Pendekanti, and N. Pouratian, “Online BCI typing using language model classifiers by ALS patients in their homes,” Brain-Computer Interfaces, vol. 4, no. 1-2, pp. 114–121, 2017.
View at: Publisher Site | Google Scholar
L. Botrel, E. M. Holz, and A. Kübler, “Using brain painting at home for 5 years: Stability of the P300 during prolonged BCI usage by two end-users with ALS,” Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics): Preface, vol. 10285, pp. 282–292, 2017.
View at: Publisher Site | Google Scholar
S. Saeedi, R. Chavarriaga, R. Leeb, and J. d. Millan, “Adaptive Assistance for Brain-Computer Interfaces by Online Prediction of Command Reliability,” IEEE Computational Intelligence Magazine, vol. 11, no. 1, pp. 32–39, 2016.
View at: Publisher Site | Google Scholar
S. Saeedi, R. Chavarriaga, and J. D. R. Millan, “Long-Term Stable Control of Motor-Imagery BCI by a Locked-In User Through Adaptive Assistance,” IEEE Transactions on Neural Systems and Rehabilitation Engineering, vol. 25, no. 4, pp. 380–391, 2017.
View at: Publisher Site | Google Scholar
R. Swaminathan and S. Prasad, “Brain computer interface used in health care technologies,” SpringerBriefs in Applied Sciences and Technology, vol. 7, pp. 49–58, 2016.
View at: Publisher Site | Google Scholar
C. Reichert, S. Dürschmid, H.-J. Heinze, and H. Hinrichs, “A comparative study on the detection of covert attention in event-related EEG and MEG signals to control a BCI,” Frontiers in Neuroscience, vol. 11, article no. 575, 2017.
View at: Publisher Site | Google Scholar
D. McFarland and J. Wolpaw, “EEG-based brain–computer interfaces,” Current Opinion in Biomedical Engineering, vol. 4, pp. 194–200, 2017.
View at: Publisher Site | Google Scholar
L. A. Farwell and E. Donchin, “Talking off the top of your head: Toward a mental prosthesis utilizing event-related brain potentials,” Electroencephalography and Clinical Neurophysiology, vol. 70, no. 6, pp. 510–523, 1988.
View at: Publisher Site | Google Scholar
J. Yoon, M. Whang, and J. Lee, “Methodology of improving illiteracy in P3 speller system with ICA blind detection,” Proceedings of HCI Korea, pp. 87–93, 2016.
View at: Publisher Site | Google Scholar
R. Carabalona, “The role of the interplay between stimulus type and timing in explaining BCI-illiteracy for visual P300-based Brain-Computer Interfaces,” Frontiers in Neuroscience, vol. 11, article no. 363, 2017.
View at: Publisher Site | Google Scholar
S. L. Shishkin, I. P. Ganin, I. A. Basyul, A. Y. Zhigalov, and A. Y. Kaplan, “N1 wave in the P300 BCI is not sensitive to the physical characteristics of stimuli,” Journal of integrative neuroscience, vol. 8, no. 4, pp. 471–485, 2009.
View at: Publisher Site | Google Scholar
L. Bianchi, S. Sami, A. Hillebrand, I. P. Fawcett, L. R. Quitadamo, and S. Seri, “Which physiological components are more suitable for visual ERP based brain-computer interface? A preliminary MEG/EEG study,” Brain Topography, vol. 23, no. 2, pp. 180–185, 2010.
View at: Publisher Site | Google Scholar
K. Yoon and K. Kim, “Multiple kernel learning based on three discriminant features for a P300 speller BCI,” Neurocomputing, vol. 237, pp. 133–144, 2017.
View at: Publisher Site | Google Scholar
D. B. Ryan, G. Townsend, N. A. Gates, K. Colwell, and E. W. Sellers, “Evaluating brain-computer interface performance using color in the P300 checkerboard speller,” Clinical Neurophysiology, vol. 128, no. 10, pp. 2050–2057, 2017.
View at: Publisher Site | Google Scholar
V. Guy, M.-H. Soriani, M. Bruno, T. Papadopoulo, C. Desnuelle, and M. Clerc, “Brain computer interface with the P300 speller: Usability for disabled people with amyotrophic lateral sclerosis,” Annals of Physical and Rehabilitation Medicine, 2017.
View at: Publisher Site | Google Scholar
Q. Li, K. Shi, S. Ma, and N. Gao, “Improving classification accuracy of SVM ensemble using random training set for BCI P300-speller,” in Proceedings of the 13th IEEE International Conference on Mechatronics and Automation, IEEE ICMA 2016, pp. 2611–2616, China, August 2016.
View at: Publisher Site | Google Scholar
G. C. Cawley and N. L. Talbot, “On over-fitting in model selection and subsequent selection bias in performance evaluation,” Journal of Machine Learning Research, vol. 11, pp. 2079–2107, 2010.
View at: Google Scholar | MathSciNet
D. J. Krusienski, E. W. Sellers, F. Cabestaing et al., “A comparison of classification techniques for the P300 Speller,” Journal of Neural Engineering, vol. 3, no. 4, article 007, pp. 299–305, 2006.
View at: Publisher Site | Google Scholar
A. Rakotomamonjy and V. Guigue, “BCI competition III: dataset II-ensemble of SVMs for BCI P300 speller,” IEEE Transactions on Biomedical Engineering, vol. 55, no. 3, pp. 1147–1154, 2008.
View at: Publisher Site | Google Scholar
Y. Yu, Z. Zhou, J. Jiang et al., “Toward a Hybrid BCI: Self-Paced Operation of a P300-based Speller by Merging a Motor Imagery-Based “Brain Switch” into a P300 Spelling Approach,” International Journal of Human-Computer Interaction, vol. 33, no. 8, pp. 623–632, 2017.
View at: Publisher Site | Google Scholar
Y. Yu, J. Jiang, Z. Zhou et al., “A self-paced brain-computer interface speller by combining motor imagery and P300 potential,” in Proceedings of the 8th International Conference on Intelligent Human-Machine Systems and Cybernetics, IHMSC 2016, pp. 160–163, China, September 2016.
View at: Publisher Site | Google Scholar
S. Sonnenburg, G. Rätsch, C. Schäfer, and B. Schölkopf, “Large scale multiple kernel learning,” Journal of Machine Learning Research, vol. 7, pp. 1531–1565, 2006.
View at: Google Scholar | MathSciNet
Q. V. Le, J. Ngiam, A. Coates, A. Lahiri, B. Prochnow, and A. Y. Ng, “On optimization methods for deep learning,” in Proceedings of the 28th International Conference on Machine Learning (ICML '11), pp. 265–272, Bellevue, Wash, USA, July 2011.
View at: Google Scholar
S. Lawrence, C. L. Giles, A. C. Tsoi, and A. D. Back, “Face recognition: a convolutional neural-network approach,” IEEE Transactions on Neural Networks and Learning Systems, vol. 8, no. 1, pp. 98–113, 1997.
View at: Publisher Site | Google Scholar
A. Krizhevsky, I. Sutskever, and G. E. Hinton, “Imagenet classification with deep convolutional neural networks,” in Proceedings of the 26th Annual Conference on Neural Information Processing Systems (NIPS '12), pp. 1097–1105, Lake Tahoe, Nev, USA, December 2012.
View at: Google Scholar
P. Y. Simard, D. Steinkraus, and J. C. Platt, “Best practices for convolutional neural networks applied to visual document analysis,” in Proceedings of the 7th International Conference on Document Analysis and Recognition, vol. 2, pp. 958–963, IEEE Computer Society, Edinburgh, UK, August 2003.
View at: Publisher Site | Google Scholar
O. Abdel-Hamid, A.-R. Mohamed, H. Jiang, L. Deng, G. Penn, and D. Yu, “Convolutional neural networks for speech recognition,” IEEE Transactions on Audio, Speech and Language Processing, vol. 22, no. 10, pp. 1533–1545, 2014.
View at: Publisher Site | Google Scholar
Y. LeCun and Y. Bengio, “Convolutional networks for images, speech, and time series,” The Handbook of Brain Theory and Neural Networks, vol. 3361, no. 10, p. 1995, 1995.
View at: Google Scholar
M. S. Treder, N. M. Schmidt, and B. Blankertz, “Gaze-independent brain-computer interfaces based on covert attention and feature attention,” Journal of Neural Engineering, vol. 8, no. 6, Article ID 066003, 2011.
View at: Publisher Site | Google Scholar
R. W. Homan, J. Herman, and P. Purdy, “Cerebral location of international 10–20 system electrode placement,” Electroencephalography and Clinical Neurophysiology, vol. 66, no. 4, pp. 376–382, 1987.
View at: Publisher Site | Google Scholar
V. Nair and G. E. Hinton, “Rectified linear units improve Restricted Boltzmann machines,” in Proceedings of the 27th International Conference on Machine Learning (ICML '10), pp. 807–814, Haifa, Israel, June 2010.
View at: Google Scholar
G. Benjamin, “Fractional max-pooling, 2014,” https://arxiv.org/abs/1412.6071.
View at: Google Scholar
G. E. Dahl, T. N. Sainath, and G. E. Hinton, “Improving deep neural networks for LVCSR using rectified linear units and dropout,” in Proceedings of the 38th IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP '13), pp. 8609–8613, May 2013.
View at: Publisher Site | Google Scholar
N. Srivastava, “Improving neural networks with dropout,” University of Toronto, vol. 182, 2013.
View at: Google Scholar
M. Abadi, P. Barham, C. Jianmin et al., “Tensorflow: A system for large-scale machine learning,” 12th USENIX Symposium on Operating Systems Design and Implementation, vol. 16, pp. 265–283, 2016.
View at: Google Scholar
G. Aurélien, “Hands-on machine learning with scikit-learn and tensorflow: concepts, tools, and techniques to build intelligent systems, 2017”.
View at: Google Scholar
Q. Huynh-Thu and M. Ghanbari, “Scope of validity of PSNR in image/video quality assessment,” IEEE Electronics Letters, vol. 44, no. 13, pp. 800-801, 2008.
View at: Publisher Site | Google Scholar
R. Näätänen, Attention and Brain Function, Psychology Press, 1992.
K. Takano, H. Ora, K. Sekihara, S. Iwaki, and K. Kansaku, “Coherent activity in bilateral parieto-occipital cortices during P300-BCI operation,” Frontiers in Neurology, vol. 5, Article ID Article 74, 2014.
View at: Publisher Site | Google Scholar
F. A. Capati, R. P. Bechelli, and M. C. F. Castro, “Hybrid SSVEP/P300 BCI keyboard: Controlled by Visual Evoked Potential,” in Proceedings of the 9th International Conference on Bio-Inspired Systems and Signal Processing, BIOSIGNALS 2016 - Part of 9th International Joint Conference on Biomedical Engineering Systems and Technologies, BIOSTEC 2016, pp. 214–218, ita, February 2016.
View at: Google Scholar
S. Ikegami, K. Takano, M. Wada, N. Saeki, and K. Kansaku, “Effect of the green/blue flicker matrix for P300-based brain-computer interface: An EEG-fMRI study,” Frontiers in Neurology, 2012.
View at: Publisher Site | Google Scholar
W. Speier, A. Deshpande, and N. Pouratian, “A method for optimizing EEG electrode number and configuration for signal acquisition in P300 speller systems,” Clinical Neurophysiology, vol. 126, no. 6, pp. 1171–1177, 2015.
View at: Publisher Site | Google Scholar
C.-Y. Chen, C.-H. Chen, C.-H. Chen, and K.-P. Lin, “An automatic filtering convergence method for iterative impulse noise filters based on PSNR checking and filtered pixels detection,” Expert Systems with Applications, vol. 63, pp. 198–207, 2016.
View at: Publisher Site | Google Scholar

Copyright

Copyright © 2018 Jaehong Yoon et al. This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

PDF Download Citation

Download other formats

Order printed copies

Views

1490

Downloads

1400

Citations