Abstract

A telemedicine system using communication and information technology to deliver medical signals such as ECG, EEG for long distance medical services has become reality. In either the urgent treatment or ordinary healthcare, it is necessary to compress these signals for the efficient use of bandwidth. This paper discusses a quality on demand compression of EEG signals using neural network predictors for telemedicine applications. The objective is to obtain a greater compression gains at a low bit rate while preserving the clinical information content. A two-stage compression scheme with a predictor and an entropy encoder is used. The residue signals obtained after prediction is first thresholded using various levels of thresholds and are further quantized and then encoded using an arithmetic encoder. Three neural network models, single-layer and multi-layer perceptrons and Elman network are used and the results are compared with linear predictors such as FIR filters and AR modeling. The fidelity of the reconstructed EEG signal is assessed quantitatively using parameters such as PRD, SNR, cross correlation and power spectral density. It is found from the results that the quality of the reconstructed signal is preserved at a low PRD thereby yielding better compression results compared to results obtained using lossless scheme.

1. Introduction

Electroencephalography (EEG) is widely used in brain research and clinical diagnosis [13]. Due to the enormous data size of the EEG resulting from large electrode arrays and prolonged recordings, data compression is desired for efficient data archiving and transmission through networks. Furthermore, for telemedicine or telebrowsing applications, transmitting a large amount of digital data through a bandwidth-limited channel becomes a heavy burden. Compression techniques practically aim at obtaining maximum data volume reduction while preserving the significant feature upon reconstruction. Data compression can be lossless, when the signal waveform fidelity is totally preserved, or lossy, in cases where a certain amount of distortion or lack of fidelity in the decompressed data is allowed. Efficient compression of the EEG signal is a difficult task due to the randomness inherent in the signal, and hence high compression rates cannot be achieved with lossless compression methods [410]. In [10], an adaptive error modeling technique for lossless compression has been applied to improve the compression efficiency. In [11], Wongsawat et al. have shown the application of KLT transform for the lossless compression of EEGs. A context-based error model using linear and neural network predictors has shown the removal of offset bias for attaining some improvement in compression efficiency [6, 12]. The effect of uniform quantization and nonuniform quantization on compression gain using the near-lossless compression of EEG signals has been reported in [6, 13, 14]. Gopikrishna and Makur proposed a near-lossless compression scheme using wavelets and ARX model [15]. In [16], The author has shown the influence of context-based error modeling for the near-lossless compression of EEG signals.

Lossy compression techniques might be acceptable as long as the reconstructed quality of the signal preserves the diagnostic information for clinical investigation [1719]. Further, recent works reported based on pursuit approach with wavelet dictionaries, wavelet-SPIHT, and finite rate of innovation technique exploiting sampling theory have shown some improvement in the compression performance [2022]. It can be revealed from the literature that the compression schemes for ECG signals have emphasized the quality aspects of the reconstructed signal at lower bandwidth utilization [2326]. To the best of the authors knowledge, attempts have not been made to interpret the quality aspects of the EEG signal for telemedicine applications and at the same time satisfying the low-bandwidth requirement. This paper discusses the quality on demand compression of EEG Signal using neural network predictors. Three neural network models, namely, single-layer perceptron (SLP), multi-layer perceptron (MLP), and Elman network (EN), are used [710], and the performances are evaluated in terms of bits per sample (BPS) and fidelity parameters such as percent of root mean square difference (PRD), signal-to-noise ratio (SNR), cross-correlation (CC), and power spectral density (PSD). The results are also compared with two linear predictors, namely, finite impulse response (FIR) filters and autoregressive (AR) model.

2. Quality on Demand Compression

It is well known that a higher compression can be achieved by sacrificing the quality of the reconstructed signal and vice versa. A trade-off has to be made to obtain a good quality of decoded signal with a considerable amount of compression. For telemedicine applications, a physician at the receiving end must interactively adjust certain parameters associated with compression algorithm according to physician’s quality consideration. It has been reported in [12, 2326] that two factors, namely, bits per sample (BPS) and percent of root-mean-square-difference (PRD) decide the quality on demand specifications, namely, bandwidth constraints and reconstructed signal quality, respectively. This paper highlights the quality on demand compression scheme for EEG signal using neural network predictors. The fidelity of the reconstructed signal is measured quantitatively by means of four factors, namely, PRD, SNR, CC, and PSD. For EEG signal compression, two-stage lossless compression schemes involving predictor in the first stage with an entropy encoder in the second stage have been successfully used [610]. The main function of the predictor is to estimate the present value of a sample using its past samples and then transmit only the error (residues), which are generally of a lesser magnitude and size than the original samples. It is assumed that both the encoder and the decoder simulate an identical prediction process [27]. The prediction process starts with the transmission of initial header information consisting of neural network parameter settings and selected number of input sample values. At the receiving end, the prediction process is repeated and the original input is recovered by adding the transmitted residues to the predicted values. If we transmit the error signals based on certain threshold values and followed by quantization, there is a possibility of achieving better compression, and it may also be clinically acceptable as long as the reconstructed signal preserves the required diagnostic features. The compression efficiency can be further improved by using an arithmetic entropy encoder in the second stage [28]. For a quality on demand compression of EEG signal, the two-stage compression scheme as reported in [710, 13] can be modified as shown in Figure 1.

If is the current sample and is the predicted sample, then the value of the error (residue) sample is given by  (1) Figure 1 shows the block diagram of the proposed compression scheme. The thresholded value of denoted as is determined based on a threshold value as shown in (2). According to (2), if the magnitude of the error signal is less than the threshold value , it is assumed to be zero. On the other hand, if the magnitude of the error signal is greater than or equal to , the actual value of error signal is transmitted. The error signals are then quantized and encoded using an arithmetic encoder which is denoted as . At the receiving end, the output of the counterpart neural network combined with the transmitted error signal to obtain the resultant signal is shown below where , is the resultant signal and , is the thresholded and quantized value of the error signal.

The performance of two-stage compression scheme is evaluated for different values of .

3. Neural Network Predictors

Neural networks possess certain attractive properties such as massive parallelism, robustness, adaptive learning, self-organization, fault tolerance, and generalization which are useful to enhance the performance of a predictor [29]. The purpose of the predictor is to decorrelate the input data thereby reducing the amplitude range of the data and generating a sequence, which is approximately white Gaussian. In this paper, the neural network models considered are: (1) single-layer perceptron (SLP), (2) multi-layer perceptron (MLP), (3) Elman network (EN). The architectures of SLP, MLP, and EN with -th predictor order are shown in Figures 2, 3, and 4, respectively [27, 30]. The first two networks are feed forward models whereas the third one is a feedback network.

All the neural network predictors are optimally configured based on the parameters, such as number of hidden neurons, predictor order, activation functions, learning algorithms in order to ensure the network convergence with minimum error. Table 1 shows the configuration details along with the number of iterations required during the training phase [10, 16].

4. Experimental Setup

For our experimental study, we have used EEG signals recorded under three different physiological conditions, namely, EEG dataset1 (DS1) which consists of a 16-bit EEG signal recorded under epileptic condition with sudden seizures obtained from six channels (Fp1, Fp2, F3, F4, C3, C4) with a sampling rate of 256 Hz, EEG dataset2 (DS2) which consists of a 16 bit normal EEG signal with eyes open and closed, recorded using BIOPAC data acquisition system with a sampling rate of 256 Hz [10], and EEG dataset3 (DS3) which contains EEG signals recorded during the occurrence of epileptic seizures exhibiting ictal activity with a sampling rate of 173.61 Hz [10, 31]. Figure 5 shows exemplary EEG signals corresponding to the three datasets described above.

5. Performance Evaluation Parameters

The performances of the proposed compression schemes are evaluated using the compression parameter, compression ratio (CR), and four fidelity parameters, namely, percent of root-mean-square-difference (PRD), signal-to-noise ratio (SNR), peak signal signal-to-noise ratio (PSNR), cross correlation (CC), and power spectral density (PSD).

The compression ratio (CR) is defined as follows [27]: where :total number of samples in test file,:total number of bits used to represent a sample,:order of the predictor,:number of residues below ,:number of weights,:number of bits used to represent a weight,:number of bits used to represent a residue.

To validate the reliability of the compression method, the fidelity (quality) of the reconstructed signal has to be assessed. The parameters, PRD, SNR, PSNR, CC, and PSD are used to judge the quality of the reconstructed EEG signal.

PRD is defined as [32, 33] where : is the original sample,: is the reconstructed sample,: is the length of the window over which the PRD is calculated.

SNR is defined as shown [33]: PSNR is defined as shown [13]: where is the maximum value of the original EEG signal

RMSE is the root mean square error which is defined as shown Cross correlation (CC) denotes the statistical correlation between two signals [34]. The correlation between the original and the reconstructed signal is measured by CC which is defined as where is the covariance between and and are the variances of and , respectively.

Cross correlation plays an important role in judging the resemblance of two signals. It can be concluded that the reconstructed signal is very close to the original one when cc = 1.

Power spectra of original and reconstructed EEG signals is calculated by determining the parameter PSD [35, 36]. The similarities in both the power spectra imply that the original and reconstructed EEG signals are approximately identical.

The error signal obtained from the difference between the original signal and predicted EEG signals are thresholded by varying in accordance to the 1–10% of the maximum value of . The thresholded error signals are further quantized in to three levels, namely, Q1, Q2, and Q3. For the experimental data sets, the bits assigned for the three quantization levels (QL) are shown in Table 2.

6. Experimental Results

The performance of the proposed compression scheme is evaluated in terms of CR by varying the threshold, , and quantization levels, QL, using the two-stage compression with different predictors in the first stage and an arithmetic encoder in the second stage. Tables 3, 4, and 5 show the values of CR obtained for the three experimental datasets DS1, DS2 and DS3 at Q1, Q2, and Q3, respectively. For DS1, the mean values of CR obtained from all six channels are given.

From Tables 3, 4, and 5, it is found that the CR value increases as increases. Further, the effect of compression increases as the QL increases. It can be seen that the values of CR obtained at Q3 yield the best compression results. Among the predictors, SLP yields the best results for all the three datasets used for this experimental study.

The quality of the reconstructed signal at different thresholds is evaluated in terms of PRD and SNR for the two-stage compression schemes. Figure 6 shows the variation of PRD and SNR with respect to BPS at different and quantization levels using different predictors for the DS1. The PRD and SNR values represent the mean values obtained for the reconstructed EEG signals of all the six channels.

It is found from Figure 6 that the increase in BPS results in the decrease of PRD (increase of SNR) value. Among the different prediction scheme, SLP yields the best results. It can be seen that the value of BPS decreases as QL increases. It can be concluded that the minimum BPS obtained at Q1, Q2 and Q3 are 5.32, 5.02, and 4.9, respectively.

Figures 7 and 8 show the variations of PRD and SNR with BPS for DS2 and DS3, respectively, using the SLP.

From Figure 7, It is found that the minimum BPS obtained at Q1, Q2, and Q3 are 5.92, 5.72, and 5.3, respectively. From Figure 8, It is found that the minimum BPS obtained at Q1, Q2, and Q3 are 5.02, 4.3, and 4.01, respectively. Figure 9 shows the fidelity plot in terms of PSNR using SLP predictor.

The quality of the reconstructed signal is then further assessed by estimating the parameters, CC and PSD. The value of CC is evaluated for the three EEG datasets at different QL. Table 6 shows the CC obtained using SLP which yields the best results.

From Table 6, it is found that the CC value decreases as the value of threshold, , increases. The correlation between the original and the reconstructed signal increases as the quantization increases. This holds good for all the three experimental data sets.

Figures 1012 depict the PSD estimation of the original signal and the reconstructed signal for the three datasets.

From Figure 10, it is found that for the dataset DS1, the PSD of the reconstructed signal until , , and resembles the original signal for the quantization levels Q1, Q2, and Q3, respectively. From Figure 11, it is found that for the dataset DS2, the PSD of the reconstructed signal until , , and resembles the original signal for the quantization levels Q1, Q2, and Q3, respectively. From Figure 12, it is found that for the dataset DS3, the PSD of the reconstructed signal until , , and resembles the original signal for the quantization levels Q1, Q2, and Q3, respectively.

7. Discussion

The main idea of the proposed compression scheme is to obtain the reconstructed EEG signal suitable for clinical diagnosis at a lower BPS and PRD. The effect of thresholding and quantization level decides the quality on demand criteria, and a better trade-off between the BPS and PRD is achieved for the clinical investigations. Suppose the enduser wishes to receive the quality of the reconstructed signal at certain PRD, then the parameters BPS, , QL, and CC for the three experimental datasets can be chosen accordingly as shown in Table 7.

From Table 7, it can be observed that an average compression efficiency [13] of 74.12%, 76.2%, and 79.2% is achieved, respectively, for the three specified fidelity criterions. For telemedicine-based transmission and retrieval, one can ensure the diagnostic quality of the reconstructed EEGs based on appropriate selection of the fidelity criteria with efficient low-bandwidth utilization. It can be further seen from the results shown in Table 7 that the proposed scheme was found to yield comparable results with the near-lossless scheme reported in [13].

8. Conclusions

This paper discusses the quality on demand compression scheme for EEG signal using neural network predictors. Neural network predictors such as single-layer perceptron, multi-layer perceptron, and Elman networks have been used. The error signals were thresholded before they were applied to the arithmetic encoder. A two-stage compression scheme with a predictor in the first stage and an entropy encoder in the second stage has been used. The error (residue) signal which is the difference between the original and the predicted EEG signals was first thresholded using various levels of thresholds and was further quantized and then encoded using an arithmetic encoder. Three neural network models, namely, single-layer perceptron, multi-layer perceptron, and Elman network were used, and the results were compared with linear predictors such as FIR filters and AR modeling. Experiments were carried out using EEG signals recorded at various physiological conditions. The fidelity of the reconstructed EEG signal was assessed quantitatively using parameters such as percent of root- mean-square- difference (PRD), signal-to-noise ratio (SNR), cross correlation (CR) and power spectral density (PSD). It has been found from the experimental results that the single-layer perceptron yields the best results by preserving the diagnostic information at low PRD values.