Abstract

Blind source separation (BSS) has applications in the fields of data compression, feature recognition, speech, audio, and biosignal processing. Identification of ECG signal is one of the challenges in the biosignal processing. Proposed in this paper is a new method, which is the combination of related function relevance to estimated signal and negative entropy in fast independent component analysis (FastICA) as objective function, and the iterative formula is derived without any assumptions; then the independent components are found by maximizing the objective function. The improved algorithm shorthand for R-FastICA is applied to extract random mixed signals and ventricular late potential (VLP) signal from normal ECG signal; simultaneously the performance of R-FastICA algorithm is compared with traditional FastICA through simulation. Experimental results show that R-FastICA algorithm outperforms traditional FastICA with higher similarity coefficient and separation precision.

1. Introduction

Blind source separation (BSS) [1] has been applied successfully to extract mixed signals in different fields of data compression [2], feature recognition [3], speech, audio, and biosignal processing [4, 5] as a statistical signal method. Literature [2] showed that the compression ratio was higher through the ICA method than principal component analysis (PCA). The accuracy ratio of feature recognition was 92.1% based on the complex valued Independent Component Analysis in literature [3]. Literature [4, 5] showed that FastICA was an efficient method in the extraction of speech, audio, and biosignal. Neither the source signals nor the structure of mixed matrix is known [6].

The detection and analysis of VLP generally appearing in the end of QRS wave and extending to ST segment with a series of high frequency and low-rising weak irregular electrical signal are a kind of effective means to predict unexplained asphyxia, sudden cardiac deaths, and so forth [7]. At present, the analysis methods of VLP commonly have had a time domain method, frequency domain method, spectrum scale measurement analysis method, and so forth [8]. Time domain method is not easy to improve detection rate of VLP; frequency domain analysis is limited by frequency resolution; spectrum scale measurement analysis can overcome some limitations in time domain and frequency domain analysis, but it is easily influenced by the selection of analysis time and positioning of QRS wave terminal in extracting judgmental standard parameters of VLP [9].

To overcome the abovementioned limitation and improve the detection accuracy, it is necessary to put forward a new detection technology. Independent component analysis (ICA) as a branch of BSS is widely applied to this problem in recent years [1012]. Traditional FastICA algorithm has obtained several effects in extracting electromyographic and atrial fibrillation signal. VLP compared with normal ECG signal waveform has a relative independence, and ICA algorithm can accurately distinguish relatively independent component from the ECG; it can be used to identify VLP. In addition, many authors have conducted specific research to ICA algorithm. Literature [13] proposed an adaptive ICA algorithm based on artificial neural network, which reduced the complexity of obtaining the learning matrix and independent component; literature [14] designed a fast search algorithm directly based on kurtosis as a measure of non-Gauss; literature [15] analyzed the principle and method of independent component in serial estimation ICA model; the concept of kernel ICA was proposed in literature [16], where data was analyzed through related analysis combined with mutual information theory; it had obtained better separation effect.

R-FastICA is proposed in this paper, which is the combination of related function and negative entropy as objective function, and the iterative formula is derived; then the independent components are found by maximizing the objective function. The extracted performance of R-FastICA algorithm is compared with traditional FastICA through simulation of random mixed signals and ECG signal with VLP. Experimental results show that R-FastICA algorithm outperforms traditional FastICA with higher similarity coefficient and separation precision.

2. Theory

ICA firstly proposed by Pierre Comon in 1994 is a method for finding the statistical independent components from multidimensional statistical data [17]. The mathematical model without noises can be expressed as follows: where is a column vector of source signals and is column vector of observed signals. is a mixed matrix required that .

The goal is to extract independent source signals from mixed signals by finding separation matrix through some assumptions and constraints under the premise of unknown source signal and mixed matrix , which makes output an estimation to source signal . That is to say, where is the estimation of source signals.

Generally, the normal ECG and VLP signal can be thought of as statistical independence with each other; thus VLP signal will be extracted through FastICA algorithm [11].

The basic model is shown in Figure 1.

3. Methodology

3.1. R-FastICA Algorithm

Traditional FastICA method is to estimate source signals based on negative entropy.

In order to improve separation precisely, negative entropy combined with related function as objective function is proposed in this paper. The updating formula of R-FastICA algorithm is vector gradient derived by the negative entropy combined with related function. The basic idea of R-FastICA algorithm requires that extracted signals are not only independent but also have high precision. Related function is introduced and defined by following formula: where MSE represents the mean square error between source signal and estimated signal, defined as [18]

The MSE in type (3) is substituted by type (4), simplified as

Source signal is replaced by the average of estimated signal. It is defined by , where is an arbitrary integer less than 100 [19]. The type (5) can be simplified as

BSS model is and is the average of . It can be known that where is a column vector and is a row vector.

Combining type (6) and type (7), it can be got that where Similarly,

From the above, is a function based on and ; the vector gradient can be obtained by the bottom of related function:

Vector gradient is defined as

The gradient of can be calculated according to type (12): Similarly,

Type (15) is a gradient of related function to , which can be calculated combining type (13) with type (14):

In literature [10], the approximate calculation formula of negative entropy is

The objective function is composed by negative entropy and related function including the information between source signal and estimated signal in R-FastICA: where and are Gauss random variables with the same covariance (zero mean and unit variance) and is a nonlinear function selected by distribution form of source signals.

The vector gradient of objective function to is as follows:

Type (18) is a new updating formula. The improved algorithm can ensure that the estimated signals are independent and the precision is higher due to the fact that related function is relevant to estimated signal.

3.2. Assessment Method

A familiar measure of separation performance is the similarity coefficient defined as [20]

When , separation effect is ideal; when and are mutual independent, ; generally, similarity coefficient matrix is used to measure extracted performance.

The composite scattering plot [21] is a measure to describe corresponding relationship between the source signal and the extracted signal, where we can see that not only an extracted signal is the recovery of a source signal, but also the phase of source signal and extracted signal is the same or opposite.

4. Simulation

In order to indicate the performance of R-FastICA compared with traditional FastICA, the following simulations were conducted.

Taking random signals as an example in the first simulation, R-FastICA method was proved to be effective. In the second simulation, taking ECG signal with VLP as an example, the original ECG signal without noises was from MIT/BIH database and VLP signal was generated through stacking sine waves with different frequency and amplitude [22].

4.1. Source Separation of the Random Signals

In this simulation, the source signal was sinusoidal signal and was random noises, whose sampling number was 2000. Signals and were shown in Figure 2, as well as mixed signals and that were shown in Figure 3.

The mixed signals were extracted through R-FastICA and traditional FastICA in Figures 4 and 5, respectively.

In the experiment of extracting random signals, the comparison of similarity coefficient matrix between RFastICA and FastICA algorithm was shown in Table 1.

Extracted signal was the estimation of source signal and the phase was opposite in Figure 6 and the same in Figure 7. Extracted signal was the estimation of source signal and the phase was opposite in Figure 6 and the same in Figure 7.

From the above experiments, we could see that R-FastICA algorithm outperforms traditional FastICA with higher similarity coefficient and separation precision.

4.2. Source Separation of the VLP Signal

In this simulation, the source signal was VLP signal generated by program and was ECG signal from MIT/BIH database, whose sampling number was 1600. The source signals and , as well as mixed signals and , were shown in Figures 8 and 9, respectively.

The mixed signals were extracted through R-FastICA and traditional FastICA in Figures 10 and 11, respectively.

In the experiment of extracting VLP signal, the comparison of similarity coefficient matrix between RFastICA and FastICA algorithm was shown in Table 2.

Extracted signal was the estimation of source signal and the phase was the same in Figure 12 and the opposite in Figure 13. Extracted signal was the estimation of source signal and the phase was the same in Figures 12 and 13.

From the above experiments, the performance of R-FastICA is superior to the FastICA obviously with higher similarity coefficient and high separation precision.

5. Conclusion

In this paper, R-FastICA algorithm and FastICA algorithm were adapted to extract random signals and to separate VLP signal from ECG signal. We believed that our study produced two important results. Firstly, we proposed a new method through the combination of related function and negative entropy and separated independent components by maximizing new objective function in the experiments. On the other hand, the experiments showed that R-FastICA method outperformed traditional FastICA method with higher similarity coefficient and high separation precision.

Conflict of Interests

The authors declare that there is no conflict of interests regarding the publication of this paper.

Acknowledgments

This project is supported by The General Object of National Natural Science Foundation (no. 61371062), Youth Science Foundation Project of National Natural Science Foundation (no. 61303207), Ministry of Education in 2012 Colleges and Universities by the Specialized Research Fund for the Doctoral Program of Joint Funding Subject (no. 20121402120020), Shanxi Province Science and Technology Development Project, Industrial Parts (no. 20120321024-01), Shanxi International Science and Technology Cooperation Project (no. 2012081031), Science and Technology Activities Project of Study Abroad Returnees in Shanxi Province in 2012 (Funded by Shanxi province human resources and social security hall), and Research Project supported by Shanxi scholarship council of China (no. 2013-032).