Computational and Mathematical Methods in Medicine

Volume 2016 (2016), Article ID 5127978, 10 pages

http://dx.doi.org/10.1155/2016/5127978

## A Fetal Electrocardiogram Signal Extraction Algorithm Based on Fast One-Unit Independent Component Analysis with Reference

^{1}College of Information and Communication Engineering, Harbin Engineering University, Heilongjiang 150001, China^{2}College of Electrical and Information Engineering, Beihua University, Jilin 132012, China^{3}Collaborative Research Center, Meisei University, Tokyo 1918506, Japan

Received 10 July 2016; Accepted 8 August 2016

Academic Editor: Po-Hsiang Tsui

Copyright © 2016 Yanfei Jia and Xiaodong Yang. This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

#### Abstract

Fetal electrocardiogram (FECG) extraction is very important procedure for fetal health assessment. In this article, we propose a fast one-unit independent component analysis with reference (ICA-R) that is suitable to extract the FECG. Most previous ICA-R algorithms only focused on how to optimize the cost function of the ICA-R and payed little attention to the improvement of cost function. They did not fully take advantage of the prior information about the desired signal to improve the ICA-R. In this paper, we first use the kurtosis information of the desired FECG signal to simplify the non-Gaussian measurement function and then construct a new cost function by directly using a nonquadratic function of the extracted signal to measure its non-Gaussianity. The new cost function does not involve the computation of the difference between the function of the Gaussian random vector and that of the extracted signal, which is time consuming. Centering and whitening are also used to preprocess the observed signal to further reduce the computation complexity. While the proposed method has the same error performance as other improved one-unit ICA-R methods, it actually has lower computation complexity than those other methods. Simulations are performed separately on artificial and real-world electrocardiogram signals.

#### 1. Introduction

The fetal electrocardiogram (FECG) contains much important information about the health and possible diseases of the fetus, which reflects the complete view of the heart activities. Additionally, it is more sensitive than color Doppler ultrasound in the case of fetal acidosis and anoxia. With the analysis of the FECG, the doctor can discover fetal hypoxemia, umbilical abnormality, and other fetal abnormality situations timely and take early effective actions to ensure the health of the fetus. However, the FECG signal is considerably weaker than the maternal electrocardiogram (MECG) and is often embedded in the noise, MECG, baseline wandering, power line interference, and so forth. Accordingly, it is quite difficult to extract the FECG signal and further reduce the diagnostic accuracy.

A number of methods have been reported for extracting the FECG signal, like filtering method [1], singular value decomposition [2], wavelet transform [3], independent component analysis (ICA) [4], and others [5]. Every method has its advantages and disadvantages [5, 6]. In this paper, we only consider the method based on ICA. The FECG extraction based on the ICA methods considers the extracting FECG signal as a blind source separation problem. They consider the unknown FECG signal, MECG, and other interference signals as source signals and the measured signals from the maternal abdomen and chest as mixed signals (measured signals). It actually can separate all source signals, including the FECG signal and MECG signal. Compared with the other methods, it has a simple structure and has been proved to be efficient in extracting FECG in many studies [4–9]. However, for extracting the FECG, the ICA method must first separate all the source signals from the measured signals and then artificially selects the FECG signal. When the number of measured signals is very large, the extraction of the FECG signal will consume plenty of time.

To overcome this problem, the independent component analysis with reference (ICA-R) is applied to extract the FECG [10–14], which separates the desired FECG signal by using some prior information about the desired FECG without separating all the source signals. Therefore, this FECG extraction method based on ICA-R is more efficient than those based on ICA. ICA-R was proposed and implemented by Lu and Rajapakse [15–17] by incorporating the prior information of the desired signal into the contrast function of the ICA [18]. To improve the ICA-R performance, some authors proposed improved ICA-R methods. For example, to reduce the computational complexity of ICA-R, Lin et al. [11] proposed a fast one-unit ICA-R by using prewhitening to deal with the observed signal. Compared with the original ICA-R, this method has shorter running time and the same error. Also, Huang and Mi [19] studied the gradient of inequality constraint function and corrected the gradient. This method has a higher success rate for extracting the desired signal. In addition, Li et al. [14] first divided the cost function of ICA-R into two parts that are the negentropy function and the closeness measure function and then directly used the first-order derivative of the negentropy contrast function to update the separating vector. This method has faster convergence rate and higher success rate to extract the desired signal than the previously reported method [19]. Moreover, Sun and Shang [20] studied the stability of ICA-R and proposed an initialization method of separating the matrix. This method has a higher accuracy and stability than the original ICA-R. Kavuri et al. [21] proposed a one-unit ICA-R by using evolutionary algorithm to optimize the cost function to find a global optimal solution. Although this method has a smaller error than the original ICA-R, it does consume a considerable amount of time. Additionally, Zhang et al. [22] proposed the ICA-R based on kurtosis and analyzed how to choose the reference signal. Wang [23] used the method that has been used in the fast ICA to propose a fixed-point ICA method. Rodriguez et al. [24] proposed a multiunit ICA-R method based on nonorthogonal decoupling of separated matrix. This method cannot be extended to one-unit ICA-R. Mi [25] proposed a strategy to detect future misconvergence to improve the probability of extracting the desired signal. Chen et al. [26] proposed an ICA-R that can be used in single channel by discrete wavelet transform.

The one-unit ICA-R is a special case of the multiunit ICA-R. It only separates one desired source signal every time. The only difference between the one-unit ICA-R and the multiunit ICA-R is that the former only extracts one desired signal, and the latter is based on the former and uses the orthogonalization method to extract multiple desired signals. If we use the deflationary orthogonalization method or symmetric orthogonalization method, which are widely used in ICA [18] to extract the signal in the one-unit ICA-R, the one-unit ICA-R can be easily extended to multiunit ICA-R. In the FECG extraction, we only want to extract the FECG with just one signal; thus, the one-unit ICA-R is enough to extract FECG.

Although the one-unit ICA-R and multiunit ICA-R have a little difference, they share the same cost function. Additionally, they both use the same non-Gaussian measurement function that is used in the classical ICA algorithm to measure the non-Gaussian component of the extracted signal. The negentropy approximation is needed to compute the difference between the function of the extracted signal and the function of a Gaussian vector that has the same mean and variance as the extracted signal. This will consume plenty of time. The classical ICA algorithm has no information about the source signals, except that the non-Gaussian measurements and statistics are independent. However, the ICA-R has prior information about the desired signal, but the previous methods did not use such information to reduce the computational complexity of the negentropy approximation. The aim of this paper is to reduce the computational complexity for the one-unit ICA-R by taking advantage of the prior information of the desired ECG to simplify the negentropy approximation.

This paper is organized as follows. Section 2 reviews the previously reported algorithms including the traditional ICA and one-unit ICA-R. Section 3 analyzes in detail the non-Gaussian measurement function that is used in the one-unit ICA-R cost function. In Section 4, we simplify the non-Gaussian measurement function based on the analysis in Section 3 and, then, derive an improved one-unit ICA-R with lower computation complexity. In Section 5, we test Mi’s method [25] and our proposed method on artificial ECG data and real-world ECG data to compare the performance of each method. Some conclusions are included in Section 6.

#### 2. Previous Algorithms

##### 2.1. Traditional ICA

The traditional ICA is a signal processing method for estimating the source signals from observed signals that are a mixture of source signals. The model of linear ICA is as follows:where is the observed signal, is the mixing matrix with the size (), and is the source signal. The aim of the ICA is to estimate the source signal from the observed signal by computing the separating matrix with size . The estimated source signal can be expressed as follows:where is the estimated source signal, which is the same as under ideal condition. However, under real condition, the shape of the waveforms of is close to , except for the amplitude and sequence of the waveforms. The source signals are considered as non-Gaussian and mutually statistically independent. Accordingly, it has strong non-Gaussian properties. The mixing signals have strong Gaussian characteristics according to the central limit theorem. When the estimated source signal is close to the source signal , it will have strong non-Gaussian properties. Thus, the ICA uses a non-Gaussian measurement function of the estimated source signals as a cost function [18]:where is a positive constant, is the estimated signal (extracted signal), is the nonquadratic function, and is a Gaussian variable with the same mean and variance as . By maximizing the cost function, we can obtain the separating matrix and recover the source signal.

##### 2.2. One-Unit ICA-R

The ICA can separate all the source signals from the observed signals. However, if we only want to extract some or one of the desired source signals, we must use ICA to separate all source signals and further select the desired source signal from the estimated signals. When the dimension of observed signals is very large, especially for some biomedical signals, it will take a long period of time to extract the desired signal. The one-unit ICA-R was proposed by Lu and Rajapakse [15] to reduce computational time and avoid additional operations to select the desired source signal from the estimated signals.

The one-unit ICA-R method combines prior information of the desired source signal into the ICA cost function (3) and constructs a new cost function. The cost function for the one-unit ICA-R is as follows [15]:where , is the separating vector, is the observed signals, is the extracted signal, is a positive constant, and is a Gaussian vector with the same mean and variance as . The inequality constraint term is , where is the reference signal constructed by the prior information of desired signal, is used to measure the closeness between the estimated signal and the reference signal , and is a threshold parameter to control the closeness level. A common and simple measure of closeness is the mean squared error (MSE) . is used to restrict the estimated signal to unit variance.

The above inequality constraints can be transformed into equality constraints by introducing the slack factor . Thus, the cost function of the one-unit ICA with reference to equality constraint can be expressed as follows:The augmented Lagrange multiplier method is used to optimize the above cost function with equality constraint and derive the augmented Lagrange function that can be expressed as follows:where and are the Lagrange multipliers for the constrains and , respectively, and is a scalar penalty parameter. A Newton-like learning algorithm is applied to optimize the above equation and get the updated separating vector. The updated formula is as follows [15]:where is the current iterative number, is the fixed learning rate, is the covariance matrix of the observed signals, and and are the first and second derivatives of , respectively, and can be expressed aswhere , and are the first and the second derivatives of with respect to , respectively, and and are the first and the second derivatives of with respect to , respectively.

The optimum multipliers and are updated by the following equations:

The same author Lu and Rajapakse proposed [17] another cost function for the one-unit ICA-R, as follows: The learning rule of the separating vector iswhereThe symbols of the parameters in (12) have the same meaning as those in (7). Many studies about the ICA-R method are based on the cost function (11) [19, 20, 25].

#### 3. Analysis of the Non-Gaussian Measurement Function in the One-Unit ICA-R Method

The non-Gaussian measurement function (5) that is also the cost function of the fast ICA can also be expressed aswhere is the absolute value of . Maximizing (14) equals maximizing (3). In ICA-R, is a Gaussian variable having zero mean and unit variance [17]. It actually has nothing to do with the separating vector and can be seen as a constant for . Accordingly, maximizing (14) equals maximizing or minimizing . We denote it by ; that is, where , is the source signals, is the mixing matrix, and is the separating vector. In the one-unit ICA-R, we only extract one desired signal. We consider that the successfully extracted signal equals ; hence, the stability point of is .

At the stability point, the first-order derivative of with respect to isThe , which means that the source signals with zero are mutually statistically independent, is used in (16). It is a fundamental assumption in ICA-R [17] and its second derivative is as follows:Adding a small fluctuation into , we obtainTaylor’s series expansion of at the point isIn the one-unit ICA-R, the extracted signal is considered to have unit variance. When the desired signal has unit variance, then . That is, . Thus, we obtain and its Taylor’s series at point zero is Substituting (20) in (19), we get As the components of are very small, we can neglect its higher order term in (20) and (21). From the last item in (21), we can obtain that when then is the minimum, and vice versa is the maximum.

#### 4. Improve One-Unit ICA-R Method

As the previous one-unit ICA-R methods involve the computation of the in every iteration, it is a time consuming procedure. If the computation of is avoided in , the implementation of the ICA algorithm and ICA-R algorithm can be further speeded up. We wish to incorporate the prior information of the desired signal into non-Gaussian measurement function to eliminate to avoid further computing . is a part of the cost function of the one-unit ICA-R. As a result, this will reduce computation complexity of the one-unit ICA-R.

##### 4.1. Simplification of the Non-Gaussian Measurement Function

Since the nonlinear function used in one-unit ICA-R is a positive even function, the terms and are both positive in (14), and has nothing to do with the separating vector and thus can be seen as a constant. Accordingly, when has a maximum value, this equals , which has a maximum value. On the contrary, when has a minimum value, this equals , which has maximum value.

Based on the analysis in Section 3, when , has maximum value. Therefore, when , we can determine that has a maximum value based on the analysis described in the first paragraph of this subsection. Based on the above analysis, when , then (14) can be simplified asAs has nothing to do with the separating vector, we can further simplify the non-Gaussian measurement function (22) as On the other hand, when the , the non-Gaussian measurement function can be simplified asTherefore, the negentropy approximation in (23) and (24) can be expressed aswhere .

In [16], the following function was proposed for extracting the super-Gaussian and sub-Gaussian signal, respectively.where . In [25], Mi has proved that (27) should be corrected as Accordingly, in this paper, we use (29) instead of (27). For simplicity, we set in (29). The first and second derivatives are and , respectively.

If we use (29) as a nonlinear function, in (25) can be expressed as follows: In (30), we used Taylor’s series expansion of , which is described asNeglecting the higher order term in (30) and considering that the desired signal has unit variance, which is a very common hypothesis in ICA and ICA-R, we can obtain the following:According to the kurtosis definition of a super-Gaussian signal, when is a super-Gaussian signal with unit variance, . Then in (32). This means that if the desired signal is super-Gaussian, we can directly simplify (25) as

##### 4.2. Improved One-Unit ICA-R for Extracting FECG

The one-unit ICA-R is used to extract one desired signal. The prior information that the desired signal is a super-Gaussian or a sub-Gaussian signal is easily obtained. The FECG and MECG are both super-Gaussian signals in general. Therefore, in the extraction of FECG, we can directly use (33) instead of (14) to obtain the non-Gaussian measurement of an extracted signal. Centering and whitening can reduce the computation complexity of one-unit ICA-R [11]. Thus, we use centering and whitening to process the observed signals. After preprocessing, the new cost function of the one-unit ICA-R for extracting FECG can be expressed aswhere is the extracted signal, is the centered and whitened signal, and is the separating vector. in (34) is different from in (12). is just a positive constant. Correspondingly, the augmented Lagrangian function for the problem in (34) is expressed as follows:To find the maximum of , Newton’s method is used to optimize . The learning rule of the separating vector is given by the following equation:where is the learning rate:

The main difference between the improved algorithm in (36) and the original algorithm in (12) is that in (36) is just a positive constant and in (12) is . In the original algorithm, we have to additionally compute in every iteration, which considerably consumes much more time compared with the improved algorithm.

The one-unit ICA-R uses Newton’s method to optimize cost function. Newton’s method is sensitive for the initial value. The initial separating vector directly affects the convergence of Newton’s method. However, the original algorithm randomly initializes the separating vector. To avoid random initialization, inspired by previous a report [20], we use the following expression to initialize the separating vector:where is the whitened reference signal, is the whitened signal, is the inversion of , and is the initialization separating vector. The closer to whitened desired signal is, the closer to the perfect separating vector will be. This can speed up the algorithm convergence rate and improve the probability of global convergence compared with random initialization.

Therefore, the proposed one-unit ICA-R algorithm for extracting FECG can be described as follows:(1)Center the observed signals.(2)Whiten the centered signal.(3)Choose a proper scalar penalty parameter , convergence threshold , parameter , and the leaning rate ; generally let , .(4)Use (38) to initialize separating vector .(5)Normalize .(6)Choose an initial value for .(7)Update the Lagrange multiplier by (9).(8)Update the separating vector by (36).(9)Normalize the separating vector by (10)Until , otherwise go back to step .

#### 5. Simulation and Discussion

To demonstrate the effectiveness of the proposed one-unit ICA-R algorithm, both artificial ECG data and real-word ECG data are used in the following two experiments. To quantitatively compare the performance of the proposed one-unit ICA-R and Mi’s one-unit ICA-R [25], we adopt the individual performance index (IPI) as an indicator function, as was used in a previous study [11]. It is defined as where denotes the absolute value operator, denotes element of the global system vector , is mixing matrix, is the whiten matrix, is the separating vector of mixed signals after whitening, is the global separating vector, is the number of mixing signals, and is to find the maximum valued among the elements in the vector . The extracted signal by proposed method can be described asFrom (41), it can be easily seen that if , , we will have the extracted signal , which is just rescaled signals of . It is obvious from (40) that we have and that IPI equals zero if and only if is a rescaled canonical base vector that only one element of vector is not zero. Therefore, the closer to a rescaled canonical base vector the global system vector , the nearer to zero the IPI and thus the better the performance of one-unit ICA-R method.

In the experiments, we select as the nonquadratic function and as the closeness measure function. Additionally, . We also use the corrected first derivative of as its first derivative. The corrected first derivative is given in [19] asIn the following experiments, we set , , and all equal to one and perform 1000 trials for each data set.

##### 5.1. Experiment with Artificial ECG Data

The artificial signals include the power line interference with 50 Hz, Gaussian noise, FECG, and MECG. The FECG and MECG are generated by using the ECG toolbox of Sameni [28]. The reference signals for the FECG and MECG are constructed with sign operation to roughly give the signs of the most data samples of the desired source signal [14, 15]. There are 5000 samples in the experiment. The mixing matrix is randomly generated asFour artificial source signals are depicted in Figure 1, which are considered independently of each other. s1 is the power line interference with sub-Gaussian distribution. s2 is the random Gaussian noise with Gaussian distribution. s3 and s4 are FECG and MECG with super-Gaussian distribution, respectively. The mixtures are shown as mixed signals in Figure 2. The FECG extracted by the proposed method and its reference are shown in Figure 3. The MECG extracted by the proposed method and its reference are shown in Figure 4. Comparisons of the extracted FECG and the extracted MECG are shown in Figures 3 and 4 with the desired signals s3 and s4 in Figure 1, respectively. It is evident that the waveforms of the extracted signals are most identical to the waveforms of the desired signals.