A New Method of Blind Source Separation Using Single-Channel ICA Based on Higher-Order Statistics

Lu, Guangkuo; Xiao, Manlin; Wei, Ping; Zhang, Huaguo

doi:https://doi.org/10.1155/2015/439264

Mathematical Problems in Engineering

On this page

Abstract Introduction Conclusion Acknowledgments References Copyright Related Articles

Research Article | Open Access

Volume 2015 | Article ID 439264 | https://doi.org/10.1155/2015/439264

A New Method of Blind Source Separation Using Single-Channel ICA Based on Higher-Order Statistics

Guangkuo Lu,¹Manlin Xiao,²Ping Wei,¹and Huaguo Zhang¹

Academic Editor: Carla Roque

Received01 Apr 2015

Accepted22 Jul 2015

Published11 Aug 2015

Abstract

Methods of utilizing independent component analysis (ICA) give little guidance about practical considerations for separating single-channel real-world data, in which most of them are nonlinear, nonstationary, and even chaotic in many fields. To solve this problem, a three-step method is provided in this paper. In the first step, the measured signal which is assumed to be piecewise higher order stationary time series is introduced and divided into a series of higher order stationary segments by applying a modified segmentation algorithm. Then the state space is reconstructed and the single-channel signal is transformed into a pseudo multiple input multiple output (MIMO) mode using a method of nonlinear analysis based on the high order statistics (HOS). In the last step, ICA is performed on the pseudo MIMO data to decompose the single channel recording into its underlying independent components (ICs) and the interested ICs are then extracted. Finally, the effectiveness and excellence of the higher order single-channel ICA (SCICA) method are validated with measured data throughout experiments. Also, the proposed method in this paper is proved to be more robust under different SNR and/or embedding dimension via explicit formulae and simulations.

1. Introduction

As one of the most attractive solutions for the blind source separation (BSS) problem, independent component analysis (ICA) has a strong practical background and wide applications in multiway data analysis such as biomedicine [1], image processing [2], telecommunications [3], geophysical research field [4, 5], and physics of musical instruments [6, 7], because it is a combination of informationism, optimal theory, probability, matrix theory, and mathematical statistics. Indeed, several papers recently have been written in which standard linear ICA, for instantaneous mixtures, was successfully applied to many natural environments as explosion-quakes and tremor at Stromboli and Erebus volcanoes [8, 9], to study acoustical and mechanical vibrational field in organ pipe [10]. Particularly, in intuitive way a segmentation combined with a ICA approach was already proposed to get information on the very long-period waves from water-level oscillations [11, 12], in which the ICA, the intertime occurrence, and the reconstruction of asymptotic dynamics are adopted after a preanalysis in the frequency domain. Obviously, ICA appears more appropriate in the investigation of nonlinear systems than the analyses based on the Fourier transform, even though several tidal behaviours have been pointed out by frequency-domain methods.

Generally, the number of sensors must be no less than that of the sources to acquire information to support the BSS work. Often in real cases, however, one has just a single measure of a certain specific physical variable, from which information on the underlying source mechanism has to be derived. In this case, the topic faced by the researchers is very important and difficult, that is, the extraction of characteristics from single experimental series, because of the lack of prior information. The method called SCBSS is proposed to exact the independent feature by using only one transducer. The methods employ ICA [13] to find the interested and independent feature from the decomposed signals based on oversampling [14], principal component analysis (PCA) [15], short time Fourier transform (STFT) [16], wavelet transform (WT) [17], empirical mode decomposition (EMD) [18, 19], and so forth. The SCBSS methods based on oversampling and PCA can be used to separate the single-channel signal, however, only when the source signals in the linear mixture are stationary and independent. The methods based on STFT and WT can solve the nonstationary problem whereas they do not work to separate a nonlinear time series, which is generated from most of the natural [20, 21] and artificial systems in some fields like wireless communication and radar and sonar engineering. The SCBSS method based on EMD can be directly used to separate these single-channel signals, which are nonlinear, nonstationary, and even chaotic. But this method will break down if any source signal is not an Intrinsic Mode Function (IMF). Particularly, if the mixture contains spike pulses or the source signals have different time of arrival (TOA), which introduce some spurious extrema into the mixture, the EMD algorithm suffers from the problem of IMF confusion, and then a number of phantom sources can appear in the decomposed signals. To solve these problems, the way of thinking needs to be changed for developing new methods.

In a dynamical embedding framework, the measured data can be assumed to be generated by the nonlinear interaction of just a few degrees of freedom, with additive noise, and suggests the existence of an unobservable deterministic generator of the observed data. Obviously, in this case the reconstructed phase space (RPS) can be used to uncover as much information as possible about the underlying generators based only on the measured data [22], and the ICA algorithm could be then performed on the embedding matrix to exact its underlying ICs in the SCICA method proposed by James and Lowe [23, 24]. However, the SCICA method can successfully separate the measured signal only when the time series is nonlinear and stationary. Also, the key of the SCICA algorithm is to change the one-dimension time series to equivalent multidimensions through the RPS method. In order to achieve a better result, a larger embedding dimension should be taken, which could greatly increase computational complexity. To overcome this shortcoming, Ma [25] has developed a novel method, in which the stationary segments are firstly gotten by the Bernaola-Galavan (BG) segmentation algorithm [26], then the embedding dimension is reduced by singular spectrum analysis (SSA) [27], and the ICs are finally generated by ICA. The computational complexity of Ma’s method is successfully decreased from that of SCICA while achieving better performance. Unfortunately, although the selection of a suitable window length in Ma’s method, which is crucial for the resolution of the SSA method to be computed by means of multiple autocorrelation, some subjective factors are introduced in the computational process. Essentially, SSA is a linear method based on the covariance matrix which reflects the linear relationship of the source signals and cannot reflect the intrinsic nonlinear relationship of them, although SSA-based method has been successfully applied in the field of signal processing for nonlinear dynamical systems. Particularly, the eigenvalues of the covariance matrix cannot be used to select a series of features to reconstruct the original time series if the signal to noise ratio (SNR) is too low or the embedding dimension is not correctly selected. Moreover, a multistage SSA algorithm [25] has been proposed to exact the feature signal under strong noise levels, which greatly increases the computational complexity.

In order to find a solution to the aforementioned problems, a modified method based on HOS is developed in this paper. Section 2 introduces signal model and the problem that needs to be solved in this paper. Section 3 contains the HOS-based SCICA method and simulations are carried out to verify the effectiveness of the method in Section 4. Finally, the conclusion of the paper is given in Section 5.

Notations. Hereinafter, bold uppercase letters denote matrices; bold lowercase letters stand for column vectors and lowercase letters represent scalars. Superscripts , , and denote transpose, absolute value, and Frobenius-2 norm, respectively. is the expectation operator. is the th entry of . denotes convolution. denotes real number domain. and denote the true value and the estimate of variable , respectively.

2. Problem Statement

2.1. Data Model

Generally, the observed single-channel signal could be modeled as a single-channel instantaneous linear mixture (SCILM) of unknown independent signal :where is the weight of source signal the th and is the zero-mean additive white Gaussian noise of unknown covariance. Obviously, ICA does not work when only one sensor could be employed. For the scenario, the key is to change the single time series into multidimensional time series before ICA will be used to separate the preprocessed signals. Based on different additional assumptions, the single-channel data can be reconstructed into different pseudo-MIMO models by different decomposition methods, such as PCA, STFT, WT, SSA, and EMD.

2.2. Single-Channel ICA

When the actual data is treated as a nonlinear time series with additive noise which is generated by the nonlinear interaction of just a few degrees of freedom, we can use the SCICA algorithm to solve the SCBSS problem. RPS is the first and foremost step, when the dynamic system theory is utilized to analyze a nonlinear time series. In [22], Takens shows that the map defined by is embedding, where the -dimensional state space , is a twice continuously differentiable diffeomorphism that describes the dynamics of the system and is a twice continuously differentiable function representing the observation of a single state variable. Generally, the embedding dimension must be large enough to capture the necessary information. Then for a nonlinear time series , the state of the unobservable system at time , is given bywhere is the lag, is the embedding dimension, and is the sampling interval.

Any approach to state space reconstruction uses the information in delay coordinates as a starting point. Obviously, Takens’ theorem allows us to reconstruct the unknown dynamical system that generates the measured time series by reconstructing a new state space based on the successive observations of the time series. It is indicated that the RPS of the nonlinear time series is the essential projection of the strange attractor on the axis of the space spanned by delay vectors. Therefore, each time series constructed by each delay vector can be regarded as a mixture of source signals. As shown in [23], the method based on RPS could be used to change the single-channel data into multidimensions time series. Then ICA can be used to span the embedding matrix with any ICs and to exact the feature.

2.3. Problem Statement

SCICA could separate a single-channel time series successfully if and only if this method satisfies the following conditions [23]:(1)The measured signal is stationary.(2)The phase state can be reconstructed perfectly.(3)Each time series constructed by RPS could be considered as a single-channel instantaneous linear mixture (SCILM) of source signals.(4)All the independent random processes must be bandlimited with disjoint spectral support.

Unfortunately, SCICA algorithms cannot be used directly for sources separation or extraction while the signal is nonstationary. Therefore, a nontrivial structure with nonstationarity of the actual signal with variable statistical property such as the mean and the variance is expected. The problem addressed in this paper is to segment a nonstationary time series, which consist of many segments with different statistical property, in such a way as to maximize the differences in the statistical property between adjacent segments. The BG algorithm in [26] is applied to divide the nonstationary data in [25]. However, an important assumption in BG algorithm that the variances of adjacent two segments are constant and the nonstationarity is only reflected by the difference of means of these two segments is not always true in a general sense. Therefore, the higher-order moments will be used for the nonstationary detection in this paper.

Takens’ theorem [22] shows that the unknown dynamical system can be reconstructed by recreating a new state space only when the Euclidean embedding dimension must satisfy that ( is the attractor dynamics). As shown in the proof of [22], the embedding dimension and the time lag could be selected arbitrarily, resulting in arbitrarily precise states, which is as good as any others. However, an important assumption of Takens’ theorem is that the recording data without noise must be infinite, which may not always be true in the actual case. The actual data is always finite and is added with the strong broadband noise, which can obscure states and deteriorate the good properties of RPS. Simulation results [28] show that RPS does not work while the embedding dimension is less than the requirement. Accordingly, the calculated complexity increases with the increase of the embedding dimension. Although several methods, such as false nearest neighbor [29], singular value decomposition (SVD) [30], autocorrelation [31], and mutual information [32], can be used to determine the embedding dimension and the time lag of the reconstruction, these methods are mainly based on the experiments. Therefore, the selection of the reconstruction properties is essential to solve the problem of this paper.

Assuming instantaneous linear mixing of the sources at the sensors, ICA performs a blind separation of statistical independent sources with techniques involving higher-order statistics. However, RPS, which reconstructs the nonlinear time series in the state phase based on the delay coordinates, is essentially a nonlinear transform and cannot change single-channel data into multiple instantaneous linear mixture. Therefore, SSA [27] is used to transform the decomposed signals based on the delay coordinates into the one based on the original coordinates. In [25], SSA based on an eigenvalue decomposition (EVD) of the so-called lagged-covariance matrix for determining the optical dimension of reconstruction is applied to decompose the short and noise time series into a Pseudo-MIMO model before BSS is used. However, SSA, which essentially is a linear method, cannot reflect the structure of nonlinear dependence. Furthermore, SSA is not robust to reconstructive lag, embedding dimension, and the effect of the additive noise. Therefore, based on the above analyses, HOS-based methods may be employed because of the robustness of the higher-order-cumulants to Gaussian noise and the nonlinear property.

3. Source Separation Using HOS-Based Single-Channel ICA

In this section, the actual data is assumed to be a stochastic process , where is deterministic, , and is an independent and identically distributed (IID) process. As shown in [33, 34] any process that satisfies the following assumptions can be referred to as a th-order quasistationary process:(1),(2),(3) is linear, time-invariant, and stable; that is, ,(4) exists and is finite, ,

where () is the time lag. Then, the fourth-order cumulants for a zero-mean, quasistationary (up to the fourth-order) signal is defined aswhere .

Furthermore, in this paper the actual data is considered as nonstationary signal, which is composed of many zero-mean, quasistationary (up to the fourth-order) segments with different higher-order statistical properties. Then the different statistical properties will be selected to segment the time series into several subsets by means of the BG algorithm [26], which is based on heuristic segmentation with different scales and is more effective in detecting the abrupt changes of nonlinear time series.

Considering a zero-mean, quasistationary (up to the fourth-order) subset of the actual data, which is generated by a nonlinear dynamical system, the information about the underlying generators is uncovered by employing RPS-based method. Using the mean time between peaks (MTBP) as the time window, the reconstructed parameters and are estimated simultaneously and the nonlinear time series is reconstructed by Takens’ embedding theory [22]. Then, the decomposition and reconstruction based on HOS are applied to reduce the dimension, weaken the noise, eliminate the nonlinear factors, and transform the phase space based on delay coordinates into the multiple instantaneous linear mixture. Finally, ICA is used to separate the decomposed sequences and extract the information from short and noisy time series. The modified SCICA strategy is illustrated in Figure 1, which will be further interpreted in the following sections in detail.

3.1. HOS-Based Segmentation of Nonstationary Time Series

Since SCICA does not work to the nonstationary signals which are the property indeed of the measured single-channel actual signals, a modified BG algorithm is necessary for developing HOS-based methods. The BG algorithm is designed to characterize the stationary durations of human heart beat time series in [26]. The calculated complexity of this algorithm is reduced by iteratively segmenting the time series into only two segments. A decision to cut the time series is made by evaluating a modified Student’s -test for the data in the two segments. However, the important assumption of this algorithm that the variances of both segments are kept constant and the nonstationarity is only reflected by the difference of means of the two segments may not always be true in a general sense as shown in Figure 2. Also, the BG algorithm does not work if the different scales are selected as shown in Table 1. In this paper, we suppose that the subsets composing the nonstationary signal are the stationary, real-valued, random processes with different means, and their moments up to order exist. Based on this assumption, a modified BG algorithm based on HOS is proposed in this section.

(a)

(b)

(c)

(d)

(e)

The original algorithm is modified as follows: After selecting a larger value as the minimum segment length, a sliding pointer is moved from left to right along the signal. At each position of the pointer, the time slots are computed as and are the mean of the subset of the signal to the left of the pointer and to the right, respectively. And, the statistical significance of is computed aswhich could not be expressed in a closed form. Then, a suitable approximation by means of Monte Carlo simulations in [26] is given:This significance needs to be checked whether it is larger than a selected threshold which usually is taken to be 0.5~0.95. If so, the signal is cut at this point into two subsequences; otherwise the signal remains undivided. This process is continued to be conducted for each of these two subsequences recursively until the signal is composed of many minimum subsets. In other words, the process stops when none of the significance of the possible cutting points is larger than . It needs to be noted that the process also stops if the length of subset is shorter than and we say that the signal has been segmented at the significance level . Then the time series is expressed aswhere is the mean of the subset . After selecting a smaller value as the new minimum length , a sliding pointer is moved again from left to right along each subset segmented on different means. At each position of the pointer the time spots need to be computed aswhere , , and are variances, skewnesses, and kurtosises of the subsets of signal on both sides of the pointer. Since the statistical significance of cannot be reflected by the change of , BG algorithm can be used to segment the signal into subsets with different features corresponding to the 1st–4th order moments of time series.

3.2. HOS-Based Dynamic Systems Analysis of Nonlinear Time Series

The zero-mean, quasistationary (up to the fourth-order) segments prepared for SCICA could be exacted by a modified BG algorithm, as discussed above. Unfortunately, SCICA cannot be directly used to recover all sources from the recorded mixtures, in particular scenarios that the useful sources cannot be perfectly reconstructed into a phase state and the mixing segment cannot be considered as a SCILM. Applied to the actual data, therefore the reconstruction parameters can be selected by a new method to ensure validity of the RPS, and the reconstructed phase state in the delay coordinate system can be transformed into a multipath instantaneous linear mixture, which can be solved by ICA.

3.2.1. RPS

The performance of RPS depends on two parameters, namely, the selection of the embedding dimension and the lag . The correct estimations of these two parameters make the phase state be reconstructed perfectly. Recently some studies [35, 36] prove the relationship of these two parameters, which are expressed aswhere is the time window.

Paper [35] shows that the nonlinear dynamics is successfully reconstructed in the phase space when the time window is kept constant and the other parameters are variable. Now we emphasize on how to use a simple way to estimate , which equals the mean period in some applications. Strictly speaking, chaotic system is nonperiodic. For the low dimensional chaotic system with pseudo period, the mean orbital period is approximately equal to mean oscillation period of the chaotic attractors but never meets the orbit period of phase space. Meanwhile, the literatures [36] point out that the mean orbital period and the MTBP of chaotic time series are equal in general. The fast Fourier transform (FFT) can be employed to get the main frequency , and the MTBP of the original time series is calculated to be the time window , that is, . Then the signal processed by RPS should provide a sufficient frequency coverage that can include all the frequency components related to the feature. Therefore, the following condition must be satisfied:where is the lowest frequency of interest. The sampling frequency meetsThe sufficient frequency coverage for RPS is selected based onThen,Therefore, the lag can be set as small as 1 while the embedding dimension is large enough based on Takens’ theory, which is based onIf the embedding dimension and the time lag are obtained, the initial time series is changed into the trajectory matrix :Particularly, paper [35] shows that the value of the time lag could be reflected by the autocorrelation of signal. If the degree of autocorrelation of signals becomes lower, the smaller lag can be selected. Thus the time lag cannot be too large in the RPS for the signal with low SNR. Although the signal of RPS may be with low SNR because of the large time window, the modified method based on HOS in this paper could be applied to reduce the white noise.

3.2.2. HOS-Based Coordinate Transformations

For some purposes, such as reducing the dimension of the reconstruction, it may be desirable to make a further coordinate transformation to a new coordinate system [37]: . It is shown that performing SVD for the trajectory matrix embedded by RPS is equivalent to apply EVD to the lagged-covariance matrix , which can be denoted asThenAfter the step of embedding, the EVD is applied to the lagged-covariance matrix , which is rewritten asIn this step, we select and the lagged-covariance matrix can be denoted asThen, a nonlinear transformation is defined asAlthough SVD can be used to construct a nonlinear transformation that (in a certain sense) provides the optimal coordinates in the RPS, unfortunately, singular spectrum obtained from some system data is not used for distinguishing the dynamic signal from the random noise. Furthermore, the autocorrelation-based methods cannot be robust enough for the embedding dimension and the lag with large dynamic range, which can reflect the trajectory matrix and the singular spectrum. Finally, the autocorrelation-based method, which is essentially a linear analysis technique, is difficult to reflect the intrinsic nonlinear correlation on space-time of nonlinear time series. Also, the observed data is composed of many zero-mean, quasistationary (up to the fourth-order) segments. Then to solve the above-mentioned problem, the algorithms based on HOS are considered to be used, which include the following advantages: (1) suppressing additive white and colored Gaussian noise of unknown power spectrum; (2) exacting information due to deviations from Gaussianity; and (3) detecting and characterizing nonlinear properties in signals as well as identifying nonlinear systems [38, 39].

Thus by applying the high-order cumulants theory, a modified method is provided. A fourth-order cumulant is defined as

Then, the elements from the 4th-order cumulants function are selected as the elements of . However the 4th-order cumulants function has three variables and cannot be directly used to SVD. To obtain a binary function, the 4th-order cumulants function is taken to slices. For example,where is the th-order moment function for a zero-mean, quasistationary (up to th-order) signal . Then, if the 4th-order cumulants function is taken to many different slices, the can be reexpressed as

Finally, the SVD method is applied to and the singular spectrum can be obtained, which can be used to reconstruct the original series without noise.

However, the different fourth-order cumulants slices functions of are not equal and their robustness for SNR, , and is different as shown in Figure 3. Thus the optimal splice corresponding to the mixed signal is needed to be selected. This paper examines how states are affected when the observational noise complicates the reconstruction problem for the actual data. When a -dimensional state is projected onto a -dimensional measurement with , the information is thrown away. However, if the uncertainty of the reconstructed state is much higher than that of the individual measurements, the noise is amplified. In that case, the change of the properties of the reconstruction and will result in the change of the trajectory matrix and even cause the change of the diagonal matrix and the new coordinate system as shown in Figure 3. This phenomenon is referred to as noise amplification, which can be defined aswhere and is the given noise level.

Obviously, the noise amplification depends on the measurement function, the method of reconstruction, and the dynamical system as proved in paper [40]. It means that the method of SCBSS for nonlinear time series is not robust for the different SNR and the different factors such as or/and . Thus the noise amplification has to be defined to compare different methods of state space reconstruction. This gives guidance for optimizing the parameters of a particular method, or for comparing two different methods.

Then we show that selecting the optimal slice is equivalent to minimizing the noise amplification , which can be estimated from the time series . For noise reduction, the true value is defined as . Since , the true noise amplification can be defined asObviously the true noise amplification can be used as evaluative criteria for the cumulants slices. Then, the smaller the value of , the better the robustness of the selected cumulants slice for SNR, , and .

As shown in the following experiments, there are several general motivations behind the use of the HOS-based method in signal processing.

(1) This method is used to distinguish the feature signal from the Gaussian noise. Clearly, consider signal , where is the zero-mean, deterministic signal, and is the Gaussian noise. When is quasistationary up to the fourth order, then the fourth-order mixed cumulant is given byBecause of more effectiveness for suppressing additive Gaussian noise and reflecting the intrinsic nonlinear correlation in nonlinear time series, the 4th-order cumulants will be used for conducting the singular spectrum analysis of nonlinear time series in this paper. Figure 4 illustrates the results of singular spectrum analysis obtained by a cross-correction method and a technique based on cross fourth-order cumulants. The signal of interest is assumed to be nonlinear and non-Gaussian whereas the additive noise is Gaussian. From Figure 4, it is apparent that the HOS-based coordinate transformation does suppress the effect of Gaussian noise and thus provides better singular spectrum, especially in low SNR.

(a)

(b)

(2) As shown in Figure 5, the method based on high-order moments is a more robust technique for autocorrelation-based method under different embedding dimensions, which is the greatest superiority for the nonlinear time series analysis.

(a)

(b)

(3) The th-order cumulants function of a non-Gaussian stationary random signal can be written as (for only)where is the th-order moment function of and is the th-order moment function of an equivalent Gaussian signal that has the same mean value and autocorrelation sequence as . The third motivation is based on the measured signals in the real-world, most of which are nonlinear and non-Gaussian and have nonzero higher-order moments. For the nonlinear system analysis, the HOS-based coordinate transformation can reflect the third-order correlation of the nonlinear time series while the autocorrelation-based methods only give expression to the linear correlation. Thus more information about the nonlinear feature of the nonlinear time series is exacted as shown in Figure 6.

(a)

(b)

The HOS-based method is used to process the trajectory matrix corresponding to the measured single-channel signal by means of RPS. A typical process of HOS-based optimal coordinate transformation has the following stages.

(1) Calculate the fourth-order correlation function and construct the Toeplitz matrix with the optimal slice .

(2) Apply SVD on the Toeplitz matrix :Sort the eigenvalues from the diagonal of and the eigenvectors in , and accordingly distinguish the feature components from the additive noise.

(3) Project the trajectory matrix on the subspaces :where the subspace spanned by the eigenvectors is . Then, the trajectory matrix can be written as

(4) Reconstruct the original series corresponding to the selected features based on the inverse Hankelization by antidiagonal averaging into a new time series of length , and obtain the time series from averaging of the corresponding diagonals of the matrix :where , . Then we obtain the pseudo-MIMO model: , where is the number of feature components, which need to be further separated by means of the ICA method.

3.3. Independent Component Analysis

Then, ICA [13] is chosen to represent the data in the pseudo-MIMO model in the assumption that (a) the measured data provided by the HOS-based method can be considered as a linear instantaneous mixing, and (b) any source of interest is independent of the other signals. ICA, which is a blind separation of statistically independent sources, performs as a means of identifying underlying components of the actual signals data. This method yields more useful results than other methods such as PCA as shown in [41].

In essence, ICA must find a separating or demixing matrix , which is sufficient to , where the components in the separated signals are as independent as possible. The general method based on the minimization of mutual information [42, 43] can be described aswhere is the whitening data, is the row vector of demixing matrix , is the iterative sequence number, indicates mean value, and is the score function, which can be obtained as gradient of entropy . can be estimated bywhere ,and is a smoothing parameter matrix. Then the estimation of the probability density isand the mutual information can be estimated aswhere . Therefore, the separated signals can be described aswhere is the separated independent data.

Performing ICA on the matrix processed by HOS-based method results in a set of ICs that forms the basis of the matrix. Particularly, since the data has been processed by HOS-based coordinate transformation, the ICs as many as measurement channels can be directly considered as all of the principal components in the data. Furthermore, the ICs of interest can be identified and exacted.

3.4. The Implementation of the Algorithm

Based on the previous sections, we can introduce a modified SC-ICA algorithm by the following implementation:(1)Detect and segment the measured signal into several high-order stationary subsets (with the modified BG segmentation algorithm) in time domain.(2)Determine the window (based on the time window method) and transform each subset into multidimensions time series (based on the RPS).(3)Perform the HOS-based coordinate transformation to change the trajectory matrix into the pseudo-MIMO model.(4)Separate the source signals with ICA.

4. Simulations

A single-channel observation of two sources is taken as an example. The sampling frequency is set to 1 GHz. The data consists of 30,000 samples and the signal is cut into a series of quasistationary (up to the fourth-order) subsets by means of the modified BG algorithm. A subset which consists of 4096 samples is taken as a time series under test. The power spectrum of the mixture of sources is shown in Figure 7. Note that just from the power spectrum someone may be fooled into believing that there are three distinct sources present. After calculating the minimum time window and determining the embedding dimension with a lag , the time series can be reconstructed into the multidimensions time series . Applying autocorrelation-based method to trajectory matrix results in the singular spectrum as shown in Figure 8. Figure 8 also shows that the number of singular values associated with the signal subspace is three. Obviously, the autocorrelation-based method does not work. On the contrary it is clear that the HOS-based method successfully recognizes the number of the sources shown in Figure 9. Then, the number of path is selected as six and the power spectrum of the mixture of sources by means of RPS and HOS-based coordinate transformations is shown in Figure 10. Obviously, the data from six channels can be regarded as multipath instantaneous linear mixture model. Finally, the two sources are successfully separated via ICA as shown in Figure 11, and the other ICs generated by ICA are the useless ones as shown in Figure 12.

5. Conclusion

Overall, this paper presents an approach for exacting information from single real-world data. The idea is firstly to segment the measured signal and then to form a pseudo-MIMO system by means of decomposing the observed segment into several signals using a representation method based on higher-order statistics (HOS). Finally the fixed-point FastICA algorithm is applied to estimate the source signals (independent components). The simulations show that the method is successful in isolating components from the single-channel data. Also, the methods based on fourth-order cumulants are more robust than those based on autocorrelation as the properties of the reconstruction are changed. Compared with the autocorrelation method, HOS-based SCICA is better for low SNR. At this stage, HOS will be more sensitive to the number of the used samples. This is a problem for all the HOS-based method when they are applied to the actual data, but it is not a problem for this method. Moreover, since the available techniques used in this paper can process the single-channel signal without depending on a priori, the method is a very powerful method that can isolate feature components in the actual data.

Conflict of Interests

The authors declare that there is no conflict of interests regarding the publication of this paper.

Acknowledgments

This work was supported by the National Natural Science Foundation of China (61201282) and the Fundamental Research Funds for the Central Universities of China (no. ZYGX2013J016).

References

S. Takahashi, Y. Anzai, and Y. Sakurai, “A new approach to spike sorting for multi-neuronal activities recorded with a tetrode—how ICA can be practical,” Neuroscience Research, vol. 46, no. 3, pp. 265–272, 2003.
View at: Publisher Site | Google Scholar
A. Tonazzini, E. Salerno, and L. Bedini, “Fast correction of bleed-through distortion in grayscale documents by a blind source separation technique,” International Journal on Document Analysis and Recognition, vol. 10, no. 1, pp. 17–25, 2007.
View at: Publisher Site | Google Scholar
T. Ristaniemi and J. Joutsensalo, “Learning algorithms for blind multiuser detection in CDMA downlink,” in Proceedings of the 9th IEEE International Symposium on Personal, Indoor and Mobile Radio Communications, vol. 3, pp. 1040–1044, September 1998.
View at: Google Scholar
L. H. Sibul, M. J. Roan, and J. Erling, “Deconvolution and signal extraction in geophysics and acoustics,” The Journal of the Acoustical Society of America, vol. 112, no. 5, p. 2389, 2002.
View at: Publisher Site | Google Scholar
L. T. Duarte, R. Lopes, J. H. Faccipieri et al., “Separation of reflection from diffraction events via the CRS technique and a blind source separation method based on sparsity maximization,” in Proceedings of the 13th International Congress of the Brazilian Geophysical Society, 2013.
View at: Google Scholar
B. Rivet, L. Girin, and C. Jutten, “Mixing audiovisual speech processing and blind source separation for the extraction of speech signals from convolutive mixtures,” IEEE Transactions on Audio, Speech and Language Processing, vol. 15, no. 1–4, pp. 96–108, 2007.
View at: Publisher Site | Google Scholar
H.-M. Park, H.-Y. Jung, T.-W. Lee, and S.-Y. Lee, “Subband-based blind signal separation for noisy speech recognition,” Electronics Letters, vol. 35, no. 23, pp. 2011–2012, 1999.
View at: Publisher Site | Google Scholar
A. Ciaramella, E. De Lauro, S. De Martino, B. Di Lieto, M. Falanga, and R. Tagliaferri, “Characterization of Strombolian events by using independent component analysis,” Nonlinear Processes in Geophysics, vol. 11, no. 4, pp. 453–461, 2004.
View at: Publisher Site | Google Scholar
E. De Lauro, S. De Martino, M. Falanga, and M. Palo, “Decomposition of high-frequency seismic wavefield of the Strombolian-like explosions at Erebus volcano by independent component analysis,” Geophysical Journal International, vol. 177, no. 3, pp. 1399–1406, 2009.
View at: Publisher Site | Google Scholar
E. de Lauro, S. de Martino, E. Esposito, M. Falanga, and E. P. Tomasini, “Analogical model for mechanical vibrations in flue organ pipes inferred by independent component analysis,” The Journal of the Acoustical Society of America, vol. 122, no. 4, pp. 2413–2424, 2007.
View at: Publisher Site | Google Scholar
P. Capuano, E. De Lauro, S. De Martino, and M. Falanga, “Analysis of water level oscillations by using methods of nonlinear dynamics,” International Journal of Modern Physics B, vol. 23, no. 28-29, pp. 5530–5542, 2009.
View at: Publisher Site | Google Scholar
P. Capuano, E. De Lauro, S. De Martino, and M. Falanga, “Water-level oscillations in the Adriatic Sea as coherent self-oscillations inferred by independent component analysis,” Progress in Oceanography, vol. 91, no. 4, pp. 447–460, 2011.
View at: Publisher Site | Google Scholar
P. Comon, “Independent component analysis, a new concept?” Signal Processing, vol. 36, no. 3, pp. 287–314, 1994.
View at: Publisher Site | Google Scholar
E. S. Warner and I. K. Proudler, “Single-channel blind signal separation of filtered MPSK signals,” IEE Proceedings: Radar, Sonar and Navigation, vol. 150, no. 6, pp. 396–402, 2003.
View at: Publisher Site | Google Scholar
C. Servière and P. Fabry, “Principal component analysis and blind source separation of modulated sources for electro-mechanical systems diagnostic,” Mechanical Systems and Signal Processing, vol. 19, no. 6, pp. 1293–1311, 2005.
View at: Publisher Site | Google Scholar
D. Barry, D. Fitzgerald, E. Coyle, and B. Lawlor, “Drum source separation using percussive feature detection and spectral modulation,” in Proceedings of the IEE Irish Signals and Systems Conference, pp. 13–17, September 2005.
View at: Google Scholar
J. Taelman, S. Van Huffel, and A. Spaepen, “Wavelet-independent component analysis to remove electrocardiography contamination in surface electromyography,” IEEE Engineering in Medicine and Biology Magazine, vol. 1, pp. 682–685, 2007.
View at: Google Scholar
B. Mijović, M. de Vos, I. Gligorijević, J. Taelman, and S. van Huffel, “Source separation from single-channel recordings by combining empirical-mode decomposition and independent component analysis,” IEEE Transactions on Biomedical Engineering, vol. 57, no. 9, pp. 2188–2196, 2010.
View at: Publisher Site | Google Scholar
Y. Guo, S. Huang, and Y. Li, “Single-mixture source separation using dimensionality reduction of ensemble empirical mode decomposition and independent component analysis,” Circuits, Systems, and Signal Processing, vol. 31, no. 6, pp. 2047–2060, 2012.
View at: Publisher Site | Google Scholar | MathSciNet
H. D. I. Abarbanel, T. W. Frison, and L. S. Tsimring, “Obtaining order in a world of chaos,” IEEE Signal Processing Magazine, vol. 15, no. 3, pp. 49–65, 1998.
View at: Publisher Site | Google Scholar
S. Haykin and J. Principe, “Making sense of a complex world,” IEEE Signal Processing Magazine, vol. 15, no. 3, pp. 66–81, 1998.
View at: Publisher Site | Google Scholar
F. Takens, “Detecting strange attractors in turbulence,” in Dynamical Systems and Turbulence, Warwick 1980, pp. 366–381, 1981.
View at: Google Scholar
C. J. James and D. Lowe, “Single channel analysis of electromagnetic brain signals through ICA in a dynamical systems framework,” in Proceedings of the 23rd Annual International Conference of the IEEE Engineering in Medicine and Biology Society, vol. 2, pp. 1974–1977, October 2001.
View at: Google Scholar
M. E. Davies and C. J. James, “Source separation using single channel ICA,” Signal Processing, vol. 87, no. 8, pp. 1819–1832, 2007.
View at: Publisher Site | Google Scholar
H.-G. Ma, Q.-B. Jiang, Z.-Q. Liu, G. Liu, and Z.-Y. Ma, “A novel blind source separation method for single-channel signal,” Signal Processing, vol. 90, no. 12, pp. 3232–3241, 2010.
View at: Publisher Site | Google Scholar | Zentralblatt MATH
P. Bernaola-Galván, P. C. Ivanov, L. A. N. Nunes Amaral, and H. E. Stanley, “Scale invariance in the nonstationarity of human heart rate,” Physical Review Letters, vol. 87, no. 16, pp. 168–170, 2001.
View at: Google Scholar
G. Tzagkarakis, M. Papadopouli, and P. Tsakalides, “Singular spectrum analysis of traffic workload in a large-scale wireless LAN,” in Proceedings of the 10th ACM Symposium on Modeling, Analysis, and Simulation of Wireless and Mobile Systems (MSWiM '07), pp. 99–108, October 2007.
View at: Publisher Site | Google Scholar
M. Casdagli, S. Eubank, J. D. Farmer, and J. Gibson, “State space reconstruction in the presence of noise,” Physica D: Nonlinear Phenomena, vol. 51, no. 1–3, pp. 52–98, 1991.
View at: Publisher Site | Google Scholar | MathSciNet
M. B. Kennel, R. Brown, and H. D. I. Abarbanel, “Determining embedding dimension for phase-space reconstruction using a geometrical construction,” Physical Review A, vol. 45, no. 6, pp. 3403–3411, 1992.
View at: Publisher Site | Google Scholar
D. S. Broomhead and G. P. King, “Extracting qualitative dynamics from experimental data,” Physica D: Nonlinear Phenomena, vol. 20, no. 2-3, pp. 217–236, 1986.
View at: Publisher Site | Google Scholar | MathSciNet
D. Kugiumtzis, “State space reconstruction parameters in the analysis of chaotic time series—the role of the time window length,” Physica D: Nonlinear Phenomena, vol. 95, no. 1, pp. 13–28, 1996.
View at: Publisher Site | Google Scholar
A. M. Fraser and H. L. Swinney, “Independent coordinates for strange attractors from mutual information,” Physical Review A, vol. 33, no. 2, pp. 1134–1140, 1986.
View at: Publisher Site | Google Scholar | MathSciNet
A. Swami and J. M. Mendel, “Cumulant-based approach to harmonic retrieval and related problems,” IEEE Transactions on Signal Processing, vol. 39, no. 5, pp. 1099–1109, 1991.
View at: Publisher Site | Google Scholar
J. M. M. Anderson, G. B. Giannakis, and A. Swami, “Harmonic retrieval using higher order statistics: a deterministic formulation,” IEEE Transactions on Signal Processing, vol. 43, no. 8, pp. 1880–1889, 1995.
View at: Publisher Site | Google Scholar
R. J. Povinelli, M. T. Johnson, A. C. Lindgren, F. M. Roberts, and J. Ye, “Statistical models of reconstructed phase spaces for signal classification,” IEEE Transactions on Signal Processing, vol. 54, no. 6 I, pp. 2178–2186, 2006.
View at: Publisher Site | Google Scholar
Z. Xie and K. Wang, “Selection of embedding parameters in phase space reconstruction,” in Proceedings of the IEEE 2nd International Conference on Intelligent Computing Technology and Automation (ICICTA '09), vol. 4, pp. 637–640, October 2009.
View at: Publisher Site | Google Scholar
A. M. Fraser, “Reconstructing attractors from scalar time series: a comparison of singular system and redundancy criteria,” Physica D: Nonlinear Phenomena, vol. 34, no. 3, pp. 391–404, 1989.
View at: Publisher Site | Google Scholar | MathSciNet
J. M. Mendel, “Tutorial on higher-order statistics (spectra) in signal processing and system theory: theoretical results and some applications,” Proceedings of the IEEE, vol. 79, no. 3, pp. 278–305, 1991.
View at: Publisher Site | Google Scholar
W. A. Porter and W. Liu, “Steering high order moment calculations from lower-dimensional spaces,” Information Sciences, vol. 80, no. 3-4, pp. 181–194, 1994.
View at: Publisher Site | Google Scholar | MathSciNet
M. Casdagli, S. Eubank, J. D. Farmer, and J. Gibson, “State space reconstruction in the presence of noise,” Physica D. Nonlinear Phenomena, vol. 51, no. 1–3, pp. 52–98, 1991.
View at: Publisher Site | Google Scholar | MathSciNet
K. Kobayashi, C. J. James, T. Nakahori, T. Akiyama, and J. Gotman, “Isolation of epileptiform discharges from unaveraged EEG by independent component analysis,” Clinical Neurophysiology, vol. 110, no. 10, pp. 1755–1763, 1999.
View at: Publisher Site | Google Scholar
D. T. Pham, “Mutual information approach to blind separation of stationary sources,” IEEE Transactions on Information Theory, vol. 48, no. 7, pp. 1935–1946, 2002.
View at: Publisher Site | Google Scholar | MathSciNet
R. Aichner, H. Buchner, F. Yan, and W. Kellermann, “A real-time blind source separation scheme and its application to reverberant and noisy acoustic environments,” Signal Processing, vol. 86, no. 6, pp. 1260–1277, 2006.
View at: Publisher Site | Google Scholar

Copyright

Copyright © 2015 Guangkuo Lu et al. This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

PDF Download Citation

Download other formats

Order printed copies

Views

4261

Downloads

1676

Citations