EMD Method for Minimizing the Effect of Seasonal Trends in Detrended Cross-Correlation Analysis
Detrended cross-correlation analysis (DCCA) is a scaling method commonly used to estimate long-range power-law cross-correlation in nonstationary signals. Recent studies have reported signals superimposed with trends, which often lead to the complexity of the signals and the susceptibility of DCCA. This paper artificially generates long-range cross-correlated signals and systematically investigates the effect of seasonal trends. Specifically, for the crossovers raised by trends, we propose a smoothing algorithm based on empirical mode decomposition (EMD) method which decomposes underlying signals into several intrinsic mode functions (IMFs) and a residual trend. After the removal of slowly oscillating components and residual term, seasonal trends are eliminated.
Recently, the existence of cross-correlations in complex systems has been extensively studied. Many studies provide strong empirical evidence for the existence of cross-correlations between time series. For example, in seismology, Campillo and Paul discussed the degree of cross-correlation among noise signals taken at different antennas of detector arrays . In traffic engineering, Zebende and Filho found the existence of a cross-correlation between vehicles and passengers . In finance, Shen et al. estimated the risk on the basis of cross-correlations for different assets and investment portfolios [3, 4]. In meteorology, Podobnik et al. studied the cross-correlation in successive differences of air humidity and air temperature [5, 6]. Cross-correlation functions together with autocorrelation functions are commonly used to gain insight into the dynamics of natural systems. Unfortunately, many time series, exhibiting cross-correlations, of physical, hydrological, and biological are nonstationary. Consequently, statistical properties of these systems are difficult to study due to these nonstationarities.
For determining the scaling exponent of cross-correlational time series in the presence of nonstationarities, detrended cross-correlation analysis method was developed by Podobnik et al. [5–8], and its performance was systematically tested for the effect of nonstationarities. After that, numerous methods referring to a broad range of applications [9–11] were established to investigate cross-correlational signal in the presence of nonstationarities.
However, many noisy signals in real systems exhibit trends so that the scaling phenomenon obtained by DCCA method becomes intricate to reveal the intrinsic dynamics of the real systems due to many trends changing the scaling exponent for different range of scales. Specifically, seasonal trends usually lead to a change of cross-correlations in signals. A technique, developed by Hajian and Movahed, based on singular-value decomposition (SVD) of the trajectory matrix was applied on the method specifically to eliminate possible crossovers in cross-correlation exponents . Based on chaos theory and SVD, the CSVD technique was reported by Dong et al. to minimize the effect of sinusoidal trends in traffic time series and stock data .
In this work, the DCCA procedure is pieced together with empirical mode decomposition (EMD), which is a well-established and promising method to analyze nonlinear and nonstationary signals . EMD, first proposed by Huang et al., is an empirical method to analyze data, which decomposes raw signals into several intrinsic mode functions (IMFs) and a residual term [13–15]. The method was employed to cope with the crossover of scaling exponent in DFA [16, 17]. By subtracting a trend embedded in the original signal, an EMD-based DFA was proposed in , where in each segment EMD was applied rather than a global used form, which is a good attempt but with a disadvantage of enormous time consumption. For this reason, this paper modifies the EMD-based DFA and extends to EMD-based DCCA.
Here we first investigate the scaling characteristic of noisy signals with sinusoidal trends. And then, we obtain the scaling properties of the reconstructed data by applying the EMD-based method, and we compare the difference in the scaling results. The results indicate that sinusoidal trends embedded in original data are extracted and the intrinsic fluctuations are retrieved.
The organization of this paper is as follows. First, the DCCA method is given in Section 2. The EMD-based technique is proposed in Section 3. Section 4 is the demonstration of effectiveness of the proposed technique on artificial and aeroengine data corrupted with seasonal trend. Finally, important conclusions drawn from this study are provided in Section 5.
2. DCCA Method
Many physical and biological time series are often nonstationary or have means, variances, and covariances that change over time. Nonstationary behaviors can be trends, cycles, random walks, or combinations of the three.
DCCA is a new method to quantify the cross-correlations between two nonstationary time series. This method is an extension of detrended fluctuation analysis (DFA) method. The DCCA procedure consists of four steps. Both methods are based on random walk theory. For two nonstationary time series and , , the DCCA method is given as follows.
Step 1. Compute the profiles of underlying time series using where and are the mean respectively.
Step 2. Cut the profiles and into non-overlapping segments of equal length , respectively. In each segment , we calculate the local trend by a least-square fit of the data and obtain the difference between the original time series and the fits.
Step 3. Calculate the covariance of the residuals in each segment as follows: where and are the fitting polynomials in segment , respectively. Then the average over all segments to obtain the fluctuation function is as follows:
Step 4. Determine the power-law relationship between the detrended cross-correlation function and as follows: where the scaling exponent represents the degrees of the cross-correlation between the two time series and . If, however, the detrended cross-correlation oscillates around zero as a function of the time scale , there are no long-range cross-correlations. For time series , the standard DFA procedure is retrieved.
3. EMD-Based Method
The empirical mode decomposition (EMD) first proposed by Huang et al. is designed for the time-frequency analysis of real-world signals . By applying it, the signal can be decomposed into a number of oscillatory modes named intrinsic mode functions (IMFs). The IMFs are obtained directly from the data with no a priori assumptions regarding the data nature, making EMD suitable for the analysis of nonlinear and nonstationary signals. And then, these IMFs provide meaningful instantaneous frequency estimates through Hilbert-Huang transform. The time-frequency analysis via EMD has found a wide range of applications in signal processing and related fields [18–23].
Specifically, for EMD method, if denotes an input, then the output is a set of IMFs, denoted by , such that where the residual is a monotonic function. The IMFs are achieved by sifting out rapidly oscillating components (dominant modes) from the data by iteratively subtracting slowly oscillating components (less dominant modes). These slowly oscillating components are defined as the local trend of a signal. The sifting process is stopped when all the less dominant modes are extracted in this paper.
The procedure used for the extraction of IMFs from a signal is as follows.
Step 1. Identify all the extrema (maxima and minima) of the series .
Step 2. Obtain the signal envelope passing through the minima and maxima via cubic spline interpolation, respectively.
Step 3. Compute the local mean series by point averaging of the two envelopes
Step 4. Subtract the mean from the signal to obtain an IMF candidate .
Step 5. Check the properties of : if is not an IMF, replace with and repeat the procedure from Step 1 and if is an IMF, evaluate the residual .
As previously demonstrated , the IMFs and the residual obtained by using EMD are not always unique and could change as the stoppage criterion for the sifting process changes. Once the trend is determined, the corresponding detrending operation can be implemented. With this definition of trend, the variability of the data on various time scales can also be derived naturally .
As we expect to eliminate the trend from original signal without slowly oscillating components will be used as signal without trend for DCCA. Slowly oscillating components may keep the major energy when signals present strong trend as we will see in Section 4.
4. EMD-Based DCCA Method for Minimizing the Effect of Sinusoidal Trends
The effectiveness of the proposed technique in minimizing the sinusoidal trends will be discussed in this section. Specifically, we consider two types of time series, artificial and aeroengine time series, in this paper.
4.1. Example 1: Artificial Time Series
In this section, we employ two-component ARFIMA method [26–28] to generate artificial long-range power-law signals and and test EMD-based method in minimizing the effects of sinusoidal trends. In this model, the series is defined by where is a free parameter, are weights, is the Gamma function, and is an independent and identically distributed (i.i.d) Gaussian variable. By introducing a parameter , the method can efficiently generate signals with power-law correlations characterized by a priori known correlation exponent . Using the ARFIMA method, we generate two time series: with and with , and the cross-correlation exponent approximately is equal to their mean value.
First, we use ARFIMA method to produce correlated signals with and with and illustrate DFA and DCCA on the signals in Figure 1. The scaling exponents in DFA method correspondingly approach , and cross-correlation exponent approximately equals to their mean value are expected.
For reliable detection of the cross-correlations, it is essential to distinguish trends from the intrinsic fluctuations in data. Generally, trends embedded in measurements are of two types: polynomial and sinusoidal trends. Although the DCCA method eliminates the polynomial trends, the sinusoidal trends remain .
The generated data sets with sinusoidal trends, with the period , will be denoted as and . A conspicuous crossover (circles) is found in Figure 2, when we apply DFA on the original signals superimposed with a strong sinusoidal trend.
We, respectively, use EMD to decompose original signals with sinusoidal trends into several IMFs. The IMFs are achieved by sifting out rapidly oscillating components from the data, by subtracting slowly oscillating components as shown in Figures 3 and 4. These slowly oscillating components are related to the sinusoidal trend in signal. We obtain the corresponding filtered data using the EMD-based algorithm and apply DFA to reconstructed data. We find that the detrended function (triangles) absented crossover phenomenon is similar to that of the data sets without sinusoidal trend (plus signs); see Figure 2. The results in Figure 2 indicate the remarkable ability of EMD-based method in eliminating a sinusoidal trend in DFA.
Next, we apply DCCA on the original signals superimposed with sinusoidal trend. As shown in Figure 5, the scaling exponent by DCCA shows an identical result compared to DFA exponents in Figure 2. Specifically to the crossover in circles plot, we again use EMD-based method to the signals and compare the DCCA function of reconstructed data to DCCA function of original data. Similar scaling properties are illustrated in Figure 5, which indicates a good performance of EMD in elimination of a sinusoidal trend.
We note that crossovers caused by any sinusoidal trend in artificial time series can be solved by EMD. We also test the effect of real time series on the method and find the expected results in the next section.
In the following discussion, we study the effectiveness of the proposed method on correlated signals with seasonal trend. Given a seasonal trend, where and are the amplitude. The generated data sets with seasonal trends will be denoted as and . We apply DCCA on the new signals, which is illustrated in Figure 6. A crossover (circles) exists which is affected by the trend. When the EMD-based method is applied, the DCCA function of reconstructed data is close to the DCCA function of original data.
4.2. Example 2: Aeroengine Time Series
In this section, the effectiveness of the proposed method is analyzed by applying the aeroengine series. The sinusoidal trends superimposed on the exhaust gas temperature (EGT) and engine fan speeds (N1), respectively, denoted by and will be discussed. The aircraft data includes operational flight data from on-board flight data recorders. These recorders are part of the Aircraft Condition Monitoring System (ACMS) such as Smart ACMS Recorder (SAR) and Quick Access Recorder (QAR). The QAR data are employed because the data includes an extensive list of flight parameters recorded at specific sampling intervals which are set by the manufacturer.
The DCCA of the original aeroengine data, original data with the sinusoidal trend, and reconstructed data is shown in Figure 7. It can be seen that the original characteristics of the aeroengine data are retained by the proposed smoothing technique. Obviously, the detrended cross-correlation function of filtered data is similar to cross-correlation function obtained by the data without sinusoidal trend. Similar scaling properties are illustrated compared to Figure 2, which indicates a good performance of EMD-based method in elimination of a sinusoidal trend not only in artificial time series but also in aeroengine time series.
In the paper, we consider EMD-based method to eliminate sinusoidal trends which introduce crossovers in cross-correlation exponent in DFA and DCCA. The effectiveness of the proposed EMD-based method is applied to long-range power-law cross-correlated signals which are generated by two-component ARFIMA artificially. After the removal of slowly oscillating components, sinusoidal trend is successfully eliminated. EMD shows advantage in eliminating sinusoidal trend in artificial time series.
EMD together with DCCA method can be applied to real dynamics to reliably detect long-range cross-correlation where sinusoidal trend exists. In aeroengine dynamics, we apply EMD method by decomposing underlying signals into several intrinsic mode functions and present similar results with artificial time series. We note that EMD is of great advantage when dealing with sinusoidal trend. Therefore, we do believe that our smoothing filter algorithm may provide some help to minimize the effect of the trends introduced by the sinusoidal trends and facilitate a reliable extraction of the scaling exponent.
The financial support from the funds of the Fundamental Research Funds for the Central Universities under Grant no. ZXH 2012C002, the National Natural Science Foundation of China under Grant no. U1233201, and the Natural Science Foundation of Hebei Province (E2012402013) are gratefully acknowledged.
X. Zhao, P. Shang, and J. Huang, “Continuous detrended cross-correlation analysis on generalized Weierstrass function,” European Physical Journal B, vol. 86, pp. 58–63, 2013.View at: Google Scholar
N. E. Huang and S. Shen, Hilbert-Huang Transform and Its Applications, World Scientific, Singapore, 2005.View at: MathSciNet
G. Schlotthauer, M. E. Torres, H. L. Rufiner, and P. Flandrin, “EMD of Gaussian white noise: effects of signal length and sifting number on the statistical properties of intrinsic mode functions,” Advances in Adaptive Data Analysis, vol. 1, no. 4, pp. 517–527, 2009.View at: Publisher Site | Google Scholar
H. Wang, Z. Su, J. Cao, Y. Wang, and H. Zhang, “Empirical mode decomposition on surfaces,” Graphical Models, vol. 74, pp. 173–183, 2012.View at: Google Scholar
A. Roy and F. Doherty, “Overlay communications using empirical mode decomposition,” IEEE Systems Journal, vol. 5, pp. 121–128, 2011.View at: Google Scholar
K. Hu, P. C. Ivanov, Z. Chen, P. Carpena, and H. E. Stanley, “Effect of trends on detrended fluctuation analysis,” Physical Review E, vol. 64, no. 1, Article ID 011114, 19 pages, 2001.View at: Google Scholar