Transportation Modeling and ManagementView this Special Issue
Modeling Complex System Correlation Using Detrended Cross-Correlation Coefficient
The understanding of complex systems has become an area of active research for physicists because such systems exhibit interesting dynamical properties such as scale invariance, volatility correlation, heavy tails, and fractality. We here focus on traffic dynamic as an example of a complex system. By applying the detrended cross-correlation coefficient method to traffic time series, we find that the traffic fluctuation time series may exhibit cross-correlation characteristic. Further, we show that two traffic speed time series derived from adjacent sections exhibit much stronger cross-correlations than the two speed series derived from adjacent lanes. Similarly, we also demonstrate that the cross-correlation property between the traffic volume variables from two adjacent sections is stronger than the cross-correlation property between the volume variables of adjacent lanes.
Many diversified complex systems are composed of constituents that mutually interact in a complex fashion. The complexity of the mutual interaction, such as the output of each constituent which depends not only on its own past but also on the past values of other constituent outputs, can be additionally studied if memory is included. Such complex systems are characterized by both long-range correlations and long-range cross-correlations. A number of studies suggest the existence of these properties in diverse systems. Applying the random matrix theory, Stanley et al. demonstrated the cross-correlation properties between individual stocks traded in the Korean stock market . By analyzing 48 world financial indices, Wang et al. found the long-range power-law cross-correlations in the absolute values of returns . Podobnik et al. studied the cross-correlation in successive differences of air humidity and air temperature . Du et al. provided cross-correlation time delay model to improve earthquake relocation forecasts .
These studies provide strong empirical evidences for the existence of cross-correlations between the dynamics of natural systems. Pearson’s correlation coefficient (PCC), which is used to represent the linear correlation between two time series which are both assumed to be stationary [5, 6], is commonly used to gain insight into the dynamics of cross-correlations in time series. Nevertheless, in natural systems, the nonlinear and nonstationary characteristics are usually present [7, 8]. Therefore, PCC may not be suitable to describe the cross-correlations between time series that are nonlinear or nonstationary. To address the drawbacks of PCC, the detrended cross-correlation analysis (DCCA) method is employed in this paper.
The DCCA method, which is a modification of standard covariance analysis in which the global average is replaced by local trends [9, 10], was proposed by Podobnik and Stanleys. The performance of detrended cross-correlation analysis method was systematically tested for the effect of nonstationarities [9–11]. After that, numerous issues referring to a broad range of applications [12–15] were established to investigate cross-correlational signal in the presence of nonstationarities.
In analogy with the cross-correlation coefficient, Zebende recently introduced the detrended cross-correlation (DCCA) coefficient . One of the outstanding advantages of the nonlinear cross-correlation coefficient is that it can investigate the cross-correlations at different time scales [16, 17]. After that, Cao et al. adopted the DCCA coefficient to analyze and quantify cross-correlations between the Chinese exchange market and stock market . Vassoler et al. quantified the cross-correlations between time series of air temperature and relative humidity by DCCA coefficient . Podobnik et al. showed that the tendency of the Chinese stock market to follow the US stock market is extremely weak by using the DCCA coefficient . Wang et al. studied the statistical properties of the foreign exchange network at different time scales applying the DCCA coefficient .
Here, using the DCCA coefficient method, we model the traffic data collected on the Beijing Third Ring Road as the input data which can be readily observed from conventional point detectors. The preliminary test results demonstrate that the cross-correlation property between the traffic series from two adjacent sections is stronger than the cross-correlation property between the series of adjacent lanes and disjoint lanes. The scaling results suggest the feasibility of estimating cross-correlations in traffic variables using point detector data via the proposed approach.
The organization of this paper is as follows. In the next section, we present the dataset and DCCA coefficient method. In Section 3, we show the main empirical results and discussion. Finally, we draw some conclusions in Section 4.
2. Data and Methodology
2.1. The Dataset
Traffic systems have a number of parameters that can be measured. The speed and volume are employed in collecting and studying traffic data here. The data was observed on the Beijing Third Ring Road (BTRR) over a period of about 7 days, from 0:00 AM on March 21, 2011, to 23:30 PM on March 27, 2011. Figure 1 shows the time series plot of the speed data and volume data observed at the Beijing Third Ring Road.
The BTRR is a closed road system without any traffic-signal control. There are three main lanes as well as one or two auxiliary lanes related to on-and-off ramps for each direction. The data were downloaded from the Highway Performance Measurement Project (FPMP). The periodic time of detecting is 2 min and the distance between two adjacent detectors is about 500 m. For investigating the cross-correlations in traffic time series, we will analyze twelve datasets as follows (see Figure 2):(1): the speed series of Lane in Section ;(2): the volume series of Lane in Section .
2.2. DCCA Coefficient Method
DCCA coefficient method is an extension of detrended cross-correlation analysis (DCCA) and detrended fluctuation analysis (DFA) method, and both methods are based on random walk theory [6, 21, 22]. For two nonstationary time series and , , where is the length of data, the DCCA coefficient is given as follows.
Step 1. Compute the profiles of underlying time series using where and are the mean.
Step 2. Cut the profiles and into nonoverlapping segments of equal length , respectively. In each segment , we calculate the local trend by a least-square fit of the data and obtain the difference between the original time series and the fits.
Step 3. Calculate the covariance of the residuals in each segment: for each segment , and for each segment , . Here and are the fitting polynomials in segment , respectively. Then the averages over all segments to obtain the fluctuation function are as follows:
Step 4. For the two nonstationary time series and , the DCCA coefficient is defined as the ratio between the detrended covariance function of (4) and two detrended variance functions of (5) and (6): where ranges from −1 to 1 [6, 20]. A value of or implies that the two nonstationary time series and are completely cross-correlated or anti-cross-correlated, at the time scale , whereas a value of indicates that there is no cross-correlation between the two time series and [6, 19]. Obviously, the DCCA coefficient is a function of the different window size of data, which means that it can investigate the cross-correlations between two time series and at different window scales.
3. Empirical Results and Analysis
3.1. The Cross-Correlation of the Speed and Volume Series
For two nonstationary cross-correlated time series and , the power-law relationship exists. The scaling exponent represents the degrees of the cross-correlation between the two time series and . For time series , the DCCA fluctuate function reduces to the DFA fluctuate function .
In order to study the dynamics of the traffic time series over time, we first consider two time series, both of which can be considered as two outputs of traffic system: the traffic speed fluctuation series and the traffic volume fluctuation series . Here are the speeds of Lane 1 in Section 1 and are the volumes of Lane 1 in Section 1.
Figure 3 displays the DFA and DCCA curve obtained between traffic speed fluctuation series and the traffic volume fluctuation series . The curves exhibit obvious power-law behavior with DFA exponent , and the DCCA exponent , implying long-range autocorrelation and cross-correlations in traffic dynamics.
It is apparent that the traffic flow series can be characterized by a local variability of the DCCA coefficient as shown in Figure 4. The small fluctuations exhibited by the provide evidence that a more complex evolution dynamics characterizes the traffic flow.
3.2. The Cross-Correlation of the Speed Series
It is worth noticing the fact that, according to the definition of cross-correlation , each of the two variables at any time depends not only on its own past values but also on past values of the other variable.
Here, we firstly investigate the cross-correlations between two traffic speed fluctuation variables and , which are derived from two adjacent sections of a highway and simultaneously recorded every two minutes (see Figure 2). Figure 5 displays the DFA and DCCA curve for traffic speed fluctuation variables and . The curves also exhibit obvious power-law behavior with DFA exponent , and the DCCA exponent , implying long-range autocorrelation and cross-correlations in traffic speed time series.
The DCCA coefficient curve is given in Figure 6. We find that fluctuate around the value and show that the cross-correlated behavior between the time series and is very strong.
And then, we consider the case when two time series of variables and are derived from two adjacent lanes (see Figure 2). For convenience, we study the difference between the DCCA coefficient of the data from two adjacent sections of one lane and the data from two adjacent lanes by using the error function.
The error function is defined as , where is the DCCA coefficient of traffic speed fluctuation variables and and is the DCCA coefficient of traffic speed fluctuation variables and .
From Figure 7, we can see that the error function (circles) indicates that the cross-correlation of speed series between two adjacent lanes is weaker than the time series of two adjacent sections.
In addition, we also find that cross-correlation exists between the two time series of variables and , which are derived from Lane 1 and Lane 3 (see Figure 2). We employ the error function once again, where is the DCCA coefficient of traffic speed fluctuation variables and . For comparison, the error function is also plot in Figure 7 (filled dots). Obviously, the error function (filled dots) indicates that the cross-correlation between speed series from two disjoint lanes is weaker than the cross-correlation between the time series of two adjacent sections.
To analyze the statistical properties of the speed time series, we compute the value for and . The result indicates that the difference between two quantities is statistically significant. Similarly, the value of and also shows significant difference .
3.3. The Cross-Correlation of the Traffic Volume Series
Next, we investigate the cross-correlations between two traffic volume time series and (see Figure 2). The DFA curves in Figure 8 show that each of two volume time series and exhibits autocorrelated behavior by DFA exponent , . Figure 8 also illuminates that the cross-correlated behavior between and exists by DCCA exponent.
Figure 9 shows the DCCA coefficient of traffic volume fluctuation variables and . The DCCA coefficient fluctuates around the value and shows that the cross-correlations between and exists.
Further, we investigate the case when two time series of variables and are derived from two adjacent lanes (see Figure 2). The error function is employed once again. In Figure 10, we give the error function , where is the DCCA coefficient of traffic volume variables and and is the DCCA coefficient of traffic volume fluctuation variables and . The error function (circles) demonstrates that the cross-correlation of volume fluctuation series between two adjacent lanes is weaker than the time series of two adjacent sections.
For convenience, Figure 10 also shows the error function , where is the DCCA coefficient of traffic volume series and . Similarly, it is apparent that the cross-correlation of volume series between two disjoint lanes is weaker than the time series of two adjacent sections by direct observation of the error function (filled dots).
In the statistical analysis, the value for and is which indicates that the difference between two quantities is statistically significant. and of volume time series are also statistically significant based on permutation testing .
In the paper, we consider DCCA coefficients method to understand the complexity of traffic dynamic. The technique has been implemented on the time series of the original traffic variables from adjacent lanes and adjacent sections. For the traffic speed time series and volume time series, the DCCA coefficients fluctuate around the value and provide evidence that cross-correlation characteristic exists in traffic dynamic. Then, we apply DCCA coefficients method to study the cross-correlation between traffic speed series. We find that two traffic speed fluctuation parameters derived from adjacent sections exhibit much stronger correlation than the traffic parameters derived from adjacent lanes and disjoint lanes. Similarly, by applying DCCA coefficients method to traffic volume series, the cross-correlation property between the volume variables from two adjacent sections is stronger than the cross-correlation property between the volume variables of adjacent lanes and disjoint lanes.
The relationship of traffic series between two adjacent sections or lanes in China is investigated with the data from BTRR. The results that the traffic series between two adjacent sections or lanes exhibit cross-correlation are attributable to each of the two variables at any time depending not only on its own past values but also on past values of the other variable. Therefore, the findings presented here encourage us to think that this method reveals the relation in anomalous traffic conditions.
Conflict of Interests
The authors declare that there is no conflict of interests regarding the publication of this paper.
The financial support from the funds of the Fundamental Research Funds for the Central Universities under Grant no. 3122013C005 and the National Natural Science Foundation of China under Grant no. U1233201 is gratefully acknowledged.
D. Wang, B. Podobnik, D. Horvatić, and H. E. Stanley, “Quantifying and modeling long-range cross correlations in multiple time series with applications to world stock indices,” Physical Review E, vol. 83, Article ID 046121, 2011.View at: Google Scholar
G. Wang, C. Xie, Y. Chen, and S. Chen, “Statistical properties of the foreign exchange network at different time scales: evidence from detrended cross-correlation coefficient and minimum spanning tree,” Entropy, vol. 15, no. 5, pp. 1643–1662, 2013.View at: Publisher Site | Google Scholar | MathSciNet