Abstract

We use detrended fluctuation analysis (DFA) method to detect the long-range correlation and scaling properties of daily precipitation series of Beijing from 1973 to 2004 before and after adding diverse trends to the original series. The correlation and scaling properties of the original series are difficult to analyze due to existing crossovers. The effects of the coefficient and the power of the added trends on the scaling exponents and crossovers of the series are tested. A crossover is found to be independent of the added trends, which arises from the intrinsic periodic trend of the precipitation series. However, another crossover caused by the multifractal vanishes with the increasing power of added trends.

1. Introduction

Many physical and biological systems exhibit complex behavior characterized by long-range power-law correlations. Traditional approaches such as the power-spectrum and scaled-Hurst analysis are limited to quantify correlations in stationary signals. In recent years, detrended fluctuation analysis (DFA) has been established as an important tool for the detection of long-range correlations in time series with nonstationarities. DFA is a scaling analysis method providing a quantitative parameter, the scaling exponent , to represent the long-range autocorrelation properties of a signal. The advantages of DFA over many other methods are that it permits the detection of correlations in apparent nonstationary time series and also avoids the spurious detection of seemingly long-range correlations that are artifact of nonstationarity. DFA which is a nonparametric approach for data mining has been successfully applied to diverse fields of interest such as DNA, heart rate dynamics, neuron spiking, human gait, cloud structure, economical time series, and long-time weather records as well as [19]. Besides, many parameter models as well as relevant prediction also have been systematically explored such as in traffic flows with remarkable results [1014].

A fact exists that precipitation has a dramatic effect on agriculture and plays a significant role in human’s activities. The study of precipitation can be utilized for several purposes, including hydrological structure design, flood prevention, and so forth. Precipitation has been long analyzed by traditional statistics, and effective methods as well as prediction models have been developed in bulk to investigate its role [1517]. There also exist many investigations of scaling behaviors and multifractal characterization of the precipitation records [1820]. However, traditional time series analysis of precipitation always produces spurious results due to the highly nonstationary nature of precipitation signals. Matsoukas et al. [21] used detrended fluctuation analysis to quantify the correlation properties of precipitation time series but did not describe them in detail.

In the paper, we detect the long-range correlations of the daily precipitation series collected from 21 weather stations of Beijing through about 30 years and investigate their correlation properties together with the influence of added trends under the method of DFA. As external trends are the main components which affect the correlation properties of a time series, people are trying to eliminate them to gain proper insight into the records. However, in most cases it is difficult to distinguish the trends from the intrinsic fluctuations in data. We add diverse trends on the contrary to the original data and systematically analyze their effect on the correlation properties. The essence of adding the trends in the paper is a preprocessing as the trends will be the functions of original series.

The organization of this paper is as follows: in Section 2, we briefly introduce the DFA method. Section 3 is about the details of the precipitation data we used in this paper. In Section 4 we detect the correlation properties by calculating the scaling exponents and crossover times of the original series before and after adding correlated trends. We summarize in Section 5.

2. Methodology

Experimental series are often affected by nonstationarity and fractality [22]. To investigate the scaling behavior of fluctuations, external trends are expected to be well distinguished from the intrinsic fluctuations of the system. If trends exist in the data, Hurst rescaled-range analysis and other nondetrending methods might give spurious results [58]. Very often we never know the reasons for underlying trends in collected data and even worse the scales of the underlying trends. DFA is a well-established and robust method for determining the scaling behavior of noisy data in the presence of diverse trends [1721].

For a record , where denotes the length of record, the DFA procedure briefly involves the following four steps.

Step 1 1. We determine the profile , where is the mean of the record.

Step 2. We cut the profile into boxes of the same size . In each box, we fit the integrated time series by using a polynomial function, , which is regarded as the local trend. For order- DFA, order polynomial function is applied of the fitting approximation. We subtract the local trend in each box and get the detrended fluctuation function :

Step 3. In each box of size , we calculate the root mean square (rms) fluctuation :

Step 4. We repeat this procedure for different box sizes (different scales).
If a power-law relation exists between and , It indicates the presence of scaling property. The parameter , called the scaling exponent or fluctuation exponent, represents the correlation properties of the data. For correlation exponent , which is derived from the autocorrelation function, a similar approximation for is Comparing with (2.4), we find for . A brief certification of the relation of and is proposed in [23]. We can determine the correlation exponent by measuring the fluctuation exponent . If , there is no correlation (white noise); if , the data is anticorrelated; if , the data is long-range correlated.

3. Data Description

The precipitation data here is collected from 21 weather stations of Beijing from January1 1973 to December 31 2004, 11688 days, as illustrated in Figure 1. A vision processing based on coded structure light has been investigated to acquire 3D data which can be referred for further analysis if necessary [24, 25]. There may not be any precipitation record which is different from such as temperature series. There may be a little precipitation that it is not necessary for the weather stations to record in detail but adopting a word “minim’’ instead of a specific quantity. For convenience, we regard the quantity of the days without precipitation as 0 and the “minim’’ as 0.5. We treat the mean value of the 21 records every day of different stations as a new precipitation series for analysis.

4. Data Analysis

4.1. DFA of the Original Series

First, we detect the correlation behavior of the original series. To get more information, we use the DFA arranging from 1st to 5th order. The original series is a multifractal according to Figure 2(a) since the scaling exponents of each order- DFA change twice, that is, two crossovers. At small scales the deviations grow stronger with the increasing DFA order . To decrease the impact of the deviations on the calculating of scaling exponents , we ignore some small while fitting the curve.

Crossover times and are determined by the intersection of linear fits done on both sides of the crossovers. We choose the point at the intersection in scales as the crossover times , calculate the slope on both sides of to get , and exhibit , in Table 1.

and of each order- DFA divide the series into three different scaling segments. At the first segment where the time scales , corresponds to a long-range correlation behavior which indicates that a relatively large magnitude is likely to be followed by a large magnitude event. For the two segments on the two sides of the second crossover , scaling exponents changes from to which means that at large time scales , the series is anticorrelated. The scaling behavior of the original series at large scales is similar to the DFA of a sinusoidal series shown in Figure 2(b). Comparing Figure 2(a) with Figure 2(b), there exist some common properties at large scales; after a significant crossover, both scaling exponents turn rather small. It manifests that periodic trend dominates the scaling property at large scales after which is accordant to the investigations in [4]. It also can be referred that is highly possible to be dominated by the seasonal trend in the precipitation series.

4.2. DFA of the Series with Correlated Trend

The complex properties of the original series result from the crossover times and , so it is difficult to understand the scaling behavior and make a valid prediction. A crossover usually can arise from a change in the correlation properties of the series at different time scales, that is, multifractal, or can often arise from external trends in the data [26]. Diverse methods provide inspiration to produce discrete sequences and continuous functions [27, 28] for simulation. In most cases, people generate long-range correlated experimental data with modified Fourier filtering [29] or “ARFIMA’’ [30] method and superimpose diverse trends on them which are the function of time, like linear, sinusoidal, and power-law trend. The trends effects on the original series are tested, mainly including the crossover, complete with diverse detrending methods based on such as SVD [9], EMD [3133], Fourier-DFA [34], wavelet analysis [35], and “superposition rule” [4]. In real data, the type of trend is analyzed and proper detrending method is employed correspondingly. It is an attractive and logical direction in solving the crossover caused by trends and deriving a constant scaling exponent. However, at times the type of the trend is difficult to identify, and we note that the information of the series is not fully uncovered just by the DFA method. Here we firstly propose a new method to preprocess the data by adding a trend which is a function of the original series. Then we test whether and how correlated trends added to the original series will affect the correlation properties. A common power-law function is used, where is a coefficient and presents the power. We apply DFA method to the new series . It is apparent that share the same period of trend with but with different fluctuation magnitudes. As we will see later, dominated by periodic trend is independent of added trend , but rises from the multifractal will disappear. We make and variables, respectively, to operate our study as follow.

4.2.1. Effect of Power on DFA of

In this section, is a variable and is a constant 1.

(1) Is Positive Integer
Considering the capability of order- DFA in removing trend of th order, we give integer values ranging from 1 to 6 to take a whole view of effects of DFA on the series .
For every order- DFA in Figures 3(a) and 3(b), two crossovers exist while in (c) the crossovers only remain in DFA3, DFA4, and DFA5 together with their positions being much closer. For (d), (e), and (f), although the crossovers still exist, one cannot identify them without checking carefully. For each order- DFA in (d), (e), and (f), only one crossover exists, which is still marked by only for convenience. We illustrate the representative crossovers by arrows. It seems that crossovers in all six subfigures share an identical position. We apply the method in Section 4.1, calculate these crossovers and scaling exponents, and specify the crossover time scales in Table 2. Since the scaling behavior of DFA3, DFA4, and DFA5 in Figure 3(c) on both sides of is just the same, we calculate one scaling exponent for each of them before in Table 2.
The positions of crossover times after adding diverse trends are identical to that of the original series, which demonstrate that are independent of the power of the correlated trend. With the increasing of from 1 to 6, the values of all scaling exponents and before decrease to 0.5 while all values of after have the trend to increase to 0.5. To get a clearer view of this phenomenon, for and , we calculate a new scaling exponent before of each order- DFA by linear fit. They are specified, respectively in Table 3.
The decreasing trend of to 0.5 and the increasing trend of to 0.5 are shown in Table 3. With the increasing of the approaching pace of to 0.5 seems to be slower than that of .
vanishes with increasing of the trend but is independent of the added trend. It agrees well with the previous analysis that and , respectively arise from the multifractal and periodic trend in precipitation series.
The scaling behavior of is similar to that of original signal when , as . When , since the magnitude of is mostly larger than 1, the scaling properties of will play a vital role and the influence of can be negligible which also can be inferred from the scale of the fluctuation . In fact, Figures 3(b)3(f) demonstrate the fluctuations of as well. We note that the scaling properties of is also attractive, as it is the function of original series which can be treated a preprocessing.

(2) Is Number with Decimal and
The crossovers in Figure 3 are independent of power but are not which is, respectively, affected by the multifractal and periodic trend of original precipitation series. Figures 3(b), 3(c), and 3(d) show a “vanishing’’ process of , where remains by adding an order- trend while disappears by an order- trend. However, we also expect to track the whole process of vanishing with a decimal of . We test it in Figure 4. First we find some common properties in three pair figures that these significant processes all take place between to , , which are very close to . As we use DFA to 5th order, the “vanishing’’ process of just completes when the power of the trend , , is 3.9, close to . Thus we can make a hypothesis that if our DFA is order , there will be some significant process around each , , close to and we get the final stable property around , close to . It may be related to the fact that an (th DFA can eliminate an th order polynomial trend due to the integration in DFA algorithm.

(3)
As we can see from Figure 3(a), when , that the extra trend does not have any influence on the correlation properties of the series. And as most data in our original precipitation, series are larger than 1.0 (0.1 mm). We can guess that when , that is, the adding correlated trend is weaker than that of one above, the correlated trend does not affect the correlation properties either and we prove it in Table 4.
We find from Table 4 that the crossovers of are just the same as the original data. They are rather close to the original data as well by observing their scaling exponents. The trends are so weak that there is little influence of on the correlation properties of .

4.2.2. Effect of coefficient A on DFA of

In Section 4.2.1, we have studied the correlation properties of the series with correlated trend , where is a constant 1. And we find that the crossovers are independent of power of trend . So how about coefficient ?

In this section we vary to find the relation between and the correlation properties of . As the correlation properties for DFA of in Section 4.2.1 is the most complicated when , here we just test coefficient with .

There is a clear vision that the crossover times of (a), (b), (c), and (d) are just the same as Figure 3(b). And the scaling exponents seem to be also the same that we exhibit them in Table 5 to make it evident. The scaling exponents of Figures 5(a), 5(b), 5(c), and 5(d) are not exactly the same by values but they hold the same scaling behavior with Figure 3(b).When is large, the scaling exponents are rather close to the ones of in Figure 3(b). The crossovers in DFA result from the competition between the scaling of original series and the scaling of the trend. The case of is similar to that of which can be inferred by the following analysis. If the original series dominates the scaling behavior, apparently the symbol of can be negligible. When the added trends prevail, most values may change the signs, but the magnitude of fluctuation and the period of trend remain unchangeable. Of course, and should be excluded. So the correlation properties with correlated trend are independent of the coefficient .

4.3. DFA of the Series with Other Correlated Trends

Discussion of Section 2 has told that crossovers of the original data are independent of the trend which presents another question which is are they still independent of other correlated trends? In this section we test another common trend ( is the max of ) which is stronger than with the same method applied in Section 2. We test the effects of and on the correlation properties of , respectively, in Figures 6 and 7.

Although Figure 7 indicates that the correlation properties of series with correlated trend are dependent of which are not the same as they are for trend , we can see form Figures 6 and 7 that the crossover times are just identical to the ones of original series for each figure. So crossovers of the original data are independent of the correlated trend . It is an obvious conclusion that crossovers are independent of trend as well, because it is weaker than , not to mention . Thus we can see that crossovers of the original data are rather significant and they are so stable that it is necessary to take further investigations to gain insight into them.

5. Conclusion

In summary, DFA of the original series indicates the complex correlation properties by two obvious crossovers and three different scaling segments. The second crossover is proved to be independent of the adding correlated types of trend:, , and while the first crossover disappears with added trend. They are induced by different reasons while behave similarly if we just analyze original precipitation series. The paper also provides a method to distinguish the multifractal and trend effects on the scaling behavior. With the development of study on correlation, scaling exponents and crossovers will take more significant roles in providing foundation theories for precipitation series predictions based on correlations theories.

Acknowledgments

The financial supports from the funds of The National High Technology Research Development Program of China (863 Program) (2007AA11Z212), the China National Science (60772036) and MEDF (20070004002) are gratefully acknowledged.