Abstract

We develop a random financial time series model of stock market by one of statistical physics systems, the stochastic contact interacting system. Contact process is a continuous time Markov process; one interpretation of this model is as a model for the spread of an infection, where the epidemic spreading mimics the interplay of local infections and recovery of individuals. From this financial model, we study the statistical behaviors of return time series, and the corresponding behaviors of returns for Shanghai Stock Exchange Composite Index (SSECI) and Hang Seng Index (HSI) are also comparatively studied. Further, we investigate the Zipf distribution and multifractal phenomenon of returns and price changes. Zipf analysis and MF-DFA analysis are applied to investigate the natures of fluctuations for the stock market.

1. Introduction

The analysis of market index and return is an active topic to understand and model the distribution of financial price fluctuation, which has long been a focus of economic research. As the stock markets are becoming deregulated worldwide, the modelling of the dynamics of the forwards prices is becoming a key problem in the physical assets valuation, risk management, and derivatives pricing [16]. By applying the theory of interacting particle systems [711], some research has been made by applying the statistical physics systems to study the behaviors of fluctuations of price changes in the stock market and the corresponding valuation and hedging of contingent claims [1223]. The motivation of modelling stock price by using a contact model is to uncover the empirical laws in stock price and better understand the dynamics of financial systems. Stauffer [15] and Yu and Wang [21] developed financial price models by lattice percolation system and lattice-oriented percolation system [8, 9], respectively, the local interaction or influence among market participants in a stock market is constructed, and an open cluster of percolation is applied to define the cluster of investors sharing the same opinion about the market. In these financial models, the main assumption is that the stock price fluctuation is influenced by the information in a stock market, and the investors decide the investment opinions by other investors’ attitudes, so the investors investment attitudes of the stock market lead to the stock price fluctuation. Zhang and Wang [22] invented the finite-range contact particle system to model a stock price process for studying the behaviors of returns by statistical analysis and computer simulation. The epidemic spreading of the contact model is considered as the spreading of the investors investment attitudes towards the stock market, and we suppose that the investment attitudes are represented by the viruses of the contact model. These attitudes make the investors take buying stock positions, selling stock positions, or holding stock positions.

The contact process, a model for epidemic spreading in a continuous time Markov process, is one of interacting particle systems [8, 10, 11]. The contact process was introduced by Harris [24] and one interpretation of the contact process is often regarded as a crude model for the spread of a biological population or a disease. Healthy individuals become infected at a rate proportional to the number of infected neighbors, where infected individuals recover at a constant rate. To be specific, let denote the contact process on the configuration space ; if , the individual at the point is healthy and will be infected at a rate equal to times the number of infected neighbors; if , one individual at the point is thought of as being infected and recovers from its infection at rate one. In the present paper, we study the statistical behaviors of fluctuations of stock price changes by applying the contact system. We firstly construct the financial time series model of the stock market. Then we analyze the statistical behaviors of ensembles and specifics of returns for the financial model by Zipf analysis and multifractal detrended fluctuation analysis (MFDFA). Further, the statistical properties of returns of Shanghai Stock Exchange Composite Index (SSECI) and Hang Seng Index (HSI) are also studied for comparison between the actual time series and the simulated ones.

2. A Brief Description of Contact System

The contact system on with infection parameter is a continuous time Markov process on the configuration space [8, 10, 11]. A configuration is often identified with subsets of via . Individuals in are thought of as being infected, while the other individuals are regarded as being healthy. The transition rates for are given by(a) for all at rate 1,(b) for all at rate , where denote the cardinality of a finite set and is the minimal length of a path from to . More formally, the connection between the process and the rate function is made through the generator of . For functions on that depend on finitely many coordinates, the generator has the form where if and if , for . And is given by (see [10])

Let denote the state at time with the initial state , and let be the state of at time with the initial point . The most important concept of weak complete convergence for the contact model is given as , where is the survival probability and and are the biggest invariant measure and the smallest invariant measure, respectively, in the sense of the partial order [10]. The key characteristic of the contact process is that extinction and survival can be both occur, whether extinction occurs or survival occurs depends on the value of . There is a critical value , if , the contact process is said to become extinct or die out; that is, ; otherwise (for ) it is said to survive. For any finite lattice graph (where is a large positive integer) and for every finite and every constant , there exists [11]. Furthermore, we define the edge processes and on , . And let . For , (a.s. on ), where is a nondecreasing function of and . And if , for some positive , we have , then the process dies out exponentially fast.

3. Financial Price Model Form Contact System

In the following, we adopt the notations and settings of Sections 1-2. Consider a model of auctions for a stock in a stock market. Suppose that each trader can trade the stock several times at each day , but at most one unit number of the stock at each time. Let be the time length of trading time in each trading day; we denote the stock price at time in the th trading day by , where . Suppose that this stock consists of ( is large enough) investors, who are located in a line (similarly for -dimensional lattice ). At the beginning of trading in each day, suppose that the investor at the origin receives some news. We define a random variable for this investor; suppose that this investor takes buying positions (), selling positions (), or neutral positions () with probability , , or , respectively. Then this investor sends bullish, bearish, or neutral signal to his neighbors. According to the contact dynamic system, investors can affect each other or the news can be spread, which is assumed as the main factor of price fluctuations. Moreover, here the investors can change their buying positions or selling positions to neutral positions independently at a constant rate. More specifically, (i) when and if , we say that the investor at takes buying position at time , and this investor recovers to neutral position at rate 1; if , we think that the investor at takes neutral position at time , and this investor is changed to take buying position by his neighbors at rate . In this case, the more investors who take buying positions, the more possibility that stock price goes up. (ii) When and if , we say that the investor at takes selling position at time , and also this investor recovers to neutral position at rate 1; if , the investor is changed to take selling position by his neighbors at rate . (iii) When the initial random variable , the process is ignored; this means that the investors do not affect the fluctuation of the stock price.

For a fixed ( large enough), the aggregate excess demand for the asset at time is defined by where may depend on the trading days . From the above definitions and [3, 6], we define the formula of a discrete time stock price as follows: where , represents the depth parameter of the market. Then the stock price of the present paper is supposed to follow the form where is the initial stock price at time . The formula of the single-period stock logarithmic returns from to is given as follows:

In this paper, we analyze the logarithmic returns for the daily price changes. In the light of theory of the contact process and the above definitions, if , the virus will die out at last; namely, the influence on the stock price by the investors is limited. If , the virus will not die out; namely, the news will spread widely, so this will affect the investors’ positions and finally will affect the fluctuation of the stock price. The contact system is a statistical physics system, and it is comprised of a large number of interacting units, which has the similarity with the financial markets consisting of a large number of interacting “agents.” There are random behaviors ingrained in the contact model, and then we consider the financial market as a complicated evolving system. Modeling stock price by using a contact model has the motivation to uncover the empirical laws in real stock markets and understand better the dynamics of financial systems.

4. Zipf Analysis for Financial Model and Real Stock Markets

4.1. Zipf Analysis of Time Series

Zipf analysis, as a way for quantifying time series correlations, has been widely applied to the literature, stock market, computer science, network, economic management, and many other fields [2529]. The technique is based on translating a given time series into a sequence of symbols and counting the frequency of any word, that is, pattern of consecutive symbols. Ranking these words by their frequencies from the most common to the least common and plotting the logarithm of frequencies versus the logarithm of rank give us a Zipf plot, which was firstly introduced by George Kingsley Zipf, in order to study the statistical occurrences in different languages. For the lowest ranks, the plotted points usually appear to fall along a line. The gradient of the best line fit corresponds to the Zipf exponent , which characterizes correlations in time series. Let denote a set of observations on a random variable , the corresponding cumulative distribution function is , and assume that the observations are ordered from the largest to the smallest so that the index is the rank of . The Zipf plot of the sample is the graph of against . Because of the ranking, then and . Thus, the log of the rank is simply a transformation of cumulative distribution function. In studying English word occurrence frequency, Zipf’s law reveals that while only a few words are used very often, many or most are used rarely. It is found that if the words have the descending orders of frequency, the frequency of occurrence of each word and its symbol ranking has simple inverse relations; that is . Making a transformation, the above equation can be converted into , where is the frequency of the word whose rank is and is some positive constant. Plotting the graph by against , the graph is close to a line with the slope of .

Let denote the time series of daily closing stock prices, and let be the given integer time scale; then the -step of logarithmic price changes in a stock market is defined by where . Next we consider a new time series with a random environment which is derived from the original -step of logarithmic price changes of the model. In a real stock market, various kinds of information will affect the investing positions of the market participants, and the investing environment is also ceaselessly changing. So we introduce a random environment in the financial model as the following. Let be a nonnegative random variable on a probability space (with the probability distribution ), which is called a random threshold of the model. For example, can be a uniform on the interval , or can also be a random variable , where follows a normal distribution, and so forth. Then the new time series derived from the original stock prices is given as where , , and denote “price-up,” “price-stable,” and “price-down,” respectively. In this model, the random threshold represents the expected returns for the market investors. Then, for the different parameters and , we investigate the fluctuation behaviors of the time series .

Let , , and denote the frequencies of occurrences for price-up, price-stable, and price-down, respectively. Then the corresponding absolute frequencies of the time series for these case are given as follows: where and . In financial markets, the large fluctuation of daily price changes usually occurs with the small probability. Considering this property, the frequencies of occurrences defined in the above depend on the probability distribution . Next the corresponding relative frequencies of the time series are given as follows: In the above definitions of the relative frequencies, we omit the occurrences of stable-price; thus and measure the total occurrences of price rising and price falling, respectively.

4.2. Results of Empirical Research

In this section, we study the statistical properties of absolute frequencies and relative frequencies of price changes for various values of two parameters and . Both the actual data of SSECI and the simulation data of the model will be considered. We select the data for the daily closing prices of SSECI in the 9-year period from August 23, 2002, to March 9, 2011; the database is from Shanghai Stock Exchange; see http://www.sse.com.cn/. And we also consider the simulation data of the model which is derived from the contact system with the infection rate (with which the simulation data has similar fluctuations with the real stock market). For the actual data and the simulation data, we make the empirical research for the absolute frequency and the relative frequency. By the computer simulation [30], we compute the absolute frequencies and the relative frequencies for different values of and , and the corresponding plots are plotted in Figures 1 and 2. Figure 1 displays the empirical results of the actual data for SSECI, the horizontal axis indicates the random expected return , and the vertical axis indicates the absolute frequencies of the time series . Figures 1(a) and 1(b) exhibit that the price-up function and price-down function are decreasing functions when is increasing. And for two -steps , , such that , the curve of is over the curve of , and we have the similar results to the price-down function . But for the absolute frequency of price-stable, Figure 1(c) displays the opposite trend; that is, the function is increasing with increasing. Figure 1 shows that, for a given step , the absolute frequencies reach their inflection point as increases. In addition, when increases, the corresponding value of (where the inflection of the absolute frequencies occur) becomes larger. We also consider the Zipf distributions of the absolute frequencies of price-up , price-down , and price-stable . For different interval times , we compute and plot the corresponding Zipf distributions in Figure 1 (the smaller plots in (a), (b), and (c)); Figure 1 exhibits the power-law distributions for the absolute frequencies. In Figure 2, we obtain the similar empirical results of the simulation data for the financial model with the infection rate .

Figures 3 and 4 show the distributions of the relative frequencies for different time scales and different expected returns. In Figure 3, when , the relative frequencies are approximately equal to 0.5; when becomes larger, the relative frequencies depart from the value 0.5 rapidly. Note that the daily price fluctuation is limited in Chinese stock markets; that is, the changing limits of the daily returns (i.e., ) for stock prices and stock indices are between −10% and 10%. This means that we should let for the Chinese stock markets. According to the above discussion, the relative frequencies are near to 0.5 for ; then we can reach a conclusion that is a low risk expected return. If a market participant hopes to obtain a return which is larger than 0.1, he will face a high investing risk. In Figure 4, according to the simulation data of the model, we obtain the similar empirical results to those of SSECI in Figure 3.

Table 1 gives the values of the inflection points for the absolute frequencies and the relative frequencies. We can find that the inflection points are larger when is increasing, corresponding with the empirical results of Figures 14. Furthermore, according to the definitions of , , , , and , it can be easily known that the inflection points of , , and are interrelated. Suppose that is the inflection point of , then and , and is also inflection point of or . We also obtain that the inflection points of and are the same from the definitions, the values are related to inflection points of absolute frequencies, and all the estimates are displayed in Table 1.

5. MF-DFA Analysis for Financial Time Series

5.1. MF-DFA Analysis

Recently, some research work has been made for the multifractal characterization of nonstationary time series, which is based on a generalization of detrended fluctuation analysis (DFA): multifractal detrended fluctuation analysis (MF-DFA) [12, 13, 3134]. It has been concluded that MF-DFA should be recommended for a global detection of multifractal behavior [35, 36].

MF-DFA method can be summarized as follows [32]. (i) Starting with a correlated time series , where is the length of the series, the corresponding profile is determined by integration where denotes the averaging over the whole time series. (ii) The profile is divided into nonoverlapping windows of equal length . Since the record length does not need to be a multiple of the considered time scale , a short part at the end of the profile will remain in most cases. In order to have into account this part of the record, the same procedure is repeated, starting from the other end of the record. Thus, windows are obtained. (iii) Calculate the local trend for each of the segments by a leastsquare fit of the series and determine the variance for each segment . Here, is the fitting polynomial in segment . Linear, quadratic, cubic, or higher order polynomials can be used in the fitting procedure. In this part we denote the algorithm as MF-DFA-1. (iv) Averaging all subsets and obtaining the th order fluctuation function , (v) Determine the scaling behavior of the fluctuation functions by analyzing log-log plots versus for each value of . If the series are long-range power-law correlated, increases, for large values of , as a power-law , where is called the generalized Hurst exponent. The value of , which corresponds to the limit for , cannot be determined directly using the averaging procedure because of the diverging exponent. Instead, a logarithmic averaging procedure has to be employed, Note that cannot be defined for time series with fractal support, where diverges for .

For monofractal time series with compact support, is independent of , since the scaling behavior of the variance is identical for all segments , and the averaging procedure will give just this identical scaling behavior for all values of . Only if small and large fluctuations scale differently, there will be a significant dependence of on . For positive values of , describes the scaling behavior of the segments with large fluctuations. For negative values of , describe the scaling behavior of the segments with small fluctuations. Obviously, richer multifractality corresponds to higher variability of . Then, the multifractality degree can be quantified by . As large fluctuations are characterized by smaller scaling exponent than small fluctuations, for are larger than those for , and is positively defined. The analytical relation between the generalized Hurst exponent introduced previously and the scaling exponent defined by the standard partition function multifractal formalism is given as [32]

The singularity spectrum is another way to characterize the multifractality of the series. The parameter is the Holder exponent or singularity strength, while is the fractal dimension of the subset of the time series with singularities of strength equal to . The spectrum is related with via a Legendre transformation Then it is straightforward to relate these quantities to the generalized Hurst exponent An alternative quantifier for the multifractality degree is the width of the singularity spectrum [14, 22]. is the width of the multifractal spectrum, . can statistically represent the variation ranges under scaling invariance. While the variation of the stock price is larger, the width of the multifractal spectrum is wider and the is larger, and vice verse.

5.2. Multifractal Analysis of Real Market and Financial Model

In this part, we investigate the multifractal behaviors of time series for SSECI, HSI, and the simulation data of the financial model. We select the daily closing prices of SSECI from August 23, 2002, to March 9, 2011, and the daily closing prices of HSI from February 27, 2004, to February 17, 2012, and the total number of observed data is about 2000 for Shanghai Composite Index and Hang Seng Index, respectively. We also consider the financial model with infection rate , and the corresponding simulation data with the same time length of SSECI is selected. As mentioned above, the generalized Hurst exponent can be obtained by analyzing log-log plots of versus for each . To show this procedure, we plot the log-log graphs of versus for SSECI, HSI, and the simulation data in Figure 5. Table 2 displays the generalized Hurst exponents , via the MF-DFA-2 procedure. In this analysis, ranges from to with a step length of , the window lengths of , are between and with a step of , and is the length of the time series. From and the equation in the above part, we can obtain the corresponding singularity spectrum versus in Figure 7(b), and the values of and are exhibited in Table 2.

In Figure 7(a), we can see that there is a significant dependence of on for both the real data and the simulation data, for are larger than those for , and the values of are displayed in Table 2. Since richer multifractality corresponds to higher variability of , in Table 2, we find that HSI has the multifractality degree with , where SSECI is of , which indicate that there is multifractal phenomenon in both Hang Seng Index and Shanghai Composite Index, and the multifractality in SHI is richer. We also compute the multifractality degree for the simulation data of the model. From Figure 7(a) and Table 2, we can see that of the simulation data is changing with ; this shows that the simulation data has the multifractal characteristic. The similar empirical results can be found from the study of the corresponding singularity spectrum calculated from . Figure 7(b) gives the plot of via , and Table 2 displays the corresponding values. It is known that, when the volatility of stock prices is larger, the width of the multifractal spectrum is wider and the is larger, and vice verse. HSI has the multifractal spectrum with , while for SSECI and for the simulation data. This indicates that there is the multifractality in the real data and the simulation data, corresponding with the study of .

Evidence in many existed works shows that Chinese stock markets are multifractal. The empirical results [3739] have indicated that the long-range nonlinear correlations generate multifractality. To give further evidences, we will compare the multifractality between original series and randomly shuffled series. We shuffle the return time series by Zipf’s law, which is introduced in Section 4.1 of the present paper, and comparatively study the multifractality on the original return time series and the shuffled return time series (by Zipf’s law). For an original return time series , we shuffle this return time series into a new corresponding return time series according to the ranges of small and large fluctuations. In the details, let such that , is a subsequence of , and assume that the order of elements in follows the order of the corresponding elements in . The length of the subsequence is the number of elements in . Similarly, we consider the subsequence with , with , and with . In the above procedure, we divide the original return time series into four subsequences, and then we arrange these four sequences by their length from the longest to the shortest. Thus we invent the new time series, which is shuffled by the method (or idea) of Zipf’s law. For example, an original return time series of SSECI is plotted in Figure 6(a), and the shuffled return time series of SSECI by the above method is exhibited in Figure 6(b), that is, , , , and .

From the empirical research in Figures 7(a) and 7(b), they exhibit that there is the multifractility in the shuffled return time series. In Table 2, we find that the multifractality in shuffled time series is richer than that in the original ones, which is probably that the long-range dependence of small and large fluctuations in the series shuffled by Zipf’s law is stronger. From these results, it may suggest that Zipf’s law contributes the long-range dependence of small and large fluctuations of return time series.

6. Conclusion

In this present paper, we develop a random stock price model by the interacting contact process. We apply MF-DFA analysis and Zipf analysis to investigate the multifractal characteristic and Zipf distribution of returns and price changes for the financial model. Moreover, we also consider the corresponding behaviors of daily returns for Shanghai Composite Index and Hang Seng Index, and the comparisons of statistical behaviors of returns between the actual data and the simulation data are exhibited. We analyze and show the statistical properties of ensembles and specifics of returns by Zipf method for different values of the parameters. And the empirical research shows an evidence that there is the multifractal character for both the real market and the financial model by analyzing the Hurst exponent and the singularity spectrum of returns time series. We also show the multifractality of shuffled return time series by Zipf’s law. We find that the shuffled series have richer multifractality than the original series. These all indicate that there are multifractal phenomenon and Zipf distribution in both Chinese stock markets and the financial model, which also implies that the financial model of the present paper is reasonable for the real stock market to some extent.

Conflict of Interests

The authors declare that there is no conflict of interests regarding the publication of this paper.

Acknowledgment

The authors were supported in part by National Natural Science Foundation of China Grant nos. 71271026 and 10971010.