Abstract

On the basis of GARCH-RV-type model, we decomposed the realized volatility into continuous sample path variation and discontinuous jump variation, then proposed a new volatility model which we call the GARCH-type model with continuous and jump variation (GARCH-CJ-type model). By using the 5-minute high frequency data of HUSHEN 300 index in China, we estimated parameters of the GARCH-type model, the GARCH-RV-type model, and the GARCH-CJ-type model and compared the three types of models’ predictive power to the future volatility. The results show that the realized volatility and the continuous sample path variation have certain predictive power for future volatility, but the discontinuous jump variation does not have that kind of function. What is more, the GARCH-CJ-type model has a more power to predict the future volatility than the other two types of models. Therefore, the GARCH-CJ-type model is much more useful for the research on the capital assets pricing, the derivative security valuation, and so on.

1. Introduction

The research on asset volatility in financial market is the foundation of finance, such as capital assets pricing, financial derivatives pricing, and financial risk measurement. The premise of quantitative financial analysis is to accurately measure and predict asset volatility. Therefore, the measurement and prediction of asset volatility are a hotspot of research all the time.

To measure and predict asset volatility accurately, Engle [1], in view of “clustering” and “persistence” of volatility, proposed an autoregressive conditional heteroscedastic (ARCH) model; Bollerslev [2] built a generalized ARCH (GARCH) model based on the ARCH model. Then, GARCH model was extended; Nelson [3] found that the asset volatility is “asymmetric.” He modified the GARCH model and built an EGARCH model; Glosten et al. [4] also examined the “asymmetry” and built a TGARCH model (also called GJR model). The above models (called GARCH-style model in this paper) have been proved to have strong power to predict the future volatility of assets [5].

Admittedly, GARCH-type models have fairly strong predictive power, but there is room for improvement, as the accuracy pursuit for future volatility prediction is endless in financial operations, such as financial asset pricing, financial derivative pricing, and financial risk management. Therefore, it is necessary to improve the predictive power of the models. In order to perfect the accuracy of predictions, the realized volatility (RV) as an exogenous variable has been introduced by Koopman et al. [6] into the volatility equation of GARCH model. They built a GARCH-RV model and found that the GARCH-RV model has stronger predictive power than the GARCH model. Fuertes et al. and Frijns et al. [7, 8] also showed that the GARCH-RV model has stronger power to predict the asset volatility than the GARCH model. But in realistic financial markets, the asset volatility is a continuous process with some jump components. When Andersen et al. and Huang et al. [9, 10] studied the HAR-type RV model, they found that model built with continuous sample path variation and discontinuous jump variation that decomposed from RV has stronger power than the undecomposed HAR-RV model in measuring and predicting the asset volatility. For this reason, in studying the GARCH model with an introduction of an endogenous variable RV, it is more reasonable to decompose RV into and and introduce the two parts into the volatility equation of the GARCH model. On the basis of the GARCH model, this paper decomposes RV into two parts, and , and constructs a GRACH-CJ model in an attempt to further improve the predictive power for the future volatility. Similarly, this paper will also extend the EGARCH model and GJR model to EGARCH-RV model, GJR-RV model, EGARCH-CJ model, and GJR-CJ model. After that, we estimated parameters of the above models and compared their predictive power for the future volatility, respectively, to identify the volatility model with stronger power for the asset volatility measurement and prediction, using the 5-minute high frequency data of HUSHEN 300 index in China.

The remainder of the paper is organized as follows. The GARCH-CJ-type model construction will be introduced in Section 2. The empirical evidence and predictive power of the models will be presented in Section 3. The last part, Section 4, is the conclusion.

2. Model Construction

2.1. GARCH-CJ Model
2.1.1. GARCH-RV Model Construction

Stock return volatility cannot be observed directly but can be measured in the asset return series. The return volatility is “clustering” and “persistent.” The ARCH model proposed in Engle [1] can well capture the volatility clustering of the return series, but the model is rather complicated when the regression order gets bigger. On the basis of the ARCH model, Bollerslev [2] proposed the GARCH model to overcome the defect. GARCH(1,1) is expressed as follows: where is the return, denotes the conditional mean of based on all available information, is the volatility, is the white noise disturbance, and , , and are parameters to be estimated.

In order to improve the measurement of volatility and the accuracy of the prediction of the model, Koopman et al. [6] introduced the realized volatility (RV) as an exogenous variable into the volatility equation of GARCH(1,1) model to build a GARCH-RV model where is also a parameter to be estimated as , and is the realized volatility at period, which is defined according Martens [11] and Koopman et al. [6]. With overnight return variance, realized volatility can be expressed as where is the number of equally divided parts of a trading day; denotes the first return after the opening quotation at Day , ,   is the first closing price at Day , is the opening price at Day , is the second return after opening at Day , and , so expresses the th return at Day after opening, , and refer to overnight return variance, , and is the closing price in Day .

2.1.2. GARCH-CJ Model Construction

The real financial market reveals evidently nonlinear features [12] and the financial asset price volatility is not continuous but shows jump volatility, since the market is subject to the impact of some big information shocks and investors’ irrational factors. Andersen et al. [9] showed that it has more power to predict the future volatility by decomposing the realized volatility into continuous sample path variation and discontinuous jump variation. In order to improve the predictive power of the model, we will introduce the continuous sample path variation and the discontinuous jump variation decomposed from the realized volatility into model (2).

To decompose the realized volatility (RV), Barndorff-Nielsen and Shephard [13, 14] proposed Realized Bipower Variation (RBV); that is, where is a fix time interval, are constant (usually, 1 is given), and is the sample frequency within interval . According to Barndorff-Nielsen and Shephard’s research, when , the difference between and is equivalent to a consistent estimator for discontinuous jump variation

With a limited sample size, calculated from (5) may not always be nonnegative. In order for to be always nonnegative, we will treat in the following way:

In calculating discontinuous jump variation , sampling intraday data at unequal frequency will result in calculation error. In order to improve the calculation accuracy of , it is necessary to introduce some statistic to test the significance of . This paper adopts statistic proposed by Barndorff-Nielsen and Shephard [13, 14] based on the bipower variation theory to test . is expressed as follows: where

The classic RBV calculation is closely related to the sampling frequency of the intraday data. With the increase in the sampling frequency, the RBV estimate cannot converge to integral volatility because of the influence of factors, such as the market microstructure. So using as the robust estimator for is biased, and this paper adopts proposed by Andersen et al. [15] as a robust estimator instead. can be expressed as follows:

Accordingly, , the statistic for in (6), is replaced by , which is expressed as follows:

After replacing with and replacing with in formula (7), we calculate the statistic with (7) and get the estimator for discontinuous jump variation at the significance level:

Accordingly, the continuous sample path variation estimator is

In actual calculation, we need to select a suitable confidence level . Drawing on previous research, we choose 0.99 as the confidence level in this paper. In addition, with the test of statistic and relevant bipower variation theory, we can get the estimators for the continuous sample path variation and discontinuous jump variation of the log return volatility.

According to above RV decomposition method, we decompose of the model (2) into and . Here is the GARCH-CJ model

2.2. EGARCH-CJ Modeling Building

In view of the asymmetric effect of good and bad news on volatility, Nelson et al. [3] constructed an EGARCH model on the basis of the GARCH model. Later, researchers built more EGARCH-type models, among which a commonly used EGARCH(1,1) can be presented as

Using the method discussed in Section 2.1, we take the log of the last period’s realized volatility () and introduce the log value as an exogenous variable into EGARCH (1,1) and thus get EGARCH-RV

We decompose into and , take the log of and , and thus obtain the EGRACH-CJ model

2.3. GJR-CJ Model Construction

On the basis of the GARCH model, Glosten et al. [4] constructed a TGARCH model (also called GJR model) to introduce the leverage effect on volatility into the new model. GJR model (1,1) is where is the indicator variable of the negative

Similarly, using the method in Section 2.1, we introduce as an exogenous variable into the CJR(1,1) and construct the CJR-RV model:

We divide into and , and we get the CJR-CJ model

3. Empirical Study and Comparative Analysis of Models’ Predictive Power

3.1. Empirical Study
3.1.1. Samples and Their Statistics

For the empirical study, we take samples from the HUSHEN 300 index for Chinese stock market, and the data come from the Wind financial database. The time span of the samples covers from April 20, 2007, to April 20, 2012, including 1199 trading days. In the calculation of the realized volatility, the sampling frequency of intraday data has a great influence on the research results. On the one hand, too low sampling frequency cannot well capture the volatility information; on the other hand, too high sampling frequency will produce noise which will harm the results. Therefore, in accordance with previous research of other scholars, this paper uses the 5-minute high frequency data of HUSHEN 300 index. And we use the moving average interpolation method to make up for the missed data, which derives 58751 valid data, 49 pieces of transaction data for each day (including 1 overnight transaction data and 48 day transaction data). Variables needed in this paper are return, realized volatility, continuous sample path variation , and discontinuous jump variation . All are processed on Matlab 7.0 or Excel 2003.

Table 1 is the descriptive statistics of return , realized volatility , continuous sample path variation , and discontinuous jump variation and log realized volatility, log continuous sample path variation, and log discontinuous jump variation. We can see from Table 1 that series do not follow the normal distribution and are leptokurtic. This implies that China’s stock market has a big volatility. In addition, ADF test shows that all the series reject the null hypothesis of unit root at the 99% confidence level; it can be considered that all series are stationary and thus can be further used in model analysis.

3.1.2. Model Parameter Estimation and Analysis

In this paper, maximum likelihood method is adopted to estimate the model in Section 1. Because the setting of the initial value has a great influence on the result in the estimation process, this paper adopts an approximate value from multiple fitting (also satisfying that the likelihood score be the maximum) as the initial parameter value. Tables 2, 3, and 4 list the estimates for GARCH and other eight models under the assumptions of the residuals following Gaussian distribution and distribution. Comparing the log likelihood and the AIC value for GARCH, EGARCH, and CJR, we can see that the goodness of fit for the asymmetric EGARCH model and the CJR models is better than that for the GARCH model, which indicates that the influence of favorable and of unfavorable news is asymmetric on the market volatility in China’s stock market. In addition, comparing the log likelihood and the AIC value for those models, we can see that, with the assumption of a distribution for the residuals, the fitting performs better than with a Gaussian distribution assumption. This shows that the distribution of the return series is fat-tailed. Therefore, the assumption of a distribution for the residual error of the GARCH model is more reasonable.

From the analysis of Tables 24, the coefficients of or of newly added exogenous variable, , in the volatility equation of the GARCH-type models are significantly positive at 1% significance level, which shows that market volatility in China’s stock exhibits pronounced persistence and the last period volatility may serve as an indicator for the current period volatility. In addition, comparing the AIC values for the GARCH-RV-type model and the GARCH-type model, we can see that the fitting for the GARCH-RV model works better, which is consistent with Koopman et al. [6]. When it comes to the estimation results for this paper’s newly built GARCH-CJ model, the coefficients for () are significantly positive at the 1% significance level, and the coefficients for () are significant only when the residual error in the GARCH-CJ model and the CJR-CJ model is assumed to follow a Gaussian distribution, otherwise insignificant. Form this, we can know that, in China’s stock market, the lagged continuous sample path variation contains relatively more information for predicting the current volatility, while the lagged discontinuous jump variation contains relatively less information for forecasting. In addition, regardless of whether the residual error follows a Gaussian distribution or a distribution, the AIC value for the GARCH-CJ-type model is lower than the GARCH-RV-type and the GARCH-type models, which fully demonstrates that the fitting of the GARCH-CJ-type model has a better fitting effect.

3.2. The Comparison of Model Predictive Power
3.2.1. In-Sample Prediction

In order to confirm whether the GARCH-CJ-type model has more predictive power to future volatility than the GARCH-type model and the GARCH-RV-type model, this paper compares the predictive power of these three types of models using a loss function. We select Mean Absolute Error (MAE), Heteroskedastic adjusted Mean Absolute Error (HMAE), Root Mean Squared Error (RMSE), and Heteroskedastic adjusted Root Mean Squared Error (HRMSE) as 4 indexes to evaluate and analyze the performance of the volatility models. Generally, the smaller the four are, the stronger predictive power the corresponding model has to predict future volatility. The formulae for getting the values of MAE, HMAE, RMSE, and HRMSE are expressed in (21). Since volatility cannot be directly observed in the stock market, scholars ([6, 16, 17]) usually use the realized volatility () as a substitute for the volatility in Day . In this paper, is also used as the substitute where is the size of the predictive sample, and is the real volatility; that is, ; denotes the predicted volatility.

Table 5 lists the statistics of in-sample predictive power evaluation index values for the GARCH type model, the GARCH-RV type model, and the GARCH-CJ-type model when using lag 1 data to predict the current volatility. Comparing the value for each evaluation index, we can see that, except that the HRMSE value for the GARCH-RV-type model is greater than that for the GARCH-type model, the RMSE for the EGARCH-CJ-type model is larger than that for the EGARCH-RV-type model, all values for the GARCH-CJ-type model are smaller than that for the GARCH-RV-type model, and the value for the GARCH-RV type model is lesser than that for the GARCH-type model. Therefore, we can presume that in forecasting the in-sample volatility the GARCH-CJ-type model has a greater in-sample predictive power than the GARCH-RV-type model, and the GARCH-RV-type model has greater in-sample predictive power than the GARCH type model.

3.2.2. Out-of-Sample Prediction

Compared with the in-sample predictive power of the model, we are more concerned about the out-of-sample predictive power, since it has more practical value. In order to effectively evaluate out-of-sample predictive power, we divide the sample (April 20, 2007–April 20, 2012) into two parts. The first part (April 20, 2007–November 20, 2011) totals 1099 samples to be used for model estimation; the second part (November 21, 2011–2012, April 20) totals 100 samples to be used for prediction. As in the in-sample part for model estimation, we still use the loss function to compare the effectiveness of the prediction performed by the models. The results are shown in Table 6.

Comparing the value for each evaluation index, we can see that, except that the RMSE value for the GARCH-RV-type model is greater than that for the GARCH-type model in the case where both the models’ residuals are assumed to follow a distribution, the MAE value and the HRMSE value for the CJR-RV-type model are larger than that for the CJR type model in the case where both the models’ residuals are assumed to follow a distribution, all values for the GARCH-CJ-type model are smaller than that for the GARCH-RV-type model, and the value for the GARCH-RV-type model is lesser than that for the GARCH type model. So we can presume that, in terms of the out-of-sample predictive power for volatility, GARCH-CJ-type model works better than the GARCH-RV-type model and, in turn, the latter is superior to the GARCH-type model.

Combining the discussion in Section 3.2.1 with that in Section 3.2.2, we can see that among the above three types of volatility models the GARCH-CJ-type model performs the best in predicting future volatility. Therefore, it makes sense to introduce the realized volatility (RV) into the GARCH-type model and decompose it into continuous sample path variation () and discontinuous jump variation () to enhance the model’s predictive power for volatility.

4. Conclusions

This paper constructs GARCH-CJ model on the basis of the GARCH-RV model to obtain a volatility model that can better measure and predict asset volatility. And, in order to test the validity of the model, an empirical study is carried out using the 5-minute high frequency data of HUSHEN 300 index in China (April 20, 2007, to April 20, 2012), we estimate the parameters of the GARCH-type model, the GARCH-RV-type model, and the GARCH-CJ-type model and evaluate all models’ predictive power for future market volatility using a loss function (MAE, HMAE, RMSE, and HRMSE).

From the results of the estimated parameters, we can see that favorable and unfavorable news have an asymmetric impact on the market volatility in China’s stock market, and the distribution of the market return series is leptokurtic. At the same time, through the empirical results, we can draw some conclusions as follows.(1)The past continuous sample path variation has more predictive power for future volatility, but the past discontinuous jump variation has less information to predict.(2)The GARCH-CJ-type model has a much better fitting of the future volatility than other two types of models (the GARCH-type model and GARCH-RV-type model).(3)According to the comparison of the predictive power of the three types of models, the GARCH-RV model performs the better in predicting the future volatility than the GARCH-type models, which is consistent with Koopman, Fuertes et al., and Lehnert et al. [68].(4)The proposed GARCH-CJ-type model in this paper has a better ability to predict the future volatility than the other two types of models, which means the application of GARCH-CJ model is more reasonable in measuring and predicting volatility in financial practices such as capital asset pricing, financial derivatives pricing, and risk measures.

Although GARCH-CJ model has a greater power to predict the market volatility, it is still necessary to further increase the accuracy of measuring and predicting the market volatility. Therefore, the GARCH-CJ-type model, further improvement in the fitting, and predictive accuracy of the volatility models will be our emphasis for further research.

Conflict of Interests

The authors declare that there is no conflict of interests regarding the publication of this paper.

Acknowledgments

The Research is sponsored by National Science Fund for Distinguished Young Scholars (70825006), Natural Science Foundation of China (71171076), and Scientific Research Foundation for the Returned Overseas Chinese Scholars (2011 ).