Abstract

Model criticism is an important stage of model building and thus goodness of fit tests provides a set of tools for diagnostic checking of the fitted model. Several tests are suggested in literature for diagnostic checking. These tests use autocorrelation or partial autocorrelation in the residuals to criticize the adequacy of fitted model. The main idea underlying these portmanteau tests is to identify if there is any dependence structure which is yet unexplained by the fitted model. In this paper, we suggest mixed portmanteau tests based on autocorrelation and partial autocorrelation functions of the residuals. We derived the asymptotic distribution of the mixture test and studied its size and power using Monte Carlo simulations.

1. Introduction

In practice, there are many possible linear and nonlinear models for a problem under study, for example, autoregressive, moving average, mixed ARMA models, threshold autoregressive. Box and Jenkins [1] have described time series model building as a three-stage iterative procedure that consists of identification, estimation, and validation.

Time series models should be able to describe the dependence among the observations; see, for example, Li [2]. It is a well-discussed issue that in time series model criticism the residuals obtained from fitting a potential model to the observed time series play a vital role and can be used to detect departures from the underlying assumptions, [1, 2].

In particular, if the model is a good fit to the observed series then the residuals should behave somewhat like a white noise process. So, taking into account of the effect of estimation, the residuals obtained from a good fit should be approximately uncorrelated. While looking at the significance of residual autocorrelations, one approach is to test the significance of each individual residual autocorrelation which seems to be quite cumbersome. Another approach is to have some portmanteau test capable of testing the significance of the first, say, , residual autocorrelations [1, 2], an approach we now describe.

Since Box and Pierce [3] paper, the portmanteau test has become the vital part of time series diagnostic checking. Several modifications and versions of Box and Pierce [3] have been suggested in the literature; see, for example, Ljung and Box [4], McLeod and Li [5], Monte [6], Katayama [7], and Katayama [8]. These tests are capable of testing the significance of the autocorrelations (partial autocorrelations) up to a finite number of lags.

The residuals are very commonly used as a diagnostic tool to test the goodness of fit of models. In a time series context, if the fitted model is good then it should be able to explain the dependence pattern among successive observations. In other words, all the dependence in terms of autocorrelations and partial autocorrelations of the data generating process (DGP) should be explained by the fitted model so there should be no significant autocorrelation and partial autocorrelation in successive terms of the residuals.

In practice, the most popular way for diagnostic checking of a time series model is the portmanteau test, which tests whether any of the first autocorrelations of a time series are significantly different from zero. This type of test was first suggested by Box and Pierce [3], in which they studied the distribution of residual autocorrelations in ARIMA processes.

Based on the autocorrelations of the residuals obtained by fitting an ARMA model, Box and Pierce [3] suggested the following portmanteau test: where is the residual autocorrelation at lag . They suggested that , for moderate values of , and the fitted model is adequate, under the following conditions: (1) for ,(2) ,where are the weights in the MA representation. This approximation requires substitution of residuals, , for the error term, , in the model but this sort of substitution can be a serious underestimation of significance level in diagnostic checking [9]. Many other researchers have also questioned the distribution of (see, e.g., [10] and references therein). The choice of is an important issue.

Ljung and Box [4] suggested the use of the modified statistic They have shown that the modified portmanteau statistic has a finite sample distribution which is much closer to . Their results also show that is insensitive to the normality assumption of . As pointed out by many researchers, for example, Davies et al. [11], Ansley and Newbold [12], the true significance levels of tend to be much lower than predicted by the asymptotic theory and though the mean of is much closer to the asymptotic distribution, this corrected version of the portmanteau test has an inflated variance. But Ljung and Box [4] pointed out that approximate expression of variance given by Davies et al. [11] overestimates the variance of .

McLeod [10, Theorem 1] has shown that is approximately normal with mean and , where , is the identity matrix, and is the Fisher information matrix. The superscript stands for transposition of matrix. We noticed that approximation of by , especially when is small, is a source of bias in approximating the asymptotic distribution of portmanteau tests.

In practice, the optimal choice of is difficult as the use of the approximation and diagnostic checking require large values of which results in less power and unstable size of test, as noticed by Ljung [13] and Katayama [7]. Katayama [8] suggested a multiple portmanteau test to overcome this problem. His suggested test is based on several portmanteau tests for a range of small to medium values of . He showed using some numerical examples that his suggestion leads to a superior test. He also discussed the linkage between his suggested multiple test and the test due to Peña and Rodríguez [14]. He suggested a method based on some iterative procedure to approximate joint distribution of the multiple test as the computation of the distribution is very hard due to correlated elements.

The portmanteau tests and are based on the autocorrelations. Monti [6] suggested a portmanteau test where is the residual partial autocorrelation at lag . She showed that , analogously to , has an asymptotic null distribution and that is more powerful than especially when the order of the moving average component is understated. Chand and Kamal [15] had studied the size of Monti’s test.

Portmanteau tests, for example, Box-Pierce test, which are based on residuals autocorrelations, have good power against the understatement of autoregressive component while the tests using residuals partial autocorrelations have shown good power against the understatement of moving average term. A new test using both autocorrelation and partial autocorrelations is expected to perform better in both of these two scenarios and will be capable of picking up understatement in moving average and autoregressive components.

2. Materials and Methods

In this section, we give the algorithms for the Monte Carlo method used to compute the empirical size and power of the diagnostic tests defined in Section 2. For each Monte Carlo run, a sample time series is simulated under the model . For empirical size, is the null model while for the computation of power it is the alternative model. In both of the situations, we estimate the null model for the simulated sample time series and is calculated from the residuals, .

Algorithm 1 gives the bootstrap procedure used in our numerical study. From this algorithm, we obtain the bootstrap approximation of the distribution of the test. We will use this algorithm to compute empirical size in the following algorithm of our simulation study consisting of Monte Carlo runs.

Step  1. Generate bootstrap sample using resamples of residuals, ,
obtained by fitting the model under null hypothesis to the simulated time
series. We denoted these resampled residuals as .
Step  2. Fit the null model to the bootstrap sample and obtain residuals
as , where is the fitted series.
Step  3. Using the residuals, , obtained in Step  2 above, calculate
test-statistic . We denote it as .
Step  4. Repeat Step  1–3 for each of the bootstrap samples.

3. Results

Suppose we have an observed time series generated by an ARMA model given by where is the backward shift operator, is the autoregressive polynomial of order , and is the moving average polynomial of order . Let an appropriately identified model be fitted to the observed time series and let residuals be obtained. These residuals play an important role in the diagnostic checking of the fitted model. The residuals autocorrelation at lag can be defined as where is the series at lag of . Let be the vector of model parameters and let be the corresponding estimator of .

Let us define autocorrelation of error term, , that is, white noise at lag , as

The root mean square error of can be defined as

As the above term is of order , thus we can use Taylor series of first order to approximate around . We can write where

Take For an AR process at , that is, for an AR , , we can write

From (10) and summing over observed sample, we get Partially differentiating with respect to At Now As thus where

Using Bartlett’s formula, we can write

Using and the above results Consider the AR process

Multiplying both sides by and on taking expectation, we get . Without loss of generality, we can assume ; moreover it is considered a stationary process; therefore and . Moreover, for . Thus

Divide both sides by and using , we get Using this result and the structure of AR process, it implies

The AR process can be written in an equivalent moving average representation, that is, MA , given as where Using Box and Pierce [3, page. 1514] and the above results, we can write where , and thus using (8), we get

Using McLeod [10, Theorem 1], the asymptotic distribution of , where is normal with mean and covariance matrix ; that is, where for an ARMA process

The elements and are the coefficients of in the infinite expansion of inverse of AR and MA polynomials. For an AR process, the above matrix reduces to

Let be the partial autocorrelation at lag ; then using Monti [6, page 777, equation ], we have

According to Monti [6], the can be obtained using a singular linear transformation of the noise autocorrelations, that is, , given as Using (30), we can write

Ljung and Box [4], using (30), suggested a portmanteau test for diagnostic testing of the fitted ARMA model given as

Following the same lines and using the result in (35), Monti [6] suggested the following portmanteau test:

4. New Test

We suggest a new test , that is, the sum of Ljung and Box [4] and Monti [6], given as

It is obvious that this new statistic is the sum of two chi-square variables as discussed above in (36) and (37). Moreover, it can be noticed that both of these random variables will be positively correlated due to their structure of using residual autocorrelation and partial autocorrelation. We have discussed their correlational structure later in this report.

Using the result of Joarder and Omar [16, Theorem 3.1], we can write the probability density function of as where is the correlation between and :

Now the important issue is to estimate which is discussed in Section 4.1.

4.1. Estimation of

We have used Monte Carlo simulation to estimate . We have considered various choices of sample size and and obtained the following results. Empirical results suggest that, for any , we have where . Results are given in Table 1.

4.2. Empirical Size

The size of a test is helpful in assessing how reasonable is our assumption of the null distribution. We can compute the size when the sample is simulated under the null model. This is the probability of rejecting the null model when the null model is true model. The model under null hypothesis is considered an AR process . To see the effect of stationarity and near to unit-root problem, we have considered various choices of . The value of near to , for example, , is an example of near nonstationary process.

It can be seen from the results shown in Tables 2 and 3 that our new suggested mixture test, that is, , have close approximation of corresponding nominal sizes. These results show the good approximation of asymptotic distribution of our new suggested test.

4.3. Empirical Power

The power of a test is the probability of rejecting a false null hypothesis. For empirical power, as mentioned earlier, the sample is generated under the alternative model. Algorithms 2 and 3 state the Monte Carlo procedure we use to determine the empirical size and power of test.

Step  1. Obtain using Algorithm 1, for each of the bootstrap samples,
reject null model if , otherwise accept it.
Step  2. Determine the proportion of bootstrap samples, say , for which
the null hypothesis is rejected.
Step  3. Repeat Step  1-2 for each of the Monte Carlo runs.
Step  4. Empirical size, , is determined as the proportion of Monte Carlo
runs for which the , where is the level of significance,
           .

Step  1. Calculate th percentile, say , of the bootstrap distribution of
obtained using Algorithm 1.
Step  2. Reject null model if otherwise accept it.
Step  3. Repeat Step  1-2 for each of the Monte Carlo runs.
Step  4. Empirical power, , is determined as below,
           .

For power computation, we have simulated under an ARMA process and as alternative models, while AR process is the model specified in null hypothesis. Results are provided in Tables 4 and 5. These results show that our suggested mixed portmanteau tests have empirical power comparable with other tests. It can be noticed that our new test has shown power better than other considered tests when moving average part is understated and the moving average parameter is relatively small.

5. Conclusion

Portmanteau tests based on residuals autocorrelations and partial autocorrelations have their individual good properties. Our suggested mixture portmanteau tests have a good approximation of their asymptotic distribution. As it is important to study both the residuals autocorrelation and the partial autocorrelations for model diagnostic checking, so this new test is capable of doing it simultaneously instead of running separate diagnostic tests.

Conflict of Interests

The authors declare that there is no conflict of interests regarding the publication of this paper.

Acknowledgments

The authors thank the editor and the referees for their helpful comments and suggestions that greatly improved the previous version of this paper. The work of first author was supported by the Research Grant 5227-Est-I-2013 from the University of the Punjab, Lahore, Pakistan.