Abstract

Using numerical simulation, the finite-sample properties of threshold autoregressive (TAR) and momentum-threshold (MTAR) autoregressive-based unit root tests under both deterministic and consistent methods of threshold estimation are examined in the presence of generalised autoregressive conditional heteroskedasticity (GARCH). Previous research is extended by considering both the impact of alternative robust methods of covariance matrix estimation and the behaviour of the secondary tests of asymmetry associated with the TAR and MTAR models. The results obtained reveal many interesting features, in particular the distortionary effects of consistent-threshold estimation. In summary, the findings presented indicate that caution should be exercised when interpreting the results of these frequently employed threshold-based testing methods.

1. Introduction

Following the work of Engle [1], Bollerslev [2], and Taylor [3], the analysis of volatility has become a central feature of both empirical and theoretical finance. This in turn has led to the emergence of a large literature examining the properties and behaviour of unit root tests in the presence of volatility in its commonly considered form of generalised autoregressive conditional heteroskedasticity (GARCH). (The GARCH model has become a cornerstone of empirical research in economics and finance, receiving widespread application and proving to provide accurate forecasts (see [4]).) While some authors have considered joint estimation of unit root testing equations and GARCH (1,1) processes (see, inter alia, [57] and references contained therein), others have examined the size properties of unit root tests in the presence of neglected GARCH behaviour (see [810]). (Following the emergence of a number of studies considering the relevance and implications of heavy-tailed distributions in economic and financial time series data (see, inter alia, [1114]), a further related literature has emerged examining the properties of unit root tests when applied to series with heavy-tailed disturbances (see [15, 16]).) In this paper, this latter literature is extended by considering the properties of the threshold autoregressive (TAR) and momentum-threshold autoregressive (MTAR) unit root tests of Enders and Granger [17] when applied to time series with GARCH (1,1) disturbances. This issue clearly merits attention given the noted prevalence of GARCH behaviour in economic and financial data. While Cook [9] provides an initial examination of the properties of the TAR and MTAR unit root tests in the presence of GARCH, the current study develops this work in two crucial ways. First, the impact of alternative methods of variance-covariance estimation is considered. An analysis of this is of clear importance as while the covariance matrix estimators of White [18] and Newey and West [19] are routinely adopted by econometricians and time series analysts as a general correction for problems encountered with heteroskedasticity, their impact in the presently considered circumstances is unknown. Second, the subsequent tests of the hypothesis of symmetry associated with the TAR and MTAR tests are examined. Again, this is an important issue as the results of these subsequent tests of symmetry are often reported by investigators using Fisher’s 𝐹-distribution to reinforce their empirical analyses, despite the individual properties of the tests in these circumstances, and the applicability of the 𝐹-distribution, being unknown. (The use of the 𝐹-distribution to provide critical values for the symmetry tests is apparent in the empirical analysis conducted in the seminal work of Enders and Granger [17]. More recent studies employing the 𝐹-distribution include, inter alia, Payne and Mohammadi [20] and Thompson [21].) Indeed, it will be seen that the use of the 𝐹-distribution can generate misleading results for the symmetry tests, irrespective of whether GARCH is present. The present work addresses these issues using Monte Carlo simulation, considering the TAR and MTAR tests under both deterministic and consistent methods of threshold estimation.

This paper proceeds as follows. In the following section the TAR and MTAR tests are outlined. Using Monte Carlo methods, Section 3 considers the finite-sample distributions of the TAR and MTAR unit root tests under alternative methods of covariance matrix estimation and the empirical sizes of the symmetry tests under use of the 𝐹-distribution. The simulation analysis is extended in Section 4 to consider the behaviour of the TAR and MTAR unit root tests and the subsequent tests of symmetry, in the presence of GARCH. Section 5 concludes.

2. TAR and MTAR Unit Root Tests

Consider a time series process {𝑦𝑡}𝑇𝑡=0. Given this series, the familiar Dickey-Fuller [22] (DF) test in its simplest form can be expressed as the t-ratio of 𝜙 in the following regression: Δ𝑦𝑡=𝜙𝑦𝑡1+𝜀𝑡.(2.1) Comparison of the calculated test statistic with specifically derived nonstandard critical values allows examination of the unit root hypothesis 𝐻0𝜙=0 against an alternative of asymptotic stationarity (𝐻0𝜙<0). However it is apparent that (2.1) is an implicitly symmetric specification. To allow for the possibility of stationary asymmetric adjustment about an underlying attractor, Enders and Granger [17] draw upon the threshold autoregressive methods of Tong [23, 24]. Adopting this approach, (2.1) is generalised via the introduction of the Heaviside indicator function to partition the lagged level term 𝑦𝑡1 with a resulting testing equation given as Δ𝑦𝑡=𝐼𝑡𝜌1𝑦𝑡1+1𝐼𝑡𝜌2𝑦𝑡1+𝜉𝑡,(2.2) where the zero-one Heaviside indicator function (𝐼𝑡) can be specified as either 𝐼𝑡=1if𝑦𝑡10,0if𝑦𝑡1<0(2.3) or: 𝐼𝑡=1ifΔ𝑦𝑡10,0ifΔ𝑦𝑡1<0.(2.4) The combination of (2.2) and (2.3) results in a TAR model, while combination of (2.2) and (2.4) leads to an MTAR model. The testing equation (2.2) therefore provides the basis of the asymmetric unit root tests of Enders and Granger [17], with the speed of adjustment about the stationary attractor given by 𝜌1 when above equilibrium and 𝜌2 when below. The unit root hypothesis is then tested via the null 𝖧0𝜌1=𝜌2=0, with the resulting test statistic denoted as Φ when using (2.3) and Φ when using (2.4). Due to the nonstandard distributions of the Φ and Φ statistics, their use requires the derivation of appropriate critical values via Monte Carlo experimentation. To further examine potential asymmetric stationarity, investigators often conduct a further test of the symmetry hypothesis via the null 𝜌1=𝜌2 using a conventional 𝐹-statistic. The appropriateness of the 𝐹-distribution when conducting this subsequent test of symmetry will be questioned later in this paper.

The above analysis adopts an implicit assumption of zero attractor (𝑦𝑡=0). However, in practice it is more realistic to consider a nonzero attractor. In these circumstances, it is possible to derive the required threshold either deterministically or via consistent-threshold estimation. Under the first option, a deterministic threshold is imposed via regressing the original series {𝑦𝑡} upon a constant to derive a new series {̃𝑦𝑡}. (In this analysis, the intercept-only model is considered for the TAR and MTAR tests as it is this form alone which is available for the tests under consistent-threshold estimation.) The derived series {̃𝑦𝑡} is then employed in a modified version of (2.2) as given below: Δ̃𝑦𝑡=𝐼𝑡𝜌1̃𝑦𝑡1+1𝐼𝑡𝜌2̃𝑦𝑡1+𝜁𝑡.(2.5) In a similar manner, the specification of the Heaviside indicator function in (2.3) or (2.4) is modified by using the appropriate revised series {̃𝑦𝑡} rather than {𝑦𝑡}. The resulting TAR and MTAR tests are denoted as Φ𝜇 and Φ𝜇, respectively. Under consistent-threshold estimation, the above two-step procedure is replaced by an optimisation procedure to select the threshold. To illustrate this, consider the Φ𝜇 and Φ𝜇 tests above. To construct these tests statistics, prior to regression upon a constant term effectively, this demeans the series of interest, implying that the version of (2.2) actually estimated is given as Δ𝑦𝑡=𝐼𝑡𝜌1𝑦𝑡1𝑦+1𝐼𝑡𝜌2𝑦𝑡1𝑦+𝜉𝑡.(2.6)

However, as noted by Enders [25], in the presence of asymmetric adjustment 𝜌1 and 𝜌2 differ, and consequently the mean is a biased estimator of the threshold. To overcome this and obtain a consistent value of the threshold, the approach of Chan [26] is followed with a grid search procedure employed to select the threshold. The previously defined Heaviside indicator functions of (2.3) and (2.4) are then revised as follows: 𝐼𝑡=1if𝑦𝑡1𝜏,0if𝑦𝑡1𝐼<𝜏,(2.7)𝑡=1ifΔ𝑦𝑡1𝜏Δ,0ifΔ𝑦𝑡1<𝜏Δ,(2.8) with the thresholds 𝜏 and 𝜏Δ selected by grid search over a range of values of {𝑦𝑡} and {Δ𝑦𝑡} respectively. The appropriate asymmetric unit root testing equations are therefore given as Δ𝑦𝑡=𝐼𝑡𝜌1𝑦𝑡1+𝜓1𝐼𝑡𝜌2𝑦𝑡1𝜓+𝜂𝑡,𝜓=𝜏,𝜏Δ.(2.9) Application of the consistent-threshold versions of the TAR and MTAR tests therefore requires selection of the appropriate threshold value (𝜓). Considering TAR adjustment, the series of interest {𝑦𝑡} is reordered as 𝑦𝑜1<𝑦𝑜2<<𝑦𝑜𝑇, with the central 70% of observations (𝑦𝑜𝑖,𝑖=0.15𝑇,,0.85𝑇) within this range of values considered as potential thresholds. Each of these values (𝑦𝑜𝑖=𝜏) is employed in turn in the indicator function of (2.7) with (2.9) then estimated. The value 𝑦𝑜𝑖 delivering the minimum residual sum of squares (𝜂2𝑡) for (2.9) is then deemed to be the consistent estimate of the threshold (𝜏) with the resulting TAR asymmetric unit root test labelled Φ𝑐𝜇. Under MTAR adjustment, a similar approach is followed with the central 70% of observations from the reordered sequence {Δ𝑦𝑜1,Δ𝑦𝑜2,,Δ𝑦𝑜𝑇} now considered as potential thresholds. Again the selected threshold 𝜏Δ is that value minimising the residual sum of squares of (2.9) with (2.8) now employed as the appropriate indicator function. The resultant MTAR test under consistent-threshold estimation is labelled as Φ𝜇,𝑐. In the subsequent research of Cook and Manning [27], the use of consistent-threshold estimation was shown to substantially increase the power of both TAR and MTAR asymmetric unit root tests in the presence of asymmetrically stationary processes. In the subsequent sections of this paper, the behaviour of the above tests will be examined in the presence of GARCH, with particular attention paid to the role of alternative variance-covariance matrix estimators and the secondary test of symmetry.

3. Monte Carlo Experimentation I

3.1. Derivation of Critical Values for the Asymmetric Unit Root Tests

Before considering the properties of the Φ𝜇,Φ𝜇,Φ𝑐𝜇, and Φ𝜇,𝑐 tests and their associated subsequent tests of symmetry in the presence of GARCH, critical values for these tests are derived under the use of alternative variance-covariance estimators. In addition to considering the standard OLS covariance estimator, the covariance matrix estimators of White [18] and Newey-West [19] are also employed for the four tests. The use of White and Newey-West covariance matrix estimators is denoted by the addition of the subscripts 𝑤 and 𝑛, respectively, to the test statistic, while no additional subscript is employed when using the standard OLS covariance estimator. To obtain the required critical values, the following data generation process (DGP) is employed: 𝑦𝑡=𝑦𝑡1+𝜀𝑡𝑡=1,,𝑇,(3.1) where the innovation series {𝜀𝑡} is generated using pseudo i.i.d. 𝖭(0,1) random numbers from the NRND procedure in the EViews 6. The experiments are performed over 50,000 simulations for three sample sizes: 𝑇={200,400,800}. Critical values are reported in Table 1 for three levels of significance (𝛼=0.10,0.05,0.01). The consideration of four tests, three sample sizes, three covariance matrix estimators, and three levels of significance results in the calculation of (4×3×3×3=) 108 critical values. For all of the four tests examined, the results presented in Table 1 show that finite-sample critical values are clearly dependent upon the covariance matrix estimator employed. Although the impact of alternative covariance estimators is negligible asymptotically, its finite-sample impact is apparent and is noticeable for even the relatively large sample of 800 observations considered in the present experiments. In short, it can be seen that movement from the OLS covariance matrix estimator to the White estimator and then Newey-West estimator leads to inflation of the critical values of all tests.

3.2. Empirical Sizes of the Symmetry Tests

To consider the finite-sample sizes of the subsequent associated tests of symmetry (𝖧0𝜌1=𝜌2) associated with the Φ𝜇,Φ𝜇,Φ𝑐𝜇 and Φ𝜇,𝑐 tests under alternative methods of covariance matrix estimation, the DGP of the previous subsection is employed but with (3.1) respecified as follows: 𝑦𝑡=𝜌𝑦𝑡1+𝜀𝑡𝑡=1,,𝑇.(3.2) This modification is introduced to allow consideration of stationary processes via |𝜌|<1, as the symmetry test is valid for stationary processes only. To examine the finite-sample empirical sizes of the symmetry tests associated with the Φ𝜇,Φ𝜇,Φ𝑐𝜇 and Φ𝜇,𝑐 statistics, 𝜌=0.85 is employed in the following Monte Carlo analysis. The empirical rejection frequencies for the symmetry tests are derived at the 5% level of significance (𝛼=0.05) over 50,000 replications with the innovation series {𝜀𝑡} again generated as a standard normal variable using the NRND procedure.

The empirical rejection frequencies for the symmetry tests are reported in Table 2. From inspection of this table, it is clear that there are a number of interesting findings to consider. First, under deterministic estimation of the threshold, the symmetry test for the TAR model is substantially undersized irrespective of the covariance matrix estimator employed. In contrast to this, the symmetry test for the MTAR model with a deterministic threshold has near nominal size. Second, under consistent-threshold estimation, the symmetry tests for both the TAR and MTAR models are dramatically oversized, with the former exhibiting greater size distortion than the latter. Third, movement from the standard OLS covariance matrix estimator to the White and the Newey-West estimator results in an increase in empirical size irrespective of the test and method of threshold estimation employed.

4. Monte Carlo Experimentation II

4.1. Empirical Sizes of Asymmetric Unit Root Tests in the Presence of GARCH

To examine the properties of the above-mentioned asymmetric unit root tests in the presence of GARCH (1,1) errors, the following DGP is employed: 𝑦𝑡=𝑦𝑡1+𝑤𝑡𝑡=1,,𝑇,2𝑡=𝜙0+𝜙1𝑤2𝑡1+𝜙22𝑡1,𝑤𝑡=𝑡𝑣𝑡,𝑣𝑡𝑁(0,1).(4.1) This DGP closely follows that of Kim and Schmidt [8] and considers GARCH processes generated for a range of realistic values of {𝜙1,𝜙2} corresponding to near integration, with 𝜙0=1𝜙1𝜙2 in all cases. (In this paper the less empirically realistic, or relevant, cases of degenerate GARCH (𝜙0=0) and integrated GARCH (𝜙1+𝜙2=1) are not considered.) The precise values of the volatility parameter (𝜙1) and the degree of persistence (𝜙1+𝜙2) employed are based upon estimated values observed in empirical research, with the distinction between typical values of {𝜙1,𝜙2} observed at differing data frequencies noted (see [28, 29]). Throughout, the initial value of the conditional variance is set equal to one (0=1), with an additional, initial 400 observations of the generation of the GARCH process discarded prior to generation of the series of interest 𝑦𝑡 to remove the impact of the initial condition. The initial value of 𝑦𝑡 is set to zero (𝑦0=0) without loss of generality. The innovation series {𝑣𝑡} is generated using pseudo i.i.d. 𝖭(0,1) random numbers from the NRND procedure in EViews 6 with all experiments performed over 20,000 simulations for the previously considered sample sizes: 𝑇={200,400,800}. The empirical sizes of the alternative tests are derived as the rejection frequencies for the alternative tests at the 5% level of significance under the use of the OLS, White and Newey-West covariance estimators using the critical values derived in the previous section.

Considering the results presented in Table 3 for the TAR unit root test for the standard OLS covariance estimator, it can be seen that the presence of GARCH does result in size inflation, with this becoming more noticeable for larger values of the volatility parameter. From inspection of the results for {𝜙1,𝜙2}={0.15,0.84} and {𝜙1,𝜙2}={0.25,0.70} it is also apparent that the degree of near integration as well as the size of the volatility parameter influences size distortion, as empirical size is lower for the second pairing of parameters with a lower sum for the GARCH parameters, but a higher degree of volatility. While these results hold under both deterministic and consistent threshold estimation, it can be seen that greater size distortion results from the use of consistent-threshold estimation. Conversely, it is apparent that the use of either the White or Newey-West covariance covariance matrix estimators dramatically reduces oversizing for the tests to a near negligible level.

From inspection of the results for the MTAR unit root test in Table 4, it can be seen that the behaviour of the MTAR test closely mimics that of the TAR test. However, it is also apparent that under consistent-threshold estimation and use of the standard OLS covariance matrix estimator, the oversizing of the MTAR test is less than that of TAR test for the identical values of the GARCH parameters. However, despite this, empirical sizes more than twice the nominal size are obtained.

4.2. Empirical Sizes of the Symmetry Tests in the Presence of GARCH

In Tables 5 and 6, the empirical sizes of the symmetry tests associated with the TAR and MTAR unit root tests are reported. These results are obtained using the following DGP: 𝑦𝑡=𝜌𝑦𝑡1+𝑤𝑡𝑡=1,,𝑇,2𝑡=𝜙0+𝜙1𝑤2𝑡1+𝜙22𝑡1,𝑤𝑡=𝑡𝑣𝑡,𝑣𝑡𝑁(0,1),(4.2) where 𝜌=0.85 in (4.2). Aside from the consideration of stationary time series via the use of the AR(1) specification 𝑦𝑡=0.85𝑦𝑡1+𝑤𝑡, the Monte Carlo analysis conducted in this subsection follows that of the previous subsection in terms of generation and replication. While a range of results is reported in Tables 5 and 6, the findings can be summarised straightforwardly.

Considering the TAR model with a deterministically imposed threshold value, substantial undersizing is apparent. Conversely, when consistent-threshold estimation is employed, the resulting symmetry test is massively oversized, with the use of OLS covariance matrix inducing greater size inflation than the corrected covariance matrix estimators of White and Newey-West. Despite undersizing (oversizing) of the test with a deterministic (consistent) threshold being noted previously in the absence of GARCH, it is apparent that neglected GARCH behaviour does increase size distortion with size inflation being positively related to the degree of volatility as given by 𝜙1. Turning to the results for the MTAR test, under the use of a deterministic threshold and corrected covariance matrix estimator, the test has relatively good size, particularly when the White covariance matrix is used. However, when the standard OLS covariance is employed, substantial size distortion is exhibited for volatile GARCH processes. Considering the symmetry test for the MTAR model under consistent threshold estimation, size distortion is apparent irrespective of the covariance matrix estimator employed, although the corrected covariance estimators, and the White covariance matrix estimator in particular, do reduce this. The results obtained therefore indicate that use of the preferred method of consistent-threshold estimation results in substantial oversizing of the symmetry test for both the TAR and MTAR models with all covariance matrix estimators, leading to a spurious detection of asymmetric behaviour.

5. Concluding Remarks

The popularity of threshold-based unit root tests as a means of detecting asymmetric stationarity in time series processes has increased in recent years, with their analysis and application now widespread. This paper has examined the practically important issue of the finite-sample size properties of these TAR and MTAR tests in the presence of GARCH behaviour, with particular attention paid to the use of alternative covariance matrix estimators and the subsequent test of the symmetry hypothesis. The results of the Monte Carlo analysis undertaken show the TAR and MTAR tests to be oversized when examining the unit root hypothesis in the presence of GARCH, unless a corrected covariance matrix estimator (White or Newey-West) is employed. This result is particularly apparent when considering more highly volatile GARCH processes. Interestingly, it was seen also that the use of consistent-threshold estimation increases the degree of size distortion exhibited. However, it was the examination of the symmetry hypothesis which provided the more dramatic and important results. The most crucial result concerned the substantial oversizing of the symmetry test for the TAR and MTAR models irrespective of both the covariance matrix estimator employed and the presence of GARCH. This finding indicated that a spurious inference of asymmetric adjustment could be noted with a high degree of probability when applying these tests and strongly rejected the use of the 𝐹-distribution for this test as routinely employed by practitioners (see [17, 20, 21]). Under the use of a deterministic threshold it was found that the symmetry test under the TAR model was undersized in the absence of GARCH, while the test under the MTAR model had approximately correct size. In the presence of GARCH behaviour, the most noticeable finding concerned the subsequent oversizing of the symmetry test for the MTAR model, although this was noted for the model using the standard OLS, rather than corrected, covariance matrix estimator.

In summary, the results of the present paper indicate that the routine application of TAR and MTAR unit root tests to economic and financial time series can result in incorrect or spurious inferences. While the probability of a spurious inference concerning the unit root hypothesis is most likely to occur when a corrected covariance matrix estimator is employed in the examination of time series processes exhibiting GARCH behaviour, the symmetry test has been shown to exhibit spurious rejection under consistent-threshold estimation even in the absence of GARCH. The results obtained indicate that practitioners should exercise care when applying TAR and MTAR unit root tests to economic and financial time series, particularly when considering the secondary test of symmetry where use of the 𝐹-distribution is clearly inappropriate despite its routine use in empirical analyses. As result of the above findings, an obvious future avenue of research concerns the development of asymmetric unit root tests which capture and draw upon any GARCH behaviour exhibited by data. This work, which can be viewed as a development of the studies of Seo [5] and Cook [6], is the subject of ongoing research.