Nonlinear Problems: Mathematical Modeling, Analyzing, and Computing for FinanceView this Special Issue
Estimation of Nonlinear Dynamic Panel Data Models with Individual Effects
This paper suggests a generalized method of moments (GMM) based estimation for dynamic panel data models with individual specific fixed effects and threshold effects simultaneously. We extend Hansen’s (Hansen, 1999) original setup to models including endogenous regressors, specifically, lagged dependent variables. To address the problem of endogeneity of these nonlinear dynamic panel data models, we prove that the orthogonality conditions proposed by Arellano and Bond (1991) are valid. The threshold and slope parameters are estimated by GMM, and asymptotic distribution of the slope parameters is derived. Finite sample performance of the estimation is investigated through Monte Carlo simulations. It shows that the threshold and slope parameter can be estimated accurately and also the finite sample distribution of slope parameters is well approximated by the asymptotic distribution.
Since many economic relationships are dynamic and nonlinear, nonlinear/dynamic panel data models could obtain more information from data sources than traditional models [1, 2]. For example, many researchers suggest that economic growth is a nonlinear process [3–5] and a number of empirical analyses of economic growth entail dynamic econometric models [6–9], with lagged dependent variable among the regressors. However, few researchers consider the dynamic and nonlinear relationships simultaneously and the purpose of this paper is to combine these two factors in one model.
Many results exist in the theoretical literature concerning the estimation and inference for dynamic panel data models. Since the lagged dependent variables and the disturbance term are correlated due to the unobserved effects, standard least square methods could not obtain consistent estimators when the model is dynamic. To overcome this problem, Anderson and Hsiao  suggested that we difference the model first to get rid of the unobserved effects and then use instrumental variable (IV) estimation for the transformed model. Nevertheless, this IV estimation method leads to consistent but not necessarily efficient estimates of the parameters because it does not use all the available moment conditions. Arellano and Bond  proposed a generalized method of moments (GMM) procedure that is more efficient than the Anderson and Hsiao  estimator. This literature is generalized and extended by Arellano and Bover  and Blundell and Bond , which are called forward orthogonal deviation and system GMM, respectively. For the latest development of dynamic panel data models, see Baltagi  and Han and Phillips  for more details.
Several models could be chosen to describe the nonlinear relationship such as mixture models, switching models, smooth transition threshold models, and threshold models. In this paper threshold model is used because of wide applications in empirical researches. This model splits the sample into classes based on an observed variable—whether or not it exceeds some thresholds. In most situations, the complexity of the problem increases because the exact threshold is unknown and needed to be estimated. The estimation and inference are fairly well developed for linear models with exogenous regressors [15–17], in which only the nondynamic case is considered.
The dynamic panel threshold models have been used in empirical literature. Cheng et al.  examined the evidence on the conditional convergence growth theory, which extended dynamic panel data growth model to control both threshold effects and cross-section dependence. Chong et al.  studied the relationship between the depletion rate of foreign reserves and currency crises using threshold autoregressive model. Ho  applied a dynamic panel threshold model to examine whether the low-income countries catch up with the rich ones. Kremer et al.  considered a dynamic panel threshold model to study inflation thresholds for long-term economic growth. As Hansen’s model required that all regressors are exogenous, the method of Hansen  used in these papers to estimate the dynamic models may not be suitable due to the lagged dependent variables. So far, the theory of dynamic panel threshold model has not been available as we know except for Dang et al. . However, the validity of the instrumental matrices is not proved. This paper proposes an estimation method for dynamic panel threshold model and our analysis mainly relies on Hansen , Arellano and Bond , and Caner and Hansen . First, we prove that the orthogonality conditions considered in ordinary dynamic panel data models are also valid in nonlinear dynamic models. Second, we develop a GMM estimator of the threshold and slope parameters based on the above moment conditions.
The remainder of the paper proceeds as follows. Section 2 introduces the model and notations. Section 3 discusses the estimation for the threshold and slope coefficients. Section 4 reports a Monte Carlo simulation, and Section 5 concludes.
Consider a simple AR model without exogenous variables but with individual and threshold effects as shown in the following structural equation: where denotes cross-sections and denotes time. denotes the observable dependent variable; denotes the exogenous threshold variable; denotes the threshold parameter, which is assumed to be unknown and needs to be estimated; denotes a parameter that satisfies ; is the unobserved individual effect; is the idiosyncratic error, which is assumed to be independent and identically distributed (i.i.d) with mean zero and variance conditional on . is indicator function.
One can also write (1) in the form For simplicity, we assume that is observed and let . Alternatively, (1) can also be written compactly as where , .
In this section, we first consider a simple model without exogenous covariates and derive GMM based estimator for the threshold parameter and slope parameters . Then we extend the simple model to cases with strictly exogenous covariates.
3.1. Estimation of Threshold and Slope Parameters
In traditional dynamic panel data model, two methods are commonly used to remove individual effect . One is first-difference approach suggested by Arellano and Bond ; the other one is forward orthogonal deviation proposed by Arellano and Bover . We will utilize the first-difference approach in the following derivation, due to the fact that it is more convenient for computation.
First, we take first-difference for model (3) to get rid of the time invariant individual effects where denotes difference operator. If , that is, there is no threshold effect, then additional instruments can be obtained in dynamic panel data models if one utilizes the orthogonality conditions that exist between lagged values of and the disturbance according to Arellano and Bond . Here we prove that these orthogonality conditions are also valid in model (4) when .
For any given , we have either or . Consider the former one without loss of generality. Similarly, there must be two cases in the period , Then first difference yields Correspondingly, For any given , is a valid instrument (A valid instrument means that it should have two basic conditions: first, erogeneity, i.e., it should be independent of (or, at least, uncorrelated with) the disturbance term in the equation of interest; second, relevance, i.e., it should be correlated with the included endogenous explanatory variables for which it is supposed to serve as an instrument.) for both case 1 and case 2 since it is correlated with , that is, or , but not correlated with , that is, as long as ’s are serially uncorrelated. Given the autoregressive nature of the model and the assumption that there is no serial correlation in , it can be easily shown that are also valid instruments. Therefore, the orthogonality conditions are given by
Define then for each the moment conditions described above can be written as Note that is MA. Define ; then where Let be the GMM estimator with moment conditions given by (10). The GMM estimator can be written as where , .
Stacking over individuals, (13) can be written compactly as where , is Kronecker product and , , and .
In fact, this estimator is infeasible in empirical studies, since it depends on an unknown parameter . Therefore, our next step is to estimate from the regression residuals: We apply the estimator suggested by Chan  and Hansen [15, 17]; then can be estimated by where is the sum of squared errors.
Once is obtained, we substitute the true parameter with its estimate yielding the feasible GMM estimator of slope coefficient estimate:
According to Hansen , under the case of known , GMM estimator is efficient and asymptotically normal: where
Hansen  and Caner and Hansen  show that the dependence on the threshold estimate is not of first-order asymptotic importance, so inference on could proceed as if the estimated threshold parameter was the true parameter . Then, The estimated covariance matrix becomes (One can also consider Windmeijer’s bias-corrected estimator (Windmeijer ) for the robust VCE of two-step GMM estimators.) where and . Similarly, one can prove that converges in probability to as in Caner and Hansen .
3.2. Estimation of the Model with Exogenous Variables
Now we extend the results in the previous subsection to cases with strictly exogenous variables. Consider additional regressors in model (1): for ; . Since are strictly exogenous, they are valid instruments for the first differenced form of (22). Therefore, should be added to each diagonal element of in (9). Hence, the matrix of instruments is then the estimators of and slope coefficients can be obtained accordingly as in (16) and (17).
4. Monte Carlo Experiments
In this section Monte Carlo experiments are implemented to examine the finite sample performance of our estimator. For this purpose we consider the following design.
4.1. Simulation Design
The data generating process (DGP) is given by for and , where , , , and are mutually independent random variables. Let , , , and , and varies among and varies among . All results are based on 1,000 replications.
The computation of the threshold involves the minimization problem in (16), which can be reduced to searching for the values of that minimizes the sum of squared errors among all distinct values of in the sample. Obviously, there are at most distinct values of , and the minimum value of considered in the simulation is 1000. Thus, the searching could take a fair amount of time when the number of possible values is large. To reduce the computation load, we employ the method proposed by Hansen . Specifically, instead of searching over all values of , we limit it to some specific quantiles , which contain only 393 different values. However, this approach may not be as appealing as searching over all possible values of when the number of distinct value of is small.
4.2. Simulation Results
Tables 1 and 2 represent the 5%, 50%, and 95% quantiles of the simulation distribution of , and for varying among 10, 15, and 20 and varying among 100, 200, and 300.
Table 1 reports the results of , corresponding to the case when threshold is small. The estimates of the threshold perform fairly well for all cases considered, since the medians of are around the true value . As increases, the distribution of is becoming more and more concentrated around the true value. For example, when and , the length of the quantile range between 0.05 and 0.95 is 4.02, while, when , the length decreases to 0.95. The distribution of the slope coefficient estimator exhibits a little downward bias as it has been shown in some of the existing Monte Carlo studies for dynamic panel data models. For and , the median bias of is 0.06, but this bias is reduced as and/or increases; for example, this bias is only 0.01 for and . Similarly, the length of the quantile range between 0.05 and 0.95 for is getting smaller as increases, which means that the performance improves. The quantiles of the distribution of also performs well in all cases, although it is relatively weak in cases with small and small .
Table 2 presents the results for the case when threshold is big; that is, . Compared to the small threshold case, the performance of the distribution of is improved. The median bias of is zero for almost all cases, and the length of the quantile range between 0.05 and 0.95 is getting smaller as the threshold effect increases. Meanwhile, Table 2 reports similar results as Table 1 for the parameters of and . In Table 2, they also perform fairly well in the big threshold case.
Figure 1 displays kernel estimates of the distribution of the slope parameters and based on 1,000 replications with , , and small threshold (). The estimates are slightly biased downwards when is small or is small. This bias is common in dynamic panel data model as mentioned earlier. One could also use some bias-corrected methods to improve the finite sample properties of the estimators, which is beyond the scope of this paper. The estimates are gradually centered around the true values as and/or increases, which is consistent with the above analyses and confirms the validity of our proposed estimation procedure again.
Figure 2 shows the distribution of the same parameters as Figure 1 and based on the same number of replications and sample size but with bigger threshold (). In this case the same conclusion can be found as in Figure 1. In particular, the performance of the estimators in this case is better than that in the smaller threshold for all cases.
This paper extends the estimation of threshold models in nondynamic panels to dynamic panels and presents practical estimation methods for these econometric models with individual-specific effects and threshold effects. The foremost feature of these models is that they allow the econometrician to consider the dynamic and threshold relationships in economic system simultaneously. As mentioned in the introduction, many applications may have such relationships. Using the first-difference to eliminate the individual-specific effects, we prove that the orthogonality conditions proposed by Arellano and Bond  for nonthreshold models are also valid in our models. Then, we estimate the threshold and slope parameters by GMM. Monte Carlo simulations reveal that our method has very good finite sample performance.
There are several possible extensions to this work. The asymptotic properties of the threshold parameter would be an interesting topic. Also, testing for one or multiple thresholds is also worth studying, which is saved for future research.
Conflict of Interests
The authors declare that there is no conflict of interests regarding the publication of this paper.
The authors thank the editor and three anonymous referees for many constructive and helpful comments. This work was partially supported by the National Natural Science Foundation of China (Grant nos. 71301160 and 71301173), China Postdoctoral Science Foundation funded project (Grant nos. 2012M520419, 2012M520420, and 2013T60186), Beijing Planning Office of Philosophy and Social Science (13JGB018), and Program for Innovation Research in Central University of Finance and Economics.
C. X. Huang, C. L. Peng, X. H. Chen, and F. H. Wen, “Dynamics analysis of a class of delayed economic model,” Abstract and Applied Analysis, vol. 2013, Article ID 962738, 12 pages, 2013.View at: Publisher Site | Google Scholar | Zentralblatt MATH | MathSciNet
F. Wen and Z. Dai, “Modified Yabe-Takano nonlinear conjugate gradient method,” Pacific Journal of Optimization, vol. 8, no. 2, pp. 347–360, 2012.View at: Google Scholar | Zentralblatt MATH | MathSciNet
O. Galor and D. N. Weil, “Population, technology, and growth: from malthusian stagnation to the demographic transition and beyond,” The American Economic Review, vol. 90, no. 4, pp. 806–828, 2000.View at: Google Scholar
A. Mas-colell and A. Razin, “A model of intersectioral migration and growth,” Oxford Economic Papers, vol. 25, no. 1, pp. 72–79, 1973.View at: Google Scholar
P. F. Peretto, “Industrial development, technological change, and long-run growth,” Journal of Development Economics, vol. 59, no. 2, pp. 389–417, 1999.View at: Publisher Site | Google Scholar
T. W. Anderson and C. Hsiao, “Estimation of dynamic models with error components,” Journal of the American Statistical Association, vol. 76, no. 375, pp. 598–606, 1981.View at: Publisher Site | Google Scholar | Zentralblatt MATH | MathSciNet
A. Ciarreta and A. Zarraga, “Economic growth-electricity consumption causality in 12 European countries: a dynamic panel data approach,” Energy Policy, vol. 38, no. 7, pp. 3790–3796, 2010.View at: Publisher Site | Google Scholar
T. S. Eicher and T. Schreiber, “Structural policies and growth: time series evidence from a natural experiment,” Journal of Development Economics, vol. 91, no. 1, pp. 169–179, 2010.View at: Publisher Site | Google Scholar
B.-N. Huang, M. J. Hwang, and C. W. Yang, “Causal relationship between energy consumption and GDP growth revisited: a dynamic panel data approach,” Ecological Economics, vol. 67, no. 1, pp. 41–54, 2008.View at: Publisher Site | Google Scholar
M. Arellano and S. Bond, “Some tests of specification for panel data: Monte Carlo evidence and an application to employment equations,” The Review of Economic Studies, vol. 58, pp. 277–297, 1991.View at: Google Scholar
M. Arellano and O. Bover, “Another look at the instrumental variable estimation of error-components models,” Journal of Econometrics, vol. 68, no. 1, pp. 29–51, 1995.View at: Google Scholar
R. Blundell and S. Bond, “Initial conditions and moment restrictions in dynamic panel data models,” Journal of Econometrics, vol. 87, no. 1, pp. 115–143, 1998.View at: Google Scholar
B. H. Baltagi, Econometric Analysis of Panel Data, John Wiley & Sons, Chichester, UK, 2008.
C. Han and P. C. B. Phillips, “GMM estimation for dynamic panels with fixed effects and strong instruments at unity,” Econometric Theory, vol. 26, no. 1, pp. 119–151, 2010.View at: Publisher Site | Google Scholar | Zentralblatt MATH | MathSciNet
B. E. Hansen, “Sample splitting and threshold estimation,” Econometrica, vol. 68, no. 3, pp. 575–603, 2000.View at: Publisher Site | Google Scholar | Zentralblatt MATH | MathSciNet
M. Caner and B. E. Hansen, “Instrumental variable estimation of a threshold model,” Econometric Theory, vol. 20, no. 5, pp. 813–843, 2004.View at: Publisher Site | Google Scholar | Zentralblatt MATH | MathSciNet
B. E. Hansen, “Threshold effects in non-dynamic panels: estimation, testing, and inference,” Journal of Econometrics, vol. 93, no. 2, pp. 345–368, 1999.View at: Publisher Site | Google Scholar | Zentralblatt MATH | MathSciNet
J. Cheng, C. Lin, and C. Wang, “Estimation of growth convergence using common correlated effects approaches,” Working Paper, 2009.View at: Google Scholar
T. T. L. Chong, Q. He, and M. J. Hinich, “The nonlinear dynamics of foreign reserves and currency crises,” Studies in Nonlinear Dynamics & Econometrics, vol. 12, no. 4, article 2, 2008.View at: Publisher Site | Google Scholar | MathSciNet
T.-W. Ho, “Income thresholds and growth convergence: a panel data approach,” Manchester School, vol. 74, no. 2, pp. 170–189, 2006.View at: Publisher Site | Google Scholar
S. Kremer, A. Bick, and D. Nautz, “Inflation and growth: new evidence from a dynamic panel threshold analysis,” Empirical Economics, vol. 44, pp. 861–878, 2013.View at: Publisher Site | Google Scholar
V. A. Dang, M. Kim, and Y. Shin, “Asymmetric capital structure adjustments: new evidence from dynamic panel threshold models,” Journal of Empirical Finance, vol. 19, no. 4, pp. 465–482, 2012.View at: Publisher Site | Google Scholar
K. S. Chan, “Consistency and limiting distribution of the least squares estimator of a threshold autoregressive model,” The Annals of Statistics, vol. 21, no. 1, pp. 520–533, 1993.View at: Publisher Site | Google Scholar | Zentralblatt MATH | MathSciNet
L. P. Hansen, “Large sample properties of generalized method of moments estimators,” Econometrica, vol. 50, no. 4, pp. 1029–1054, 1982.View at: Publisher Site | Google Scholar | Zentralblatt MATH | MathSciNet
F. Windmeijer, “A finite sample correction for the variance of linear efficient two-step GMM estimators,” Journal of Econometrics, vol. 126, no. 1, pp. 25–51, 2005.View at: Publisher Site | Google Scholar | MathSciNet