Averaged and Integrated Estimations of Varying-Coefficient Regression Models with Dependent Observations

Fan, Guo-Liang; Xu, Hong-Xia

doi:https://doi.org/10.1155/2019/7146793

Discrete Dynamics in Nature and Society

On this page

Abstract Introduction Data Availability Conflicts of Interest Acknowledgments References Copyright Related Articles

Research Article | Open Access

Volume 2019 | Article ID 7146793 | https://doi.org/10.1155/2019/7146793

Averaged and Integrated Estimations of Varying-Coefficient Regression Models with Dependent Observations

Guo-Liang Fan¹and Hong-Xia Xu²

Academic Editor: Rigoberto Medina

Received06 Nov 2018

Accepted31 Jan 2019

Published04 Jul 2019

Abstract

In practical applications, lots of data such as sequentially collected economic data often exhibit some evident dependence. This paper studies the varying-coefficient regression models with different smoothing variables when the data form a stationary -mixing sequence. Both the averaged and integrated estimators of coefficient functions are proposed. The asymptotic normalities of the proposed averaged and integrated estimators are also established.

1. Introduction

Regression analysis is one of the most mature and widely applied branches of statistics. Various regression models have been studied by many authors; e.g., Liang and Fan [1] studied Berry-Esseen type bounds of estimators in a semiparametric model with linear process errors; Fan, Liang, and Xu [2] considered empirical likelihood confidence regions for heteroscedastic partial linear model and so on. Recently, varying models which were proposed by Hastie and Tibshirani [3] and Chen and Tsay [4] have received more and more attention. This is mainly because varying model offer a flexible but parsimonious alternative to nonparametric model and has been used in many contexts such as it has been successfully applied to multidimensional nonparametric regression, generalized linear model, time series analysis, longitudinal and functional data analysis, and time-varying model in finance. A nice feature of varying coefficients is to allow appreciable flexibility on the structure of fitted models without suffering from the “curse of dimensionality”.

Consider the following varying-coefficient model with different smoothing variables in different coefficient functions:where is a random vector, and ; are some measurable functions from to ; is a measurable function from to and is independent of with mean zero and variance 1.

Most of the work focused on the case where all coefficient functions share a single smoothing variable in a model, i.e. in model (1). In particular, the asymptotic normalities of the estimators of these coefficient functions by local polynomial technique are obtained, such as Shen et al. [5], Wang and Lin [6], and Fan, Liang, and Zhu [7]. However, the varying-coefficient regression models on which different coefficient functions share different variables have been given less attention because the local polynomial technique is not adequate. To solve this difficulty, when , which come from , are independent and identically distributed (i.i.d.), Zhang and Li [8] employed the local linear technique to obtain an initial value for the estimate of each coefficient function in model (1), and, then, the integrated estimate of each coefficient function is defined by integrating its initial value on these variables which the coefficient function does not share; Zhang and Li [9] used the local linear technique to give an initial value for every estimate of the coefficient function in model (1). Then the averaged estimate of each coefficient function is defined by averaging its initial value on these variables which the coefficient function does not share; Yang [10] proposed estimators for this model under random right censoring case by using mean-preserving transformation and established their asymptotic properties. The estimation procedure is based on the profiling and the smooth backfitting techniques.

However, the above related papers need the i.i.d. assumption for the data. The independence assumptions are not always valid in applications, especially for sequentially collected economic data, which often exhibit evident dependence. In the sequel, are assumed to form a strong stationary -mixing sequence. Fan et al. [11] studied the varying-coefficient errors-in-variables models when the data form a stationary -mixing sequence of random variables. Recall that a sequence is said to be -mixing if the -mixing coefficient converges to zero as , where denotes the -algebra generated by with . Among various mixing conditions used in the literature, -mixing is reasonably weak and has many practical applications. In fact, under very mild assumptions linear autoregressive and more generally bilinear time series models are -mixing with mixing coefficients decaying exponentially; i.e., for some ; see Doukhan [12], page 99, for more details.

The paper is organized as follows. In Section 2, we list some assumptions and present the averaged and integrated estimators of the coefficient functions, as well as the asymptotical normalities for these estimators. Proofs of the main results and the preliminary lemmas are provided in Section 3. Concluding remarks are given in Section 4.

2. Methodology and Main Results

Suppose that is a sample from model (1), are strong stationary -mixing sequence, and that for every , has a Lipschitz continuous second derivative. Then can be approximate locally by a linear function, , at a neighborhood of which is in the support of . We minimizewith respect to and , where and are the th entries of the th observation and , respectively; , is a bounded, nonnegative, compactly supported symmetric about zero and Lipschitz continuous density function; is a sequence of positive numbers and called a bandwidth.

Let , be the first entries of the minimizers of (3); then, it follows from the least squares theory thatwhere is a unite vector with at its th position and is an matrix with as its th row, with . is called as a initial value of the estimate of the coefficient function .

The averaged estimate of the coefficient function is defined as (see [9])

Next, we define the integrated estimate of the coefficient function. Let be a deterministic weight function with , for each , where . We allow for both discrete and continuous and integrals should be interpreted in the Stieltjes sense. Let be the density of with respect to either Lebesgue or a counting measure. Suppose that the support of is contained within that of . Then, the integrated estimate of the coefficient function is defined as (see [8])

In order to formulate the main results, we need the following assumptions.(A1)(i)The joint density of and the marginal density of , , are compactly supported, bounded, Lipschitz continuous, and bounded away from zero by a constant.(ii)The conditional probabilities satisfy for different .(A2)The conditional expects and are Lipschitz continuous, or , .(A3), for some .(A4)(i), for some .(ii)There exist positive integers and such that for sufficiently large , , and , for some .

Remark 1. Conditions (A1)(i) and (A2) have been assumed in Zhang and Li [8, 9]. (A1)(ii) holds naturally when are i.i.d. (A3) has been used by Fan et al. [2]. Assumption (A4)(i) is a mild condition. In the particular case of exponential decay, for some , we have that for sufficient large , and hence assumption (A3) is satisfied. The technical condition (A4) is very easy to be satisfied. For example, when and , choosing , , and , we have and .

Theorem 2. Suppose that (A1)–(A4) hold and the bandwidths are such that , and for each , , and , , , is a constant; thenwhere , , , , and its inverse , is the marginal density of . In particular, when is a constant , .

Theorem 3. Under the conditions of Theorem 2, we havewhere . In particular, when is a constant , .

3. Proof of the Main Results

In this section, we list some preliminary Lemmas. Let be a stationary -mixing sequence of random variables with mixing coefficients . In the sequel, let denote positive constant whose value may vary at each occurrence. In case of no loss of generalization and expositional purpose, we shall work with the special case .

Lemma 4 (see [13]). Let be -mixing random variables measurable with respect to the -algebra , respectively, with and for . Then where denotes -field generated by , is the mixing coefficient.

Lemma 5 (see [14], Corollary A.2, p. 278). Suppose that and are random variables such that , where . Then

Lemma 6 (see [15], Theorem 4.1). Let and . Assume that for some and . If and . then, for any , there exists such that

Lemma 7 (see [16], Lemma 2.3). Assume , for some Let be satisfied. Moreover, let for some , , where , for , and Then holds with

Lemma 8. If assumptions (A1)–(A4) hold and , , then in the sense that each element converges, where

Proof. Let then can be presented Note that is a strong stationary sequence of -mixing random variables; then we haveSet . It is easy to see Which, combing with Lemma 7, (A4) and , , yield that, for ,Conjoining (17) and (19), we get the result of this lemma.

Proof of Theorem 2. Note that and can be represented that Since where . Then we can obtain that where Next, we establish thatIn order to prove (24), we need only to show In fact, according to (A1)(ii), (A3), and Lemma 5, we haveThus, (24) holds.
Then similar to the proof of Theorem in Zhang and Li [9], we have Hence, in order to prove Theorem 2, we need only to show thatDenote . Then can be represented as In order to prove (28), we need only to showLet be defined as follows: where , Then Hence, in order to prove (30), it suffices to show thatRelation (33) implies that and are asymptotically negligible, (35) shows that the summands in are asymptotically independent, and (34) and (36) are the standard Lindeberg-Feller conditions for asymptotic normality of under independence.
We first establish (33). Obviously Note thatwhich yields that from (A4). Since to prove and , it suffices to show thatNext, let (specified below) be a sequence of integers such that and . Put We writeIt is easy to see that Hence,On the other hand, it follows from Lemma 5 that andThen we have Therefore, by choosing and in view of (A4), we obtain thatThus, (40) is verified from (42), (44), (46), and (48).
As to , by (A4), (38), (40), and Lemma 6, we have Next we establish (34). Obviously From (A4), (38), and Lemma 6, we have On the other hand, It is easy to get that and Thus, (34) is verified.
As to (35), according to Lemma 4, we have which tends to zero by (A4).
As to (36), by (A4), (46), and Lemma 6, for any and , we have Hence, (36) is shown. This completes the proof.

Proof of Theorem 3. Note thatwhere . Then following the method of the proof of Theorem 2.1 and Theorem 2.1 in Zhang and Li [8], we can obtain this Theorem.

4. Conclusion Remarks

In regression analysis, -mixing has received a lot of attentions such as Shen and Xie [17], Fan et al. [11], Chesneau et al. [18], and Guo and Liu [19]. This is because -mixing is reasonably weak among various mixing conditions and has many practical applications. We, in this paper, focus on the varying-coefficient regression models with different smoothing variables and extend the results of Zhang and Li [8] and Zhang and Li [9] from i.i.d. assumptions to -mixing setting. We employed the local linear technique to obtain an initial value for the estimate of each coefficient function in model (1), and, then, both averaged and integrated estimates of each coefficient function are defined by averaging and integrating its initial value on these variables which the coefficient function does not share. Under certain regularity conditions, the asymptotic properties of the proposed two estimators are established. There are some other interesting issues to be further studied; for example, the data come from survival data, such as censored data, missing data, or left truncated data. These extensions greatly expand the scope of the applicability of our methods.

Data Availability

The data used to support the findings of this study are available from the corresponding author upon request.

Conflicts of Interest

The authors declare that they have no conflicts of interest.

Acknowledgments

This work is supported by National Natural Science Foundation of China (11401006).

References

H. Y. Liang and G. L. Fan, “Berry-Esseen type bounds of estimators in a semiparametric model with linear process errors,” Journal of Multivariate Analysis, vol. 100, pp. 1–15, 2009.
View at: Publisher Site | Google Scholar | MathSciNet
G. L. Fan, H. Y. Liang, J. F. Wang, and H. X. Xu, “Asymptotic properties for LS estimators in EV regression model with dependent errors,” AStA Advances in Statistical Analysis. A Journal of the German Statistical Society, vol. 94, pp. 89–103, 2010.
View at: Publisher Site | Google Scholar | MathSciNet
T. Hastie and R. Tibshirani, “Varying-coefficient models,” Journal of the Royal Statistical Society. Series B (Methodological), vol. 55, pp. 757–796, 1993.
View at: Google Scholar | MathSciNet
R. Chen and R. Tsay, “Functional-coefficient autoregressive models,” Journal of the American Statistical Association, vol. 88, pp. 298–308, 1993.
View at: Google Scholar | MathSciNet
S. L. Shen, J. L. Cui, C. L. Mei, and C. W. Wang, “Estimation and inference of semi-varying coefficient models with heteroscedastic errors,” Journal of Multivariate Analysis, vol. 124, pp. 70–93, 2014.
View at: Publisher Site | Google Scholar | MathSciNet
K. N. Wang and L. Lin, “Robust structure identification and variable selection in partial linear varying coefficient models,” Journal of Statistical Planning and Inference, vol. 174, pp. 153–168, 2016.
View at: Publisher Site | Google Scholar | MathSciNet
G. L. Fan, H. Y. Liang, and L. X. Zhu, “Penalized profile least squares-based statistical inference for varying coefficient partially linear errors-in-variables models,” Science China Mathematics, vol. 61, pp. 1677–1694, 2018.
View at: Publisher Site | Google Scholar | MathSciNet
R. Q. Zhang and G. R. Li, “Integrated estimation of functional-coefficient regression models with different smoothing variables,” Communications in Statistics - Theory and Methods, vol. 35, pp. 1039–1100, 2006.
View at: Publisher Site | Google Scholar | MathSciNet
R. Q. Zhang and G. R. Li, “Averaged estimation of functional-coefficient regression models with different smoothing variables,” Statistics & Probability Letters, vol. 77, no. 4, pp. 455–461, 2007.
View at: Publisher Site | Google Scholar | MathSciNet
S. J. Yang, “Estimation for semiparametric varying coefficient models with different smoothing variables under random right censoring,” Journal of the Korean Statistical Society, vol. 47, pp. 161–171, 2018.
View at: Publisher Site | Google Scholar | MathSciNet
G. L. Fan, H. Y. Liang, and Z. S. Huang, “Statistical inference for partially time-varying coefficient errors-in-variables models,” Journal of Statistical Planning and Inference, vol. 143, pp. 505–519, 2013.
View at: Publisher Site | Google Scholar | MathSciNet
P. Doukhan, Mixing: Properties and Examples, vol. 85 of Lecture Notes in Statistics, Springer-Verlag, Berlin, Germany, 1994.
View at: Publisher Site | MathSciNet
V. A. Volkonskii and Y. A. Rozanov, “Some limit theorems for random functions,” Theory of Probability and Its Applications, vol. 4, pp. 178–197, 1959.
View at: Publisher Site | Google Scholar | MathSciNet
P. Hall and C. C. Heyde, Martingale Limit Theory and Its Application, Academic Press, New York, NY, USA, 1980.
View at: MathSciNet
Q. M. Shao and H. Yu, “Weak convergence for weighted empirical processes of dependent sequences,” Annals of Probability, vol. 24, pp. 2098–2127, 1996.
View at: Publisher Site | Google Scholar | MathSciNet
E. Liebscher, “Strong convergence of sums of α-mixing random variables with applications to density estimation,” Stochastic Processes and Their Applications, vol. 65, no. 1, pp. 69–80, 1996.
View at: Publisher Site | Google Scholar | MathSciNet
J. Shen and Y. Xie, “Strong consistency of the internal estimator of nonparametric regression with dependent data,” Statistics & Probability Letters, vol. 83, pp. 1915–1925, 2013.
View at: Publisher Site | Google Scholar | MathSciNet
C. Chesneau, J. Fadili, and B. Maillot, “Adaptive estimation of an additive regression function from weakly dependent data,” Journal of Multivariate Analysis, vol. 133, pp. 77–94, 2015.
View at: Publisher Site | Google Scholar | MathSciNet
H. Guo and Y. Liu, “Regression estimation under strong mixing data,” Annals of the Institute of Statistical Mathematics, pp. 1–24, 2018.
View at: Publisher Site | Google Scholar

Copyright

Copyright © 2019 Guo-Liang Fan and Hong-Xia Xu. This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

PDF Download Citation

Download other formats

Order printed copies

Views

398

Downloads

614

Citations