Research Article | Open Access

Guo-Liang Fan, Hong-Xia Xu, "Averaged and Integrated Estimations of Varying-Coefficient Regression Models with Dependent Observations", *Discrete Dynamics in Nature and Society*, vol. 2019, Article ID 7146793, 10 pages, 2019. https://doi.org/10.1155/2019/7146793

# Averaged and Integrated Estimations of Varying-Coefficient Regression Models with Dependent Observations

**Academic Editor:**Rigoberto Medina

#### Abstract

In practical applications, lots of data such as sequentially collected economic data often exhibit some evident dependence. This paper studies the varying-coefficient regression models with different smoothing variables when the data form a stationary -mixing sequence. Both the averaged and integrated estimators of coefficient functions are proposed. The asymptotic normalities of the proposed averaged and integrated estimators are also established.

#### 1. Introduction

Regression analysis is one of the most mature and widely applied branches of statistics. Various regression models have been studied by many authors; e.g., Liang and Fan [1] studied Berry-Esseen type bounds of estimators in a semiparametric model with linear process errors; Fan, Liang, and Xu [2] considered empirical likelihood confidence regions for heteroscedastic partial linear model and so on. Recently, varying models which were proposed by Hastie and Tibshirani [3] and Chen and Tsay [4] have received more and more attention. This is mainly because varying model offer a flexible but parsimonious alternative to nonparametric model and has been used in many contexts such as it has been successfully applied to multidimensional nonparametric regression, generalized linear model, time series analysis, longitudinal and functional data analysis, and time-varying model in finance. A nice feature of varying coefficients is to allow appreciable flexibility on the structure of fitted models without suffering from the “curse of dimensionality”.

Consider the following varying-coefficient model with different smoothing variables in different coefficient functions:where is a random vector, and ; are some measurable functions from to ; is a measurable function from to and is independent of with mean zero and variance 1.

Most of the work focused on the case where all coefficient functions share a single smoothing variable in a model, i.e. in model (1). In particular, the asymptotic normalities of the estimators of these coefficient functions by local polynomial technique are obtained, such as Shen et al. [5], Wang and Lin [6], and Fan, Liang, and Zhu [7]. However, the varying-coefficient regression models on which different coefficient functions share different variables have been given less attention because the local polynomial technique is not adequate. To solve this difficulty, when , which come from , are independent and identically distributed (i.i.d.), Zhang and Li [8] employed the local linear technique to obtain an initial value for the estimate of each coefficient function in model (1), and, then, the integrated estimate of each coefficient function is defined by integrating its initial value on these variables which the coefficient function does not share; Zhang and Li [9] used the local linear technique to give an initial value for every estimate of the coefficient function in model (1). Then the averaged estimate of each coefficient function is defined by averaging its initial value on these variables which the coefficient function does not share; Yang [10] proposed estimators for this model under random right censoring case by using mean-preserving transformation and established their asymptotic properties. The estimation procedure is based on the profiling and the smooth backfitting techniques.

However, the above related papers need the i.i.d. assumption for the data. The independence assumptions are not always valid in applications, especially for sequentially collected economic data, which often exhibit evident dependence. In the sequel, are assumed to form a strong stationary -mixing sequence. Fan et al. [11] studied the varying-coefficient errors-in-variables models when the data form a stationary -mixing sequence of random variables. Recall that a sequence is said to be -mixing if the -mixing coefficient converges to zero as , where denotes the -algebra generated by with . Among various mixing conditions used in the literature, -mixing is reasonably weak and has many practical applications. In fact, under very mild assumptions linear autoregressive and more generally bilinear time series models are -mixing with mixing coefficients decaying exponentially; i.e., for some ; see Doukhan [12], page 99, for more details.

The paper is organized as follows. In Section 2, we list some assumptions and present the averaged and integrated estimators of the coefficient functions, as well as the asymptotical normalities for these estimators. Proofs of the main results and the preliminary lemmas are provided in Section 3. Concluding remarks are given in Section 4.

#### 2. Methodology and Main Results

Suppose that is a sample from model (1), are strong stationary -mixing sequence, and that for every , has a Lipschitz continuous second derivative. Then can be approximate locally by a linear function, , at a neighborhood of which is in the support of . We minimizewith respect to and , where and are the th entries of the th observation and , respectively; , is a bounded, nonnegative, compactly supported symmetric about zero and Lipschitz continuous density function; is a sequence of positive numbers and called a bandwidth.

Let , be the first entries of the minimizers of (3); then, it follows from the least squares theory thatwhere is a unite vector with at its th position and is an matrix with as its th row, with . is called as a initial value of the estimate of the coefficient function .

The averaged estimate of the coefficient function is defined as (see [9])

Next, we define the integrated estimate of the coefficient function. Let be a deterministic weight function with , for each , where . We allow for both discrete and continuous and integrals should be interpreted in the Stieltjes sense. Let be the density of with respect to either Lebesgue or a counting measure. Suppose that the support of is contained within that of . Then, the integrated estimate of the coefficient function is defined as (see [8])

In order to formulate the main results, we need the following assumptions.(A1)(i)The joint density of and the marginal density of , , are compactly supported, bounded, Lipschitz continuous, and bounded away from zero by a constant.(ii)The conditional probabilities satisfy for different .(A2)The conditional expects and are Lipschitz continuous, or , .(A3), for some .(A4)(i), for some .(ii)There exist positive integers and such that for sufficiently large , , and , for some .

*Remark 1. *Conditions (A1)(i) and (A2) have been assumed in Zhang and Li [8, 9]. (A1)(ii) holds naturally when are i.i.d. (A3) has been used by Fan et al. [2]. Assumption (A4)(i) is a mild condition. In the particular case of exponential decay, for some , we have that for sufficient large , and hence assumption (A3) is satisfied. The technical condition (A4) is very easy to be satisfied. For example, when and , choosing , , and , we have and .

Theorem 2. *Suppose that (A1)–(A4) hold and the bandwidths are such that , and for each , , and , , , is a constant; thenwhere , , , , and its inverse , is the marginal density of . In particular, when is a constant , .*

Theorem 3. *Under the conditions of Theorem 2, we havewhere . In particular, when is a constant , .*

#### 3. Proof of the Main Results

In this section, we list some preliminary Lemmas. Let be a stationary -mixing sequence of random variables with mixing coefficients . In the sequel, let denote positive constant whose value may vary at each occurrence. In case of no loss of generalization and expositional purpose, we shall work with the special case .

Lemma 4 (see [13]). *Let be -mixing random variables measurable with respect to the -algebra , respectively, with and for . Then where denotes -field generated by , is the mixing coefficient.*

Lemma 5 (see [14], Corollary A.2, p. 278). *Suppose that and are random variables such that , where . Then *

Lemma 6 (see [15], Theorem 4.1). *Let and . Assume that for some and . If and . then, for any , there exists such that *

Lemma 7 (see [16], Lemma 2.3). *Assume , for some Let be satisfied. Moreover, let for some , , where , for , and Then holds with *

Lemma 8. *If assumptions (A1)–(A4) hold and , , then in the sense that each element converges, where*

*Proof. *Let then can be presented Note that is a strong stationary sequence of -mixing random variables; then we haveSet . It is easy to see Which, combing with Lemma 7, (A4) and , , yield that, for ,Conjoining (17) and (19), we get the result of this lemma.

*Proof of Theorem 2. *Note that and can be represented that Since where . Then we can obtain that where Next, we establish thatIn order to prove (24), we need only to show In fact, according to (A1)(ii), (A3), and Lemma 5, we haveThus, (24) holds.

Then similar to the proof of Theorem in Zhang and Li [9], we have Hence, in order to prove Theorem 2, we need only to show thatDenote . Then can be represented as In order to prove (28), we need only to showLet be defined as follows: where , Then Hence, in order to prove (30), it suffices to show thatRelation (33) implies that and are asymptotically negligible, (35) shows that the summands in are asymptotically independent, and (34) and (36) are the standard Lindeberg-Feller conditions for asymptotic normality of under independence.

We first establish (33). Obviously