Abstract

The original stochastic differential equations (OSDEs) and forward-backward stochastic differential equations (FBSDEs) are often used to model complex dynamic process that arise in financial, ecological, and many other areas. The main difference between OSDEs and FBSDEs is that the latter is designed to depend on a terminal condition, which is a key factor in some financial and ecological circumstances. It is interesting but challenging to estimate FBSDE parameters from noisy data and the terminal condition. However, to the best of our knowledge, the terminal-dependent statistical inference for such a model has not been explored in the existing literature. We proposed a nonparametric terminal control variables estimation method to address this problem. The reason why we use the terminal control variables is that the newly proposed inference procedures inherit the terminal-dependent characteristic. Through this new proposed method, the estimators of the functional coefficients of the FBSDEs model are obtained. The asymptotic properties of the estimators are also discussed. Simulation studies show that the proposed method gives satisfying estimates for the FBSDE parameters from noisy data and the terminal condition. A simulation is performed to test the feasibility of our method.

1. Introduction

Since 1973, when the world’s first options exchange opened in Chicago, a large number of new financial products have been introduced to meet the customer’s demands from the derivative markets. In the same year, Black and Scholes [1] provided their celebrated formula for option pricing and Merton [2] proposed a general equilibrium model for security prices. Since then, modern finance has adopted stochastic differential equations as its basic instruments for portfolio management, asset pricing, risk management, and so on. Among these models, the backward stochastic differential equations (BSDEs for short) are a desirable choice for hedging and pricing an option. Its general form is as follows: where is the generator, is a Brownian motion, and is a -valued Borel function as the terminal condition. Usually the terminal condition is designed as a random variable with given distribution. If meets certain conditions, the BSDE has a unique solution.

In terms of the backward equation, within a complete market, it serves to characterize the dynamic value of replicating portfolio with a final wealth and a special quantity that depends on the hedging portfolio. In particular, while the generator consists of diffusion process, the corresponding equation is proved to be a forward-backward stochastic differential equation (FBSDE), which can be expressed as where satisfies the following ordinary stochastic differential equation (OSDE): Compared to the OSDE that contains an initial condition, the solution of the FBSDE is affected by the terminal condition . As is well known, there exist a number of parametric and nonparametric methods to deal with estimation and test for the OSDE. However, these methods cannot be directly employed to infer the BSDE and FBSDE because the two models are related to a terminal condition. Forward-backward stochastic differential equations are used in biology systems, mathematical finance, insurance, real estate, multiagent, and network control. See Antonelli [3], Wang et al. [4], Zhang and Li [5], and so on.

For the FBSDE defined above, the statistical inference was investigated initially by Su and Lin [6] and Chen and Lin [7]. Furthermore, by financial and ecological problems, a relevant statistical model was proposed by Lin et al. [8]. However, they did not take the terminal condition into account in the inference procedure. In the framework of the FBSDE mentioned above, the terminal condition is additional, which is not nested into the equation. Thus, there is an essential difficulty to use the terminal condition to refine the inference procedure.

As a result, their methods fail to cover the full problems given in the FBSDE. Zhang and Lin [9] proposed two terminal-dependent estimation methods via terminal control variable for the integral form models of FBSDE. However, they only considered the parametric form of the generator in their paper.

This paper intends to explore the method to fulfill the terminal-dependent inference: quasi-instrumental variable methods. It is worth mentioning that the key point of our method is the use of the terminal condition information rather than neglecting it. This change leads to a completely new work among the existing researches. The key technique in our method is the use of quasi-instrumental variable which is similar but not the same as instrumental variable (IV). It is known that IV is widely employed in applied econometrics to achieve identification and carry out estimation and inference in the model containing endogenous explanatory variables or panel data; see Hsiao [10] for an overview of the relevant statistical inference and econometric interpretation and see Hall and Horowitz [11] for recent work on nonparametric instrumental variable estimation.

Through the backward equation (2) of FBSDE, we get a regression model. To use the terminal condition information, we put the terminal condition as a quasi-instrumental variable and introduce it into our model. However, when a constraint is appended artificially, the original model may change to be biased in the sense of , because the constraint condition influences the increase trend of wealth so that may deviate from the original center zero; in other words, due to the constraint, the trajectory of may departure from the original expectation so that cannot be regarded as error. Therefore, some problems arise naturally, including how to correct the bias of the model and how to construct the constraint-dependent estimation. To solve these problems, we will use remodeling method to draw terminal condition into differential equation, similar but not the same as IV, called quasi-instrumental variable methods; in other words, the terminal condition enters into the equation as a control variable. This remodeling method takes advantage of the terminal information naturally, and the estimator performs quite well.

We use the nonparametric form of the generator in model (2) because the correct FBSDEs model for a specific topic can neither be provided automatically by financial market nor be derived from theory of mathematical finance, and in lack of prior information about the structure of a model, nonparametric inference can provide a flexible as well as robust description of a data-generating process. Even in some cases when parametric models are available, nonparametric methods are still employed to avoid the model misspecification that may lead to large errors in option pricing and other problems from financial market. So we adopt the nonparametric form that can endow the model (2) with flexibility and robustness.

Note that is usually unobservable and cannot be completely specified in the financial market. The problems of interest are therefore to give both proper estimations of the generator and the process based on the observed data and the terminal expectation .

The remainder of the paper is organized as follows. In Section 2, the FBSDE is rebuilt as a nonparametric model that contains the terminal condition as a quasi-instrumental variable. Consequently, a terminal-dependent estimation procedure is proposed. Next we discuss the asymptotic properties of the newly proposed estimations in Section 3. Simulation study is proposed in Section 4 to illustrate our methods. The proofs of the theorems are presented in Appendix.

2. Model and Method

In this section, we propose a nonparametric estimator with the help of quasi-instrumental variable.

2.1. Model and Its Statistical Version

We begin the following original model by combining (2)-(3): where is the standard Brownian motion and is a -valued Borel function. Here the generator is a function of , , , and . For the FBSDEs model (4), only one of the backward components, , and the forward components, , can be observed. Another backward component is totally unobservable. Furthermore, the adapted process and terminal condition could be indicated as a function of .

In this section, we present the statistical structure of FBSDEs by taking advantage of quasi-instrumental variable and obtain the consistent asymptotically normal estimators of and based on observed data and the terminal condition .

2.2. Remodeling for Model (4)

To construct terminal-dependent estimation for the generator and process , the key technique is how to plug the terminal condition into the equation. When is plugged into the model, we call it the quasi-IV, similar but not the same as IV. Evidently, the property of Brownian motion shows that , but , which means drawing the terminal control directly into the equation as the condition should not be encouraged at the cost of model bias. Rewriting the first equation of (4) enables us to construct an unbiased model: where , , and . The newly defined model (5), together with the second equation in (4), can be thought of as a quasi-IV FBSDE. Because the equation in (5) contains the terminal condition , we can construct the terminal-dependent estimation. From the above definitions, we see that, by bias correction, the original model changes to be an additive nonparametric model with nonparametric components and . It shows that when terminal condition is regarded as a quasi-IV and then appended to the model, the result model is unbiased and changes to be nonparametric additive model.

2.3. Estimation for

Before estimating the model function and the generator , we need to estimate firstly because is unobservable and it will be seen that the estimators of the model function and the generator depend on . Since the distribution of is supposed to be known, let for be a sample of . Suppose that, for each terminal data and equally spaced time points , we record the observed time series data: At any time point , , denoting and satisfying the initial condition , is a determined function of . As was shown by Su and Lin [6] and Chen and Lin [7], we can adopt a difference-based method to approximate as It shows that the numerical approximation error to converges to zero at rate of order .

For each , if depends on only via variable , by (7) and N-W kernel estimation method, we estimate at by Otherwise, we estimate at by where and , are regular kernel functions, with and being the corresponding bandwidths.

2.4. Estimation for

After plugging the estimator into model (5), we still need to consider inference of . As we all know, the nonparametric function in (5) can be acquired as . We note that is a higher order infinitesimal of when tends to zero. Under this situation, if is ignored, then It implies that we can use ordinary nonparametric method to estimate function . For example, we use the N-W ordinary nonparametric method to estimate valued at : where and are regular kernel functions, with and being the corresponding bandwidths.

2.5. Estimation for Generator

As was shown in the nonparametric instrumental variables estimator of Hall and Horowitz [11] (hereinafter HH), we can adopt a nonparametric quasi-instrumental variables estimation to estimate the generator . So in the section we summarize the HH estimator of in the model: Since and are the consistent estimator of and , respectively, we use them instead of and in the above model and we get Because is function of and , for simplicity of presentation, we denote . Thus, the model becomes

Let , , , , and ; the model becomes

It is assumed that the support of is contained in . This assumption can always be satisfied by, if necessary, carrying out monotone increasing transformations of , , and . For example, one can replace , , and by , , and , where is the normal distribution function. We take to be a vector, where and are scalars, and are supported on , and is supported on . The model is where , for , are independent and identically distributed as . Thus, and are endogenous and exogenous explanatory variables, respectively. Data , for , are observed.

Let denote the density of , write for the density of , and, for each , and put Define the operator on by It may be proved that, for each for which exists, where denotes the expectation with respect to the distribution of conditional on . In this formulation, denoted the result of applying to the function and evaluating the resulting function at .

To construct an estimator of , given and and , let , put analogously for and , let , and define where is a function from to a real line. Then the estimator of is

3. Asymptotic Results

In this section, we study the asymptotic properties of our proposed estimators. All proofs are presented in Appendix.

3.1. Asymptotic results of

To give the asymptotic results of , we need the following conditions.(a) are -mixing dependent; namely, the -mixing coefficients satisfy as , where  with .(b) (a. s.) uniformly for , where is a positive constant and .(c)The continuous kernel function is symmetric about 0, with a support of interval , and

Condition (a) is commonly used for weakly dependent process; see, for example, Kolmogorov and Rozanov [12], Bradley and Bryc [13], Lu and Lin [14], and Su and Lin [6]. Condition (b) is also reasonable because, as is shown by (10), can be regarded as the deviation between the adjacent two observations. Condition (c) is standard for nonparametric kernel function.

Theorem 1. Besides conditions (a), (b), and (c), let be an observation sequence on a stationary -mixing Markov process with the -mixing coefficients satisfying for . Furthermore, have a common and probability density , and for each interior point in the support of , , , the functions and have continuous two derivatives in neighborhood of . As , such that , , and , then where .

The asymptotic result in Theorem 1 is standard for nonparametric kernel estimator and here undersmoothing is used to eliminate asymptotic bias.

3.2. Asymptotic results of

This section gives conditions under which the HH estimator of the generator is asymptotically distributed as . The following additional notations are used.

Define , , and . Then, . Define . Write Define . It follows from a triangular array version of the Lindeberg-Levy central limit theorem that as . Therefore, if .

Assumption 2. The data , , , are independently and identically distributed as , where is supported on and .

Assumption 3. The distribution of has a density with respect to Lebesgue measure. Moreover, is times differentiable with respect to any combination of its arguments, where derivatives at the boundary of are defined as one sided derivatives. The derivatives are bounded in absolute value by . In addition, is times differentiable on with derivatives at 0 and 1 defined as one sided. The derivatives of are bounded in absolute value by . In addition, and , and for some finite constant .

Assumption 4. The constants and satisfy , , and . Moreover, , , and for all . In addition, there are finite strictly positive constants, and , such that for all .

Assumption 5. The tuning parameters and satisfy and , where .

Assumption 6. denotes a generalized kernel function, with the properties if or , for all if , else if . For each , is supported on , where is a compact interval not depending on . Moreover,

Assumption 7. Consider and .

Theorem 8. Let Assumptions 2–7 hold. Then holds except, possibly, on a set of values whose Lebesgue is 0.

Corollary 9. Let Assumptions 2–7 hold. And if is replaced with the consistent estimator, where . This yields the Studentized statistic . Then holds except, possibly, on a set of values whose Lebesgue is 0.

As was shown in the remark given in the previous section, even the conditional mean of error of the model is nonzero, and the newly proposed estimation is consistent because of the mixing dependency; for details see the proof of Theorem 8. Furthermore, because of the terminal condition, the asymptotic variance is larger than that without the use of the terminal condition.

4. Simulation Studies

In this section, we investigate the finite-sample behaviors by simulation.

Example 10. We consider a simple FBSDE as where is Geometric Brownian motion for modeling stock price satisfying while the riskless asset is the same as formula (31); .

Firstly, let , , , , , and . Obviously, . We adopt Epanechnikov kernel defined by , where is the indicator function. For bandwidth selection, various data-driven techniques have been developed, such as cross-validation, the plug-in method, and the empirical bias method. However, these useful tools require additional computation intensiveness. In our simulation, we simply apply the rule of thumb bandwidth selector. For bandwidth selection, bandwidth . The values of the tuning parameters are , , . Figure 1 presents the estimated curves for diffusion and drift by one simulation.

Example 11. According to the theory of mathematical finance, we represent a European call option by the following FBSDEs model: Here and are the price processes of the stock and the option, respectively, and is the striking price at the expiration date . follows the geometric Brownian motion as

We use the Euler scheme to generate the price series of the stock as where is an i.i.d. series with standard normality.

The price series by Black Scholes formula is part of the solution of the FBSDEs above at discrete time points; that is, which, together with gives us data generating formulae, where is a cumulative normal function, and

We produce the true curve of the drift coefficient by We first apply formulas (21) and (11) to estimate and , respectively. We adopt Epanechnikov kernel defined by , where is the indicator function. For bandwidth selection, we simply apply the rule of thumb bandwidth selector: to implement the estimation.

Let , , , , , , and . The bandwidth parameters and are used for estimation of and , respectively. The values of the tuning parameters are , , and . To see the performance of our estimation method, the simulated and the estimated curves of the two coefficients of the backward equation are displayed in Figures 2 and 3.

Appendix

A. Proofs

Proof of Theorem 1. Denote . By the Taylor expansion and formula (8), we have Furthermore, From the conditions of Markov process and -mixing coefficient, Note that , where , . Thus and furthermore To our interest, both the conditional expectation and variance are independent on , so the condition could be erased.
From Lemma A.1 of Politis and Romano [15] and the relation between the -mixing condition and the -mixing condition (e.g., Theorem of Lu and Lin [14]), we can ensure that is a -mixing dependent process and the mixing coefficient, denoted by , satisfies where is a positive constant. Finally, we use the central limit theorems for -mixing dependent process (e.g., Theorem of Lu and Lin [14]) to complete this proof.

Proof of Theorem 8. Theorem 8 follows from proving that and except, possibly, if belongs to a set of Lebesgue measure 0. The first result is established in Lemma A.1, and the second is established in Lemma A.2. Throughout this Appendix, “for almost every ” means “for every except, possibly, a set of Lebesgue measure 0.” We make repeated use of the fact that if for some , then for almost every .

Lemma A.1 (asymptotic normality of ). Let Assumptions 2–7 hold. Then for almost every .

Proof. Define , Then . by a triangular array version of the Lindeberg-Levy central limit theorem. The proof of the triangular-array version of the theorem is identical to the proof of the ordinary Lindeberg-Levy theorem. The lemma follows if we can prove that for and almost every .
Assumption 7 and arguments like those leading to (6.2) of HH [11] show that It follows from the Cauchy-Schwartz inequality, , and that Therefore, it follows from Assumptions 5 and 7 that for almost every . Now consider . Define the operator . Then Therefore, the Cauchy-Schwartz inequality gives HH show that Therefore, it follows from Assumptions 5 and 7 that for almost every . Finally, some algebra shows that Therefore, for almost every follows from (A.11) and .

Lemma A.2 (asymptotic negligibility of ). Let Assumptions 2–7 hold. Then for almost every .

Proof. Define Redefine Then . Arguments identical to those used to derive (6.2) and (6.3) of HH [11] show that and Therefore, it follows from Assumptions 5 and 7 that for almost every .
Now consider . Define and . HH show that and . Therefore, it follows from Assumptions 5 and 7 that for almost every . Now consider . Write where and . It follows from (A.11)-(A.16) and (A.20) that for almost every .
To analyze , define Define . HH show that Define Then HH show that See (6.11), (6.13), (6.14), and (6.15) of HH [11]. Moreover, See the arguments leading to (6.24) in HH [11] and the analogous result for their equation (6.24) in HH [11] and the analogous result for their quantity . Combining (A.25)–(A.30) with Assumptions 5 and 7 yields the result that Now consider . Standard calculations for kernel estimators show that Therefore, But Therefore, it follows, by combining Assumption 7 and equations (A.11), (A.17), and (A.34), that where for almost every . Combining this result with (A.21) and (A.31) gives for almost every .
Now consider . HH show that Therefore, it follows from (A.19) and (A.30) that for almost every .
Now combine (A.19), (A.37), and (A.39) to obtain for almost every .The lemma follows by combining this result with (A.16).
This completes the proof.

Conflict of Interests

The author declares that there is no conflict of interests regarding the publication of this paper.