Abstract

In this paper, the delayed doubly stochastic linear quadratic optimal control problem is discussed. It deduces the expression of the optimal control for the general delayed doubly stochastic control system which contained time delay both in the state variable and in the control variable at the same time and proves its uniqueness by using the classical parallelogram rule. The paper is concerned with the generalized matrix value Riccati equation for a special delayed doubly stochastic linear quadratic control system and aims to give the expression of optimal control and value function by the solution of the Riccati equation.

1. Introduction

As is known to all, the stochastic differential equation and stochastic analysis have developed rapidly. The theory of the stochastic differential equation is widely used in economy, biology, physics, financial mathematics, and other fields. The latest research on the insurance model was given in [13]. The social optimal mean field control problem was discussed in [4]. In order to provide a probabilistic interpretation for the solution of a kind of partial differential equations, Pardoux and Peng [5] first introduced the backward doubly stochastic differential equations and proved the existence and uniqueness of this kind of differential. Then, people began to study doubly stochastic differential equations. Zhu and Shi [6, 7] were concerned with a class of partial information control problems for backward doubly stochastic systems and gave the maximum principle and its applications for the system. Recently, Shi and Zhu [8] studied a type of forward-backward doubly stochastic differential equations driven by Brownian motions and the Poisson process and applied the result to backward doubly stochastic linear quadratic nonzero sum differential games with random jumps to get the explicit form of the open-loop Nash equilibrium point by the solution of this kind of equation. With the deepening of research, people gradually realized that many problems are not only affected by the current situation, but also by their past history. This kind of problem is called the delay problem. The equation describing this kind of problem is called the delay equation. Due to the fact that time delay widely exists in the practical systems, it will cause the change in system performance. Therefore, it can increase the control difficulty of the system. Delayed problems have become the focus of scholar’s research studies. Chen and Wu [9] considered the delayed backward system and obtained the maximum principle for these problems. Wu and Wang [10] studied the optimal control problem of the backward stochastic differential delay equation under partial information. Lv et al. [11] considered the maximum principle for optimal control of anticipated forward-backward stochastic delayed systems with regime switching. Wang and Wu [12] were concerned with the optimal control problems of forward-backward delay systems involving impulse controls and established the stochastic maximum principle for this kind of systems. Yu [13] investigated the maximum principle for stochastic optimal control problems of delay systems with random coefficients involving both continuous and impulse controls.

Linear quadratic (LQ) optimal control problem is the theoretical basis for many problems. When delay variables exist in the doubly stochastic control system, the LQ problem becomes more complex and interesting. Chen and Wu [14] considered the LQ problem with delay in which the state depended on the past time but not the control in the system. Tang and Wu [15] were concerned with the linear stochastic system with Lévy processes. Huang et al. [16] were concerned with one kind of delayed forward-backward linear quadratic stochastic control problems and derived the explicit form of the optimal control. However, to our best knowledge, there is little work on the doubly stochastic LQ problem with delay. Based on the abundant literature, we want to discuss the delayed doubly stochastic LQ problem. When the LQ control system contains time delay, some important characteristic changes have taken place in research. The system contains a delayed doubly stochastic differential equation and a new kind of equation called the anticipated backward doubly stochastic differential equation which was discussed in [17, 18]. Inspired by the idea of the maximum principle for the delayed doubly stochastic control system [19, 20], we studied the general LQ system in which both the state variable and the control variable contain time delay at the same time. As is known to all, it is the key to find out the feedback control of the LQ problem. We deduce the explicit expression of the optimal control for the delayed doubly stochastic LQ problem. We consider the matrix Riccati equation for a class of the LQ problem. We deduced the solution of the LQ system by the solution corresponding to the Riccati equation, which was introduced originally by Peng [21]. We hope that our research can better describe the optimal feedback control of the delayed doubly stochastic LQ problem.

The rest of our paper is organized as follows. First, we introduce preliminary results and some necessary notations. In Section 3, we give the explicit expression of optimal control and prove its uniqueness by using the classical parallelogram rule. And then, we discuss a special kind of the control system, in which the time delay is contained only in the control variables. We try to introduce the generalized matrix value Riccati equation corresponding to the system. And then, we use the solution of the Riccati equation to show the optimal control for the delayed doubly stochastic LQ problem. At the same time, we indicate the objective function by the solution of the Riccati equation and the initial value of the state variable.

2. Preliminaries

First, let us introduce the common notations in this paper. Let be a probability space. Assume and be two mutually independent standard Brownian motions defined on , with values, respectively, in and in . Note the integral with respect to as the forward Itô’s integral and as the backward Itô’s integral. Let denote the class of null sets of . We define , , and for each . We denote the set of all classes of () measurable stochastic process satisfying . Similarly, denotes the set of continuous -dimensional measurable stochastic process satisfying . denotes the conditional expectation under filtration . denotes the scalar product, and in the superscripts means the transpose of the matrix.

In this paper, we mainly investigate the delayed doubly stochastic linear quadratic control system:where the notation , , and .

Remark 1. In this delayed doubly stochastic control system, the state and control variables contain time delay at the same time. Time delay exists all the time in the system. But, we do nothing before the initial time. So, we give the assumption that when time belongs to the interval before the control intervenes.

The cost functional is written as

For a convex subset , we define as follows:

Our optimal control problem can be stated as minimizing the cost functional over . For optimal control satisfyingthe corresponding is called an optimal triple.

The corresponding adjoint equation becomes

We assume that the following conditions hold:

(A1) Assume that the coefficient matrices , , , , , and are the matrix process with a proper dimension(A2) The random matrix : is the nonnegatively bounded symmetric adapt matrix(A3) All the functions of are bounded, are symmetric nonnegative definite, and is symmetric uniformly positive definite

3. Main Results

Theorem 1. The function, is the unique optimal control for the delayed doubly stochastic linear quadratic optimal control problem, where is the solution of the following system:

Proof. The existence and uniqueness of the solution for equation (1) can be guaranteed by Theorem 3.1 in [20] under the assumptions (A1)–(A3). Equation (5) is an anticipated backward doubly stochastic differential equation. Its existence and uniqueness can be deduced by Theorem 3.2 in [18]. Now, we prove that is the optimal control. For all , let and be the trajectory of the system corresponding to and , respectively.
Then,From the definitions of , , , and , we know , , and are symmetric nonnegative definite and is symmetric uniformly positive definite. So, we haveApplying the Itô–Doeblin formula toand paying attention to the initial condition and the terminal condition, we haveIn fact, we haveSo, we haveThen,So from the definition of , we havefor any . This shows that is the optimal control.
Next, we will prove the uniqueness. Assume that and are both optimal controls. and are the trajectories corresponding to and , respectively. Equation (5) is a new type of the anticipated backward doubly stochastic differential equation. The existence and uniqueness of the solution for the equation can be guaranteed by Theorem 3.2 in [18]. By the uniqueness of the solution of the equation, we know that is the trajectory corresponding to . From the definition of , , , and , we know .
Then,From the definition of , we have .
We complete the proof of Theorem 1.

Next, we study a special class of the delayed doubly stochastic LQ problem. We discuss the case that only the control contained the delayed variable, and the initial value of the state variable is deterministic. The delayed system can be written as

We also consider the optimal control problem (4) under the classical quadratic index (4).

From Theorem 1, we can deduce the optimal control directly:

Peng [21] gave the solution of a kind of stochastic Hamiltonian systems by the Riccati equation corresponding to the system. Following this way, we want to use the solution of the Riccati equation to introduce the optimal control for the delayed doubly stochastic LQ problem. First, we give the generalized matrix Riccati equation:

Theorem 2. Let the assumptions (A1)–(A3) be satisfied. If the Riccati equation has a solution , then system (17) has a unique solution:where is solved by

Proof. We apply Itô’s formula to and compare the coefficient with , then we can deduce the conclusion directly. Next, we will prove the uniqueness.
Assume that is a solution of system (17), with the conditions and .
We set , , and . By differentiating , we haveSubstituting the first and the second equality of the Riccati equation (19) into (22), we haveOn the other hand, from the Riccati equation (19), we haveThen, equality (23) can be written asWe denote , then we haveNow, we set , that is, , then equation (26) can be written asThis is a linear backward doubly stochastic differential equation. This is a unique solution , that is, . So, we have , , and .
We complete the proof of Theorem 2.

Theorem 3. Let the assumptions (A1)–(A3) be satisfied. Assume that satisfies the generalized Riccati equation, then the optimal control of the delayed doubly stochastic linear quadratic optimal control problem has the following form:

Proof. From the assumption that is the solution of the Riccati equation (19) set , , and . Applying Itô’s formula to , we haveFrom the definition of the matrix Riccati equation (19), we haveThat is,So, we know that the solution of system (17) satisfied the formula , , and . Then, the optimal control can be written as (28).
Next, we deduce the objective function by the solution of the Riccati equation and the initial value of the state variable.
Applying Itô’s formula to and taking expectation, we haveSubstituting equalities (33) and (34) into the function , we can deduceWe complete the proof of Theorem 3.

4. Applications

Case 1. For the general delayed doubly stochastic LQ system (1), when the coefficients arethis is the system without time delay. The optimal control deduced by Theorem 1 is consistent with the results in [22].

Case 2. When in system (1), this is the case that the system does not contain time delay variables in control variables. The optimal control deduced by Theorem 1 isThis is consistent with the result in [20]. From the expression of the optimal control, the factors affecting the optimal control are the same as that in [22].

Case 3. For the delayed doubly stochastic LQ system (17), when the optimal control deduced from Theorem 3 isthe optimal control is derived from the solution of the matrix Riccati equation. This is consistent with the result in [21].
In this paper, we are mainly concerned about the delayed doubly stochastic LQ system in which all the variables contained time delay. We deduced the explicit form of the optimal control for the general case. The results of this paper extend our previous work and enrich the content of the delayed doubly stochastic LQ problem. But, from the conclusion of the article, there are still some contents that need further study. We only discuss the matrix-valued Riccati equation corresponding to the special cases. For a more general Riccati equation of the delayed doubly stochastic LQ system, we will work hard to increase the research on this part in our future work.

Data Availability

No data were used to support this study.

Conflicts of Interest

The authors declare that there are no conflicts of interest regarding the publication of this paper.

Acknowledgments

The authors would like to thank Professor Yuecai Han for valuable suggestions and guidance on this article. This work was supported by science and technology research projects of the Jilin Province Education Department (JJKH20200235KJ) and the National Natural Science Foundation of China (grant number 11871244).