#### Abstract

We mainly study a general risk model and investigate the precommitted strategy and the time-consistent strategy under mean-variance criterion, respectively. A lagrange method is proposed to derive the precommitted investment strategy. Meanwhile from the game theoretical perspective, we find the time-consistent investment strategy by solving the extended Hamilton-Jacobi-Bellman equations. By comparing the precommitted strategy with the time-consistent strategy, we find that the company under the time-consistent strategy has to give up the better current utility in order to keep a consistent satisfaction over the whole time horizon. Furthermore, we theoretically and numerically provide the effect of the parameters on these two optimal strategies and the corresponding value functions.

#### 1. Introduction

There exist two important risk models: the Cramér-Lundberg model (the C-L risk model) and the dual risk model. The C-L risk model describes the surplus process of an insurance company. The insurer has two opposing cash flows: incoming cash premiums and outgoing claims. It can get the premium at a rate from the insured and pay for claims which arrive according to a Poisson process with intensity and claims are independent and identically distributed (i.i.d) nonnegative random variables with mean . The surplus of an insurer which starts with initial surplus is described as follows: Correspondingly, the dual risk model is called dual as opposed to the C-L risk model with applications to insurance. The dual risk model describes a surplus process of another company engaging in research and development. This company also has two opposing cash flows. The positive incomes or profits arrive according to a Poisson process with intensity and profits are i.i.d nonnegative random variables with mean . is the rate of expenses of the company. Thus the surplus of the company is subject to the following equation: There are many possible interpretations for the dual model. The surplus can be viewed as the amount of capital of a company (e.g., petroleum or pharmaceutical companies) engaging in research and development where costs are certain and gains are random at random instants.

By incorporating the C-L risk model and the dual risk model, a general risk model can be given by the following equation: where , is a Poisson process with intensity , and are independent and identical double distributed with the probability density function . The C-L risk model and the dual risk model are all the special examples. This model can also be described as the surplus of a company engaging in research and development where success or failure can cause a greater profit or a bigger loss, respectively.

The literature for mean-variance (MV) analysis of the general risk model has not appeared. Mean-variance analysis for optimal asset allocation is an important result of financial economics. Markowitz [1] proposed the mean-variance approach and it is viewed as the foundation of modern finance theory. Since then, a large number of papers have been published on this topic. The single period case was dealt with by many scholars. The MV optimal portfolio problem in a multiperiod framework and continuous time version is time-inconsistent which means that the Bellman Optimality Principle does not hold.

Nowadays two basic ways are used to deal with time-inconsistency in optimal control problems in the literature. One way is to study the precommitted problem where “optimal” is interpreted as “optimal from the point of view of time zero” and the decision makers themselves follow the policies chosen at the initial time in the future. Zhou and Li [2] and Li and Ng [3] made excellent works in dealing with time-inconsistency by employing an embedding technique. Another way is to take the time-inconsistency more seriously and study the problem within a game theoretical framework. One possible interpretation of the time-inconsistency is that our preferences change in a temporally inconsistent way as time goes by, and we can view the MV problem as a game, where the players are the future incarnations of our own preferences. Nash equilibrium points can be found in the game theoretical approach to address general time-inconsistency. See more references in Björk and Murgoci [4], Ekeland and Lazrak [5], and Kryger and Steffensen [6].

Recently many scholars have paid more attention to mean-variance analysis for the risk model. Bäuerle [7] investigated a reinsurance problem and measured deviations from a certain predefined benchmark . Bai and Zhang [8] studied optimal reinsurance-investment (no-shorting) strategy for the mean-variance problem under the following two risk models: a classical risk model and a diffusion risk model. Some other scholars also discussed the reinsurance-investment problem from game theoretical framework. Li et al. [9] and Zeng et al [10] investigated the time-consistent investment and reinsurance strategy when the prices of risky assets followed Heston’s SV process and a jump diffusion process, respectively.

In this paper, we are concerned with the optimal investment problem for the general risk model under mean-variance criterion. Our study contributes to the literature in three ways. Firstly, we study the general risk model under mean-variance criterion and the precommitted strategy and the time-consistent strategy are derived. Secondly, we propose a simple technique (lagrange technique) to deal with the precommitted investment problem. Our method is different from the lagrange technique proposed by Zhou and Li [2]. They showed that this nonstandard problem (MV) can be embedded into a class of auxiliary stochastic linear-quadratic (LQ) problems. The optimal strategy was derived by solving the LQ problem and they calculated the efficient frontier in a closed form for the original portfolio selection. Correspondingly, the precommitted strategy and the efficient frontier are derived together in the process of solving problem (MV) by the lagrange technique we proposed. Thirdly, we investigate the effect of the parameters on the investment strategies and the corresponding value functions from the theoretical and numerical analysis. The comparisons of the value functions and the efficient frontiers show that the company under the time-consistent strategy has to give up the better current utility in order to keep a consistent satisfaction over the whole time horizon.

The rest of this paper is organized as follows. Section 2 describes the model and formulates the problem under mean-variance criterion. In the next two sections, we investigate the precommitted investment strategy and the time-consistent strategy for problem (MV), respectively. In Section 5, numerical analysis is presented for our results.

#### 2. Problem Formulation

In this section, we start with a filtered complete probability space (), where represents the time horizon and stands for the information available at time . Here recall the general risk model; namely, the surplus process of a company is given by where is a Poisson process with intensity , is the size of the th income or profit which is double distributed with the probability density function and the first and second moment and , and represents the uncertainty of the income. Furthermore assume the expected increase of the surplus per unit time satisfies the positive loading condition, .

The financial market consists of a risk-free asset and risky assets. The company is allowed to invest its surplus into this financial market. The total amount of money invested into th risky asset at time is described as . The price of the risk-free asset is subject to the following stochastic differential equation: and the price of the th risky asset satisfies the following stochastic differential equation: where is a risk-free rate, is the appreciation rate of the th risky asset and the functions including and, and are all positive continuous bounded functions. is a -dimensional standard Brownian motion which is independent of and . Here the superscript “” denotes the transpose of a matrix or a vector and . Let denote the resulting surplus process after incorporating strategy into (4). The dynamics of can be described as follows: where and . Furthermore, denote and assume that is reversible for all .

For , denote is once continuously differentiable on [0, T], and is twice continuously differentiable on .

For , the infinitesimal operator of the surplus process is given by the following equation: Next, we give the definition of admissible strategy on the general risk process .

*Definition 1 (admissible strategy). *A strategy = is said to be admissible if it satisfies the following conditions:(1) is an -adapted process;(2) satisfies the integrability condition, almost surely, for all ;(3)SDE (7) has a unique solution corresponding to .

In addition, let denote the set of all admissible strategies with respect to initial condition . The objective is to find the optimal investment strategy among all the admissible strategies in order to make the expected terminal wealth maximized and the variance of the terminal wealth minimized. The alternative objective is to find a strategy which maximizes the expected terminal wealth minus the variance of the terminal wealth by the biobjective optimization theory. So the objective is changed to find the maximization of the following function: where is a prespecified risk aversion coefficient, . Because this mean-variance criterion lacks the iterated-expectation property, this problem is time-inconsistent in the sense that the Bellman Optimality Principle does not hold. This problem can be reduced to a resolvable problem by virtue of some techniques including the lagrange technique and the game theoretical technique. In the following two sections, optimal investment strategies and the value functions can be explicitly derived in the general risk model for problem (MV), respectively.

#### 3. Optimal Precommitted Investment Strategy for Problem (MV)

This section will provide the precommitted investment strategy for problem (MV). We firstly state the main idea of solving problem (MV).

Let be fixed and consider the following problem with a constrained expectation: Add the terminal condition to the objective function and problem is equal to the following problem: Denote the value function for problem () by and the value function for problem satisfies . Problem () can be solved by introducing a lagrange multiplier and for define a quadratic utility problem Denote the value function for problem () by . The duality theory implies that the value function for problem satisfies that The precommitted investment strategy for problem (MV) is derived by the following three steps.

*Step 1. *We will calculate the optimal investment strategy and the value function by solving the related Hamilton-Jacobi-Bellman (HJB) equation for problem (). From standard arguments described as in Fleming and Soner [11], it is not hard to derive the following Verification Theorem on the HJB equation.

Lemma 2 (verification theorem). *If there exist a real function and , which satisfy the following HJB equation:
**
then and is the optimal investment strategy.*

Now, we will solve the HJB equation in Lemma 2. Assume that there exists a real function which satisfies the boundary condition (15). By virtue of the infinitesimal operator (8), (14) can be rewritten as Since both the structure of (17) and the boundary conditions of are quadratic in , we can conjecture that Obviously, Substituting (18)-(19) into (17), we have Differentiating the function in the left bracket of (20) with respect to and setting the derivative to zero, we get thus, Inserting (22) into (20), we have where Letting the coefficients of and and the constant coefficient be equal to 0 in (23), we have The solutions of the ordinary differential equation (25) are as follows: Substituting (26) and (27) into (22), we have According to the argument above, optimal investment strategy and the value function for problem () are given by the following theorem.

Theorem 3. *For problem (), optimal investment strategy is given by
**
and the value function is given by
*

*Step 2. *By virtue of , we can solve problem . Differentiating with respect to , we have
Setting the derivative to zero yields that
Furthermore,
Therefore is the point which minimizes according to the extreme value theory. By inserting (33) into (30)-(31), optimal investment strategy and the value function for problem () are given by the following theorem.

Theorem 4. *For problem (), optimal investment strategy is given by
**
and the value function is given by
*

*Step 3. *Problem (MV) can be finally solved by virtue of the relationship of and . Differentiating at yields that
From the extreme value theory, the optimal expected terminal wealth does exist and satisfies . By a simple calculation, we have
Therefore, inserting (38) into (35) and (36), optimal investment strategy and the value function for problem (MV) can be derived explicitly and they are given by the following theorem.

Theorem 5. *For problem , optimal precommitted investment strategy is given by
**
and the value function is given by
*

*Remark 6. *The efficient frontier at initial state can be derived. By Theorem 5 and the definition of , we have
So the efficient frontier at initial state is as follows:
This efficient frontier is not a straight line but a hyperbola in the mean-standard deviation plane.

*Remark 7. *The precommitted investment strategy is stochastically dependent on the current wealth which means that is a stochastic process which satisfies the following stochastic differential equation:
So all the parameters impact the precommitted investment strategy together, and we can only analyze the effect of the parameters on by numerical simulation.

*Remark 8. *When all the parameters are all constants and , the optimal precommitted investment strategy, the corresponding value function, and the efficient frontier are given by the following equations:

#### 4. Optimal Time-Consistent Investment Strategy for Problem (MV)

In this section, we will provide optimal time-consistent investment strategy and the equilibrium value function for problem (MV) by solving the extended HJB equations.

Firstly define problem () and denote the value function by for problem (). Due to the fact that this objective function is nonlinear in the expectation of the terminal surplus, problem () is time-inconsistent in the sense that the Bellman Optimality Principle does not hold. In order to deal with this time-inconsistent problem, we can view the investment problem as a noncooperative game with one player for each time and look for some equilibrium strategy which will also be equilibrium for any time . The definitions of equilibrium strategy and verification theorem for problem () are described as similarly as in Björk and Murgoci [4] or Zeng et al. [10].

*Definition 9 (equilibrium strategy). *For any fixed chosen initial state , consider an admissible strategy . Choose two fixed real numbers and and define the following strategy:
If
then is called an equilibrium strategy, and the corresponding equilibrium value function is defined by

It is easy to see that the equilibrium strategy is time-consistent. So the equilibrium strategy is called optimal time-consistent strategy.

Lemma 10 (verification theorem). *If there exist two real functions and , satisfying the following extended HJB equations:
**
where
**
then , , and is optimal time-consistent strategy.*

Next, we will find the solution to the extended HJB equations. By using the infinitesimal generator (8), we can rewrite the extended HJB equations in Lemma 10 as where is determined below.

On one hand, differentiating the function in the left bracket of (56) with respect to and setting the derivative to zero, we get On the other hand, since the linear structure of (56) and (57) and the boundary conditions of and given by (52) and (54) are linear in , we can guess that Thus, the partial derivatives for the functions and are easily calculated: Substituting (60) into (58) yields Inserting (59)–(61) into (56)-(57), we have where Because (62) holds for which means that the coefficient of and the constant coefficient are equal to 0, we have The solutions of the system to ordinary equations are given as follows: By inserting (65) and (67) into (61), the optimal time-consistent strategy is given by the following equation: Based on the argument above, the explicit expressions for and are obtained. Let the original time for problem () equal 0 and the value function for problem (MV) is given by the following theorem.

Theorem 11. *For problem (MV), optimal time-consistent strategy is given by (69) and the equilibrium value function is given by the following equation:
**
Furthermore,
*

*Remark 12. *By virtue of (70) and (71), the relationship between the expectation and the variance of the terminal wealth is derived:

Equation (72) also shows this efficient frontier is a hyperbola in the mean-standard deviation plane.

*Remark 13. *This time-consistent investment strategy is independent on the current wealth which means is a deterministic function with respect to . The parameters of the surplus process have no impact on the optimal strategy; the risk aversion coefficient and the coefficients of financial market decide the optimal strategy together.

*Remark 14. *When all the parameters are all constants and , optimal time-consistent strategy , the equilibrium value function , and the efficient frontier are given by

#### 5. Numerical Analysis

In the next two subsections, we study the effect of parameters on the optimal strategies (precommitted strategy and time-consistent strategy) and the corresponding value functions and provide some numerical examples to illustrate the effects. Finally, compare the precommitment results with the time-consistent ones by some numerical analysis. For convenience but without loss of generality, all the parameters involved are constants and . The optimal investment strategies, the corresponding value functions, and the efficient frontiers are given by Remarks 8 and 14 from different views. For the following numerical illustrates, unless otherwise stated, the basic parameters are given by , , , , , , , , , , and .

##### 5.1. Analysis of Optimal Precommitted Strategy and the Corresponding Value Function

In this subsection, we will work on numerical analysis of the precommitted strategy and the value function.

Firstly, we will show how the coefficients involved impact on the precommitted strategy. Since the precommitted investment strategy is stochastically dependent on the current wealth, we explore the effect of parameters of the financial market and the risk aversion by stochastic simulation. Because the precommitted strategy is indeed a stochastic process, we investigate the effect of different parameters in a same sample trajectory. In order to model the trajectory, we assume that is a Poisson process with intensity and the profits or the incomes are double exponentially distributed with parameters , , and ; namely, its probability density function is given by . From (44), we can see that the optimal precommitted investment strategy increases when the current wealth decreases; namely, if the current wealth is big enough, the company should invest less money in the risk-risky asset. In order to simulate the general risk model assume , , and and Figure 1 shows how the coefficients involved impact the optimal precommitted investment strategy for the general risk model with diffusion. In order to simulate the C-L risk model, assume , , , and and Figure 2 shows how the coefficients involved impact the optimal precommitted investment strategy for the C-L risk model with diffusion. From Figures 1 and 2, we can conclude the following findings: all the parameters impact the precommitted strategy together and optimal precommitted investment strategy has more complex relation with all the parameters, because the increase of one parameter can change the deterministic part of the precommitted strategy and the current wealth together which results in the uncertainty of their difference.

(a) The effect of on optimal precommitted strategy |

(b) The effect of on optimal precommitted strategy |

(c) The effect of on optimal precommitted strategy |

(d) The effect of on optimal precommitted strategy |

(e) The effect of on optimal precommitted strategy |

(f) The effect of on optimal precommitted strategy |

(a) The effect of on optimal precommitted strategy |

(b) The effect of on optimal precommitted strategy |

(c) The effect of on optimal precommitted strategy |

(d) The effect of on optimal precommitted strategy |

(e) The effect of on optimal precommitted strategy |

(f) The effect of on optimal precommitted strategy |

Secondly, we will show how the coefficients involved impact the value function. For convenience, introduce the notation and we can show that and for all by an elementary calculation. From (45), we can conclude the following findings.(1) and = (<0) when (, which means as the coefficient risk aversion increases or as the intensity of the jumps of the profit decreases (increases), the optimal mean-variance utilities decrease; see Figure 3(a).(2) and × − , which illustrates that when the appreciation rate increases or the volatility of the market’s risky asset decreases, the optimal mean-variance utilities increase; see Figure 3(b).(3) and , which shows that when the expectation of the size of each income increases or the second moment of the size of each income decreases, the optimal mean-variance utilities increase; see Figure 3(c).(4), which reveals that the value function is decreasing with respect to ; namely, when the uncertainty of the profit increases, the optimal mean-variance utilities decrease; see Figure 3(d).

(a) The effect of and on the value function |

(b) The effect of and on the value function |

(c) The effect of and on the value function |

(d) The effect of and on the value function |

##### 5.2. Analysis of Optimal Time-Consistent Strategy and the Equilibrium Value Function

In this subsection, we will work on numerical analysis of the time-consistent strategy and the equilibrium value function.

Firstly, we work on how the coefficients involved impact optimal time-consistent investment strategy. From (73) it is easy to see that , which means the company will invest more money into the risky asset as time goes by and also obtain the following findings.(1), which illustrates that the more the company dislikes risk, the less amount the company invests into the risky asset; see Figure 4(a).(2), which reveals that the smaller the risk-free rate is, the more amount the company invests into the risky asset; see Figure 4(b).(3), which reveals when the appreciation rate increases, the company should invest more money into the risky asset; see Figure 4(c).(4), which tells that when the volatility of the risky asset increases, the company should invest more money into the risk-free asset; see Figure 4(d).

(a) The effect of and on optimal time-consistent strategy |

(b) The effect of and on optimal time-consistent strategy |

(c) The effect of and on optimal time-consistent strategy |

(d) The effect of and on optimal time-consistent strategy |

Secondly, we will show how the coefficients involved impact the value function. From (74), we can conclude the following findings: Figure 5 shows that how the coefficients involved impact the equilibrium value function. The parameters , , , , , , and have the similar effect on the equilibrium value function as their effect on the value function with precommitment discussed in Section 5.1.

(a) The effect of and on the equilibrium value function |

(b) The effect of and on the equilibrium value function |

(c) The effect of and on the equilibrium value function |

(d) The effect of and on the equilibrium value function |

##### 5.3. Comparisons between the Precommitted Strategy and the Time-Consistent Strategy

In this subsection, we compare the optimal investment strategy, the corresponding value function, and the efficient frontier under the precommitted framework with the ones under the time-consistent framework.

Firstly, we compare the precommitted strategy with the time-consistent strategy. The time-consistent strategy is time deterministic but the precommitted strategy depends on the current wealth. We also assume that is a Poisson process with intensity and are double exponentially distributed with their probability density function . Here, fix the parameter , , , and . We investigate the precommitted investment strategy by exploiting Monte Carlo Methods. We simulate 2000 tracks of the precommitted investment strategy and calculate the average of 2000 tracks. Figure 6(a) illustrates that the average of the precommitted investment strategy is bigger than the time-consistent investment strategy which means that the company with the time-consistent strategy makes a more conservative investment than the one with the precommitted strategy on the long term.

**(a) The comparison of optimal strategies**

**(b) The comparison of the value functions**

**(c) The comparison of different efficient frontiers**

Secondly, we compare the optimal value function with the equilibrium value function . By a simple calculation, we can conclude that which means that the company with the time-consistent strategy has to give up the chance to attain greater current utility in order to ensure a consistent return for the whole time horizon as is illustrated in Figure 6(b).

Thirdly, we compare the efficient frontiers derived from two different perspectives. From (46) and (75) we can see that the efficient frontiers are no longer straight lines no matter in each perspective. The efficient frontier under the time-consistent strategy is never above the efficient frontier under the precommitted strategy as is illustrated by Figure 6(c).

It seems to be true that the precommitted strategy is prior to the time-consistent strategy from the comparison of the value functions and the efficient frontiers, but we cannot conclude that the precommitted strategy is better than that from the game theoretical framework. Because the latter strategy is time-consistent and it can make the company ensure a consistent return for the whole time horizon. Meanwhile, the precommitted strategy is a global optimal strategy which only can make the company’s mean-variance utilities maximized at . Furthermore, the time-consistent strategy is suboptimal strategy for all and the investment problem for all can be viewed as a noncooperative game with one player for each time t. The time-consistent strategy can make this entire system equilibrium and it is suboptimal for the player . Correspondingly, the precommitted strategy is a global optimal strategy for player and it cannot make the entire system equilibrium.

#### 6. Conclusion

In this paper, we have investigated the optimal investment strategy for a general risk model under mean-variance criterion. The precommitted strategy is derived by the lagrange method and the time-consistent strategy is also calculated via the approach based on the time-consistent equilibrium controls. In the end, we theoretically and numerically provide the effect of the parameters on the optimal investment strategies and the corresponding value functions. The value function and the efficient frontier under the precommitted strategy are prior to the ones under the time-consistent strategy, we cannot conclude that the precommitted strategy is better than that from game theoretical framework, because the company under the time-consistent strategy has to give up the better current utility in order to keep a consistent satisfaction over the whole time horizon. Meanwhile, the precommitted strategy is a global optimal strategy and it only can make the company’s mean-variance utilities maximized at initial time .

#### Conflict of Interests

The authors declare that there is no conflict of interests regarding the publication of this paper.

#### Acknowledgments

This research is supported by the National Natural Science Foundation of China (Grant nos. 11201335 and 71071111) and the Research Project of the Social Science and Humanity on Young Fund of the Ministry of Education (Grant no. 11YJC910007). The authors would like to thank anonymous reviewers for very helpful suggestions which improved this paper greatly.