Long-Memory Processes and ApplicationsView this Special Issue
Research Article | Open Access
The Optimal Portfolio Selection Model under -Expectation
This paper solves the optimal portfolio selection model under the framework of the prospect theory proposed by Kahneman and Tversky in the 1970s with decision rule replaced by the -expectation introduced by Peng. This model was established in the general continuous time setting and firstly adopted the -expectation to replace Choquet expectation adopted in the work of Jin and Zhou, 2008. Using different S-shaped utility functions and -functions to represent the investors' different uncertainty attitudes towards losses and gains makes the model not only more realistic but also more difficult to deal with. Although the models are mathematically complicated and sophisticated, the optimal solution turns out to be surprisingly simple, the payoff of a portfolio of two binary claims. Also I give the economic meaning of my model and the comparison with that one in the work of Jin and Zhou, 2008.
In the area of optimal financial portfolio selection, the expected utility maximization theory (EUT), developed by Neumann and Morgenstern , has been the most important decision rule for a long time. During the past twenty years, portfolio choice theory has been developed to both discrete time and continuous time models in dynamic setting. Markowitz  laid down the basement for modern financial portfolio selection theory by his pioneer work on single-period mean-variance portfolio selection. Li and Ng  extended the Markowitz model to the dynamic setting. El-Karoui et al.  considered a portfolio-consumption model where the objective is to optimize the recursive utility of consumption and terminal wealth, and they adopted the terminal perturbation method to solve this problem. Bielecki et al.  employed the dual approach to deal with a continuous time portfolio selection model without negativity constraint on wealth process. Pliska had earlier introduced this approach for discrete time models in . In , Ji and Zhou firstly used the terminal perturbation method and dual approach together, and they reformulated FBSDE controlled system as a backward system by taking the terminal condition of the forward state as a control variable. By applying Ekeland’s variational principle, they could deal with additional constraint without convexity assumption on the coefficients of the backward approach to continuous time mean-variance portfolio selection problem in a complete market. Until now, this terminal perturbation method has been widely used in many control problems; see [8–16]. As for these problems, there have been essentially two classical approaches developed to solve in the utility model: one is the stochastic control or the dynamic programming approach, firstly developed by Merton [17, 18], which transforms the problem into solving a partial differential equation called the Hamilton-Jacob-Bellman (HJB) equation. The other one was developed by Harrison and Kreps  called the martingale approach.
Although the optimal portfolio selection models under the expected utility theory have been well solved, the risk preference measure or the expected utility theory has some basic tenets which has systematically violated the reality. In other words, the EUT cannot be able to describe the way people make decision in the real world clearly and precisely. Firstly, for example, EUT has an underlying assumption that decision makers are rational and uniformly risk averse when facing uncertainties. But in the real world, people are risk averse on gains and risk taking on losses and appear significantly more sensitive to losses than to gains. Secondly EUT thinks that everyone is able to objectively evaluate probabilities, but the fact is that people usually overweight small probabilities and underweight large probabilities. Thirdly, in EUT, investors evaluate wealth according to final asset position, but evidence shows that people evaluate assets on gains and losses, not on final wealth position. The difference between the theory and the practice leads to many paradoxes and puzzles that the EUT fails to explain, including the famous Allais paradoxes, Ellsberg paradoxes, and the equity premium puzzle. Hence, many alternative preference measures have been put forth. For example, goal reaching, dual theory of choice, and Lope’s SP/A model. However, these new models which have successfully solved some paradoxes and puzzles would create new ones.
In 1970s, Kahneman and Tversky proposed the prospect theory (PT) for decision making under uncertainty , incorporating human emotions and psychology into their theory. The key elements of this Nobel-prize-winning theory are as follows.(1)A reference point in wealth which defines gains and losses.(2)Value function, being similar with utility function, which is concave for gains, convex for losses, and steeper for losses than for gains.(3)Nonlinear probability distortion, that is, a transformation of the probability scale, enlarging small probabilities and diminishing large probabilities.
The three points above have given PT the power to describe a man’s risk attitude and emotions more clearly. So the model under the prospect theory is closer to the reality and the research on it is very interesting and important.
In this framework, owing to the discontinuity and nonglobal convexity of the -shaped value function, Lagrange method cannot be used. Worse still, the coupling of these two ill-behaved features greatly amplifies the difficulties of the problem. Berkelaar et al.  did some research on continuous setting, but they neglected the probability distortion, which is the main difficulty for the problem. Jin et al. [22, 23] studied the continuous model, with both the -shaped value function and the nonlinear probability distortion. The probability distortions involved the nonlinear Choquet expectation instead of the conventional linear expectation. Using Choquet expectation, the thought in theory of Kahneman and Tversky is described.
This paper extends the thought of prospect theory to another nonlinear case. I replace the Choquet expectation in  by another nonlinear expectation, -expectation. Mathematically -expectation and Choquet integral are two different ways to describe nonlinear case, where -expectation is more nonlinear in some sense. It is shown that they coincide only when they become linear expectation; see (Chen and Selum ). Although Choquet expectation has many applications in statistics, economics, and finance, it is difficult to define conditional Choquet expectation in terms of Choquet expectation, while it is easy to define conditional expectation via -expectation. Then there are some important applications for -expectation in various areas especially in finance. For example, the ambiguity in financial model can be described by the -expectation; see Chen and Epstein . The -probabilities and -expectation have also been found to have intimate connection with the rapidly developed dynamic risk measure theory. Choquet et al.  showed that in dynamic setting, risk measures could be formulated via the -expectation.
This paper firstly adopts -expectation and -probability to describe an ambiguous environment. The only difference from prospect theory framework in Jin and Zhou  is the decision rule when cost constraint is linear. In prospect theory, there is only a reference or “real” probability in the world, where an agent has a distortion to this probability to describe his attitude. Here, the economic meaning to use -expectation instead of Choquet integral is that an agent faces an ambiguous world, where there may be a set of priors. So ambiguous attitude replaces the probability distortion. Actually the model I build is a general form, so it can feature many other cases when specific -function is chosen, which can be searched in the future.
This paper is organized as follows. Section 2 gives out the background of the problem and the optimal portfolio selection model under the prospect theory and -expectation. In Section 3, the original model is divided into three subproblems owing to the discontinuity, nonconvexity maximization problem. Also this section proves the equivalence between the original problem and the three subproblems. Section 4 solves out the subproblems with the perturbation method and Ekeland’s variational principle. Section 5 analyzes the form of the optimal solution under two simple but fundamental examples, gives out the economic meaning under the model, and compares our model with the one in Jin and Zhou . The final part presents some concluding comments.
2. The Model under Prospect Theory and -Expectation
In this paper is a fixed terminal time, and is a standard -dimensional Brownian motion defined on a complete filtration probability space . The information structure is given by a filtration , which is generated by and augmented by all the -null sets. According to Karatzas and Shreve , they define a complete capital market in continuous setting: given kinds of risky assets, for example, kinds of stocks whose price processes are written as and given a riskless asset, for example, the bank account, whose price process is written as . These processes of kinds of assets satisfy the equations below: where and , , show the appreciation rate and the disperse rate of risky assets, respectively. They are all -progressively measurable and satisfy and is the interest rate which is an adapted progressively measurable random process satisfying . Accordingly, in a complete market the total wealth process which is replicated with a portfolio of the assets can be represented by backward stochastic differential equations (BSDE) introduced in  where is the terminal wealth at , , and . Here, is the value process replicated by constructing a self-financing portfolio with kinds of risky assets whose nonsingular volatility matrix is . This wealth process means that there are no transaction costs including price effects in the market. In fact, in a standard complete market, it is possible to construct a portfolio which attains as final wealth the amount , as in (3). Then, the dynamics of the value of the replicating portfolio are given by a BSDE with linear function , with (or in fact ) corresponding to the hedging portfolio. So, the existence of solution restricted to square-integrable ones of (3) should be guaranteed. In , Pardoux and Peng got the existence and uniqueness of solutions under some conditions which will be showed later, and because of the uniqueness of solutions, there is only one price as well as hedging portfolio so that valuation of the terminal wealth (e.g., contingent claim) is well possessed without arbitrage.
Furthermore, the can be expended to the nonlinear form. For example, people can allocate part of the capital to buy the call option, or people can assume the rates between loan and deposit are different. An interesting example of the nonlinear wealth equation is the optimal portfolio choice for large investor that is considered in . A large investor’s portfolio choices can affect the securities’ price process. The impact of the investor’s position on price is specified exogenously and the price may rise because of size or because of other agents in the market believing that the large investor has superior information. In , the respective asset price and are described by the following equations: where , , are functions describing the effect of the wealth and the strategies possessed by the large investor. The corresponding wealth process is governed by where is a volatility process.
In this wealth equation, the function is nonlinear about and , where and is -valued. So clearly I can break through the limit of the linear wealth equation; then I can incorporate more situations which are closer to the real market.
As I see, BSDE can form more generalized kinds of financial models. In fact, BSDE can define generalized stochastic differential utility (see ) and model drift uncertainty by using a dynamic nonlinear expectation called -expectation (see ). The key point is that in BSDE the time consistency related to dynamic model is kept because of the property of solutions. Generally, I need several assumptions about BSDE for getting its solution.
Assumption 1. Let be a function satisfying the following:: is uniformly Lipschitz with ;: is continuous about and ;:;:.
I can say that for any given and satisfying , the solution of the backward stochastic differential equation satisfying is formed by a pair of uniquely adapted solutions ; see .
Then I will give the definition of -expectation and dynamic pricing mechanism correspondence to a BSDE.
Definition 3. Given , and , satisfies , , and . Considering the BSDE (3) above with respect to function and terminal claim , I will define the dynamic pricing mechanism from  by Peng as follows: Peng proves that the dynamic pricing mechanism has the following properties:(1) for all ;(2); (3) then one can have ;(4).
Attention. is not the same with -expectation. I could say that if a function satisfies the properties , , and , then it is a kind of dynamic pricing mechanism. Through a testation in  people could know whether a dynamic pricing mechanism is a -expectation. Obviously a dynamic pricing mechanism has weaker assumption compared with the -expectation.
In this paper, I consider an agent with an endowment . Without loss of generality, I assume the psychological reference point at terminal which serves as a base point to distinguish gains from losses is zero (for details, see remarks in Jin and Zhou ). As a result, means gains while represents losses. For convenience, this paper adopted the backward method [7, 10] and transformed the limit of the initial capital into the control of form of the dynamic pricing mechanism: . Next, I give two utility functions and , both of which map from to measure the gains and losses, respectively. The technical assumptions on these utility functions, which will be imposed throughout this paper, are summarized as follows.
Assumption 4. are strictly concave, increasing functions and satisfy and are continuously differential, with their derivatives bounded.
Note that, under Assumption 4, when , then .
Now I give out the contingent claim a value reflecting the investor’s risk attitudes and psychological emotions different from the prospect theory. It can be written as , where , that is, the terminal contingent claim value, and are, respectively, the positive part and the negative part of the contingent claim:
Here I replace Choquet integrals in  with -expectations: , , given two backward stochastic differential equations generating functions as below satisfying and they can be nonlinear form so as to generate the nonlinear expectations: I consider and as a kind of measure or evaluation on the utility of gains and losses of terminal time. This form will greatly enrich the risk measure methods. Then, I give . So, the problem under prospect theory with -expectation’s decision rule can be introduced as follows: Here, is the solution of BSDE (3) with respect to function and terminal variable and actually it is a cost constraint.
Note the following. (1)I did not assume that wealth function satisfies , which will limit the scope of . For example, the large investor case will be omitted by the assumption . So I could not replace with the form of -expectation. However, when satisfies the assumption , it will incorporate some nonlinear cases, for example, the wealth equation which allows borrowing money. This makes my model expand to a larger scope that is important both in the theory meaning and in the practical meaning.(2)This value has some good mathematical properties compared with the rule in  (see Chen and Selum ). (1) has dynamic consistency which is important in practice, while the Choquet integral does not. (2) The probability distortion and in  is where and are two differentiable and strictly increasing functions, satisfying and . In my model, when has a special form, for example, does not depend on and is linear in (see Lemma 12), and , here is the -probability. Obviously this equivalent probability is on the event sets adapted with and have the similar effect as the distortion of probability , but it cannot be represented by a in .
Theorem 5. For the original Problem (10), the constraint is equivalent with .
Proof. For any given , if I could prove that the constraint is superior to the constraint , then I could prove the equivalence. For any given investment portfolio selection , with the given capital , through the wealth equation I could get at terminal time. With the same portfolio , the random part has the same form which could be omitted. Since the original capital , and by the comparison theorem of SDE, we could get . Then I can get the conclusion.
Lemma 6. satisfies that , if and only if the generating function satisfies .
So throughout this paper I assume that satisfies and satisfy .
3. The Analysis of the Model Structure
This section divides the original problem (10) into three subproblems, and I give out the proof that the solution of problem (15) is exactly the solution of problem (10), which means that I can translate the focus on the original problem (10) into the treatment of subproblems (13), (14), and (15). Because the subproblems can be treated beautifully using the method proposed by Ji and Zhou  and Ji and Peng , the original problems can be solved at the same time. At first, for any given , we have the following.
In order to get the equivalence between the original problem (10) and its three subproblems, I need to be linear. Under the linear cost constraint, I give the following theorem.
Theorem 7. Let be linear with . Given , define and . Then is the optimal solution of problem (10) if and only if is the optimal solution of problem (15); also and are, respectively, the optimal solutions of problems (13) and (14) with respect to .
Proof. (1) Consider
So, is a feasible solution for the original problem (10). Then, I have . So for any feasible solution of problem (10) written as , I define and . Then I get and , so ≤ ≤ , which means is the optimal solution of the original problem (10).
(2) Assume is the optimal solution of the original problem (10); then definitely and . If I assume the inequality is strict, then there exists which is a feasible solution for problem (13) with , so that . So I can define which is a feasible solution for problem (10), and ; this convicts with the optimality of , so is an optimal solution of subproblem (13). Similarly I can prove that is also the optimal solution of subproblem (14). So, and . For any feasible I only need to prove the following: There are three different cases in the following.(1)When , , and , so Attention: owing to the , so ,which contributes to the fourth equality above.(2)When , then , and since , so I could get .(3)When , for any , problems (13), (14) have nonempty feasible solution space for any , so, for any given , I can find , respectively, feasible for (13) and (14), and also I can get , . Since is a feasible solution for problem (10), then ≤ = .
Theorem 7 gives out the equivalence between the original problem and the three subproblems. In the next section, I will address separately the maximum and minimum control problems under -expectation environment.
4. The Treatment of the Model
In this section, I deal with the optimal portfolio problem. To get the optimal portfolio of problem (10), I need additional assumption.
and are both continuously differentiable function with , and their derivatives are uniformly bounded.
Then, I give the necessary form of solution in problem (13) as the theorem below.
Theorem 8. Assume that and also satisfy , for any given , where , and satisfies Assumption 4; if is the optimal solution of problem (13) with these parameters, then has the following form: where is the subset of . are respective solutions to the following stochastic differential equations: where and , when . If , then let , , and when , let , . Functions , , , and are respective derivatives of , with respect to or , and and are respective solutions of BSDE (3) and (9) with terminal random variables and .
Proof. Here I adopt the terminal variation method and Ekeland’s variational principle to get the form that optimal solution of problem (13) needs to satisfy. First I define the state constraint as follows: For each and , I define: . Let be the solution to (3) and (9) corresponding to , and let be the solutions to the following variational equations: and define
Lemma 9. , for ;
, for .
Proof. I only need to prove the lemma at the case of . From the variational equations and (3), (9) I can get the following equations: Let Then, A standard estimation on the above BSDE yields where is a constant. However the Lebesgue dominated convergence theorem implies that The desired result follows by applying Gronwall’s inequality.
Let be metric in naturally introduced by its norm and introduce a mapping which has the form as follows:
Lemma 10. There must exist , with , , and such that the following variational inequality holds:
Proof. It can be easily checked that and , . This leads to . Thus by Ekeland’s variational principle, there exists such that
Because is the optimal solution of problem (13), it is easy to see that . For any and , I introduce the following notation:
Then, and .
Consider the following variational equation: where corresponds to . As for Lemma 9 we can get the following: This leads to the following expansions: Now I consider two cases.
Case 1. , so there exists such that − . Consider I define So, , where .
Case 2. . There exists such that − , so Also I define that , . I can get the following: At last, , , and for both cases. There exists a converging subsequence of with the limit . Since , we have , so . On the other hand, it is easy to check that , .
Now, I apply Ito’s lemma to and then have .
So by Lemma 10
Since the above is true for any , so I can have , , .
Owing to , so , and Then the proof is completed.
For problem (14) I can solve it in the similar way.
Theorem 11. Assume that and also satisfy , for any given , where , and satisfies Assumption 4; if is the optimal solution of problem (14) with these parameters, then has the following form: where , , and . is a subset of . and are respective solutions of BSDE (3) and (9) with terminal random variables and . are, respectively, the solutions of the following stochastic differential equations:
5. The Economic Explanation and Model Comparison
In this section, I try to illustrate the difference between my model and that in  and the economic explanation. Because is a linear form, the cost constraint is the same form; then the only difference is the contingent claim’s value: I use the -expectation to replace the Choquet integral. The relation between two nonlinear expectations is searched in .
Lemma 12. Suppose satisfies . Define for a given event . Then, can be represented as a Choquet integral for any ; that is, if and only if there exists a function such that is of the following form:
This lemma implies the -expectation coincides to the Choquet integral only when it becomes a classic linear expectation. Note that this kind of Choquet integral represented by -probability is different from the form in . Here, when is a linear form, the original probability is transformed to an equivalent probability, . Actually, -expectation and Choquet integral are two parallel ways to search nonlinear expectation and nonadditive probability. Besides, it has been proved that the -expectation derived in the BSDE framework is more nonlinear in some sense.
The economic motive in model  is from prospect theory founded by Kahneman and Tversky. The key point is that an agent considers the world differently when facing gains and losses. In the world, there is a reference probability or I could say it is a “real” probability. Because of the emotion and psychology of an agent, the assumption of rational behavior is slack; then this leads to a probability distortion. The distortion makes the probability of events in agent’s thought nonadditive. So, a Choquet integral is used to express the expectation. Moreover, the agent treats the model differently when facing gains and losses, so there are two utilities and probability distortions.
In my model, I accept the idea that an agent treats the world differently when facing gains and losses; that is, there is a change in agents’ thought (even I could rationally consider that the agent is absolutely different between facing gains and losses). The reason I use -expectation to replace the Choquet integral is that I try to describe another economic background. Here, I consider an ambiguous setting, a complicated world. In an ambiguous world, there is a set of priors instead of one reference probability. So the ambiguous environment replaces the probability distortion environment in . Also, I use and to describe two sets of priors in the agent’s thought, which reflects two uncertainty attitude to gains and losses. The uncertainty under -expectation framework is considered as drift uncertainty. Recently, Epstein and Ji consider the volatility uncertainty case; see [32, 33].
Then, I will give two fundamental examples to describe some economic explanations.
Example 13. For simplicity I will define that , , which means that investors adopted the same risk attitude as the market. Now the optimal model can be transformed to
For given , it could be divided into three parts as the method above.
Subproblem Three This is also the problem posed by Jin and Zhou, when . For subproblem one by method in , we could solve this problem as follows: however, by my method above I could know that they have the similar forms with my conclusion in Theorem 8 when using to replace , , and . This leads to a hint that the two methods have the similar effect on solving the conventional cases. This means that the special forms of -expectation could also be solved by the method in .
Example 14. Fixing , , can be any form which could demonstrate all the wealth movement forms. So the optimal model can be the following forms: Here show the risk attitudes facing earning and losses by defining By the lemma above people can know that these two kinds of -expectations can be replaced by the special form of Choquet expectations. In this case, the probability distortion functions , have different forms owing to the difference between and . The probability distortion reflects the psychology of the investors, so and reflect the different psychologies when facing gains and losses.
In view of the form (53), it can be seen that the optimal strategy should deliver a wealth in good states and a shortfall in bad states . To realize this goal, the investor should buy a contingent claim with the payoff at the cost . Since , he needs to sell a contingent claim at the price of to finance the shortfall. Given the investors’ risk attitudes and their special -shaped utility, the investor should try his best to get optimized with the initial , so he has to consider how to allocate the limited money on the call option and the put option. This also means that the investor not only invests in stocks but also takes leverage to gamble on the good state of the market. As a result, the ratio of reflects the risk attitudes of the investors in some extent. Also in the optimal solution, reflects the risk probability ratio between the investors and the market.
Finally, I consider the nonlinear case of , which means ambiguous environment. Consider a -expectation with a function; when is sublinear in , is a sublinear expectation and it could be proved that there is a nonempty convex closed set of probabilities absolutely continuous to original probability, , such that . Also, if is superlinear, there is a set of probabilities, , such that . Choosing specific , with corresponding conditions, the contingent claim’s value can be expressed as . This form features an agent who is extremely pessimistic or cautious about his judgment for gains and losses. Note that it is reasonable that he has two sets of priors, and , for two kinds of environment as well as two kinds of mood. But if , the sets and coincide.
From the above linear and nonlinear cases, I give the detailed introduction about the similar and different points between the probability distortion used in  and -expectation used in my model. The key point is that only when there is no probability distortion considered in  and there is only one probability (no ambiguous belief) in my model like Example 13, both of the problems convert to the conventional optimal problem, and, of course, they are the same problem. Otherwise, the model in  and my model consider two different economic situations as mentioned above; that is, one is about probability distortion and the other considers the ambiguous case. From the mathematical viewpoint, Choquet integral and -expectations are two different and parallel ways to search nonlinear expectation. The last point I want to say is that my model is a general form, because can be any nonlinear case satisfying the conditions of Theorems 8 or 11. So my model can be applied in other economic cases by choosing other -functions in the future that cannot be interpreted now.
The optimal portfolio selection problem under the conventional expected utility theory has been well researched. There are also some classical methods to be adopted. Owing to the bad performance in realistic market, some new decision rules have been proposed by many economists. The famous prospect theory provided by Kahneman and Tversky has been considered as the most acceptable rule for its three points. Under the framework of this theory, the optimal model becomes difficult for the -shaped utility function and the nonlinear probability distortion. The previous research usually neglected one or two points of the PT theory, which will greatly simplify the problem. My paper replaces the nonlinear Choquet expectation in  by the nonlinear -expectation, so the nonlinear probability distortion could be replaced by the function. Also I adopt the -shaped utility function and different -functions to show the different uncertainty attitudes towards gains and losses. If the wealth movement equations can be expanded to the nonlinear cases, it will incorporate many famous cases, for example, the larger investor case, the borrowing rate different from the lending rate case, and the different risk premium for short and long position case. In , they only research the classical linear case. Unfortunately, I can only simplify the problem in linear wealth movement equations. So, how to divide the original problem (10) under nonlinear wealth function case (nonlinear cost constraint) needs further research. Attention that for the decision rule, I use the -expectation, but the wealth equation adopted the raised by Peng. This dynamic pricing mechanism has some similar properties with the -expectation, but they are different. could incorporate more cases than the -expectation.
Using some techniques, for example, dividing the original problem into three subproblems, Ekeland’s variational principle, and the terminal perturbation method, I could get the necessary form of the optimal solution in our model. At last, I provide the economic meaning for my model.
Conflict of Interests
The author declares that there is no conflict of interests regarding the publication of this paper.
- V. J. Neumann and O. Morgenstern, Theory of Games and Economic Theory, Princeton University Press, Princeton, NJ, USA, 1944.
- H. Markowitz, “Portfolio selection,” The Journal of Finance, vol. 7, no. 1, pp. 77–91, 1952.
- D. Li and W. L. Ng, “Optimal dynamic portfolio selection: multiperiod mean-variance formulation,” Mathematical Finance, vol. 10, no. 3, pp. 387–406, 2000.