Abstract

Portfolio optimization with respect to different risk measures is of interest to both practitioners and academics. For there to be a well-defined optimal portfolio, it is important that the risk measure be coherent and quasiconvex with respect to the proportion invested in risky assets. In this paper we investigate one such measure—conditional capital at risk—and find the optimal strategies under this measure, in the Black-Scholes continuous time setting, with time dependent coefficients.

1. Introduction

The choice of risk measure has a significant effect on portfolio investment decisions. Downside risk measures—that focus attention on the downside tail of the distribution of portfolio returns—have received considerable attention in the financial world. Value at risk is probably the most famous among these measures, having featured heavily in various regulatory frameworks. It can be defined for a random variable and a confidence level by , where is the -quantile of (see e.g., [1, Equation ]) (Another common definition is that the VaR of a loss distribution is the smallest number such that . This is equivalent to the definition given here if we define the loss of the portfolio to be given by , and identify .). A closely-related downside risk measure is capital at risk (CaR), defined in [2] (see also [3, 4]) as the difference between the riskless investment and the quantile .

Quantile-based risk measures such as and suffer from several shortcomings. First, while they measure the best of the worst outcomes at the confidence level, they do not answer the question of how severe the loss can be. Also, one of the most important concerns is that these measures are not in general subadditive; that is, when used to measure risk, they do not always satisfy the notion that the diversification should not create more risk [5]. Finally, as illustrated in [6], can exhibit multiple local extrema.

These issues were addressed in the widely cited article by Artzner at al. [7], where the authors define coherent risk measures by four conditions that such measures should satisfy. The article motivated a number of authors [5, 813] to propose and investigate different types of coherent risk measures, all of which are tail mean-based risk measures.

One such measure, that does not suffer from the critical shortcomings of VaR and CaR, is conditional capital at risk . This is defined (in [14]) as the difference between the riskless investment and the conditional expected wealth, under the condition that the wealth is smaller than the corresponding quantile, for a given risk level. As such, this measure provides an indication of the likely severity of the loss in the event that the loss exceeds a given quantile. In this paper we prove that is strongly quasiconvex as a function of the portfolio, which is an essential property for optimization. We investigate conditional capital at risk in a multiasset Black-Scholes setting, in continuous time, and with time-dependent coefficients. We generalize and extend the optimization approach of Emmer at al. (see [2, 14]) to the continuous-time setting.

The outline of this paper is as follows. In Section 2, we give the notation and define the portfolio process and . Section 3 provides the proof that is a coherent risk measure and that it satisfies the property of strong quasiconvexity. In Section 4, we derive an analytical solution for the minimal problem, up to a scalar constant which has to be evaluated numerically. Section 5 is devoted to the derivation of an analytical strategy for the maximal expected wealth, subject to constrained . Section 6 provides some numerical examples, and Section 7 concludes this paper.

2. Preliminaries

We introduce the following notation. The -dimensional column vector with each component equal to is denoted by , the Euclidean norm of a matrix or vector by , and the space of -valued, square-integrable functions defined on by , , or just . The natural inner product of this space is denoted by , and the corresponding norm by .

We work under the following assumptions.

Assumption. (i) The securities are perfectly divisible.
(ii) Negative positions in securities are possible.
(iii) Rebalancing of the holdings does not lead to transaction costs.

Assumption. (i) assets are traded continuously over a finite horizon .
(ii) of these assets are stocks that follow the generalized Black-Scholes dynamics: where , , are independent standard Brownian motions.
(iii) One of the assets is a bond, whose price , evolves according to the differential equation: where is the interest rate of the bond. Throughout this work, we assume that borrowing in the bond is unconstrained.
(iv) The volatility matrix (with th element ), its inverse , the drift vector , and the interest rate are deterministic, Borel measurable, bounded functions over , so that they belong to the appropriate spaces.
(v) satisfies the nondegeneracy condition: where is a given constant.

We note that, under the above assumptions, the market is complete.

At any time , shares are held in the asset , leading to the wealth . The -dimensional vector-valued function is called the trading strategy. We denote the fraction of the wealth invested into the risky asset by and call the portfolio. The fraction held in the bond is . Under the assumption that the trading strategy is self-financing, the wealth process follows the dynamics where is the initial wealth, and the risk premium vector is defined by

Proceeding as in [3], to ensure a minimal tractability of the optimization problems which we solve through the following sections, we restrict our attention in this work to the class of portfolios which are Borel measurable, deterministic, and bounded over . Such portfolios are called admissible. Note that for an admissible portfolio , (2.5) is guaranteed to have a strong solution (see [15, Theorem ]). Note that, by allowing for time-dependent coefficients, this generalises the class of portfolios considered in [14].

Under condition (2.3) the market price of risk is uniquely defined by It will be shown throughout this work that the magnitude of the -norm of the market price of risk is the determining criterion for optimal investment strategies, which turn out to be the weighted averages of the bond and Merton's portfolio defined by This result is an illustration of the mutual fund theorem in complete markets.

We now recall the definitions of the quantiles and quantile-based risk measures for .

Definition 2.3. Suppose that is the cumulative distribution function of the wealth , at time . For a risk level , the -quantile of is defined as The tail mean or expected shortfall of the wealth process , which we denote by , is the expected value of conditional on , that is, The conditional capital at risk, which we denote by , is defined to be the difference between the riskless investment and the tail mean, that is, where is defined as

Remark 2.4. Note that, with CCaR defined in this way, we get the following.(i)An investment in a riskless asset corresponds to zero CCaR.(ii)An increase in the tail mean corresponds to a decrease in CCaR. Thus, negative CCaR corresponds to the case when the tail mean is above the riskless return, that is, a desired result, while positive CCaR corresponds to the case when the tail mean is below the riskless return which we try to reverse by minimizing CCaR.

It was shown in [3] that the -quantile of the wealth process can be written as where, for a given risk level , denotes the corresponding -quantile of the standard normal distribution (note that this is negative when ). In the following, we let and denote the density and cumulative distribution functions of the standard normal random variable. We have the following proposition.

Proposition 2.5. The tail mean of the wealth process solving (2.5) can be expressed as and the expected value is given by

The proof is given in the appendix. From Proposition 2.5 we get the following corollary.

Corollary 2.6. The conditional capital at risk of the solution to (2.5) can be written as

Throughout the paper we will use the following function: which transforms the expression for conditional capital at risk into

We now turn our attention to investigating the properties of CCaR.

3. Coherency and Quasiconvexity of CCaR

The notion of coherent risk measures has been widely discussed in the recent literature (see [5, 713] e. g.). Below we recall the definition of coherency, confirm that is a coherent risk measure, and prove that it is quasiconvex.

Definition 3.1. Let be the set of real-valued random processes on a complete probability space , with its natural filtration , satisfying for all , and for . Thenis a coherent risk measure if it satisfies the following properties.(1)Subadditivity: is subadditive if, for all random processes ,, (2)Positive Homogeneity: is positive homogeneous if, for all and constant , (3)Monotonicity: is monotone if, for all ,, such that almost everywhere, and , (4)Translation Invariance: is translation invariant if, for all and ,

To prove that of a wealth process is a coherent risk measure we refer to [8, Definition ], where the expected shortfall of a random process , with the corresponding -quantile , is defined as and is shown to be coherent [8, Proposition ]. We should note that the definition of coherency used in [8] involves translation invariance in the sense that , for all and .

In order to relate the result in [8] to the coherency of CCaR in the sense used here, we first note that, if is a random process with a continuous probability distribution, we have , so that (with a slight abuse of notation) . If we consider the shifting of the portfolio value by an amount , we have and so we have the following result.

Corollary 3.2. Conditional capital at risk of a wealth process , at time , for a risk level , is a coherent risk measure.

Remark 3.3. (i) While properties 2 and 3 are quite natural, subadditivity has the obvious, yet important consequence that diversification does not create more risk.
(ii) Properties 1 and 2 together are very important as they guarantee the convexity of the risk measure which is essential for optimization.
(iii) In our definition of a coherent risk measure we say that a risk measure is translation invariant, if , for all , and . While in article [8], It is said that arisk measure is translation invariant, if , for all and It is our belief that the definition of the translation invariance, as given in this paper, corresponds more with the intuition behind the notion of translation invariance than the definition given in [8].

Remark 3.4. In the above remarks we investigated the impact of diversification to the portfolio risk, as measured by . We saw that the axiom of subadditivity requires that the risk of the sum of two risky processes, say two stock price processes, be less than or equal to the sum of the individual risks associated with these stock price processes. This means that diversification does not create more risk, as measured by , and that, as long as we diversify, we expect risk reduction.
However, if the portfolio consists of all market assets, the diversification is completed. Then the risk can be further reduced by optimization, that is, by rebalancing the positions across these assets. We then look at the risk measure as a function of the portfolio , and prove that it is strongly quasiconvex in which further implies the uniqueness of the corresponding optimization problems' solutions.

We now turn to the notion of strong quasiconvexity, which we note has not been discussed in the context of portfolio optimization in any of the quoted references except in [3, 4]. Its usefulness lies in its role in establishing the existence of unique solutions to portfolio optimization problems.

We first recall (see [16, Definition ]) that a function is said to be strongly quasiconvex if In the following theorem we prove that has this important property, when viewed as a function of the portfolio weights.

Theorem 3.5. For all distinct , (where the set has a positive Lebesgue measure), and for all

Proof of Theorem 3.5. We suppose, without loss of generality, that in which case, from (2.18), we have Let . We claim that which is equivalent to the statement For ease of notation, we define the function Since , (3.10) is equivalent to while (3.12) becomes Clearly, from (3.14), the left-hand side of (3.15) is greater than and the theorem will be proved if we can establish that this is nonnegative. In order to do this, we make use of the following lemma, the proof of which is given in the appendix.Lemma 3.6. The function is decreasing and strictly concave for all .Since , one can make use of Lemma 3.6 to establish the following: The proof is complete.

This theorem has an immediate, important consequence. Namely, from [16, Theorem ], if a function is strongly quasiconvex, then its local minimum is its unique global minimum. Therefore, the following corollary is true.

Corollary 3.7. If has a local minimum at , then is its unique global minimum.

This guarantees that minimization of with respect to gives the true, global minimum. To illustrate strong quasiconvexity, we give the plot of for the following example (where the parameters were chosen to represent rather extreme conditions, for the purposes of illustration).

Example 3.8. We consider a market consisting of one stock following the and the bond with the constant interest rate . The graph of is given in Figure 1.

4. Minimal Conditional Capital at Risk

The first portfolio selection problem we consider is to minimize risk as measured by , that is, to determine its minimal value over all . The problem can be stated as Using (2.18), problem (4.1) transforms into Since is a constant, problem (4.2) is equivalent to

We now introduce the fundamental dimension reduction procedure, used throughout this work. Following [2, 14], we project the optimization problems considered in this paper onto the family of surfaces , and note that .

We denote by the restriction of to , so that Taking into account the definition of , problem (4.3) can be stated as We deal with this problem in two stages. First, fixing reduces the problem to If denotes the unique maximising portfolio for this problem, then (4.5) can be solved through the one-dimensional problem: It remains to solve the subproblem (4.6). Since is fixed, we see from (4.4) that (4.6) is equivalent to the problem the solution of which is given by the following proposition (the proof of which is given in [6, Proposition ]).

Proposition 4.1. For a fixed , the solution of problem (4.6) is attained by where denotes the market price of risk, defined in (2.7).

We are now ready to solve problem (4.3), and we have the following theorem.

Theorem 4.2. Let be the market price of risk (2.7). attains its maximum when where, if , is defined to be the unique solution of the equation and, if , . The corresponding minimum conditional capital at risk is (which is if ), and the expected wealth is

Proof of Theorem 4.2. Using the definition of and substituting (4.9) into (4.5) allows us to rewrite (4.5) as If we define the function we get
We see that has the same form as , where is defined in Lemma 3.6, with , instead of , so that . Since is a decreasing function of for, we have two cases.(i)If , that is, , then the equation has a unique solution which we denote by . Since is a concave function, it reaches its maximum at .(ii)If , that is, , then for all , and so (4.17) has no solution. This implies that is decreasing for all , so that the optimal solution to problem (4.14) is .
To complete the proof of Theorem 4.2, note that (4.9), with , is the optimal solution of problem (4.3). One can then write leading to

Remark 4.3. (i) Note that, if , we can deduce by using (A.12) that Therefore, the condition , which has to be satisfied for investing into stocks under conditional capital at risk, is stronger than the condition . The latter condition is sufficient in order to include stocks into the optimal strategy using capital at risk as a risk measure. Otherwise stated, conditional capital at risk is a more conservative risk measure than capital at risk, which is consistent with its definition.
(ii) Increasing the time horizon leads to increasing the norm of the market price of risk, so that, in case (ii), the optimal strategy changes from a pure bond strategy to a mixed bond-stocks strategy. In case (i), increasing the norm leads to increasing the expected wealth and decreasing conditional capital at risk .
(iii) As was noted in the preliminary remarks, the optimal portfolio is a weighted average of Merton's portfolio and the bond, which is an illustration of the two-fund separation theorem.

We note that the solution provided in Theorem 4.2 is not an explicit analytical solution, but it is expressed in terms of the solution of the one-dimensional equation (4.11), whose solution can be easily computed. Analytical upper and lower bounds for are given in the following lemma, whose proof is given in the appendix.

Lemma 4.4. The unique solution of (4.17) satisfies the inequality where

Remark 4.5. Note that, for , that is, at the 95% confidence level, , (4.21) approximates the result from [14], which states However, at a higher confidence level, that is, , (4.21) gives a better approximation for , that is, a smaller interval to which belongs.

5. Portfolio Optimization with Respect to Conditional Capital at Risk

We now turn to the problem of maximizing wealth subject to constrained , that is, Using (2.18), the above problem can be written in the form which is equivalent to Since , from its definition, is smaller than total wealth, to avoid trivial cases we only consider such that Under condition (5.4), problem (5.3) can be written as where Note that condition (5.4) guarantees that is well defined. Using the dimension reduction procedure of [2, 14] (see Section 4), we can write problem (5.5) as a one-parameter optimization problem: The solution to (5.7) is given in the following theorem.

Theorem 5.1. Suppose that the constant risk level satisfies the following condition: where is defined in Theorem 4.2. Then the optimal solution to problem (5.7) is where is defined in (5.6), and is the unique solution of the equation The corresponding expected wealth is equal to with the corresponding conditional capital at risk: The efficient frontier is given by the following curve, whose first component is an increasing function of defined implicitly through (5.6), (5.10), and (5.11):

Proof of Theorem 5.1. Proposition 4.1 states that the problem has the optimal solution Substituting (5.15) into (5.7) transforms it into the problem Using (5.10), the above problem can be written as Clearly, the problem achieves its optimal solution for maximal for which the constraint is satisfied. The solution to this problem is given in the following lemma, whose proof is given in the appendix.Lemma 5.2. Under condition (5.8), the equation has a maximal solution . Furthermore, , for .We substitute into (5.15) to get the optimal portfolio which yields Finally, to prove that the expected wealth is an increasing function of the risk constant , we rewrite (5.10) in the form which defines as an implicit function of . Differentiating the above, we get so that Since , and since , it follows that . From Lemma 5.2 it follows that , for all , and hence for , so that . Thus, is an increasing function of . From (5.11), we see that the expected wealth is an increasing function of , and consequently of . This completes the proof of the theorem.

Remark 5.3. If a risk-averse investor decides to take the minimal risk which, in the first case, means taking for the risk constant implying then the unique solution of the equation is .
In the second case, for a risk averse investor, the minimal risk constant is , which implies , leading to the equation so that the unique solution of the equation is .
Therefore, in both cases, choosing the minimal risk constant leads to the optimal strategies coinciding with the strategies from the previous chapter, in which minimal was determined.

6. Applications

To illustrate the results from previous sections, we give some numerical examples. We recall that the system of SDEs that models the stocks' prices is and that the stocks' returns variance-covariance matrix, which we denote by , is equal to . We also recall that can be decomposed as where is the stocks' returns correlation matrix, and is a diagonal matrix with the entries equal to the stocks' returns standard deviations. Therefore, from (6.2) we get Although, theoretically, we assume that the vector of independent Brownian motions is -adapted, that is, known at time , it is a common practice that we only observe or, equivalently, and , but not . From (6.3) we see that this leads to a nonunique decomposition of into the product . Despite that fact, the Euclidean norm, and consequently, the -norm of the market price of risk, is uniquely determined by or, in the terms of the standard deviation and the correlation matrix, as

To keep the exposition simple we consider the cases where the interest rate , as well as the volatility matrix, is constant, and the number of stocks is . In all numerical computations and the corresponding plots we use an annual time scale for the drifts, standard deviations, and the correlation matrix of the stocks. We model time dependency in the drift by assuming that it oscillates periodically around a constant level . In order to capture cycles in the economy or in the dynamics of the stocks we model the drifts as We note that the above model for stocks' drifts was already used in [17], as the amplitude and frequency coefficients and allow a high degree of flexibility in adjusting the shape of this time dependency. We also note that, when modeling real market data, it is quite easy to deal with the above functional form and estimate these two parameters by maximum likelihood techniques, rather than detrending the data.

We now look at four special cases with the following characteristics.(i)We let , with ; that is, the economic cycles of all three stocks are the same. We consider , , , with , which corresponds to a deviation around the constant values .(ii)We assume that the interest rate is and numerically explore the sensitivity of the optimal strategies with respect to and .(iii)We assume that the stocks' returns have constant standard deviations given as follows:

To emphasize the importance of the diversification effect and, consequently, of the market price of risk, for the optimal strategies, in Examples 6.2, 6.3, and 6.4 we keep the same drift and volatility coefficients and change the correlation matrix.

Example 6.1. We assume that , , and , and that the correlation matrix is

In Figure 2, we show (a) the stocks' drifts over a ten year period, with daily granularity, and (b) the optimal strategy, under minimal .

In this example, we see the expected result, that is, stock 1, which has the largest constant part in the drift, and the smallest volatility is present in the optimal portfolio in the highest percentage. However, to assume that just drifts and volatilities determine the optimal strategy would be misleading, as the following examples show.

Example 6.2. We assume that , , , and that the correlation matrix is the same as in Example 6.1.

The stocks' drifts and portfolio weights are given in Figure 3.

We see that, despite the fact that stock 1 has the smallest constant part of the drift, it is still present in the optimal portfolio in the highest percentage, due to its high negative correlations with both stocks 2 and 3.

Example 6.3. In this example we assume that , , and ; that is, the stocks have the same constant parts of the drifts as in Example 6.2 but a different correlation matrix given by

The optimal strategy, under minimal , is given in Figure 4.

In this example, stocks 2 and 3, which are highly negatively correlated, are present in the optimal portfolio in percentages above 400%, while stock 1 and the bond are being borrowed.

Example 6.4. In this example, we assume again that the stocks have the same constant parts of the drifts as in Examples 6.2 and 6.3, while the correlation matrix is

The criterion for investing into stocks, under the minimal , is . For , , so that we must have , in order to start investing into stocks. In the case of low correlation coefficients, such as in the above example, the market price of risk increases very slowly, so that it takes the time horizon of 34 years to invest into stocks with the risk level of 0.05. For , , so that we must have to start investing into stocks. If we choose the risk level to be 0.01, we have to assume the very long time horizon, of 57 years, in order to include stocks into the optimal strategy, under minimal . The relation between and can be found in Table 1.

Efficient Frontiers
Figure 5 shows the efficient frontier for Example 6.1 created using Theorem 5.1. The theorem states that the expected wealth is an increasing function of , which is bounded above by
In order to avoid extremely risky strategies as , and for the sake of more transparency of the graphs, we restrict to the interval . We note that the efficient frontiers for the other three examples are of the same exponential form, so that we omit their graphs.
The graph given in Figure 5 illustrates the fact that the efficient frontiers are an increasing function of the risk constant which bounds .

Time-CCaR-Expected Wealth Surfaces
In Figure 6 we give a three-dimensional plot of the expected wealth as a function of time and the risk constant . From Theorem 5.1 we know that the expected wealth increases when the risk constant increases. We recall that the expected wealth is also an increasing function of the time horizon . The following figure illustrates these facts for Example 6.1. In order to avoid extremely risky strategies, and to get a better representation, we restrict the upper bound for to the interval .

7. Conclusion

In this work we investigated continuous time portfolio selection under the notion of conditional capital at risk, within the Black-Scholes asset pricing paradigm, with time dependent coefficients. We showed that conditional capital at risk is a coherent risk measure and proved that it satisfies an important property of strong quasiconvexity. Based on an idea from [14], generalized in [3], we introduced the fundamental dimension reduction procedure which transforms -dimensional optimization problems into one-dimensional problems, within the class of admissible portfolios, which are Borel measurable, bounded, and deterministic. We further developed optimal strategies for portfolio optimization under constrained conditional capital at risk. It is important to emphasize that we considered time dependent portfolios, where the methods developed in [14] no longer work. We illustrated the two-fund separation theorem, by showing that all optimal strategies are the weighted averages of Merton's portfolio and the bond, and the weights depend on the investor's risk tolerance only. Finding optimal strategies requires solving nonlinear equations which include the cumulative distribution function of the normal random variable, so that the weights can be only found numerically. We provide several numerical examples which illustrate the importance of diversification, given by the correlation matrix. The correlation matrix significantly impacts the magnitude of the norm of the market price of risk, which, in turn, is the determining criterion for investment strategies.

Appendix

Proof of Proposition 2.5. Let Then the differential equation of the wealth process given by (2.5) can be written as For convenience, set . Applying the multidimensional version of Itô's Lemma, it can be shown (see the proof of Proposition of [3]) that follows the dynamics The -quantile of is equal to so that the -quantile of is equal to . We will further simplify the notation by introducing From where is the corresponding indicator function, and from Bayes' theorem, we obtain Choose . We will evaluate (A.7) using the characteristics of the distribution of , for fixed . From (A.3) we have that is a normal random variable with the parameters Taking into account (A.7) and (A.8), we can write Using the standard integration techniques, and taking into account that , we get If we substitute the original notation from (A.8) and (A.1), we get (2.10). This completes the proof of Proposition 2.5.

Proof of Lemma 3.6. Let . The function is clearly decreasing. To prove that it is strictly concave, we differentiate twice to get To evaluate , we use the following standard inequality. For , Applying the above, we get that which means that is concave. To prove that is strictly concave we need to show that If we define the function it is an easy exercise to prove that , for all ; that is, the function is strictly concave for all , which ends the proof of Lemma 3.6.

Proof of Lemma 4.4. We prove that the optimal solution of (4.17) satisfies (4.21). From (4.17) we have that We use again the standard inequality (A.12). From inequality (A.12), with , we have The right-hand side of the inequality implies From the left-hand side of the inequality we have Since inequality (A.18) implies The fact that and (A.20) give the desired result, so that Lemma 4.4 is proved.

Proof of Lemma 5.2. In this proof we find a maximal solution of the equation Since , defined by (4.15), we will apply some arguments from the proof of Theorem 4.2.(i)Suppose that , which means that . Then the unique maximum of is achieved at . From condition (5.8) (i) we have that Hence Therefore, . We further distinguish the following two subcases.(a)Suppose that . Then (A.21) implies that . From the proof of Theorem 4.2, for , , is concave, and since , it follows that the equation has at least one solution, with the bigger solution .(b)For , . Using the same arguments as in case (a), we get that (A.21) has a unique solution .(ii)Suppose , that is, . We recall that in this case . The definition of yields From condition (5.8) (ii), we have that , that is, . Using , and the proof of Theorem 4.2, we have Therefore, for all , with , so that has a unique solution . This completes the proof of Lemma 5.2.

Acknowledgments

This research was partially supported by the National Science and Engineering Research Council of Canada and the Network Centre of Excellence, Mathematics of Information Technology and Complex Systems.