Abstract

Based on the risk control of conditional value-at-risk, distributionally robust return-risk optimization models with box constraints of random vector are proposed. They describe uncertainty in both the distribution form and moments (mean and covariance matrix of random vector). It is difficult to solve them directly. Using the conic duality theory and the minimax theorem, the models are reformulated as semidefinite programming problems, which can be solved by interior point algorithms in polynomial time. An important theoretical basis is therefore provided for applications of the models. Moreover, an application of the models to a practical example of portfolio selection is considered, and the example is evaluated using a historical data set of four stocks. Numerical results show that proposed methods are robust and the investment strategy is safe.

1. Introduction

The mean-variance portfolio model developed by Markowitz [1] is considered as the beginning of the financial investment research. The return and risk of a portfolio were measured by the expected value and the variance of the random portfolio return in this model. The risk measures “value-at-risk” (VaR) and “conditional VaR” (CVaR) have been studied extensively in recent literature of risk management and portfolio selection. However, when return distributions are not elliptical, the variance and VaR fail for risk evaluations. Artzner et al. [2], Mausser and Rosen [3], Rockafellar and Uryasev [4], and Uryasev and Rockafellar [5] also showed that CVaR has the most desirable properties. For example, it satisfies subadditivity property and looks deeply into the tail of a distribution. To cope with nonnormal elliptical distributions, CVaR may be compared with the widely accepted VaR risk performance measure. Based on the risk control of CVaR, Lin and Gong [6] established a mean-CVaR model on the basis of the mean-variance model and under normal distribution condition. Assuming that security rates of return have a multivariate normal distribution, Alexander and Baptista [7] analyzed the portfolio selection implications arising from imposing a VaR or a CVaR constraint on the mean-variance model. They showed that a CVaR constraint dominates a VaR constraint as a risk management tool when a risk-free security is present. Based on CVaR risk measure, Andersson et al. [8] examined a new approach for credit risk optimization. In this approach, the credit risk distribution was generated by Monte Carlo simulations, so it was not necessary to assume that the credit risk distribution is known. The optimization problem was solved by linear programming. However, generated several thousands of scenarios may lead to the linear programming with huge number of variables and constraints thus decreasing the computational efficiency of the problem. To overcome this difficulty, nondifferential optimization techniques and linear programming duality theory were explored in [9] and [10], respectively.

Robust optimization refers to finding a strategy whose behavior under the worst possible realizations of the uncertain inputs is optimized, to deal with uncertain data in optimization models. Robust optimization models have been designed and discussed extensively; see, for example, Ben-Tal et al. [11] and Bertsimas et al. [12] and the references therein. The research on robust optimization in both theory and application was surveyed in [12]. Ogryczak [13] showed that for box uncertainty the robust model of the mean is essentially a CVaR, and also the robust model of the CVaR itself is a CVaR with appropriately redefined probabilities. Based on the worst-case CVaR, Gao and Bian [14] gave an objective model considering the overall investment period’s risk level and derived a dynamic optimal portfolio model. Using vector autoregressive method and Monte Carlo simulation, the concrete steps for solving the optimal investment strategy were presented. Zhu and Fukushima [15] considered the worst-case CVaR in the situation where only partial information on the underlying probability distribution was available. The minimization of the worst-case CVaR under mixture distribution uncertainty, box uncertainty, and ellipsoidal uncertainty was investigated. Its application to robust portfolio optimization was given, and the corresponding problems were cast as linear programming and second-order cone programming. Tong and Liu [16] presented a worst-case CVaR index under the knowing part information of random variable, and three profit-risk robust portfolio models were proposed. The models were restated as linear programming problems for the case of box discrete distribution of random variable and the linear loss function. Bertsimas et al. [17] proposed a semidefinite optimization model for a class of minimax two-stage stochastic linear optimization problems with risk aversion. The distribution of second-stage random variables belongs to an uncertainty set of multivariate distributions with known first- and second-order moments. Zymler et al. [18] developed tractable semidefinite programming based approximations for distributionally robust individual and joint chance constraints, assuming that only the first- and second-order moments as well as the support of the uncertain parameters are given. They demonstrated that the worst-case CVaR can be computed efficiently for some classes of constraint functions. And the worst-case CVaR approximation for joint chance constraints was studied. Delage and Ye [19] proposed a model that describes uncertainty in both the distribution form and moments of random variable. Its application to portfolio selection that maximizes expected utility obtained from the single-step return of investment portfolio was given, and the problem was reformulated as a semidefinite programming problem when random variable belongs to or an ellipsoidal set.

In this paper, based on the risk control of CVaR, we propose distributionally robust return-risk optimization models, with box constraints of a random vector and the uncertainty in both the distribution form and moments of a random vector. The models are restated equivalently as semidefinite programming problems, which is computationally tractable both practically and theoretically. Moreover, an application of the models to a practical example of portfolio selection and corresponding numerical results are given.

The present paper is organized as follows. After this introduction in Section 2 distributionally robust return-risk optimization models are presented. Using the conic duality theory and the minimax theorem, the models are reformulated as semidefinite programming problems. In Section 3 we apply the models to a portfolio selection problem, and the problem is evaluated using a historical data set. Numerical results show that our proposed methods are robust and the obtained investment strategy is safe. Finally Section 4 is devoted to research perspective.

2. Distributionally Robust Return-Risk Optimization Models

In this section, we propose distributionally robust return-risk optimization models with first- and second-order moments uncertainty and box constraints of random vector. Their equivalent semidefinite programming formulations are presented.

Let be the decision variable, let be the random vector, and let be the probability distribution of the random vector . Suppose that a loss function is defined as . From [4], the CVaR performance function can be written as where is a confidence level,

In many economic and financial planning models, which depend on uncertain parameters (for example future interest rates, future demands for a product, or future commodity prices), their probability distribution is usually unknown. In such situations, using historical data of uncertain parameters, lower and upper bounds on the parameters and empirical estimates of the mean and covariance matrix of the parameters can usually be obtained. We use them to define a distributional uncertainty set below. Distributional set taking into account moments’ uncertainty in random vector and box constraints of random vector is as follows: where is the set of all probability measures on the measurable space , is the Borel -algebra on , and are lower and upper bounds on the random vector , , and are estimates of the mean and covariance matrix of the random vector , respectively, and the constants and quantify one’s confidence in and , respectively. Same as inequalities in [19], the second inequality in assumes that the mean of lies in an ellipsoid of size centered at the estimate , and the last inequality in assumes that the “centered second-moment matrix” of lies in a positive semidefinite cone defined with a matrix inequality.

Based on the distributional set and classical portfolio optimization models, we set up the following distributionally robust return-risk optimization models.

Model 1. Maximize the worst-case expected return with constraint including worst-case CVaR: where is a convex set and the upper bound on CVaR is given constant.

Model 2. Minimize the worst-case CVaR with constraint including the worst-case expected return: where the lower bound on expected return is given constant.

Model 3. Maximize the objective function including worst-case expected return and CVaR: where is a risk-aversion constant used to trade off the reward (expected return) and risk (CVaR).

Remark 1. The objective function of Model 3 represents a risk-adjusted expected return function. In the models above, should be given at first. Model 2 with bounded return is more common than Model 1 with bounded risk, because it is not so easy to expect an investor to specify maximum risk aversion levels with confidence. See, for example, Mansini et al. [20] and the references therein. The consideration to robust versions of the Sharpe ratio model [21] without requiring bounds on reward or risk will be included in our future work.
Because it is not easy to solve the models numerically, we give their tractable formulations by the following theorems. In the theorems, we use the following notation: the scalar product of any two symmetric matrices and is defined as , where “” denotes the trace (sum of diagonal elements) of a matrix.

Theorem 2. Model 1 can be reformulated as the following semidefinite programming problem: where , , , , , , , , and are variables.

Proof. From and Theorem 4 in [19], we have that, for any given , the problem can be transformed into the following problem: where , , , , , and are variables.
Similarly to the above proof, for any given , we have that can be transformed into the following problem: where , , , , , , and are variables.
In (8), for any if and only if Because , the convex set has an interior point and objective function in the left-hand side of (10) is bounded below on , and from the conic duality theorem, we know that (10) can be transformed into the following problem: where , the left-hand side of (11), is dual problem of the left-hand side of (10) with the same optimal value. Similarly to the above proof, for any if and only if where . Similarly, for any if and only if where , and .
From (8) and (11), we have that can be transformed into the following problem: where
From (9), (12), and (13), we have that if and only if the set (denoted by ) is nonempty, where .
From equality (20), (15), and (16), we have that Model 1 can be restated equivalently as the following problem:
Because the matrix inequality implies that , the matrix inequality in is removed. Similarly, the matrix inequality in is removed. So our proof is complete.

Theorem 3. Model 2 can be reformulated as the following semidefinite programming problem:where , , , , , , , , and are variables.

Proof. The proof is similar to that of Theorem 2, so it is omitted.

Theorem 4. Model 3 can be reformulated as the following semidefinite programming problem: where , and , and are variables.

Proof. From equality where , we have that where . Hence, where ; interchanging of the “min" and “max" operators in the last equality is obtained by using the minimax theorem ([22]). Hence, Similarly to the proof of Theorem 2, we can prove that Model 3 can be reformulated as the problem (21). Therefore the proof of the theorem is concluded.

3. Application in Portfolio Selection

In this section, we apply the models discussed in the previous section to solve portfolio problem and compare the behavior of the solutions obtained by our models and the solutions obtained by the model in [23] and the model using nominal data. The equivalent semidefinite programming problems for our models are solved by interior point methods [24].

In the following application, decision variable means portfolio weights and random vector means daily logarithmic return of the stocks. For Example 2, we evaluated our portfolio optimization problem using a historical data set of four stocks (China Mingsheng Banking (600016), Sinopec Group (600028), Sany Heavy Machinery (600031), and China Unicom (600050)) over a horizon of 10 years (2003–2012), obtained from CSMAR Solution. The historical data set we choose consisted of daily closing price of each stock of Shanghai Stock Exchange. Daily logarithmic returns were calculated by daily closing price. The parameters and are assigned as the estimates of the mean and covariance matrix of daily logarithmic return.

Example 1 (see [23]). Let the mean and covariance matrix of daily return of seven stocks be as follows:

Example 2. The empirical estimates and of the mean and covariance matrix of daily logarithmic return calculated by the historical data of the four stocks are as follows:

The results for Examples 1 and 2 are listed in Tables 1 and 2, respectively. The tables give the expected return (ER) , standard deviation (SD) , worst-case expected return, and worst-case CVaR ((WER,WCVaR), it is the risk-adjusted expected return function value (i.e., objective value) for Model 3 in tables), where is a global solution of models.

From tables and figures, we arrive at the following conclusion.

(1) We compare the models with the model in [23] and (26). The byproduct-solution pair , obtained by our models, is not worse than that obtained by the model in [23] and (26). Our model sometimes gives a slightly better result. That is, the investment decision with highest return is obtained. And our models give worst-case return and risk. So our model may be a better choice.

(2) Usually, the estimated mean and the covariance matrix of a return vector of the stocks based on the market data are subjected to errors. That is, there are errors between nominal data and true data. To analyze the robustness of the solution, for Example 1, let the nominal data be slightly perturbed. That is, let be the exact mean and covariance matrix, where “diag()" denotes a diagonal matrix whose th diagonal element is . Using the data to solve the models, the solution pair obtained by model in [23] and (26) is (0.1068, 0.0611) and (0.1200, 0.3899), respectively. Hence, using results obtained by model in [23] and (26) in Table 1, investors tend to overestimate return and underestimate risk. From , , and , we have that distribution of random vector with mean and covariance matrix lies in the set . So our proposed methods are robust and the investment strategy is safe.

(3) By solving Models 1 and 3 for different values of and , one can generate a sequence of optimal portfolios on the efficient frontier. From Figures 1 and 2, we observe that it ranges from the portfolio with the smallest overall variance to the portfolio with the highest expected return; that is, efficient frontier is monotonically increasing. This is consistent with the reality.

4. Conclusion

We proposed distributionally robust return-risk optimization models based on the risk measure CVaR, with box constraints of random vector and uncertainty in both the distribution form and moments (mean and covariance matrix) of random vector. The models are reformulated as computationally tractable semidefinite programming problems. And the scale of semidefinite programming problems is independent of the number of sample points. Moreover, an application of the models to a practical example of portfolio selection with historical stock data shows that proposed methods are robust and the investment strategy obtained by them is safe. The models are suitable for investors with high security demands.

Conflict of Interests

The authors declare that they do not have any conflict of interests in their submitted paper.

Acknowledgments

The authors are grateful to anonymous reviewers for many helpful suggestions. Their suggestions led to an improved version of this paper. The work was supported by the National Natural Science Foundation of China (11301050, 11171051, 91230103, 71172136, and 71301017).