Statistical Estimation of Portfolios for Dependent Financial Returns
View this Special IssueResearch Article  Open Access
Optimal Portfolios with EndofPeriod Target
Abstract
We study the estimation of optimal portfolios for a Reserve Fund with an endofperiod target and when the returns of the assets that constitute the Reserve Fund portfolio follow two specifications. In the first one, assets are split into short memory (bonds) and long memory (equity), and the optimality of the portfolio is based on maximizing the Sharpe ratio. In the second, returns follow a conditional heteroskedasticity autoregressive nonlinear model, and we study when the distribution of the innovation vector is heavytailed stable. For this specification, we consider appropriate estimation methods, which include bootstrap and empirical likelihood.
1. Introduction
The Government Pension Investment Fund (GPIF) of Japan was established in April 1st 2006 as an independent administrative institution with the mission of managing and investing the Reserve Fund of the employees’ pension insurance and the national pension (http://www.gpif.go.jp/ for more information) [1]. It is the world’s largest pension fund ($1.4 trillions in assets under management as of December 2009), and it has a mission of managing and investing the Reserve Funds in safe and efficient investment with a longterm perspective. Business management targets to be achieved by GPIF are set by the Minister of Health, Labour, and Welfare based on the law on the general rules of independent administrative agencies. In the actuarial science, “required Reserve Fund” for pension insurance has been investigated for a long time. The traditional approach focuses on the expected value of future obligations and interest rate. Then, the investment strategy is determined for exceeding the expected value of interest rate. Recently, solvency for the insurer is defined in terms of random values of future obligations (e.g., Olivieri and Pitacco [2]). In this paper, we assume that the Reserve Fund is defined in terms of the random interest rate and the expected future obligations. Then, we propose optimal portfolios by optimizing the randomized Reserve Fund.
The GPIF invests in a portfolio of domestic and international stocks and bonds. In this paper, we consider the optimal portfolio problem of the Reserve Fund under two econometric specifications for the asset’s returns.
First, we select the optimal portfolio weights based on the maximization of the Sharpe ratio under three different functional forms for the portfolio mean and variance, two of them depending on the Reserve Fund at the endofperiod target (about 100 years). Following the asset structure of the GPIF, we split the assets into cash and domestic and foreign bonds on one hand and domestic and foreign equity on the other. The first type of assets are assumed to be short memory, while the second type are long memory. To obtain the optimal portfolio weights, we rely on bootstrap. For the short memory returns, we use wild bootstrap (WB). Early work focuses on providing first and secondorder theoretical justification for the wild bootstrap in the classical linear regression model (see, e.g., [3]). Gonçalves and Kilian [4] show that WB is applicable for the linear regression model with conditional heteroscedastic such as stationary ARCH, GARCH, and stochastic volatility effects. For the long memory returns, we apply sieve bootstrap (SB). Bühlmann [5] establishes consistency of the autoregressive sieve bootstrap. Assuming that the long memory process can be written as AR() and MA() processes, we estimate the long memory parameter by means of the Whittle’s approximate likelihood [6]. Given this estimator, the residuals are computed and resampled for the construction of the bootstrap samples, from which the optimal portfolio estimated weights are obtained. We study the usefulness of these procedures with an application to the GPIF assets.
Second, we consider the case when the returns are time dependent and follow a heavytailed. It is known that one of the stylized facts of financial returns are heavy tails. It is, therefore, reasonable to use the stable distribution, instead of the Gaussian, since it allows for skewness and fat tails. We couple this distribution with the conditional heteroskedasticity autoregressive nonlinear (CHARN) model that nests many wellknown time series models, such as ARMA and ARCH. We estimate the parameters and the optimal portfolio by means of empirical likelihood.
The paper is organized as follows. Section 2 sets the Reserve Fund portfolio problem. Section 3 focuses on the first part, that is, estimation in terms of the Sharpe ratio and discusses the bootstrap procedure. Section 4 covers the CHARN model under stable innovations and the estimation by means of empirical likelihood. Section 5 concludes.
2. Reserve Funds Portfolio with EndofPeriod Target
Let be the price of the th asset at time (), and let be its logreturn. Time runs from 0 to . The paper, we consider that today is and is the endofperiod target. Hence the past and present observations run for , and the future until the endofperiod target for . The price can be written as where is the initial price. Let denote the Reserve Fund on asset at time and be defined by where denotes the maintenance cost at time . By recursion, can be written as where .
We gather the Reserve Funds in the vector . Let be a portfolio form by the Reserve Funds, which depend on the vector of weights . The portfolio Reserve Fund can be expressed as a function of all past returns
We are interested in maximizing at the endofperiod target
It depends on the future returns, the maintenance cost, and the portfolio weights. While the first two are assumed to be constant from to (the constant return can be seen as the average return over the periods), we focus on the optimality of the weights that we denote by .
3. SharpeRatioBased Optimal Portfolios
In the first specification, the estimation of the optimal portfolio weights is based on the maximization of the Sharpe ratio: under different functional forms of the expectation and the risk of the portfolio. We propose three functional forms, two of them depending on the Reserve Fund. The first one is the traditional based on the returns: where and are the expectation and the covariance matrix of the returns at the endofperiod target. The second form for the portfolio expectation and risk is based on the vector of Reserve Funds: where and indicate the mean and covariance of the Reserve Funds at time . Last, we consider the case where the portfolio risk depends on the lower partial moments of the Reserve Funds at the endofperiod target: where is a given value.
Standard portfolio management rules are based on a meanvariance approach, for which risk is measured by the standard deviation of the future portfolio value. However, the variance often does not provide a correct assessment of risk under dependency and nonGaussianity. To overcome this problem, various optimization models have been proposed such as meansemivariance model, meanabsolute deviation model, meanvarianceskewness model, mean(C)VaR model, and meanlower partial moment model. The meanlower partial moment model is an appropriate model for reducing the influence of heavy tails.
The returns are split into  and dimensional vectors and , where and stand for short and long memory, respectively. The short memory returns correspond to cash and domestic and foreign bonds, which we generically denote by bonds. The long memory returns correspond to domestic and foreign equity, which we denote as equity.
Cash and bonds follow the nonlinear model where is a vector of constants, is a positive definite matrixvalued measurable function, and are i.i.d. random vectors with mean and covariance matrix . By contrast, equity returns follow a long memory nonlinear model where with , and are i.i.d. random vectors with mean and covariance matrix .
We estimate the optimal portfolio weights by means of bootstrap. Let the superindexes and denote the bootstrapped samples for the bonds and equity, respectively, and the total number of bootstrapped samples. In the sequel, we show the bootstrap procedure for both types of assets.
Bootstrap Procedure for
Step 1. Generate the i.i.d. sequences for and from .Step 2. Let , where . Generate the i.i.d. sequences for and from the empirical distribution of .Step 3. Compute for and as
where denotes the cellwise product.
Bootstrap Procedure for
Step 1. Estimate from the observed returns by means of Whittle’s approximate likelihood:
where
Step 2. Compute for ,
Step 3. Generate for and from the empirical distribution of .Step 4. Generate for and as
We gather and into for and . The bootstrapped Reserve Funds And the bootstrapped Reserve Fund portfolio is Finally, the estimated portfolio weights that give the optimal portfolio are where and may take any of the three forms introduced earlier but be evaluated in the bootstrapped returns or Reserve Funds.
3.1. An Illustration
We consider monthly logreturns from January 31 1971 to October 31 2009 (466 observations) of the five types of assets considered earlier: domestic bond (DB), domestic equity (DE), foreign bond (FB), foreign equity (FE), and cash (cash). Cash and bonds are gathered in the shortmemory panel and follow (3.5). Equities are gathered into the longmemory panel and follow (3.6). Figure 1 shows the five assets. Cash is virtually constant, and equities are significantly more volatile than bonds and with averages that are slightly higher than those of bonds.
We estimate the optimal portfolio weights, , and , corresponding to the three forms for the expectation and risk of the Sharpe ratio, and we compute the trajectory of the optimal Reserve Fund for . Because of liquidity reasons, the portfolio weight for cash is kept constant to 5%. The target period is fixed to 100 years, and the maintenance cost is based on the 2004 Pension Reform.
Table 1 shows the estimated optimal portfolio weights for the three different choices of the portfolio expectation and risk. The weight of domestic bonds is very high and clearly dominates over the other assets. Domestic bonds are low risk and medium return, which is in contrast with equity that shows higher return but also higher risk, and with foreign bonds that show low return and risk. Therefore, in a sense, domestic bonds are a compromise between the characteristic of the four equities and bonds.

Figure 2 shows the trajectory of the future Reserve Fund for different values of the yearly return (assumed to be constant from to ) ranging from 2.7% to 3.7%. Since the investment term is extremely long, 100 years, the Reserve Fund is quite sensitive to the choice of the yearly return. In the 2004 Pension Reform, authorities assumed a yearly return of the portfolio of 3.2%, which corresponds to the middle thick line of the figure.
4. Optimal Portfolio with TimeDependent Returns and Heavy Tails
In this section, we consider the second scenario where returns follow a dependent model with stable innovations. The theory of portfolio choice is mostly based on the assumption that investors maximize their expected utility. The most wellknown utility is the Markowitz’s meanvariance function that is optimal under Gaussianity. However, it is widely acknowledged that financial returns show fat tails and, frequently, skewness. Moreover, the variance may not always be the best risk measure. Since the purpose of GPIF is to avoid making a big loss at a certain point in future, risk measures that summarize the probability that the Reserve Fund is below the prescribed level at a certain future point, such as value at risk (VaR), are more appropriate [7]. In addition, the traditional meanvariance approach considers that returns are i.i.d., which is not realistic as past information may help to explain today’s distribution of returns.
We need a specification that allows for heavy tails and skewness and time dependencies. This calls for a general model with location and scale that are a function of past observations and with innovations that are stable distributed. The locationscale model for the returns is the conditional heteroscedastic autoregressive nonlinear (CHARN), which is fairly general and it nests important models such as ARMA and ARCH.
Estimation of the parameters in a stable framework is not straightforward since the density does not have a closed form (Maximum likelihood is feasible for the i.i.d. univariate case thanks to the STABLE packages developed by John Nolan—see Nolan [8] and the website http://academic2.american.edu/~jpnolan/stable/stable.html. For more complicated cases, including dynamics, maximum likelihood is a quite complex task.). We rely on empirical likelihood, which is one of the nonparametric methods, as it has been already studied in this context [9]. Once the parameters are estimated, we simulate samples of the returns, which are used to compute the Reserve Fund at the endofperiod target, and estimate the optimal portfolio weights by means of minimizing the empirical VaR of the Reserve Fund at time .
Suppose that the vector of returns follows the following CHARN model: where is a vectorvalued measurable function with a parameter and is a positive definite matrixvalued measurable function with a parameter . Each element of the vector of innovations is standardized stable distributed: and ’s are independent with respect to both and . We set , where and .
The stable distribution is often represented by its characteristic function: where is a location parameter, is a scale parameter, is a skewness parameter, and is a characteristic exponent that captures the tail thickness of the distribution: the smaller the the heavier the tail. The distributions with correspond to the Gaussian. The existence of moments is given by : moments of order higher than do not exist, with the case of being an exception, for which all moments exist.
The lack of important moments may, in principle, render estimation by the method of moments difficult. However, instead of matching moments, it is fairly simple to match the theoretical and empirical characteristic function evaluated at a grid of frequencies [9]. Let be the residual of the CHARN model. If the parameters and are the true ones, the residuals should be independently and identically distributed to . So the aim is to find the estimated parameters such that the residuals are i.i.d. and stable distributed, meaning that their probability law is expressed by the above characteristic function. Or, in other words, estimate the parameters by matching the empirical and theoretical characteristic functions and minimizing their distance. Let be the number of frequencies at which we evaluate the characteristic function: . That makes, in principle, a system of matching equations. But since the characteristic function can be split into the real and imaginary parts, , we double the dimension of the system by matching these parts. Let and be the real and imaginary parts of the theoretical characteristic function, and and the empirical counterparts. The estimating functions are for each , and gather them into the vector The number of frequencies and the frequencies themselves are chosen arbitrary. Feuerverger and McDunnough [10] show that the asymptotic variance can be made arbitrarily close to the CramérRao lower bound if the number of frequencies is sufficiently large and the grid is sufficiently fine and extended. Similarly, Yu [11, Section 2.1] argues that, from the viewpoint of the minimum asymptotic variance, many and fine frequencies are the appropriate. However, Carrasco and Florens [12] show that too fine frequencies lead to a singular asymptotic variance matrix and we cannot calculate its inverse.
Given the estimating functions (4.5), the natural estimator is constructed by GMM: where is a weighting matrix defining the metric (its optimal choice is typically the inverse of the covariance matrix of ) and the expectations are replaced by sample moments. GMM estimator can be generalized to the empirical likelihood estimator, which was originally proposed by Owen [13] as nonparametric methods of inference based on a datadriven likelihood ratio function (see also [14], for a review and applications). It produces a better variance estimate in one step, while, in general, the optimal GMM requires a preliminary step and a preliminary estimation of an optimal matrix. We define the empirical likelihood ratio function for as where and the maximum empirical likelihood estimator is Qin and Lawless [15] show that this estimator is consistent, asymptotically Gaussian, and with covariance matrix , where
Once the parameters are estimated, we compute the optimal portfolio weights and the portfolio Reserve Fund at the endofperiod target. Because of a notational conflict, the weights are now denoted by . And, for simplicity, we assume that there is no maintenance cost, so (2.5) simplifies to The procedure to estimate the optimal portfolio weights is as follows.
Step 1. For each asset , we simulate the innovation process based on the maximum empirical likelihood estimator .
Step 2. We calculate the predicted logreturns for and based on the estimators and the simulated obtained in Step 1.
Step 3. For a given portfolio weight , we calculate the predicted values of fund at time , , with (4.10).
Step 4. We repeat Step 1–Step 3 times and save . Then we calculate the proportion that the predicted values fail below the prescribed level , that is,
Step 5. Minimize with respect to : .
4.1. An Illustration
In this section, we apply the above procedure to real financial data. We consider the same monthly logreturns data in Section 3.1. domestic bond (DB), domestic equity (DE), foreign bond (FB), and foreign equity (FE) are assumed to follow the following ARCH(1) model: respectively. Here and . Cash is virtually constant so we assume the logreturn of cash as 0, permanently. Set the present Reserve Fund and the target period is fixed to half years.
Table 2 shows the estimated optimal portfolio weights for the different prescribed level . The weights of domestic and foreign bonds tend to be high when is small. Small implies that we want to avoid the loss. On the contrary, the weights of equities become higher when is larger. Large implies that we do not want to miss the chance of big gain. This result seems to be natural because bonds are lower risk (less volatile) than equities.

5. Conclusions
In this paper, we study the estimation of optimal portfolios for a Reserve Fund with an endofperiod target in two different settings. In the first setting, one assets are split into short memory (bonds) and long memory (equity), and the optimality of the portfolio is based on maximizing the Sharpe ratio. The simulation result shows that the portfolio weight of domestic bonds is quite high. The reason is that the investment term is extremely long (100 years). Because the investment risk for the Reserve Fund is exponentially amplified year by year, the portfolio selection problem for the Reserve Fund is quite sensitive to the yearbased portfolio risk. In the second setting, returns follow a conditional heteroskedasticity autoregressive nonlinear model, and we study when the distribution of the innovation vector is heavytailed stable. Simulation studies show that we prefer the bonds when we want to avoid the big loss in the future. The result seems to be natural because the bonds are less volatile than the equities.
Acknowledgments
This paper was supported by Government Pension Investment Fund (GPIF). The authors thank all the related people of GPIF, especially Dr. Takashi Yamashita.
References
 GPIF Home Page, http://www.gpif.go.jp/eng/index.html.
 A. Olivieri and E. Pitacco, “Solvency requirements for life annuities,” in Proceedings of the Proceedings of the Actuarial Approach for Financial Risks (AFIR '00), Colloquium, 2000. View at: Google Scholar
 C.F. J. Wu, “Jackknife, bootstrap and other resampling methods in regression analysis,” The Annals of Statistics, vol. 14, no. 4, pp. 1261–1350, 1986. View at: Publisher Site  Google Scholar  Zentralblatt MATH
 S. Gonçalves and L. Kilian, “Bootstrapping autoregressions with conditional heteroskedasticity of unknown form,” Journal of Econometrics, vol. 123, no. 1, pp. 89–120, 2004. View at: Publisher Site  Google Scholar
 P. Bühlmann, “Sieve bootstrap for time series,” Bernoulli, vol. 3, no. 2, pp. 123–148, 1997. View at: Publisher Site  Google Scholar  Zentralblatt MATH
 J. Beran, Statistics for LongMemory Processes, vol. 61 of Monographs on Statistics and Applied Probability, Chapman and Hall, New York, NY, USA, 1994.
 T. Shiohama, M Hallin, D. Veredas, and M. Taniguchi, “Dynamic portfolio optimization using generalized conditional heteroskedasticity factor models,” ECARES WP 201030, 2010. View at: Google Scholar
 J. P. Nolan, “Numerical calculation of stable densities and distribution functions,” Communications in Statistics—Stochastic Models, vol. 13, no. 4, pp. 759–774, 1997. View at: Publisher Site  Google Scholar  Zentralblatt MATH
 H. Ogata, “Empirical likelihood estimation for a class of stable processes,” Mimeo, 2009. View at: Google Scholar
 A. Feuerverger and P. McDunnough, “On the efficiency of empirical characteristic function procedures,” Journal of the Royal Statistical Society B, vol. 43, no. 1, pp. 20–27, 1981. View at: Google Scholar  Zentralblatt MATH
 J. Yu, “Empirical characteristic function estimation and its applications,” Econometric Reviews, vol. 23, no. 2, pp. 93–123, 2004. View at: Publisher Site  Google Scholar  Zentralblatt MATH
 M. Carrasco and J. Florens, “Efficient GMM estimation using the empirical characteristic function,” IDEI Working Papers 140, Institut d’Economie Industrielle (IDEI), Toulouse, France, 2002. View at: Google Scholar
 A. B. Owen, “Empirical likelihood ratio confidence intervals for a single functional,” Biometrika, vol. 75, no. 2, pp. 237–249, 1988. View at: Publisher Site  Google Scholar  Zentralblatt MATH
 A. B. Owen, Empirical Likelihood, ChapmanHall/CRC, Boca Raton, Fla, USA, 2001.
 J. Qin and J. Lawless, “Empirical likelihood and general estimating equations,” The Annals of Statistics, vol. 22, no. 1, pp. 300–325, 1994. View at: Publisher Site  Google Scholar  Zentralblatt MATH
Copyright
Copyright © 2012 Hiroshi Shiraishi et al. This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.