Abstract

A Value-at-Risk (VaR) forecast may be calculated for the case of a random loss alone and/or of a random loss that depends on another random loss. In both cases, the VaR forecast is obtained by employing its (conditional) probability distribution of loss data, specifically the quantile of loss distribution. In practice, we have an estimative VaR forecast in which the distribution parameter vector is replaced by its estimator. In this paper, the quantile-based estimative VaR forecast for dependent random losses is explored through a simulation approach. It is found that the estimative VaR forecast is more accurate when a copula is employed. Furthermore, the stronger the dependence of a random loss to the target loss, in linear correlation, the larger/smaller the conditional mean/variance. In any dependence measure, generally, stronger and negative dependence gives a higher forecast. When there is a tail dependence, the use of upper and lower tail dependence provides a better forecast instead of the single correlation coefficient.

1. Introduction

A Value-at-Risk (VaR) forecast is crucial as a main reference for banking and insurance industries in assessing their financial risk performance as well as in allocating their capital. The VaR forecast may also be used as an alarm for such industries before they reach the worst risk. It is, therefore, a must for a risk manager to have accurate and reliable forecast of VaR.

Forecasting VaR, in general, requires the quantile of the loss distribution (see, e.g., [14], for review of VaR forecast); later, we name this as the quantile-based VaR forecast which will be described in Section 2 through simulation. In practice, however, we may obtain VaR of loss data by simply taking the quantile of its ordered data (known as the historical simulation method) without knowing its estimate of probability function or density. This approach, however, may not be satisfied and has the possibility of high forecast error. Other approaches include variance-covariance and Monte Carlo methods with some restrictions such as normality distribution assumption and accurate parameter estimate. In fact, there are still efforts by authors to develop the best risk forecast models.

A VaR forecast model may typically be developed in the direction of proposing the best VaR forecast technique along with its accuracy. For example, we may forecast VaR by a technique that substitutes model parameters by their estimates. The resulting VaR forecast, called an estimative VaR forecast, has coverage probability accuracy bounded to . To have the best VaR forecast, one then can modify the calculation to yield the so called improved VaR forecast (e.g., [57]). This improved VaR forecast is more accurate and has coverage property bounded to . An alternative direction of having the best VaR forecast is by incorporating statistical properties of loss distribution in the VaR forecast technique such as high kurtosis, volatility clustering, and heavy-tailed distribution, e.g., [8, 9], or in relation to extreme observations as in [10]. It is hoped that the resulting VaR forecast is not only having the best accuracy but also capturing the empirical moment as well as distributional properties.

In this paper, we aim to explore the VaR forecast in a different direction for a random loss that depends on another random loss, i.e., forecasting VaR for dependent losses. In particular, we are interested in finding the effect of dependence measures, such as unconditional correlation and tail dependence, to the estimative VaR forecast. There have been some works related to the dependence and VaR forecast. For instance, Ku and Wang [11] have used the MGARCH model with dynamic conditional correlation (DCC), instead of constant correlation, in backtesting the VaR forecast through forecast failures and average deviations. Meanwhile, Santos and Alves [10] considered dependence of excesses and duration by using S&P 500 index and found that the linear correlation was weak. In addition, Messaoud and Aloui [12] calculated the VaR and CoVaR forecast based on the Clayton copula which is in fact less than those based on the /normal copula since the Clayton copula has tail correlation. Multivariate copula shows more observations in the tails, a function of correlation.

To begin with, we review, through examples and simulation, the quantile-based VaR forecast for a random loss alone and/or a random loss that depends on another random loss, in Section 2. We visualize such estimative VaR forecasts along with their coverage probability via distribution function and probability function. It is important to note that the use of a copula to obtain bivariate distribution function is unavoidable due to nonnormality or nonidenticality of marginal loss distributions.

In Section 3, we present the conditional estimative VaR forecast that is explicitly formed as the conditional mean and variance; we take the case symmetrical distributions of normal and random samples. The effect of some dependence measures such as unconditional correlation and tail dependence is shown. Furthermore, we describe, in Section 4, dependent random losses for a certain stochastic process, namely, Autoregressive Conditional Heteroscedasticity (ARCH); note that the Generalized ARCH model, known as GARCH, has been also the interest for authors, including [2], in finding the VaR forecast with a copula. For this stochastic process-type dependent random loss case, we treat it as from (i) a single stochastic process and (ii) two dependent stochastic processes. Finally, Section 5 considers the dependent random losses with upper tail dependence coefficient. We show and compare simulated data to find the estimative VaR forecast by involving the tail dependence coefficient in comparison to a single correlation coefficient or two correlation coefficients. To illustrate our proposed method on involving dependence for the VaR forecast, we simulate such calculations for real data of the S&P 500 index and the Dow Jones index; this is described in Section 6. The discussion is given in Section 7.

2. Motivating Examples of Quantile-Based Value-at-Risk Forecast

From a statistical perspective, the Value-at-Risk (VaR) forecast is an application of the concept of (upper) forecasting limit for future observations, given a collection of random variables for losses. Suppose that a random loss has distribution function , where is the distribution parameter. Provided that the inverse of distribution function exists, we obtain VaR of at the level of significance : for all ; for more general random loss, we may use the VaR forecast formula as , see, e.g., [3, 13]. For symmetrical distributions such as normal and , in (1) consists of the mean and variance of . Furthermore, in practice, parameter must be estimated from data. The estimative VaR forecast, or , is obtained by substituting parameter with its estimate as in [5, 6]; the common estimators used are the maximum likelihood and least square. As mentioned early in this section, VaR forecast calculation is actually the (upper) forecasting limit of when the available data are . We may assume that the distribution of is identical to , for . Note that forecasting VaR may be carried out for the -step-ahead forecast.

We illustrate, in Figure 1, the VaR forecast of normal (Figures 1(a)1(d)) and (Figures 1(e)1(h)) distributions. The red dot mark in the figure shows the “position” of the VaR forecast (to -axis) and its corresponding () (to -axis). The first row (Figures 1(a) and 1(e)) is obtained by taking the quantile of such distributions. The next three rows explain the VaR forecast calculated from some approaches of historical simulation (Figures 1(b) and 1(f)), Monte Carlo (Figures 1(c) and 1(g)), and variance-covariance (Figures 1(d) and 1(h)). It may be observed that the VaR forecast obtained from the variance-covariance method has an interesting feature: a lower value than other methods (for normal distribution) and a higher value than others for distribution (see Table 1 for the exact number). The corresponding VaR forecasts calculated from the probability function are shown in Figures 2(a)2(d) for normal distribution and Figures 2(e)2(h) for distribution. The visualization of the probability function-based VaR forecast instead of distribution function may be easier for some readers as the coverage probability () for such VaR forecasts is easy to see in the area under the line (see, for instance, [3]), whilst the sample quantile for normal and distributions along with their bands may be found in [14]. We may conclude that the quantile-based estimative VaR forecasts obtained directly from (normal and ) loss distributions are more accurate than other methods.

The choice of normal and distributions for the VaR forecast illustration above came from the fact that both are symmetrical distribution and location-scale family (thus, they are comparable) with certain features of heavy tail for distribution. The estimative VaR forecast of normal distribution with parameter is where and denote the distribution function and -quantile of the standard normal, respectively. Meanwhile, the VaR forecast for distribution with degrees of freedom , , which has mean and variance , for , is , where denotes distribution function of standard . The class of (i) (a)symmetrical, (ii) heavy-tailed, and/or (iii) extreme distributions in the VaR forecast, typically for return innovation, has also been discussed by authors, e.g., [810, 13].

2.1. The Example of Two Dependent Random Losses

We consider now two random losses where one depends on another random loss. For example, observe bivariate normal distribution simulated data (second and third rows) with several correlation coefficients: 0.1-0.5-0.9 (first-second-third columns) in Figure 3. To compare with, a univariate normal distribution simulated data is given (Figure 3(a)). It is interesting to learn a quite different probability function of univariate and bivariate normal data and perhaps the effect of dependence.

Suppose that two random losses and are dependent on the parameter vector, for their joint distribution function, and dependence measure . We assume that their dependence may be a Pearson linear correlation, a rank correlation, and the coefficient of tail dependence, where the last two are copula-based dependence measures [3]. Our aim is to find the VaR forecast of , given , such that it has conditional coverage probability equal to , i.e., for all ; in other words, the VaR forecast (3) is obtained by calculating the inverse of conditional distribution function of , . It is, therefore, important to derive a joint distribution function of , , in order to obtain conditional distribution function . It is easy to have bivariate normal or distribution. However, when the distributions of and are neither normal nor identical, we may need a copula to obtain such joint distribution function.

We briefly describe a copula as follows. Let and denote the distribution function for and , respectively. Defining and , each has uniform distribution in . According to Sklar’s theorem, a two-dimensional copula is the distribution function , where is the joint distribution function of () (see, e.g., [12, 15]). The copula will be beneficial not only for reducing parameters (particularly in an explicit copula) but also for relaxing dependence models and understanding their dependence more easily [3]. For example, the class of the Archimedean copula, namely, Gumbel and Clayton copulas, are which reduce the parameter vector of bivariate normal () to only one parameter . The Gumbel copula is more appropriate for data with upper tail dependence whilst the Clayton copula represents phenomena of lower tail dependence. Meanwhile, the implicit copulas of Gaussian and are not in the closed form and are represented as and , respectively, where denotes joint distribution function and is the distribution function of standard . In contrast to the Gumbel and Clayton copulas, the Gaussian copula does not have tail dependence. On the other hand, the copula has both upper and lower tail dependence and is suitable for financial return data [2].

In Figure 4, we present the estimative VaR forecast (in the probability function-based visualization) for normal (left column) and (right column) distributed data; in Figures 4(a) and 4(d), such forecasts are computed by using the bivariate normal, , and , , distribution functions. The estimative VaR forecast (in the red dot mark) obtained from Gaussian and copulas (member of implicit copula) is shown in Figures 4(b) and 4(e), respectively; in Figures 4(c) and 4(e), the use of the Clayton copula (which is a member of the Archimedean copula) is presented. As in Table 2 (for the exact number), we find a quite similar result of the estimative VaR forecast computed by bivariate normal () and the corresponding Gaussian () copula.

3. Correlation Dependence and Conditional VaR Forecast: The Case of Symmetrical Distribution

3.1. Normal Distribution

Let and be two dependent random losses assumed normally distributed. The mean and variance vectors are () and (), respectively; thus, the parameter vector is . The conditional distribution of , given , is normal with conditional mean and its corresponding conditional variance where is the Pearson correlation coefficient of linear relationship between and [16]. The estimative VaR forecast of , based on the standard deviation premium principle, at the level of significance is given by where denotes the -quantile of standard normal. When and are independent (), we will obtain the well-known estimative VaR forecast for a normal random loss : , whereas when (there is strong linear dependence between and ), we have the estimative VaR forecast of random loss , given , as just. Note that by taking the first derivative of the estimative VaR forecast with respect to , i.e., and normalize it (equalized to zero), we obtain the value of that maximizes the estimative VaR forecast that is

It is interesting to observe the impact of dependence measure to the behavior of the conditional estimative VaR forecast (see Figure 5). The stronger the dependence of to (in the positive direction), the larger the conditional mean of , given , whereas the large value of may reduce the weight of conditional variance of which means the smaller conditional variance. In comparison to , other dependence measure of is presented, and it is observed that the higher value of (than ) will give a similar estimative VaR forecast, for negative dependence, whilst positive dependence of will give a higher estimative VaR forecast than .

3.2. Distribution

We consider distribution for both random losses of and . For simplicity, we take and the degrees of freedom .

The bivariate probability function of () is defined as for , where [17]. To obtain conditional probability function , given , , we need to divide the above probability function by the probability function of , .

The resulting conditional probability function is where its conditional mean and variance do not have explicit form and thus must be obtained numerically. In Figure 6, the effect of correlation to the estimative VaR forecast is illustrated. As tends to have a positive strong correlation, the estimative VaR forecast is lower. It is typical of heavy-tailed distributed data, commonly in financial data, to reduce the effect on a random loss by another (dependent) random loss.

3.3. Joint Distribution of Normal and

In practice, the two random losses and may have nonidentical (marginal) distributions. For example, may be normally distributed whilst has distribution or vice versa. Thus, the use of a copula is unavoidable to obtain the joint distribution function of such nonidentical losses, and the conditional distribution function can be derived.

Let and be the marginal distribution function of and , respectively. As we mention in Section 2, the joint distribution function of () can be represented, through copula with dependence measure , as where and . Note that the choice of is because of its invariant property under nonlinear monotonic transformation; in other words, it is copula-based dependence measure [3]. From [18], by taking the first derivative of with respect to , we can obtain the conditional distribution function of , given , i.e.,

Here, the function is the conditional distribution function of , given . For Archimedean, Gaussian, and copulas, based on [19], each function is given in Table 3.

Based on the conditional distribution function defined above, the conditional VaR of , given , at a level of significance can be derived from and we obtain

Because the parameter of the copula can be expressed as the function of dependence measure (see the third column of Table 3), it is clear that in (16) consists and so the conditional VaR of .

If is normally distributed whilst has distribution, we get the conditional estimative VaR forecast:

If has distribution whilst is normally distributed, the conditional estimative VaR forecast can be expressed as

For both cases, the effect of dependence to the conditional estimative VaR forecast is illustrated in Figure 7, by using Gumbel, Clayton, Gaussian, and copulas. The stronger the dependence of random loss to others (in negative or positive sign), the lower the conditional estimative VaR forecast. It happens when Clayton, Gaussian, and copulas are used. If we use the Clayton copula, the conditional estimative VaR forecast always decreases as the value of increases.

4. The Case of Risk Model with Volatility: ARCH

In statistical modeling, random losses may be represented as a stochastic process denoted as asset returns: , where is white noise with zero mean and unit variance, e.g., [1]. Suppose that the available data are . Thus, the estimative VaR forecast for , given observations up to time , , is where denotes the one-step-ahead volatility forecast; is the dependence measure between and (with as the parameter vector of an asset return model), for ; and is ()-quantile of distribution of .

Unlike in Section 3, now the dependence is between and in which the literatures show that no or little dependence occurs. Note that the VaR forecast formula (19) does not include conditional mean since this has zero value. As for the volatility process, we consider an ARCH(1) model; its forecast is given by which can be observed as conditional only on the last observation, , since the Markov property applies to the ARCH model. Table 4 and its corresponding Figure 8 show the conditional coverage probability of the estimative VaR forecast.

4.1. Two Dependent Stochastic Processes

We may also consider the case of stochastic processes and which are dependent with certain dependence measures such as Kendall’s , where is the parameter vector of the model. In other words, we aim to find the estimative VaR forecast of , given : with nonzero conditional mean and time-varying conditional variance. For simplicity, we take an ARCH(1) model for both processes. Provided an ARCH(1)-normal model, is normally distributed with mean and variance .

Now, the conditional mean of , given , with dependence measure is given as where (see [20]); its corresponding conditional variance is

Note that the choice of instead of is mainly due to the possibility of nonlinear dependence among two processes. In Figure 9, we observe the coverage probability of the estimative VaR forecast of two ARCH(1) processes. The simulation result shows quite close conditional coverage properties (red line) to the true coverage.

5. Estimative VaR Forecast of Two Losses with Tail Dependence

We have so far consider the calculation of the estimative VaR forecast as an upper prediction limit with a certain level of significance; the parameter of distribution or model is typically estimated by the maximum likelihood (ML) method. The accuracy of such VaR forecast may be computed through coverage probability that takes parameter variability into account (and the ML method is the common approach). Some authors, e.g., [57, 21], have done the effect of parameter variability into coverage probability accuracy for independent observations and heteroscedastic processes. We propose here to observe the effect of dependence measure for the case of two random losses, in particular when they have tail dependence.

To gain more tail dependence effect on VaR forecast computation, we first briefly show the linear correlation behavior for whole data. We have usually assumed that a single correlation coefficient is appropriate to cover dependence behavior for such data. Figure 10 explains several types of data with a single correlation line (that means single correlation coefficient, , 0.5, and −0.8). We, of course, hope that such correlation line will represent whole data, i.e., each observation is close to the line. However, there is possibility that a single correlation line or coefficient may not be appropriate (see Figure 10(b)).

Now, we observe another direction of dependence behavior for the data and thus the effect on calculating the estimative VaR forecast. Specifically, we rely on tail distribution for the dependence among losses; single dependence value but only for some part of data. The proposed idea is motivated by the fact that two random losses may have dependence only in part of the data, say in the tail of its distribution. To illustrate, we describe in Figure 11 in which two losses have used different dependence measures. As before, the key point here is the joint distribution in which we most likely have to rely on a copula instead of the classic joint distribution function; the Clayton copula may capture the lower tail dependence whilst the Gumbel copula shows the distribution to have upper tail dependence.

Suppose that two random losses, and , are dependent with parameter vector . McNeil et al. [3] and Lai et al. [15], for instance, have a formal definition of upper TDC between and as provided that the limit exists. The tail coefficient is ranging from zero to one, i.e., . Note that and are the inverse of distribution function for and , respectively. When the coefficient is zero, i.e., , then we would say that two losses are asymptotically independent.

The upper TDC formula involving the Gumbel copula, for example, will be defined as where denotes the surviving copula and is the parameter of the Gumbel copula. Figure 11 describes Gumbel distributed data against linear correlation coefficient and tail dependence. We show the behavior of linear correlation in “adapting” (to adjust) data behavior. In Figure 11(a), there is only one correlation coefficient for each figure (, top; , bottom), whereas there are two correlation coefficients for each figure in Figure 11(b) to adjust data generated from the Gumbel copula. It may be explained as follows (for example, Figure 11(b), bottom figure): data generated with upper tail dependence coefficient may have similar behavior with composite data with a lower correlation coefficient and upper correlation coefficient ; in other words, these lower and upper correlation coefficients may be more appropriate than a single correlation coefficient.

The estimative VaR forecasts of , given , along with correlation, rank, and upper tail dependence coefficient are shown in Table 5; its backtesting (Table 6) is carried out via Correct-VaR (e.g., see [22]). As the correlation coefficient (and the rank and upper tail dependence coefficients) is higher, so is the estimative VaR forecast for a single random loss. The accuracy, however, may not be accurate. In comparison, the Correct-VaR for dependent random loss is quite accurate and close to . The stronger the dependence of random loss to others (in negative or positive signs), the better the coverage properties of their estimative VaR forecasts.

6. Illustration of VaR Forecast for Real Data

We carry out numerical study on a data set that contains negative daily log-returns of Standard & Poor’s 500 (or S&P 500) and Dow Jones stock market indexes for the time period 2015–2019. The data set is obtained from Yahoo Finance. First, we assume that the negative daily log-returns are i.i.d. observations. The goodness-of-fit of the data can be determined visually in Figure 12. We can see that each data set fits to distribution.

Since the marginal distributions are from the same family (), we consider bivariate for their joint distribution. As comparison, the scatter plot in Figure 13 shows that the copula can also be used because the transformed data have both upper and lower tail dependence. The (conditional) estimative VaR forecasts of the negative daily log-return of the S&P 500 index, given the negative daily log-return of the Dow Jones index, are summarized in Table 7. As we can see, for each level of significance, the use of the copula implies the larger conditional estimative VaR forecast than the forecast through the bivariate distribution.

Now, Figure 14 shows that the negative daily log-return data have volatility clustering and time-varying properties. Hence, we also assume that the data follow a heteroscedastic process; here, the ARCH(1) process is considered. The (conditional) estimative VaR forecasts are summarized in Table 8.

Note that all the (conditional) estimative VaR forecast above have been evaluated through its conditional coverage probability, as shown in the bracket, or Correct-VaR. Such forecast accuracy is computed by the method as in [5, 6, 23]. The best or more appropriate VaR forecast is determined by the closeness of coverage probability with the target value of . The use of the copula, in Table 7, suggests larger VaR forecast with the consequence of less accuracy.

7. Discussion

The estimative VaR may be calculated based on the quantile of its distribution. In addition, the effect of dependence (such as Pearson’s correlation, Kendall’s tau, or tail dependence) occurs for the VaR forecast of two dependent random losses or stochastic process. Furthermore, the use of a copula is significant due to the fact that the classical joint distribution function may not be explicitly formulated. However, it has consequence of reducing forecast accuracy. As for the efficiency of the VaR forecast, it may be applied when we consider several models such as ARCH/GARCH with different assumptions of innovation: normal (N), skewed normal (SN), skewed (ST), generalized error distribution (GED), etc.

Kuan et al. [24] pointed out the expectile-based VaR forecast, say eVaR. This considers the magnitude of losses instead of just the probability of losses. There is a possibility that the measure of dependence is incorporated to the eVaR forecast. It is of course because the tail dependence and corresponding copula play an important role.

Data Availability

S&P 500 index and Dow Jones index data have been used.

Conflicts of Interest

The authors declare that they have no conflicts of interest.

Acknowledgments

We are grateful for financial support of “HIbah Berbasis Kompetensi” from the Ministry of Research, Technology and Higher Education of the Republic of Indonesia, 2017-2018. We are indebted to Arief Hakim for providing joint distribution of normal and and VaR forecast calculation of real data.