Abstract

This paper proposes an innovative semiparametric nonlinear fuzzy-EGARCH-ANN model to solve the problem of accurate modeling for forecasting stock market volatility. This model has been developed by a combination of the FIS, ANN, and EGARCH models. Because the proposed model is highly nonlinear and gradient-based parameter estimation methods might not give global optimal parameters for highly nonlinear models, the study has decided to use evolutionary algorithms instead. In particular, a differential evolution (DE) algorithm is suggested to solve the parameter estimation problem of the proposed model. After this, the semiparametric nonlinear fuzzy-EGARCH-ANN model has been developed mathematically from the three models mentioned before, and the study has simulated data by it. After the simulation, parameter estimation of the proposed model using a differential evolution algorithm on the simulated data is done. Finally, it is seen that the proposed model is good in capturing the volatility clustering and leverage effects of highly nonlinear and complicated financial time series data that were overlooked by the EGARCH model.

1. Introduction

Modeling and forecasting volatility of the stock market have gained great attention from researchers in academia as well as in financial markets due to its wide range of applications [1]. Investment decisions in financial markets powerfully depend on the forecast of expected returns and volatility of the assets. Volatility delivers a measure of fluctuation in a financial security price around its expected value [1]. Expected market return is correlated to predictable stock market volatility [1]. Because of the usefulness of volatility prediction, a lot of time series models have been established for financial data with time-varying variance. But all these models are still deficient in several aspects [2]. An important task in applied research is to decide which of the many possible volatility models one should employ in any given situation. The work by [3, 4] attempts to provide some guidance in this respect by evaluating the modeling ability of various statistical procedures. Unfortunately, their results suggest that none of the popular employed models are able to adequately capture asymmetric effects and volatility clustering, at least for the particular stock market indices and time periods [2].

Even though GARCH family models play a successful role in modeling and forecasting volatility, they fail to capture the asymmetric volatility [5]. This is because the volatility of stock prices is significantly affected by negative shocks to returns rather than positive shocks. Though many financial time series observations have a nonlinear dependence structure, a linear correlation structure is usually assumed among the time series data [1]. However, conventional time series models produce forecasts based on some strict statistical assumptions about data distributions, and, therefore, they are not very appropriate to forecast financial data sets [6]. Also, due to their complexities, nonlinear models are in very limited use today. Considering these difficulties, there is currently a demand for more flexible models.

In the evolution of time series models, researchers have made wide efforts to take advantage of artificial intelligence to process a broad amount of information and to forecast financial markets leading to an increase in the investment return. To increase the predictive power of the financial time series models, the traditional time series models are hybridized with the neural networks for volatility prediction of different benchmark financial data sets [79]. The attempt for the hybrid system is to outperform the forecast results and overcome the shortcomings by extracting input variables from statistical methods and include them in the ANN learning process. Regardless of the high ability to deal with the problem of volatility forecasting, ANN drawbacks include its “black box” nature, more computational burden, proneness to overfitting, and the empirical nature of model development. Again, to overcome the drawbacks of neural network models and to tackle the uncertainties that exist in precise forecasting of stock market volatility, several fuzzy time series models have been evolved [1].

The hybrid fuzzy time series models proposed in references [10, 11] have shown important enhancements in forecasting stock market volatility, outperforming the traditional time series models, neural networks, other hybrid models, etc. Adaptive neurofuzzy information system (ANFIS) is a different popular hybrid model used in volatility forecasting [1215]. The ANFIS model gives the advantage of combining the rules in the rule base of fuzzy theory to describe the complex relationships between the variables and the learning capability of the neural network to amend the membership functions and the rule base [1]. Furthermore, the ANFIS model offers a superior approach of using the low-level learning, the computational power of the neural network, and the high-level human-like thinking and reasoning of the fuzzy system [1].

Although there have been numerous studies on volatility model evaluation for stock markets, no optimal model exists to provide an accurate forecast of volatility indices [5]. Thus, the motivation of this study is to increase the ability of the time series model in forecasting return volatility by integrating it with a suitable machine learning approach [1]. Therefore, the proposal of numerous hybrid time series models seems like a potential device for volatility modeling and forecasting. Hence, the study has developed a new model by combining the EGARCH model with the ANN and FIS to handle the issue of not simulating stock fluctuations with volatility clustering and asymmetric effects overlooked by the EGARCH model.

The paper is organized into seven sections. Not together from the introduction in Section 1, GARCH family models are presented in Section 2. In Section 3, detail of the EGARCH-ANN model is given. In Section 4, a new fuzzy-EGARCH-ANN model is presented to handle the stochastic nature of stock market volatility. A differential evolution approach is given in Section 5 to update the weights of the fuzzy-EGARCH-ANN model, and Section 6 provides the simulation study of the stock market volatility using the newly proposed model. Concluding remarks are given in Section 7.

2. GARCH Family Models

Among a number of time series models, the GARCH models proposed by Bollerslev appear to be the most successful and popular form for modeling and forecasting the conditional variance of the return of volatility [16, 17]. The GARCH (p, q) model considers the current conditional variance dependent on the past conditional variances as well as the past squared innovations [18]. Let be the daily closing stock price at time and be the continuously compounded rate of stock returns from time to . Now, unpredictable component of is given by [19]. The GARCH (p, q) model can be written as [5]where , and is the white noise with and . In this model, is the conditional variance of . In addition to the nonnegativity of the parameters here, there is parameter restrictionto ensure the positivity of the conditional variance.

GARCH models’ simplicity and ability to capture persistence of volatility explain its empirical and theoretical appeal [5]. However, despite the success of the GARCH model, it fails to capture the asymmetric volatility. This limitation has been managed somehow by introducing more flexible volatility treatments by accommodating the asymmetric responses of volatility to positive and negative shocks. Two simple classes of models that capture the asymmetric behavior of returns are the GJR-GRACH model proposed by Glosten and the exponential GARCH (EGARCH) introduced by Nelson [5]. The GJR-GARCH model allows the conditional variance to respond differently to past negative and positive innovations. The GJR-GARCH (p, q) model is given by [1]where else .

At this point, the variable distinguishes negative shocks from positive shocks, so that asymmetric effects on the data are captured by , , , and parameters with the constrains, and

The exponential GARCH (EGARCH) model provides an alternative asymmetric model by considering the leverage effects of a price change on the conditional variance. That means a large price decline can have a bigger impact on volatility than a large price increase. The EGARCH (p, q) model can be represented aswhere is a sequence of i.i.d random variables, , and , with no parameter restrictions. Finally, the use of the log form allows the parameters to be negative without the conditional variance becoming negative [1].

Although the GJR-GARCH and EGARCH models include the asymmetric responses of volatility to positive and negative shocks, they do not simulate stock fluctuations with volatility clustering accurately. This fact can lead to poor adequacy and forecasting ability. Therefore, the proposal of several hybrid time series models appears as a potential tool for volatility modeling and forecasting [1]. In this paper, the basic idea is to propose a unified framework of a computing model that adopts the merits of both ANN and FIS with an efficient statistical time series model like EGARCH.

3. EGARCH-ANN Model

As mentioned in the previous section, ANFIS is also a hybrid time series model for volatility prediction. ANNs have been broadly applied to answer many hard problems in different areas, including pattern recognition, signal processing, and language learning, as mentioned in [20, 21]. Since the pioneer work of White [22], there have also been abundant applications of ANN in economics and finance [1]. Unfortunately, the ANN literature is not easy to enter, so it is hard for applied economists to understand why ANN works and how it can be applied appropriately [22, 23].

As in [24], the study prefers to consider a network with three layers called an input layer, a hidden layer, and an output layer. The input layer contains input nodes, and the output layer contains output nodes in such a way that each nodes of the input layer corresponds to a particular input variable, and each nodes of the output layer corresponds to a particular output variable, respectively. There are total of hidden nodes in the hidden layer which are connected to all input and output nodes. The strengths of such connections are assigned by parameters which are unknown and are called the network connection weights. In particular, represents the vector of the connection weights between the hidden node and all input nodes, and represents the vector of the connection weights between the output node and all hidden nodes.

Formally, let us say that the input vector is . The input nodes receives the information and send to all hidden nodes, weighted by the connection weights between the input and hidden nodes. The information is then transformed from each input nodes by the activation function to each hidden node. That is, the hidden node receives and transforms it to where is the bias term in the hidden node. The information generated by all hidden nodes is further passed to the output nodes, again weighted by the corresponding connection weights, and transformed by the activation function in each output node. Hence, the output node receives and transforms it into the network output:where is the bias term in the output node and the output is used to describe or predict the behavior of the target . As far as model specifications are concerned, the building blocks of an ANN model are the activation functions and . Different choices of the activation functions result in different network models. But, this study has chosen the most common activation functions. That is, and . In addition to this, the study has considered only the output layer of one hidden unit.

From the given lessons learned from the existing literature and the above preliminary concepts [19, 2427], the hybrid model from the EGARCH model and ANN is given by EGARCH-ANN (, q, m, s):wherein which is the asymmetric function of and , where and are the in-sample mean and the in-sample variance, respectively; both and the elements of are going to be generated by a uniform random number generator, and they lie between −1 and 1 [2, 5]. After having the values of , q, m, and s in hand, the estimation method for EGARCH-ANN (, q, m, s) can be MLE based on the preferred distribution of . This EGARCH-ANN hybrid model can handle the leverage which is overlooked by the EGARCH model by the help of ANN’s universal approximation nature [2]. That is, because the ANN component can add the ability to handle the nonlinear relationship of the past return innovations and its future volatility to the EGARCH model [28]. In other words, this hybrid model is good in modeling and forecasting volatility than that of the EGARCH model because the ANN component helps it to handle the leverage effect stylized fact of volatility overlooked by the EGARCH model [1, 2].

In the same way, one among many means of combining GARCH-type models with FIS to build a hybrid model is the fuzzy-GARCH-type hybrid models [20]. That is, just by applying FIS on GARCH-type models. In other words, it is just building fuzzy-GARCH-type hybrid models by applying a fuzzy inference system on GARCH-type models [28]. For the same reason mentioned before, now the study wants to focus only on the fuzzy-EGARCH hybrid model among other fuzzy-GARCH-type hybrid models [1]. In this model, the fuzzy component gives the hybrid model the ability to simulate stock price variations with volatility clustering. And also, in this model, leverage effects of a price change on volatility are captured by the EGARCH model.

4. Proposed Fuzzy-EGARCH-ANN Model

From the given lessons learned from the existing literature and some other preliminary concepts of FIS [2932], the study has developed a hybrid model from FIS, and the EGARCH-ANN model by the same methodology of R. Dash and D. K. Dash [1] has been applied to build the fuzzy-EGARCH model. In a similar fashion to the formation of the fuzzy-EGARCH hybrid model, the study has considered the fuzzy approach as it provides the capability to simulate stock fluctuations with volatility clustering, whereas the asymmetry and leverage effects of price change on the conditional variance is captured by the EGARCH-ANN model. This fuzzy-EGARCH-ANN model is described by a collection of fuzzy rules in the form of If-Then statements in order to describe the stock fluctuations with volatility clustering overlooked by the EGARCH-ANN model via fuzzy rules and the asymmetric responses of volatility to positive and negative shocks via an EGARCH-ANN model.

The rule of the fuzzy system for the EGARCH-ANN model is described as follows.

: if is and is , thenwherein which is the input vector with at instance , is the fuzzy set to describe the stock market return and volatility for , is the number of rules, and is the premise variable. In addition, the study has assumed that the distribution of residual series follows either the Gaussian normal or Student’s t-distribution. That is, if it is not following the Gaussian normal distribution, then it follows the Student’s t-distribution directly.

In this fuzzy-EGARCH-ANN hybrid model, the premise variables include both the previous value of the stock market return and the previous volatility of the stock market return. The then part clearly expresses a local input and output relation by considering the EGARCH-ANN model parameters. Like in the previous EGARCH-ANN hybrid model, this fuzzy-EGARCH-ANN hybrid model is not only an asymmetric function of but also an asymmetric function of . Wang and Mendel [33] stated that a fuzzy system with the Gaussian membership function has been shown to be a universal approximation of any nonlinear function on a compact set.

The final output of this fuzzy-EGARCH-ANN model is the weighted average output of each individual rule output and is obtained by using FIS fundamental steps [1] as follows.

Step 1 (fuzzification layer). Using the Gaussian membership function, the grade of membership of the input in is found as follows:where is the center and is the spread of rule membership function corresponding to the premise variable.

Step 2 (firing strength layer). The firing strength of the rule by assuming the product T-norm of the antecedent fuzzy sets is found as

Step 3 (normalization layer). Finding the normalized firing strengths, i.e., the ratio of the rule’s firing strength to the sum of all rule’s firing strengths:

Step 4 (consequent and defuzzification layer). Combining the normalized firing strengths and the consequent corresponding rule to produce the model output as the weighted average of each individual rule:where is the output of the rule. That is,This implies thatThe collection of the R rules assembles the fuzzy-EGARCH-ANN model as a combination of R local models. Every local model contributes a value to the overall output of the fuzzy-EGARCH-ANN model that is proportional to the normalized degree of the firing strength of each rule. As a result, the exponential of this weighted average value gives the predicted stock market return volatility using the newly proposed model.
Finally, Figure 1 explains the general framework of the proposed model. Figure 2 explains the architecture of the proposed model for p = 1, q = 1, m = 1, s = 1, and R = 3. Moreover, for are the EGARCH-ANN local models.In order to select the optimum value of , , , and for the fuzzy-EGARCH-ANN model, first the study obtains five different randomly produced sets of (connection weight between the input and hidden layers including the bias term). This is what makes the model a semiparametric model. Then, for each of these five random sets and for each for , it estimates the parameters of the EGARCH-ANN model using the maximum likelihood estimation method [2]. As a result, one can have now an optimal value for , and of EGARCH-ANN [2]. The grid more encompasses the usual range of models employed in the literature since for each model and index, the data always select a specification well within the inferior of the search grid, and the study believes that this grid search does not unduly constrain the parameter space [2].
The number of fuzzy rules and its respective cluster center and spreads are going to be determined from the data sample using the subtractive clustering (SC) algorithm, proposed by Chiu in 1994 [34]. SC uses the data points as candidate prototype cluster centers. The capability of a point to be a cluster center is evaluated through its potential, a measure of the spatial proximity between a particular point and all other data points [35]:where denotes the potential of the data point, is a positive constant which defines the zone of influence of the cluster, and N is the number of data points. The value of the potential is higher for a data point that is surrounded by a large number of close data points. Therefore, it is reasonable to establish such a point to be the center of a cluster. The potential of all other data points is reduced by an amount proportional to the potential of the chosen point and inversely proportional to the distance to this center. The next center is found also as the data point with the highest (after this subtraction) potential. The procedure is repeated until the potential of all data points is reduced below a certain threshold [35].
After having the optimum values for , and in hand, the parameters of the fuzzy-EGARCH-ANN model can be estimated by minimizing a mean-squared error criterion, which is reasonable to obtain success [25]. Therefore, the objective function of this optimization is defined aswhere is the parameter vector of the fuzzy-EGARCH-ANN hybrid model, represents the realized volatility of the stock market (actual) data sample for i = 1, 2, …, N, N is the number of data points, and is the volatility of the stock market data estimated by the fuzzy-EGARCH-ANN model.

5. Fuzzy-EGARCH-ANN Model Parameter Estimation with DE

The design of the fuzzy-EGARCH-ANN model includes the determination of the unknown parameters, namely, the parameters of the consequent parts of the fuzzy rules by minimizing the prediction error. Now, the corresponding optimization problem can be written aswhere D = R (1 + 2p + q + s). Such an optimization problem is usually called an unconstrained optimization problem. It is unconstrained because the parameter space of the EGARCH model is unconstrained. That is, the EGARCH model does not need restriction of its parameters unlike other GARCH-type models. This is one of the advantages of the EGARCH model give us, and that is why the study has selected it. Clearly, the objective function is a highly nonlinear function of X. Simulation analysis showed that conventional gradient search methodologies produce poor estimates or even are not capable to find the global minimum of E(X) [1]. Therefore, to estimate parameters the fuzzy-EGARCH-ANN model without suffering from local optimum, the study has proposed a differential evolution (DE) algorithm.

Differential evolution (DE) is a population-based stochastic function optimizer, which uses a rather greedy and less stochastic approach for problem solving in comparison to classical evolutionary algorithms, such as genetic algorithms, evolutionary programming, and PSO [36]. DE combines simple arithmetical operators with the classical operators of recombination, mutation, and selection to evolve from a randomly generated starting population to a final solution. DE also incorporates an efficient way of the self-adapting mutation using small populations. It is able to reproduce the same results consistently over many trials, unlike PSO which is more dependent on the randomized initialization of individuals. DE has the advantage of not depending on the choice of initial values since any point in the parameter space has no null probability to be chosen [36].

The optimization process is conducted by means of three main operations: mutation, crossover, and selection. In each generation, individuals of the current population become target vectors. For each target vector, the mutation operation produces a mutant vector, by adding the weighted difference between two randomly chosen vectors to a third vector. The crossover operation generates a new vector, called a trial vector, by mixing the parameters of the mutant vector with those of the target vector. If the trial vector obtains a better fitness value than the target vector, then the trial vector replaces the target vector in the next generation.

A number of mutation strategies like rand1, rand2, best1, best2, and current to best are available in the literature [1]. In this application, we have taken the current to best mutation strategy. Like other evolutionary algorithms, the performance of DE algorithms depends on the settings of the control parameters: scaling factor F and crossover rate CR. As the optimal settings of these parameters are problem dependent, it is often necessary to tune the control parameters in order to achieve the desired results [37]. In this paper, instead of using a fixed crossover rate CR and scaling factor F, the JADE [38] approach has been used to update these iteratively. Initially, the crossover rate and the scaling factor associated with each individual are generated according to a normal (Cauchy) distribution with means [38]. At the end of each generation, the values of and are updated according to the pair that resulted in the generation of the successful trial vector in that generation [38]. As the search progresses, and should gradually approach the optimal values for the given problem [38]. Furthermore, step-by-step self-explanatory summary of the algorithm that the study has used is given in Table 1. For your understanding, the software used throughout this study is MATLAB R2018a.

6. Simulation Study

In this section, first of all, the study has simulated data by the fuzzy-EGARCH-ANN model with two different error distribution types, namely, the Gaussian and the Student’s t-distribution. Then, the study has estimated back the parameters of the fuzzy-EGARCH-ANN model with the corresponding identified error distribution types. That is, the study has simulated data for a certain given parameters (including the connection weights of the ANN component) with a given return shocks distributions in the next two Sections 6.1 and 6.2.

6.1. Return Innovation with Gaussian Distribution

For , , , , sample size = 1200, Gaussian distribution type, and the parameters given in Table 2, the study has simulated data by the fuzzy-EGARCH-ANN model. As a result, the obtained basic statistical properties of these data are given in Table 3.

Furthermore, Figures 37 are the corresponding figures of the simulated stock price series, return series, true volatility series, residual series, and histogram of the simulated residual series, respectively.

Now, by using the first 1000 (that is, for T = 1000) simulated data points, the study has obtained the following results. For p and q in the grid search specified in the previous section, all of EGARCH (, q) models are not fit at all. The study has identified that the residual series follows the Gaussian normal distribution. Then, by using this distribution type, the study has obtained that the model is the best-fitted model to the selected data. Furthermore, its fitness summary is presented in Table 4.

Now, to proceed to the parameter estimation of the fuzzy-EGARCH-ANN model, only the number of rules (R) and the realized volatility are remaining to be determined. For our case, as mentioned earlier, R is going to be determined by the subtractive clustering algorithm (SCA), and the realized volatility at time is the square root of the square of residual series at time . As a result, the study has obtained R = 3, and the graph of the realized volatility is given in Figure 8.

Therefore, for , , , and , by JADE with archive, the estimated parameter vector is given in Table 5. In addition, the corresponding mean-squared forecast error (MSFE) value is .

Figure 9 presents a summary and analysis of the MSFE that has been obtained by JADE with archive during the parameter estimation of the proposed model.

Finally, Figures 10 and 11 present the summary of estimated volatility by the proposed model for T = 1000 and validation for T = 1200, respectively. Here, the validation data set is the last 200 data points.

Note. The mean-squared forecast error (MSFE) for this validation is 4.7030e04, which is clearly less than that of the mean-squared forecast error (MSFE) obtained during the parameter estimation time. Therefore, the model has fitted the data very well. As a result, the study has obtained the forecast values for different periods as given in Table 6.

As one can see from the above forecasted values of volatility, the newly proposed model fuzzy-EGARCH-ANN has given us almost nearly the same values to the data simulated by itself. In the next section, the same thing with the same procedure was done except that Student’s t-distribution is the distribution of the return innovation series. Moreover, in this section and the next section, the realized conditional variance series is the square root of the square of the residual series.

6.2. Return Innovation with Student’s t-Distribution

For , , , , sample size , the Student’s t-distribution type, degree of freedom , and the parameters given in Table 7, the study has simulated data by the fuzzy-EGARCH-ANN model. As a result, the obtained basic statistical properties of these data are given in Table 8.

Furthermore, Figures 1216 are the corresponding figures of the simulated stock price series, return series, true volatility series, residual series, and histogram of the simulated residual series, respectively.

Now, by using the first 800 (that is, for T = 800) simulated data points, the study has obtained the following results. Since the study has been identified as the residual series which follows Student’s t-distribution, then by using this distribution type, the model is obtained as the best-fitted model to the selected data. Moreover, its fitness summary is presented in Table 9.

To proceed to the parameter estimation of the fuzzy-EGARCH-ANN model, only number of rules (R) and the realized volatility are remaining to be determined. For our case, as mentioned earlier, R is going to be determined by the subtractive clustering algorithm (SCA), and the realized volatility at time is the square root of the square of residual series at time . R = 3 is obtained, and the graph of the realized volatility is given in Figure 17.

Therefore, for , , , and , by JADE with archive , the estimated parameter vector is given in Table 10. In addition, the corresponding mean-squared forecast errors (MSFE) value is .

Figure 18 presents a summary and analysis of the MSFE obtained by JADE with archive during the parameter estimation of the proposed model.

Finally, Figures 19 and 20 present a summary of estimated volatility by the proposed model for T = 800 and validation for T = 1000, respectively. Here, the validation data set is the last 200 data points.

Note. The mean-squared forecast error (MSFE) for this validation is 2.225849991650711e04, which is very near to that of the mean-squared forecast error (MSFE) obtained during the parameter estimation time. Therefore, the model has fitted the data very well. As a result, the forecasted values have been obtained for different periods as given in Table 11.

Here also, as one can see from the above forecasted values of volatility, the newly proposed model fuzzy-EGARCH-ANN has given us almost nearly the same values to the data simulated by itself. In the next paper, we the authors of this research article are going to publish the model calibration of the proposed model on some well-known financial time series data.

7. Conclusions

From the previous section, as one can see from the above simulation study results, the proposed semiparametric nonlinear fuzzy-EGARCH-ANN model has fitted the simulated data very well. Clearly, one can see from Table 6 that the proposed model is good to forecast only one ahead forecast. This problem is the inherited problem from the EGARCH model. That is, the EGARCH model is good only for one ahead volatility forecasting. Therefore, the proposed model is very good in both capturing the leverage effect that was overlooked by the EGARCH model and volatility clustering that was overlooked by the EGARCH model. Hence, for both cases of return innovation distributions types, the proposed model is good only for one ahead period volatility forecasting.

Data Availability

The simulated financial time series data (Gaussian distribution-type simulated data and Student’s t-distribution-type simulated data) used to support the findings of this study are included within the article.

Conflicts of Interest

The authors declare that they have no conflicts of interest.

Authors’ Contributions

Methodology was done by Geleta Mohammed and Jane Aduda and Ananda Kube conceptualized and supervised the study. Finally, all the authors have read and approved the final manuscript.

Acknowledgments

Geleta T. Mohammed has paid the article processing charge from his monthly stipend.