Research Article | Open Access
Xinyu Wang, Kegui Chen, Xueping Tan, "Forecasting the Direction of Short-Term Crude Oil Price Changes with Genetic-Fuzzy Information Distribution", Mathematical Problems in Engineering, vol. 2018, Article ID 3868923, 12 pages, 2018. https://doi.org/10.1155/2018/3868923
Forecasting the Direction of Short-Term Crude Oil Price Changes with Genetic-Fuzzy Information Distribution
This paper proposes a novel approach to the directional forecasting problem of short-term oil price changes. In this approach, the short-term oil price series is associated with incomplete fuzzy information, and a new fused genetic-fuzzy information distribution method is developed to process such a fuzzy incomplete information set; then a feasible coding method of multidimensional information controlling points is adopted to fit genetic-fuzzy information distribution to time series forecasting. Using the crude oil spot prices of West Texas Intermediate (WTI) and Brent as sample data, the empirical analysis results demonstrate that the novel fused genetic-fuzzy information distribution method statistically outperforms the benchmark of logistic regression model in prediction accuracy. The results indicate that this new approach is effective in direction accuracy.
It is well documented that the oil price has strong connection with the business cycle, macroeconomics variables, global economic conditions, and policy uncertainty [1–3]. In addition, oil price contributes a crucial risk factor to explain cross-sectional asset prices in equity market and derivative market . Therefore, government and financial institutions such as US Energy Information Administration, Bank of Canada, and Deutsche Bank regularly publish the short-term forecasts of the crude oil price; and oil forecasting price has attracted a growing large number of literature among economists (see ), statistics (see [3, 6–8]), econometricians (see [9–11]), engineers (see [12–15]), policy makers (see ), and market players (see ).
The oil price, or equivalently, the return of oil price can be decomposed as a product of two components: the direction of price change (or the sign of log return) and change magnitude (the absolute value of the return). While the forecasting models of variance have been widely developed in statistics and econometrics like Generalized AutoRegressive Conditional Heteroskedasticity (GARCH) models, however, the magnitude of directional change is less understood and less developed in literature except for a few remarkable exceptions. Several methods have been utilized in literature for forecasting of oil price; for example, Ghaffari developed a soft computing approach to predict the daily variation of the West Texas Intermediate (WTI) crude oil price, and adopted the direction prediction accuracy ratio to prove its effectiveness (see ). Murat investigated whether there was a causal relationship between the crack spread futures and the spot oil markets in a vector error correction framework and found that the crack spread futures could be a good predictor of oil price movements (see ). Shin applied semisupervised learning to forecast the crude oil price prediction of WTI from January 1992 to June 2008 (see ). Tang et al.  proposed an ensemble learning paradigm coupling complementary ensemble empirical mode decomposition (CEEMD) and extended extreme learning machine (EELM) to enhance the prediction accuracy for crude oil price, and indicated the model’s superiority. Yu et al.  built a decomposition-and-ensemble forecasting model with extended extreme learning machine for crude oil price prediction and showed high accuracy, time saving, and robustness. Wang et al.  used a Markov switching multifractal (MSM) volatility model to forecast crude oil return volatility. Similar to stock market, options market and short-term load forecasting problem (see [21, 22]), the world crude oil market is also an important commodity trading market with a large number of investors participating in trading. There is considerable amount of literature devoted to forecasting crude oil price, such as nonlinear models (see [23–25]), econometric models (see [9–11]), and directional forecasting models (see [26–31]).
Several other researchers have analyzed the directional forecasting problem in other contexts of economics. The directional accuracy as a reasonable utility-based measure of forecasting performance was also advocated by Engel and Hamilton . In equity market, Pierdzioch et al.  found that an asymmetric loss function often (but not always) makes forecasts look rational. Christoffersen and Diebold  found the asset return signs and asset return volatilities were closely interrelated, and volatility dependence was entirely consistent with the direction of the market. And lastly, Merino and Albacete  constructed a congruent econometric model with financial and fundamental variables and analyzed the relative weight of the variables to explain the short-term oil price forecast. Thomakos and Wang  studied the directional predictions of financial returns and the probability of returns exceeding a threshold. Ahn et al.  employed an intelligent approach for directional forecasting in the options market, and showed the reasonably strong performance with empirical study. Rulke and Pierdzioch  used ROC-techniques (receiver operating characteristic) to study the directional accuracy of forecasts with respect to directional changes of exchange rates. This paper focuses on the short-term oil price change directional forecasting as an application of the fuzzy information distribution theory.
Through the literature review, most of the studies originated from modeling the oil price series as a complete information structure, and less studies have perceived the fuzziness of oil price information. This paper contributes to literature by presenting a new approach to forecast the directional change of oil price. In our approach, the crude oil price series is associated with an incomplete and inaccurate date set due to possible missing samples or noise information. To deal with incomplete data, a fuzzy framework for incomplete data is developed to explore the true hidden data generation process. Huang (see [37–39]) gave an introduction to fuzzy information distribution method. This paper provides some important extensions for methodical perfection especially for directional forecasting. The fuzzy information distribution method emphasizes that there exists certain transition trend of price series from incompleteness to completeness, and that each sample point has the trend to be evolved into multiple points; thus, the oil price time series corresponds to an incomplete fuzzy information set. Each oil price sample should not be regarded as an independent point. Instead, it should be regarded as the fuzzy information that has certain influence area around it but with variable degrees. Since the long-term oil price distribution and fluctuation patterns may have high instability or structural change, it is intended to implement fuzzy modeling, fuzzy inference and fuzzy pattern recognition on the fluctuation of the short-term oil price series and to reveal the intrinsic nonlinearity of short-term oil price series.
In this paper, a new fused genetic-fuzzy information distribution method is proposed to predict the direction of the short-term crude oil price changes in a fuzzy incomplete information setting. The adjustable weighted sum of the reciprocal of directional accuracy and root of mean square error (RMSE) are set as the fitness function to identify the solution with the least RMSE under the same directional accuracy. The numbers of information controlling points and lagged order of returns in fuzzy information distribution are optimized by the presented genetic algorithm. A coding algorithm of multidimensional information controlling points are further introduced to fit genetic-fuzzy information distribution to time series forecasting, with the crude oil spot prices of WTI and Brent as sample data, an empirical analysis on the oil price changes are adopted for following the presented approach. We select logistic regression as a benchmark model to compare the directional forecasting accuracy.
The rest of the paper is organized as follows. In Section 2, the new genetic-fuzzy information distribution model is proposed and analyzed; the empirical analysis is presented in Section 3. Finally the conclusions are given in Section 4.
2. A Genetic-Fuzzy Information Distribution Model
The fuzzy information distribution theory has been considerably successful in processing the fuzziness of information, especially when observed information is incomplete or inaccurate. To understand its essential characteristics, the genetic-fuzzy information distribution model is explained next in details.
In fuzzy information distribution method, the objective is to construct a fuzzy inference from to , where contains a set of independent variables and is a dependent variable. The range of each variable is discretized into some information controlling points, so the space of is discretized into a multidimensional grid. Each sample supplies a unit of information. Because of the characteristics of incomplete samples, each sample has the latent trend to be evolved into other unobserved neighbour samples. Therefore, an information distribution function is adopted to distribute one unit of sample information over its neighbour information controlling points. Eventually, the information of all samples is rearranged over the crossing points in the multidimensional grid using multidimensional information distribution function. In the end, the obtained information distribution structure in the multidimensional grid reflects the empirical knowledge learned from samples and to form a fuzzy inference mechanism based on the empirical knowledge. However, as far as the authors are concerned, obtaining the optimal empirical knowledge is not resolved.
For the purpose of directional forecasting, the fuzzy information distribution theory is extended in several important theoretical aspects. (1) The fuzzy information distribution is fused and a new fuzzy forecasting model is developed, a genetic algorithm by using the weighted sum of the reciprocal of direction accuracy and root of mean square error (RMSE) are developed as the fitness function in the genetic algorithm. The role of genetic algorithm is to search the “optimal parameters” for enhancing the quality of fuzzy reasoning, which has been missed in the field of fuzzy information distribution theory and its applications. (2) It demonstrates that there is no sample information loss throughout the multi-dimension linear information process. (3) In order to fit fuzzy information distribution to the oil price time series analysis, a coding algorithm of multidimensional information controlling points is adopted, which can maintain the temporal structure of time series data particularly for large sample applications.
2.1. One-Dimensional Linear Information Distribution
Let denote a set of observations with -dimension input indexes and one-dimension output index . Let be the universe of the th input index, and be the universe of , that is, .
Without loss of generality, the output variable is taken as an example to introduce one-dimensional linear information distribution. Let , where represents a discrete universe and consists of the important nodes of the output index . These nodes are called information controlling points, which can be characterized according to expert experiences, while a simple approach is to choose information controlling points in the universe of equidistance. According to the transition property of sample points, the information distribution process is a fuzzy partition process of sample information on .
Let , , and , and the rest of information controlling points are chosen in equidistantly, that is,
For any one sample , each information controlling point leads to an information gain by an one-dimensional linear information distribution function,where is called step length of controlling points.
Similarly, for input variables, let , be the discrete universe of with equal step length, and is the number of information controlling points of . Then the one-dimension linear information distribution of the input index iswhere is also evenly spaced.
2.2. Multidimensional Information Distribution Matrix
A multidimensional sample, , offers one unit of information. The multidimensional linear distribution functions are used to distribute it into the multidimensional information controlling points, which is denoted as .
The multidimensional linear distribution function is equal to the product of all one-dimensional linear distribution functions:as long as , ; otherwise, .
In practical application of fuzzy information distribution, lots of multidimensional information controlling points like , are generated. The information gains on all multidimensional information control points, obtained from each sample , should be accumulated. After doing it, the information structure stored in the multidimensional information controlling points is one kind of empirical knowledge learned from samples, and serves as the basis of fuzzy inference. Therefore, a valid encoding rule is provided to identify all multidimensional information controlling points next.
To maintain the temporal structure of time series data, a coding method with digit number in base is proposed to record the multidimensional information controlling points, in which . That is to say, the multidimensional information controlling point is coded as , a digit number in base . In practical computation process, need to be converted into a decimal number:
By using this method, dynamic non-repetitive coding according to the number of information controlling points of each index are conveniently realized. The coding method enables us to keep the logical temporal structure of time series data, and therefore, facilitate the application of fuzzy information distribution in time series analysis.
Through multidimensional information distribution, all information controlling points with dimensions are generated, but it cannot be directly used to make fuzzy inference. In order to clearly express the two-dimensional information conversion relationship from the inputs to the output, the coding method is adopted to integrate all inputs into one new input. More specifically, let , , the multidimensional information controlling point set can be encoded as
Finally, the two-dimensional information distribution matrix from to is formed under all observed samples, which is written as
As shown by the next result, the whole information gains on all multidimensional information controlling points from sample information are preserved; in other words, there is no sample information loss through the multidimensional linear information process.
2.3. The Fuzzy Relationship Matrix
The distribution matrix is the basis and starting point of fuzzy inference. Two types of fuzzy relationship matrixes can be generated from . The first one is to construct a fuzzy relationship matrix based on fuzzy concept. According to the theory of the factor space proposed by Huang  and Jun and Kang , when is a general factor space, an element corresponds to a fuzzy concept. Supposing that is a set of state space with fuzzy concept, any element in is a fuzzy concept. can be regarded as ’s domain; hence the fuzzy membership of about can be constructed from , yielding a fuzzy relation matrix , that is,
Alternatively, the set-valued statistics and conditional falling shadow formula based the falling shadow theory are used by Jun and Kang , to generate another fuzzy relation matrix . For discrete , the conditional falling shadows of on are obtained,
The fuzzy relation matrix is
Both these two approaches will be used in our model below. We now move on to discuss fuzzy inference process and use it to deal with incomplete information data set.
2.4. Fuzzy Inference
A fuzzy inference process is, in essence, a fuzzy transform from an input fuzzy set defined on to an output fuzzy set defined on . Given a fuzzy relation matrix , can be obtained by fuzzy operating like,
Suppose that is the input vector of a new sample; our aim is to infer out the possible output. The sample information on the space of are distributed, and then the information gains on controlling points are equivalent to the fuzzy membership of on multidimensional information controlling points, which constitutes the fuzzy set on ,
For fuzzy inference on in last section, the classical max-min inference method are applied,
For inference for , we know that is formed by the conditional falling shadow formula, so the inference using total falling shadow formula is proper; we obtain
Moreover, two defuzzification methods are adopted to transform the fuzzy set into clear forecast values. The first one is the maximum membership principle, and we set the information controlling point with the maximum membership degree as the final recognition result,
Another one is the weighted average of the information controlling points with the membership degree as weights, that is,
At last, to simplify notations of four fuzzy inference and defuzzification methods stated in (13)-(16), the following notations are used: is inference with the defuzzification of maximum membership degree; is inference with the defuzzification of weighted average, is inference with the defuzzification of maximum membership degree; is inference with the defuzzification of weighted average. In empirical applications, the forecasting accuracy usually relies on the proper fuzzy inference and defuzzification method, which is described in next section.
2.5. Parameters Estimation
2.5.1. Error Evaluation
In fuzzy information distribution process, the key optimization parameters are: the numbers of input indexes , the number of input information controlling points and the number of output information controlling points . How to select parameter appropriately affects the prediction precision.
The genetic algorithm is applied to optimize the parameter combinations. For this purpose, and are defined as the in-sample directional prediction accuracy with the maximum membership degree inference for and . Let and be the out-of-sample directional prediction accuracy with the maximum membership degree inference for and , respectively. are the out-of-sample directional prediction accuracy with the weighted average inference for and . Similarly, and denote the out-of-sample root mean squared errors (RMSE) with the maximum membership degree inference for and respectively; are the out-of-sample RMSE with the weighted average inference for and .
Since the directional change of the crude oil spot price is investigated, the logarithmic returns are used as , where is the close prices at day . Let , be the forecasted returns and the out-of-sample predicted direction accuracy isEvidently, mean that and have the same sign. The prediction error of RMSE is defined by:
2.5.2. Fitness Function in Genetic Algorithm
In the fuzzy information distribution theory, the numbers of information controlling points of input and output indexes are the vital parameters of information distribution model, which directly affect the precision of the model. To reduce the number of estimated parameter, we consider a special case where each input index has the same number of controlling points, i.e., . The lagged orders of explanatory variables in time series data applications are also an important parameter to be optimized. Generally, the input index number , the number of input information points , and the number of output information point are three core parameters; however, there exist no literatures to study the trade-off for the best inference. Then, a fusion approach of combining fuzzy information distribution with genetic algorithm is proposed to obtain the optimal parameters. Two fuzzy relation matrixes and and two defuzzification modes comprise four fuzzy inferences modes. In the construction of the fitness function of genetic algorithm, the reciprocal of out-of-sample directional accuracy and out-of-sample RMSEs are synthesized under four inference methods by weighted average method,where , is the weight parameter to reflect a preference between directional accuracy and RMSE. One advantage of the fitness function is to integrate the predicted effects of four fuzzy inference modes, and the optimal parameters smooth out the differences of four predicted effects.
2.5.3. Pareto Solutions and Forecasting
There are three steps in finding Pareto solution.
First, an empirical weight set such as are constructed, which reflects the latent preference of decision-makers. Then the optimal solution set, written as are derived, by a genetic algorithm.
Second, for each at given , in order to distinguish the best fuzzy inference and defuzzification method from , w the fitness values are decomposed into four predicted combinations of directional accuracy and RMSEs, so predicted combinations of directional accuracy and RMSEs denoting as PCDAR= are obtained. Because of its nonlinearity feature of the fitness function, some combinations in PCDAR might not be Pareto optimal towards two goals of the direction accuracy and RMSE; then, we have to find out the Pareto optimal ones in step three below.
Third, given an element , if there is no which is better than in both directional accuracy and RMSE, then A is a pareto optimal case. According to the above definition of Pareto optimization, we screen out the bad combinations from PCDAR, and identify the Pareto ones inside:
Clearly, each combination in corresponds to a special parameter setting including the weight , the fuzzy inference and defuzzification method and the optimal . Therefore, when decision-makers determine the final element from with their preference towards directional accuracy and RMSE, the corresponding parameter setting is also made simultaneously, including , , fuzzy inference, and defuzzification method. Next, the final parameter setting is used to perform forecasting of new samples.
3. Empirical Analysis
We obtained crude oil daily spot prices from US Department of Energy: Energy Information Administration, unit: Dollars per Barrel. Brent crude oil spot prices (Brent-Europe) and West Texas Intermediate crude oil spot prices (WTI-Cushing, Oklahoma) are extracted from Nov. 13, 2017, to Sep. 28, 2018, with a total of 220 samples. The first 200 samples are used for modeling, while the latter 20 samples are for testing the prediction accuracy. The descriptive statistics of samples are shown in Table 1. The statistics of skewness and kurtosis clearly show that return distributions are not normal, leading to some difficulties in classical econometric modeling. However, as a nonparametric approach, the genetic-fuzzy information distribution method needs not consider such a constraint of return distribution.
3.1. Data Preprocessing
To apply the method of fuzzy information distribution, the first step is to determine the number of input indexes and the number of information controlling points of each index. As is known, the crude oil market assimilates and reacts to the new information with a time delay. The time length of the delay process depends on the maturity degree of crude oil market. So in the model the lagged returns are set as the input indexes (explanatory variables) and the returns of the following day as the output index (dependent variables). Matlab codes are written to carry out the fuzzy inference, and the genetic optimization is done by the GA(x) function in Matlab genetic algorithm toolbox, allowing for integer optimization. In crude oil spot market, the historical short-term price information in about one week is of particularly importance to affect the current oil price; and it also indicate that a larger does not necessarily enhance the performance substantially. Therefore the initial range of is set to be .
The intervals of information controlling points are set as 1-1.1 times the range of sample return, and it can be dynamically adjusted after future oil price data accumulation. In order to simplify the calculation, the information controlling points with equal steps are used. However, if there is enough information, a more flexible approach can be taken to arrange information controlling points in line with experts’ knowledge, such as placing more information controlling points over the interested range of return. Another rule is that the number of controlling points should be appropriate. We aim to determine the numbers of controlling points objectively by a genetic optimization algorithm. The initial ranges of and are both set to be a wide range of .
3.2. Fuzzy Inference and Forecasting
3.2.1. Generate Alternative Solutions
In order to achieve the best prediction effect, several weights are chosen in the interval with a step of 0.05 and then apply the genetic optimization program to determine the optimal parameters under each group of weights. Integer programming with genetic algorithm involves special creation, crossover, and mutation functions that enforce variables to be integers. Taking as an example, the evolving fitness values are shown in Figure 1. The estimated results for Brent and WTI crude oil spot price series are shown in Table 2.
is the number of input variables and are the numbers of input (output) information points. are the in-sample directional prediction accuracy with the maximum membership degree inference for . and are the out-of-sample directional prediction accuracy for with the maximum membership degree inference and the weighted average inference, respectively. and are the out-of-sample roots of mean square errors for with the maximum membership degree inference and the weighted average inference, respectively. and are the means of out-of-sample directional accuracy and RMSE.
3.2.2. Pareto Solutions
The parameter optimization problem under directional accuracy and RMSE is similar to a multi-objective programming. Because of the complexity of fitness function of fuzzy information model, however, the linear weighted average of two objectives method does not necessarily produce pareto solutions. Since the fitness function of fuzzy information model is the synthesis of four fuzzy inference modes, and each solution includes four sub-solutions, there are 84 sub-solutions in Table 2. All Pareto solutions are identified and the corresponding fuzzy inference methods are explained in Table 3, where RMSE and D are out-of-sample forecasting error and directional accuracy.
The bold estimators are optimal.
The solutions among pareto optimal solutions with much preference to directional accuracy are characterized. For Brent crude oil price series, is the final solution of by inference. For WTI crude oil price series, is the final solution of by inference. To present spatial structure of inference knowledge, three-dimensional stereograms of information distribution matrix at the final solutions are plotted in Figure 2.
(a) Brent crude oil prices
(b) WTI crude oil prices
As shown in Table 2, the fuzzy information distribution approach gives a high in-sample forecasted directional accuracy over 0.8. Additionally, under the optimal parameters for Brent and WTI crude oil price series, the out-of-sample one-step-ahead forecasted returns are calculated and then converted into the out-of-sample one-step-ahead forecasted prices. The oil spot prices and forecasting prices are shown in Figure 3.
(a) Brent crude oil prices
(b) WTI crude oil prices
Clearly, the in-sample predictions are better than the out-of-sample ones. The out-of-sample forecasted values and the direction consistency comparison are presented in Table 4. Besides reaching a good directional accuracy, the forecasted prices are also very close to the actual values, because the proper structure of fitness function in (19) can supply an efficient optimization rules to reduce the RMSEs.
Y represents the consistent forecasting results and N is for inconsistent cases.
By fuzzy inference, a fuzzy set is defined on the universe of crude oil return by (11). Therefore, the membership function of fuzzy set does provide a fuzzy probability distribution about oil returns. For example, the out-of-sample probability distributions of WTI oil returns are plotted in Figure 4. Although the forecasted fuzzy possibility curve is not very smooth, it supplies useful information about the uncertainty of oil prices.
3.2.4. Directional Forecasting Comparison
In order to test the effectiveness of the model in directional forecasting, the classical logistic regression and random walk with drift are taken as two benchmark models to compare the directional prediction accuracy. The results are promising.
First, we discuss logistic regression model. The number of lagged input indexes is the only one parameter for logistic regression model. The identified results of logistic regression model are shown in Table 5.
The bold estimators are optimal. is the directional accuracy, TN is true negative ratio, and TP is true positive ratio. The ROC (receiver operating characteristic) curve plots the true positive rate as a function of false positive rate for differing classification thresholds. AUC is used to measure the prediction performance that is the area under the ROC curve.
According to the principle of maximum and maximum AUC in logistic regression, the optimal lagged order for Brent crude oil price series is , and the one for WTI crude oil price series is . For Brent crude oil prices, the optimal directional accuracy (D) of the genetic-fuzzy information distribution model is 0.75 that is higher than the optimal directional accuracy of the logistic regression (D=0.7). For WTI crude oil prices, the optimal directional accuracy (D) of the genetic-fuzzy information distribution model is 0.75 that is also higher than that of the classical logistic regression (D=0.7). Because the logistic regression model can only predict the direction of price changes, the proposed genetic-fuzzy information distribution approach has obvious appealing of forecasting the direction, the magnitude and the fuzzy possibility distribution of price changes overall.
Second, we assume that the stock price obeys a geometric random walk process with a dynamic drift as follows:where represents a dynamic drift that is used to measure a trend in the price. is a white noise. The model is equivalent to . The drift equals the moving average of previous returns. The forecasting error RMSE and directional accuracy D are presented in Table 6. The optimal directional accuracy values for Brent oil prices and WTI oil prices are 0.55 and 0.50 respectively, but they are also lower than the corresponding directional accuracy values from genetic-fuzzy information distribution approach, which are 0.75 and 0.75 for Brent oil prices and WTI oil prices, respectively, in Table 3. Further, we find that genetic-fuzzy information distribution approach for WTI oil prices can reach a high precision with RMSE, D equal to 0.0147,0.75 in Table 3, which has less RMSE and higher D than the optimal result of random walk with RMSE, D equal to 0.0157,0.5 in Table 6.
The bold estimators are optimal.
In this paper, a new approach is proposed to enhance the directional forecast accuracy of short-term oil price. It is distinct from previous studies that the crude oil price series is viewed as an incomplete data set with inaccurate fuzzy information and then it is modeled by fuzzy information distribution theory. A new fused approach of genetic-fuzzy information distribution is adopted to forecast the direction of oil price changes. The genetic algorithm is used to optimize the assignment of fuzzy information controlling points, and a coding algorithm of multidimensional information controlling points is applied to make genetic-fuzzy information distribution approach feasible in its time series applications. With the crude oil spot prices of WTI and Brent as sample data, the empirical analysis demonstrates that the novel approach offers high accuracy on the directional forecasting of short-term oil prices, compared with the logistic regression and random walk.
The genetic-fuzzy information distribution can distribute the information of the high level oil price into two neighbour information points; thus it is a new method to construct the oil price distribution structure and then form a fuzzy knowledge-based inference to predict oil prices. It particularly helps to describe the true oil price behaviour hidden in oil price bubbles when oil prices present an unsustainable fast rise. Besides, we demonstrate an effective time series prediction in this paper, but it is worthy of selecting economic or political factors as input variables to enhance the prediction ability and performing the medium and long-term oil price forecasting in future.
The data used to support the findings of this study can be found on the website of US Department of Energy: Energy Information Administration, which is free for academic use.
Conflicts of Interest
The authors declare that they have no conflicts of interest.
The research described in this paper was substantially supported by the National Natural Science Foundation of China under Grant no. 71871215; the Humanity and Social Science Youth Foundation of Ministry of Education of China under Grant no. 17YJC630012; the Postgraduate Research & Practice Innovation Program of Jiangsu Province under Grant no. KYCX17_1501.
- W. Q. Tang, L. B. Wu, and Z. X. Zhang, “Oil price shocks and their short- and long-term effects on the Chinese economy,” Energy Economics, vol. 32, no. 1, pp. S3–S14, 2009.
- J. D. Hamilton, “Nonlinearities and the macroeconomic effects of oil prices,” Macroeconomic Dynamics, vol. 15, no. 3, pp. 364–378, 2011.
- S. Bekiros, R. Gupta, and A. Paccagnini, “Oil price forecastability and economic uncertainty,” Economics Letters, vol. 132, pp. 125–128, 2015.
- S. B. Ramos and H. Veiga, “Risk factors in oil and gas industry returns: International evidence,” Energy Economics, vol. 33, no. 3, pp. 525–542, 2011.
- A. Merino and R. Albacete, “Econometric modelling for short-term oil price forecasting,” OPEC Energy Review, vol. 34, no. 1, pp. 25–41, 2010.
- C. Baumeister and L. Kilian, “Forecasting the real price of oil in a changing world: a forecast combination approach,” Journal of Business & Economic Statistics, vol. 33, no. 3, pp. 338–351, 2015.
- S. Reitz, J. C. Rülke, and G. Stadtmann, “Regressive oil price expectations toward more fundamental values of the oil price,” Jahrbücher für Nationalökonomie und Statistik, vol. 230, no. 4, pp. 454–466, 2010.
- H. Chiroma, S. Abdulkareem, and T. Herawan, “Evolutionary Neural Network model for West Texas Intermediate crude oil price prediction,” Applied Energy, vol. 142, pp. 266–273, 2015.
- T. A. Knetsch, “Forecasting the price of crude oil via convenience yield predictions,” Journal of Forecasting, vol. 26, no. 7, pp. 527–549, 2007.
- X. Zhang, K. K. Lai, and S.-Y. Wang, “A new approach for crude oil price analysis based on empirical mode decomposition,” Energy Economics, vol. 30, no. 3, pp. 905–918, 2008.
- C. Baumeister and L. Kilian, “Real-time forecasts of the real price of oil,” Journal of Business and Economic Statistics, vol. 30, no. 2, pp. 326–336, 2012.
- Y. Fan, Q. Liang, and Y.-M. Wei, “A generalized pattern matching approach for multi-step prediction of crude oil price,” Energy Economics, vol. 30, no. 3, pp. 889–904, 2008.
- L. Yu, S. Wang, and K. K. Lai, “Forecasting crude oil price with an EMD-based neural network ensemble learning paradigm,” Energy Economics, vol. 30, no. 5, pp. 2623–2635, 2008.
- A. Ghaffari and S. Zare, “A novel algorithm for prediction of crude oil price variation based on soft computing,” Energy Economics, vol. 31, no. 4, pp. 531–536, 2009.
- R. Jammazi and C. Aloui, “Crude oil price forecasting: experimental evidence from wavelet decomposition and neural network modeling,” Energy Economics, vol. 34, no. 3, pp. 828–841, 2012.
- Y. Wang, C. Wu, and L. Yang, “Forecasting crude oil market volatility: A Markov switching multifractal volatility approach,” International Journal of Forecasting, vol. 32, no. 1, pp. 1–9, 2016.
- A. Murat and E. Tokat, “Forecasting oil price movements with crack spread futures,” Energy Economics, vol. 31, no. 1, pp. 85–90, 2009.
- H. Shin, T. Hou, K. Park, C.-K. Park, and S. Choi, “Prediction of movement direction in crude oil prices based on semi-supervised learning,” Decision Support Systems, vol. 55, no. 1, pp. 348–358, 2013.
- L. Tang, W. Dai, L. Yu, and S. Wang, “A novel CEEMD-based eelm ensemble learning paradigm for crude oil price forecasting,” International Journal of Information Technology & Decision Making, vol. 14, no. 1, pp. 141–169, 2015.
- L. Yu, W. Dai, and L. Tang, “A novel decomposition ensemble model with extended extreme learning machine for crude oil price forecasting,” Engineering Applications of Artificial Intelligence, vol. 47, pp. 110–121, 2015.
- A. Kavousi-Fard, T. Niknam, and M. Golmaryami, “Short term load forecasting of distribution systems by a new hybrid modified FA-backpropagation method,” Journal of Intelligent & Fuzzy Systems: Applications in Engineering and Technology, vol. 26, no. 1, pp. 517–522, 2014.
- F. A. Oliveira, C. N. Nobre, and L. E. Zárate, “Applying Artificial Neural Networks to prediction of stock price and improvement of the directional prediction index - Case study of PETR4, Petrobras, Brazil,” Expert Systems with Applications, vol. 40, no. 18, pp. 7596–7606, 2013.
- H. Liu and D.-L. Zhang, “Analysis and prediction of hazard risks caused by tropical cyclones in Southern China with fuzzy mathematical and grey models,” Applied Mathematical Modelling: Simulation and Computation for Engineering and Environmental Systems, vol. 36, no. 2, pp. 626–637, 2012.
- S. Reitz, J.-C. Rülke, and G. Stadtmann, “Nonlinear expectations in speculative markets - Evidence from the ECB survey of professional forecasters,” Journal of Economic Dynamics and Control (JEDC), vol. 36, no. 9, pp. 1349–1363, 2012.
- A. Azadeh, M. Moghaddam, M. Khakzad, and V. Ebrahimipour, “A flexible neural network-fuzzy mathematical programming algorithm for improvement of oil price estimation and forecasting,” Computers & Industrial Engineering, vol. 62, no. 2, pp. 421–430, 2012.
- P. F. Christoffersen and F. X. Diebold, “Financial asset returns, direction-of-change forecasting, and volatility dynamics,” Management Science, vol. 52, no. 8, pp. 1273–1287, 2006.
- D. Senol and M. Ozturan, “Stock Price Direction Prediction Using Artificial Neural Network Approach: The Case of Turkey,” Journal of Artificial Intelligence Research, vol. 1, no. 2, pp. 70–77, 2010.
- Y. Kara, M. Acar Boyacioglu, and Ö. K. Baykan, “Predicting direction of stock price index movement using artificial neural networks and support vector machines: The sample of the Istanbul Stock Exchange,” Expert Systems with Applications, vol. 38, no. 5, pp. 5311–5319, 2011.
- A. Skabar, “Direction-of-change financial time series forecasting using a similarity-based classification model,” Journal of Forecasting, vol. 32, no. 5, pp. 409–422, 2013.
- Y. Yuan, “Forecasting the movement direction of exchange rate with polynomial smooth support vector machine,” Mathematical and Computer Modelling, vol. 57, no. 3-4, pp. 932–944, 2013.
- C. Baumeister, P. Guérin, and L. Kilian, “Do high-frequency financial data help forecast oil prices? The MIDAS touch at work,” International Journal of Forecasting, vol. 31, no. 2, pp. 238–252, 2013.
- C. Engel and J. D. Hamilton, “Long swings in the dollar: are they in the data and do markets know it,” American Economic Review, vol. 80, pp. 689–713, 1990.
- C. Pierdzioch, J.-C. Rülke, and G. Stadtmann, “Oil price forecasting under asymmetric loss,” Applied Economics, vol. 45, no. 17, pp. 2371–2379, 2013.
- D. D. Thomakos and T. Wang, “‘Optimal’ probabilistic and directional predictions of financial returns,” Journal of Empirical Finance, vol. 17, no. 1, pp. 102–119, 2010.
- J. J. Ahn, D. H. Kim, K. J. Oh, and T. Y. Kim, “Applying option Greeks to directional forecasting of implied volatility in the options market: An intelligent approach,” Expert Systems with Applications, vol. 39, no. 10, pp. 9315–9322, 2012.
- J. C. Rülke and C. Pierdzioch, “On the directional accuracy of forecasts of emerging market exchange rates,” International Review of Economics & Finance, vol. 38, pp. 369–376, 2015.
- C. Huang, “Information diffusion techniques and small-sample problem,” International Journal of Information Technology & Decision Making, vol. 1, no. 2, pp. 229–249, 2002.
- C. Huang and H. Inoue, “Soft risk maps of natural disasters and their applications to decision-making,” Information Sciences, vol. 177, no. 7, pp. 1583–1592, 2007.
- D.-F. Li, “A fuzzy closeness approach to fuzzy multi-attribute decision making,” Fuzzy Optimization and Decision Making, vol. 6, no. 3, pp. 237–254, 2007.
- Y. B. Jun and M. S. Kang, “Fuzzy positive implicative ideals of BCK-algebras based on the theory of falling shadows,” Computers & Mathematics with Applications, vol. 61, no. 1, pp. 62–67, 2011.
Copyright © 2018 Xinyu Wang et al. This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.