Mathematical Tools of Soft ComputingView this Special Issue
Prediction of Currency Volume Issued in Taiwan Using a Hybrid Artificial Neural Network and Multiple Regression Approach
Because the volume of currency issued by a country always affects its interest rate, price index, income levels, and many other important macroeconomic variables, the prediction of currency volume issued has attracted considerable attention in recent years. In contrast to the typical single-stage forecast model, this study proposes a hybrid forecasting approach to predict the volume of currency issued in Taiwan. The proposed hybrid models consist of artificial neural network (ANN) and multiple regression (MR) components. The MR component of the hybrid models is established for a selection of fewer explanatory variables, wherein the selected variables are of higher importance. The ANN component is then designed to generate forecasts based on those important explanatory variables. Subsequently, the model is used to analyze a real dataset of Taiwan's currency from 1996 to 2011 and twenty associated explanatory variables. The prediction results reveal that the proposed hybrid scheme exhibits superior forecasting performance for predicting the volume of currency issued in Taiwan.
The Central Bank of Taiwan is responsible for planning and producing the currency of the country. The volume of currency issued is primarily determined by the demand of the public. It is associated with the economic growth rate, seasonal factors, and the development of noncash payments. Because the issue of currency is very important from an economic point of view, the prediction of currency volume issued has become an important research topic [1–3].
In contrast to the error correction model (ECM) for predicting the volume of currency issued [4–7], this study applies two types of forecasting techniques to predict the volume of currency issued in Taiwan. The first technique is the single-stage forecasting modeling method, which includes the autoregressive integrated moving average (ARIMA), the multiple regression (MR), and the artificial neural networks (ANNs). ARIMA is a suitable modeling technique for making currency predictions because seasonal effects will be significantly involved [8, 9]. The ARIMA models are wellknown and flexible because they can represent different types of time series, such as pure autoregressive (AR), pure moving average (MA) and the combined AR and MA (ARMA) series. In addition, ARIMA models are widely used in forecasting many practical phenomena, such as product sales, food prices, and stock prices, among others [10–15]. However, the assumptions inherent to the linear form of ARIMA models often present difficulties in capturing the nonlinear pattern of the real data [10, 16, 17]. The MR model is also a widely used forecasting technique for many practical applications [15, 18–20]. This model has often been used to forecast based on known marketing variables and macroeconomic measures . However, MR modeling has been criticized for its strong modeling assumptions, such as variation homogeneity, and thus, its application has been limited. In contrast, the ANN modeling is data driven because it contains fewer a priori assumptions. Accordingly, the ANN is another alternative modeling scheme for predicting currency because the ANN allows to model nonlinearity and provides good forecasting characteristics [16, 17, 22, 23]. As a result, the ANN has been reported to exhibit better forecasting capability than the regression technique [22–28]. However, ANN is criticized for its long training process in designing the optimal network topology and because it is difficult to identify the relative importance of potential input variables [25, 26, 29, 30].
The second proposed technique is the two-stage hybrid modeling scheme. The general concept of using a hybrid scheme is to capture different patterns in the data by taking advantage of each individual model’s capability. The research findings indicated that the hybrid modeling is superior for improving the performance of each individual model [13, 16, 31–36]. Since both MR and ANN are very suitable for modeling the currency issued, this study considers a combination of MR and ANN as the proposed hybrid model. Because a large number of input variables for the ANN modeling may not be appropriate , the proposed MR technique is able to select a lower number of explanatory variables that have a greater importance. In the second stage of the hybrid modeling scheme, the important variables then serve as inputs for the proposed ANN prediction model.
This hybrid model is then used to analyze a real monthly dataset containing one response variable (i.e., Taiwan’s currency) and twenty associated explanatory variables obtained from January 1996 to December 2011. The real dataset makes it possible to compare predictions about Taiwan’s currency using the single-stage models and two-stage hybrid models. This study applies the first 14 years of data to build the forecasting models and then performs a confirmation test using the last two years of data. The rest of the study is organized in the following manner. Various forecasting methodologies are discussed in Section 2. The development and design of the single- and two-stage hybrid models are presented in Section 3. Practical data regarding the volume of currency issued in Taiwan are used to verify the single-stage models and the proposed hybrid models. The final section discusses the research findings and conclusions inferred from this study.
2. The Methodologies
In this study, we employ single-stage forecasting techniques, ARIMA, MR, and ANN, as well as a two-stage hybrid technique, MR-ANN, to predict the volume of currency issued. Additionally, to compare the performance of the different performance models, a real Taiwanese currency dataset is analyzed. The dataset consists of 192 records. Each sample record consists of 20 variables that are summarized in Table 1 (i.e., please see http://www.cbc.gov.tw/mp1.html for more details and descriptions about the dataset).
2.1. ARIMA Modeling
The time series data can be simply defined as observations made in a sequential order. Because seasonal effects are involved in the prediction of the currency issued, time series forecasting techniques should be used. Box and Jenkins [8, 9] have developed a well-known approach, ARIMA, for the prediction of a time series data. The ARIMA technique has proven viable for a wide variety of applications, ranging from economics and finance to traffic control and engineering.
A general ARIMA model can be described as follows: where is an unknown constant, are the working series values as a function of time , which are stationary after fitting a suitable transformation from the original time series , are the values of nonseasonal and seasonal transformations, respectively, is white noise at time , which independent and identical (iid) with normal distribution, and are the order (parameters) of autoregressive (AR) and moving average (MA) models, respectively, is the backward shift operator, defined as , and are the number of months in a year, and for the monthly data. is a polynomial function for a nonseasonal AR model, defined as , is a polynomial function for a seasonal AR model, defined as , is a polynomial function for a nonseasonal MA model, defined as , and is A polynomial function for a seasonal MA model, defined as .
Typically, the original time series (i.e., ) values are transformed into a stationary working series (i.e., ) to fit the ARIMA models. The transformation is usually performed using four combinations of and ; that is, , and . Once the stationary working series have been obtained, we can apply the sample autocorrelation function (SAF) and sample partial autocorrelation function (SPAF) to determine the order of , and for the seasonal ARIMA models. After performing a diagnostic assessment for the parameters and residuals, we obtain the forecasting models. In this study, the prediction capability of the models is compared using three criteria, including mean absolute percentage error (MAPE), root mean square error (RMSE), and mean absolute difference (MAD). These prediction measurements are defined as follows: where stands for the value of the residual at time .
2.2. ANN Modeling
In recent years, ANN has been widely applied in engineering, education, social science, medical research, business, and forecasting. A neural network is a massively parallel system comprised of highly interconnected, interacting processing elements based on neurobiological models . Due to its associated memory characteristic and its generalization capability, ANN has been increasingly utilized for modeling nonstationary processes [35, 37–42].
ANN can be classified into two categories: feedforward and feedback networks . The nodes in the ANN can be divided into three layers: the input, the output, and one or more hidden layers. The output of each neuron in the input layer is the same as the input to that neuron. For each neuron in the hidden layer and neuron in the output layer, the net inputs are given by where is a neuron in the previous layer, is the output of node and is the connection weight from neuron , to neuron . The neuron outputs are given by where is the input signal from the external source to the node in the input layer and is the bias. The transformation function shown in (4) is called a sigmoid function. Because this is one of the most commonly utilized functions, it is applied in this study.
The generalized delta rule is the conventional technique used to derive the connection weights of the feedforward network . Initially, a set of random numbers is assigned to the connection weights. Then, to determine the pattern with a target output vector , the sum of the minimized squared error is given by where is the number of output nodes.
2.3. Multiple Regression Modeling
Regression analysis is one of the most used statistical methods in modeling real-world applications. The modeling process involves setting up the relationships between one dependent (or response) variable and several independent (or explanatory) variables. The performance of the regression models is typically acceptable as long as the assumptions have been met. However, the assumptions of the regression model (e.g., variation homogeneity) often confine its application.
The general MR model can be represented as follows: where are referred to as model parameters and is a random variable called the error term. The error term accounts for the variability in that cannot be explained by the linear effect of the explanatory variables. In general, there are four assumptions about the error term in the MR model, including the following:(1) the is a normally distributed random variable;(2)the is a random variable with a mean value of zero; that is, ;(3)the variance of is denoted by and is the same for all values of the explanatory variables ;(4)the values of are independent.
Because collinearity among independent variables will lead to imprecise estimates and serious stability problems, the collinearity diagnosis procedure should be performed first before screening significant independent variables. Some well-known criteria such as the variance inflation factor (VIF) or tolerance can be applied to examine collinearity. The VIF is defined as follows: where is the coefficient of determination of a regression that evaluates all other independent variables. The tolerance is defined as the reciprocal of the VIF. It has been suggested that when the value of VIF is greater than 10, the sample set may have enough variation to suggest serious multicollinearity. Although several methodologies can be used to overcome the problems of collinearity, this study used the method in which one or several explanatory variables could be dropped from the model in order to lessen the collinearity and thus reduce the standard errors of the estimated regression coefficients of the explanatory variables remaining in the model. In addition to the simplicity and effectiveness, this method has another advantage of reducing the numbers of explanatory variables. This feature is quite suitable for hybrid modeling since it typically captures less explanatory variables for the initial stage of modeling.
In addition, when a large number of explanatory variables are involved in the MR design, a great amount of computation is required for examining a large volume of computer outputs, most of which is associated with poor MR models. As a consequence, three variable selection procedures are employed in this study. Those three selections include forward selection, backward elimination, and stepwise regression procedures. Given a dataset with twenty explanatory variables, this study uses those three selection procedures to determine the explanatory variables that lead to the best model. The selection procedures are iterative, wherein a single explanatory variable is added or deleted at each step of the procedure, and the new model is evaluated. The criterion for selecting an explanatory variable is based on statistics. The significant selected explanatory variables then serve as inputs to the ANN for the development of a two-stage, MR-ANN, hybrid model.
3. Modeling Results and Analysis
After performing various modeling techniques to predict the Taiwan’s currency, the results are reported and discussed in this section.
3.1. ARIMA Modeling Results
For the ARIMA modeling, this study divides the currency data into two groups. The first group contains 168 samples used for the design of the model, and the second group contains 24 samples used for the confirmation of the model. Figure 1 displays the original time plot for the 144 currency observations. The time series of these 144 observations is not stationary, and a different transformation must be performed.
After performing the identification, estimation and diagnostic assessment steps using the SAS package, we obtain the parameter estimates provided in Table 2. Accordingly, we use the following ARIMA model (i.e., (8)) to predict the currency issued in Taiwan: Additionally, Table 3 provides the autocorrelation test for residuals for the ARIMA model. It indicates that the ARIMA model in (8) is adequate.
3.2. ANN Modeling Results
The purpose of using ANN is to predict the currency issued in Taiwan. The structure of the ANN is described as follows. It has been reported that more than 75% of neural networks applications use the backpropagation neural network (BPNN) structure. Thus, this study uses the BPNN in designing the ANN forecasting model [37–40]. For the ANN developed herein, this study utilizes 20 input nodes and one output node. The hidden nodes were set to , where is the number of input variables. Thus, the following hidden nodes are chosen: 18, 19, 20, 21 and 22. In this study, the training data and testing datasets include 168 and 24 data vectors for every possible parameter setting. Since the learning rate of 0.01 is a very effective setting [35, 40], this study sets the values of the learning setting as 0.01 for the ANN modeling. In addition, since MAPE is one of the most important performance measurements for the forecasting capability, this study uses the smallest MAPE as the criterion for selecting the ANN topology.
After performing the ANN modeling, we found that the topology with a learning rate of 0.01 provides the best results and a minimum testing MAPE. Here, stands for the number of neurons in the input layer, the number of neurons in the hidden layer, and the number of neurons in the output layer, respectively. Table 4 lists the corresponding MAPE values for various settings of the ANN topologies. Accordingly, the ANN topology of with a learning rate of 0.01 is chosen for the model of ANN alone.
3.3. MR Modeling Results
This study considers currency issued (i.e., ) as the dependent variable and the twenty related economic variables (i.e., to ) as the explanatory variables. To exclude variables with high collinearity, the Pearson correlation coefficients between variables are used, and the results are provided in Table 5. When the correlation coefficient between variables and is greater than 0.7, we exclude the variable that has a lower relationship with (i.e., exclude the variable with a smaller correlation coefficient ). After discarding variables with high collinearity, seven explanatory variables, , , , , , , and , remain. The associated VIF result is provided in Table 6. Additionally, this model is simply referred to as the MRVIF, and it is described as follows: After using VIF to perform the parameter selection, this study used the typical statistical hypothesis tests to obtain the significant variables in the model. Accordingly, this study deleted the variables from seven retained variables (i.e., , , , , , , and ) whose absolute value is less than 1.96. In here, the type I error, , is chosen as 0.05. This study utilized the SPSS with a testing sample, and the estimates of the parameters are provided in Table 6. After performing the diagnostic assessment, we obtained an MR model that is selected by using the significance test. This model is referred to as the MRSIG, and it is described in (10).
After using VIF to perform the parameter selection, this study used the typical statistical hypothesis tests to obtain the significant variables in the model. Accordingly, this study deleted the variables from seven retained variables (i.e., , , , , , , and ) whose absolute value is less than 1.96. In here, the type I error, , is chosen as 0.05. This study utilized the SPSS with a testing sample, and the estimates of the parameters are provided in Table 7. After performing the diagnostic assessment, we obtained an MR model that is selected by using the significance test. This model is referred to as the MRSIG, and it is described as follows:
This study also used three selection techniques to develop the MR models for the currency issued. These three techniques include forward selection, backward elimination, and the stepwise regression analysis.
The concept of three variable selection procedures is described as follows. The forward selection is similar to the stepwise selection. The first explanatory variable selected for inclusion of the regression equation is the one with the largest positive or negative correlation with the dependent variable, . This explanatory variable is entered into the regression equation only if it satisfies the tolerance criterion for entry. If the first variable is entered, the explanatory variable which is not in the regression equation that has the largest partial correlation is considered next. The forward selection procedure would stop when there are no explanatory variables that meet the entry criterion. The back elimination procedure initially considers all explanatory variables to be included in the regression equation and then sequentially removed. The explanatory variable with the smallest partial correlation with the dependent variable is first for the removal. If that variable meets the tolerance criterion for elimination, it is removed. After the first variable is removed, the variable remaining in the regression equation with the smallest partial correlation is considered next. The procedure would stop when there are no variables in the regression equation that satisfy the removal criteria. At each step for the stepwise selection procedure, the explanatory variable which is not in the regression equation that has the smallest probability of is entered, if the probability is sufficiently small. Explanatory variables that have already existed in the regression equation are removed if their probability of is sufficiently large. The stepwise selection procedure would stop when no more variables are eligible for inclusion or removal.
In our experiment, all those three selection procedures resulted in the same MR model. Table 8 shows the SPSS results of the parameter estimates. This model is referred to as the MRSEL, and it is described as follows:
3.4. Hybrid Modeling Results
In our proposed two-step hybrid model, the first step is to obtain the appropriate input variables for the ANN model. Because this study utilizes different MR modeling selections, the explanatory variables in MRVIF, MRSIG, and MRSEL models serve as the input variables for ANN. Accordingly, this study employs three combinations of MR and ANN as the candidate hybrid models, wherein combinations of MRVIF and ANN, MRSIG and ANN, and MRSEL and ANN are referred to as MRVIF-ANN, MRSIG-ANN, and MRSEL-ANN, respectively.
When the first stage of hybrid modeling is completed, the ANN topology settings are established. Table 9 displays the various ANN topology settings for the hybrid models. As a result, we found that the topologies with a learning rate of 0.01 provide the best result for MRVIF-ANN, MRSIG-ANN and MRSEL-ANN, respectively.
3.5. The Modeling Results and Analysis
This study develops various forecasting models to predict the volume of currency issued in Taiwan. Table 10 provides the forecasting results, as well as the MAPE, MSE, and MAD of those models. A low MAPE, MSE, or MAD is associated with better forecasting accuracy. In comparison to the MAPE performance for the single-stage models in Table 10, we note that MR models exhibit better performance than the ARIMA and ANN models. However, the MSE, and MAD values of the MR models are larger than those of the ARIMA and ANN models. Accordingly, the forecasting results reveal that there is no significant difference among those single-stage models.
Nevertheless, our proposed hybrid models provide more accurate results than the single-stage models. In terms of MAPE, MSE or MAD, the three hybrid models are all lower than the four single-stage models. The MAPE percentage improvements of the proposed MRVIF-ANN model over the four single stage models, ARIMA, ANN, MRSIG, and MRSEL for the 24-period forecasts are 36.52%, 36.21%, 17.73, and 15.39%, respectively. Table 11 provides a comparison with respect to the overall improvement percentage in the single-stage models. As shown in Tables 10 and 11, the proposed hybrid models outperform all single-stage models.
The accurate prediction of currency volume issued is very important for the economic development of a country. This study performed a comparison of single-stage and hybrid models in predicting the volume of currency issued in Taiwan.
Because it is difficult to fully capture the characteristics of the real data, the hybrid scheme can be a good practical modeling approach. In this study, the concept of the proposed hybrid scheme takes advantage of each component model’s unique capability to capture patterns in the currency data. Different combinations of hybrid technique were proposed to overcome the deficiencies of single models and yield more accurate prediction results. In this study, the MAPE, MSE, and MAD are used to measure the forecasting capability. The forecasting results reveal that the hybrid models are more fruitful methods for improving the forecasting performance of each single-stage model.
The proposed hybrid technique is more effective than the single-stage modeling. However, due to the difficulty of obtaining other countries’ datasets, we are unable to perform the same procedures in reference to other countries. We do believe that the proposed hybrid approach is suitable for forecasting the currency issued for other countries in addition to Taiwan. We have described a framework for integrating several frequently used MR modeling methods and ANN techniques. The extension of these two-stage hybrid procedures to other techniques is currently under investigation.
This research was supported in part by the National Science Council of Taiwan, Grant no. NSC 99-2221-E-030-014-MY3.
G. L. Kaminsky and C. M. Reinhart, “The twin crises: the causes of banking and balance-of-payments problems,” American Economic Review, vol. 89, no. 3, pp. 473–500, 1999.View at: Google Scholar
D. F. Hendry and N. R. Ericsson, “An econometric analysis of UK money demand in monetary trends in the United States and United Kingdom by Milton Friedman and Anna J. Schwartz,” American Economic Review, vol. 81, pp. 8–38, 1991.View at: Google Scholar
D. F. Hendry, “Modeling the demand for narrow money in the United Kingdom and the United States. by David F. Hendry and Neil R. Ericsson,” European Economic Review, vol. 35, no. 4, pp. 883–886, 1991.View at: Google Scholar
T. Yoshida, “On the stability of Japanese money demand function: estimation results using error the correction model,” Monetary and Economic Studies, vol. 8, no. 1, pp. 1–48, 1990.View at: Google Scholar
Y. E. Shao, Y. F. Lin, and S. T. Yuan, “Integrated application of time series multiple-interventions analysis and knowledge-based reasoning,” Journal of Applied Statistics, vol. 26, no. 6, pp. 755–766, 1999.View at: Google Scholar
Y. E. Shao, “Multiple intervention analysis with application to sales promotion data,” Journal of Applied Statistics, vol. 24, no. 2, pp. 181–191, 1997.View at: Google Scholar
R. Sadownik and E. P. Barbosa, “Short-term Forecasting of industrial electricity consumption in Brazil,” Journal of Forecasting, vol. 18, no. 3, pp. 215–224, 1999.View at: Google Scholar
A. Ustundag, “A hybrid model for forecasting sales in Turkish paint industry,” International Journal of Computational Intelligence Systems, vol. 2, no. 3, pp. 277–287, 2009.View at: Google Scholar
A. M. S. Muniz, H. Liu, K. E. Lyons et al., “Comparison among probabilistic neural network, support vector machine and logistic regression for evaluating the effect of subthalamic stimulation in Parkinson disease on ground reaction force during gait,” Journal of Biomechanics, vol. 43, no. 4, pp. 720–726, 2010.View at: Google Scholar
M. Caselli, L. Trizio, G. De Gennaro, and P. Ielpo, “A simple feedforward neural network for the PM10 forecasting: comparison with a radial basis function network and a multivariate linear regression model,” Water, Air, and Soil Pollution, vol. 201, no. 1–4, pp. 365–377, 2009.View at: Publisher Site | Google Scholar
K. Subramanian, V. M. Periasamy, M. Pushpavanam, and K. Ramasamy, “Predictive modeling of deposition rate in electro-deposition of copper-tin using regression and artificial neural network,” Journal of Electroanalytical Chemistry, vol. 636, no. 1-2, pp. 30–35, 2009.View at: Publisher Site | Google Scholar
H. M. Chung and P. Gray, “Special section: data mining,” Journal of Management Information Systems, vol. 16, no. 1, p. 11, 1999.View at: Google Scholar
M. W. Craven and J. W. Shavlik, “Using neural networks for data mining,” Future Generation Computer Systems, vol. 13, no. 2-3, pp. 211–229, 1997.View at: Google Scholar
C. D. Hou and Y. E. Shao, “Integrated use of statistical-based approaches and machine learning techniques for tumors classification using microarray data,” Journal of Biobased Materials and Bioenergy. In press.View at: Google Scholar
D. E. Rumelhart, D. E. Hinton, and R. J. Williams, Learning Internal Representations by Error Propagation in Parallel Distributed Processing, MIT Press Cambridge, Cambridge, Mass, USA, 1986.
Y. E. Shao and B. S. Hsu, “Determining the contributors for a multivariate SPC chart signal using artificial neural networks and support vector machine,” International Journal of Innovative Computing, Information and Control, vol. 5, no. 12, pp. 4899–4906, 2009.View at: Google Scholar
Y. E. Shao, C. J. Lu, and C. C. Chiu, “A fault detection system for an autocorrelated process using SPC/EPC/ANN and SPC/EPC/SVM schemes,” International Journal of Innovative Computing, Information and Control, vol. 7, no. 9, pp. 5417–5428, 2011.View at: Google Scholar
A. Azadeh, M. Saberi, A. Kazem, V. Ebrahimipour, A. Nourmohammadzadeh, and Z. Saberirw, “A flexible algorithm for fault diagnosis in a centrifugal pump with corrupted data and noise based on ANN and support vector machine with hyper-parameters optimization,” Applied Soft Computing, vol. 13, no. 3, pp. 1478–1485, 2013.View at: Publisher Site | Google Scholar
R. Hedjar, “Adaptive neural network model predictive control,” International Journal of Innovative Computing, Information and Control, vol. 9, no. 3, pp. 1245–1257, 2013.View at: Google Scholar