Abstract

The Chinese currency, RMB, is developing as an international currency. Therefore, the effective strategy for trading RMB exchange rates would be attractive to international investors and policymakers. In this paper, we have constructed hybrid EMD-MLP models to forecast RMB exchange rates and developed a trading strategy based on these models. Empirical results show that the proposed hybrid EMD-MLP model always performs best based on both NMSE and criteria when the forecasting period is greater than five days. Moreover, we compare the models’ performance using different horizons and find that accuracy will increase with the growth of the forecasting horizons; however, the NMSE will become larger. Lastly, we adopt the best performing model to develop trading strategies with longer forecasting horizons when considering the number of profitable trading activities. If we consider a 0.3% transaction cost, the developed strategy will bring an annual return exceeding 10%, as well as enough trading opportunities.

1. Introduction

As the Chinese government continues the process of RMB internationalization, RMB currency trading becomes increasingly important in personal investment, corporate financial decision-making, governments’ economic policies, and international trade and commerce. The RMB has therefore attracted an increasing attention from policymakers, investment institutions, and entrepreneurs worldwide. One reflection of this is the 2015 finding of the Society for Worldwide Interbank Financial Telecommunication (SWIFT) that the RMB has overtaken the Japanese yen to become the fourth ranking world payment currency. The first milestone in the process of RMB internationalization is the establishment of an RMB offshore center in Hong Kong in 2004. In December 2008, the Chinese Premier announced the pilot program for cross border trade settlement in RMB, making the offshore RMB officially deliverable in Hong Kong. From July 19, 2010, the offshore market for RMB officially commenced. Recent literature [14] has found that RMB movements have an impact on regional currencies.

Due to the importance of the RMB, the purpose of this study is to create a reliable RMB forecasting model and provide a possible application for designing trading strategies. Two kinds of current rates for exchanging the US dollar (USD) into Chinese RMB will be considered. If the RMB is traded onshore (in mainland China), it is referred to as CNY, whereas if traded offshore (mainly in Hong Kong), it is designated as CNH. Since the onshore and offshore markets might respond differently to changes depending on financial markets, the CNY and CNH are not always at the same price level. Craig et al. [5] and Funke et al. [6] study CNY-CNH pricing differentials in detail.

It is well documented that the exchange rate series is considered as nonlinear and nonstationary time series and interactively influenced by many factors, which makes the accurate forecasting of exchange rate rather challenging. During past decades, traditional econometric and statistical techniques, including autoregressive integrated moving average (ARIMA), cointegration analysis, vector autoregression (VAR), and error correction (ECM) models, have been widely used in foreign exchange rates forecasting. However, in the real-world financial markets, exchange rate series are nonlinear and rarely form purely linear combinations [710]. Thus, the above traditional models always provide unreliable forecasting if one continue applying these traditional econometric and statistical models. The main reason of this deficiency is that traditional econometric and statistical models are constructed based on linear assumptions, which will be unable to capture the nonlinear features hidden in the exchange rate series.

Considering the limitation of traditional econometric and statistical models, many nonlinear artificial intelligence models (AI), such as artificial neural networks (ANN) [11, 12], feedforward neural networks (FNN) [13, 14], support vector regression (SVR) [1517], and genetic programming (GP) [18, 19], have been applied to investigate the forecasting ability of financial time series. Yu et al. [20] provide a complete review of foreign exchange rate forecasting with ANN and also introduce SVR and GP. However, many AI-based models also have their own shortcomings. For instances, ANN usually suffers from overfitting and local minima, while other models, including GP and SVR, are sensitive to parameter estimation. Recently, due to the complex characteristics of exchange rate series, several studies show that a single model is unable to capture all the features and make accurate forecasts [21].

To overcome above shortcomings, many researchers start to rely on the hybrid model to forecast exchange rate series accurately. The composite forecasting method allows us to obtain an approach to the dynamics underlying the data by combining the predictions obtained from different individual techniques. Álvarez-Díaz and Álvarez [22] attempt to exploit the nonlinear structure by constructing the genetic programming and neural networks composite method and apply this composite method to forecast Yen/US$ and Pound Sterling/US$ exchange rates. Other scholars also try to improve the forecasting accuracy of exchange rate series by applying hybrid forecasting models. For example, Zhao and Yang [23] and Wong et al. [24] use fuzzy clustering and ANN to forecast financial time series. Yu et al. [25] propose an online big data-driven oil consumption forecasting model based on Google trend by combining the relationship investigation and prediction improvement. Yu et al. [14] design a novel ensemble forecasting approach for complex time series by coupling sparse representation (SR) and feedforward neural network (FNN), i.e., the SR-based FNN approach.

The empirical mode decomposition (EMD) technique is first proposed by Huang et al. [26]. From the theoretical views, EMD is suitable for time series data in terms of decomposing the original data into components, which could break the forecasting task down into simpler forecasting subtasks. These decompositions consist of a finite and often small number of intrinsic mode functions (IMFs) and one residual. From the perspective of financial practice, various economic activities may cause a different impact periodicity on the exchange rate; for example, the company’s financial report is released quarterly, but government economic indicators are usually published once a year. EMD is potentially helpful in decomposing these effects and leads to improve forecasting works. There are several studies employing the EMD technique in hybrid models. For instance, Yu et al. [27] forecast crude oil prices with an EMD-based neural network model; Chen et al. [28] also combine EMD and the ANN approach to forecast tourism demand, and Lin et al. [29] propose a hybrid model using EMD and SVR for foreign exchange rate forecasting. In this empirical experiment, the original CNY or CNH price series, with characteristics of nonlinearity and nonstationarity, are divided into several independent subseries by the EMD technique and then partial or all IMFs and one residual are used to forecast. Tang et al. [30] combine the ensemble empirical mode decomposition (EEMD) with random vector functional link (RVFL) network and find the proposed EEMD-based RVFL network performs significantly better in terms of prediction accuracy than not only single algorithms such as RVFL network, extreme learning machine (ELM), kernel ridge regression, random forest, backpropagation neural network, least square support vector regression, and autoregressive integrated moving average, but also their respective EEMD-based ensemble variants. We summarize in Table 1 the main characteristics of these studies on individual and hybrid forecasting methods in the recent literature.

This study will focus on using ANN to forecast the RMB with different forecasting horizons of 1, 5, 10, 20, and 30 days. Nevertheless, Huang et al. [32] show that ANN performs better than the random walk while the forecasting horizon is less than five days, but for longer horizons, such as 10 and 30 days, the general performance of ANN is worse than the random walk model. In this study, a type of feedforward artificial neural network model called multilayer perceptron (MLP) is adopted. Empirical studies consist not only of the pure MLP model but also a hybrid model with EMD to improve forecasting performance.

In this study, the hybrid forecasting model combining MLP and EMD is similar to the work of Yu et al. [27]. However, this study further considers the influence of IMFs that have different levels of frequency. In concrete terms, higher frequency IMFs could be regarded as noise components, when our forecasting horizon is longer. Based on this concept, two hybrid models are proposed and named EMD-MLP and EMD-MLP. Comparing these to pure MLP, the former adjusts the original time series data by subtracting higher frequency IMFs and then uses MLP to compute the one-day ahead predictions. The EMD-MLP model applies the “divide-and-conquer” principle to construct a novel forecasting methodology, in which partial or all IMFs and one residual are used. According to the empirical evidence in this study, both types of hybrid models are superior to the pure MLP model, whether with 1-, 5-, 10-, 20-, or 30-day forecasting horizons. Moreover, the application of trading strategies is proposed. Although the transaction costs can make the profits practically disappear or become negative, as proved by Álvarez Díaz [31], this empirical analysis shows that, even considering a 0.3% transaction cost in each trade, the trading strategy based on EMD-MLP, on average, produces an annualized profit exceeding 10%.

In this paper, we will investigate RMB exchange rate forecasting and develop the relevant trading strategies based on the constructed models. The main contributions come from three aspects. First, this study focuses on the RMB, including the onshore RMB exchange rate (CNY) and offshore RMB exchange rate (CNH). We will use daily data to forecast RMB exchange rates with different horizons based on three types of models, i.e., MLP, EMD-MLP, and EMD-MLP. Second, we will consider not only the MLP model but also the hybrid EMD-MLP and EMD-MLP models to improve forecasting performance. It should be noted that the EMD-MLP is different from the methodology proposed by Yu et al. [27]. We regard some IMF components as noise factors and delete them to reduce the volatility of the RMB. Finally, we will choose the best forecasting models to construct the trading strategies by introducing different critical numbers and considering different transaction costs.

The remainder of the paper is organized in the following manner. Section 2 provides a brief description of ANN and EMD. The overall forecasting process and model notations of three kinds of forecasting models are also included in this part. In Section 3, two currency exchange rates of USD to RMB, CNY, and CNH, are used to test the effectiveness of the proposed methodology, and the selected models are applied in trading strategies. The conclusions drawn from this study are presented in Section 4.

2. Methodology

2.1. Artificial Neural Network (ANN)

This study considers MLP based on an error backpropagation algorithm. Figure 1 shows a simple MLP structure with three input nodes (X1, X2, and X3). One hidden layer consists of four hidden nodes and one output node (Y). The nodes are organized in layers and are usually fully connected by weights, which indicate the effects of the corresponding nodes. In each node of the hidden and output layers, all data are firstly processed by the integration function (also called the summation function), which combines all incoming signals, and secondly processed by the activation function (also called the transfer function), which transforms the output of the node. In general, the amount of the hidden layer is less than 3, due to the converging restriction.

Particularly, the MLP model is fitted by the training data, and then the testing data and an out-of-sample dataset are used to verify its forecasting performance. Through supervised learning algorithms, the parameters (weights and node intercepts) are adjusted iteratively by a process of minimizing the forecasting error function. Formally, an MLP with an input layer with n nodes, one hidden layer consisting of J hidden nodes, and an output layer with one output node calculates the following function:where is the vector of all input variables, is the intercept of the output node, is the intercept of the jth hidden node, denotes the weight corresponding to the node starting at the jth hidden node to the output node, and denotes the weight corresponding to the node starting at ith input node to the jth hidden node. Therefore, all hidden and output nodes calculate the function , where denotes the integration function, which is defined as , and denotes the activation function, which is often a bounded nondecreasing nonlinear and differentiable function. In this study, the logistic function () is used as the activation function.

Given inputs and the current weights, which are initialized with random values from a standard normal distribution, the MLP produces an output . Then, an error function is defined. This study selects the mean square error as follows:where N is the number of data samples and is the observed output. During the iterative training process, the above steps are repeated to adapt all weights until a prespecified criterion is fulfilled. In order to find a local minimum of the error function, the resilient backpropagation algorithm modifies the weights in the opposite direction of partial derivatives. According to Riedmiller and Braun [34], the weights are adjusted by the following rule:where t and k index the iteration steps and the weights, respectively. In order to speed up convergence, the learning rate increases if the corresponding partial derivative keeps the same sign; otherwise, it will decrease.

2.2. Empirical Mode Decomposition (EMD)

In this study, we further apply EMD to decompose the time series into several IMFs and remaining residues. These IMFs usually satisfy two conditions: the first is that the number of extrema and zero crossings must be equal or different by not more than one. Second, the mean value of the envelopes, which include both local maxima and minima, must be zero at all points.

Given the time series data , Huang et al. [26] propose a sifting process to decompose . The first step is to identify all local maxima and local minima of . Then, the upper and lower envelopes are defined by connecting all local extrema by a spline line. Next, for all points at the envelope, the mean value from upper and lower envelopes is calculated. Then, it follows computation of the first IMF of :

If does not meet the above two conditions, this study then takes as a new data series and repeats procedure (4). Thus, we calculate

In this calculation, is the mean value of the upper and lower envelopes of .

Repeating the same procedure until meeting both conditions, we get the first IMF component of , , that is

The stopping rule indicates that absolute values of the envelope mean must be less than the user-specified tolerance level. In this study, the tolerance level is denoted as the standard deviation of times 0.01. Other interesting stopping rules can be found in the works of Huang et al. [26] and Huang and Wu [35].

After extracting the component from , we denote another series , which contains all information except . Huang et al. [36] suggest a sifting stop criterion that is becomes a monotonic function or cannot extract more IMFs. Finally, the time series can be expressed aswhere n is the number of IMFs and is the final residue, which represents the central tendency of the data series . These IMFs are nearly orthogonal to each other and all have means of nearly zero. According to the above properties, it is possible to forecast all decompositions and summarize these estimations to predict .

2.3. Overall Forecasting Process and Model Notations

Considering the time series , , we would like to predict l-day ahead, which is denoted as . In this study, the input variables (past observations) include , , , and to , which represent the past price levels of 60 days, 20 days, 10 days, and the last 5 days, respectively. The output is the l-day ahead prediction. Formally, it could be shown as follows:where is a function determined by neural network training and is a weight vector of all parameters of MLP.

Three kinds of forecasting models are adopted in this study. The first is the pure MLP model with one and two hidden layers. The second is an EMD-MLP model, which uses EMD to subtract some volatile IMFs from the original data series and then uses the new data series to calculate the final prediction by MLP technology. Figure 2 indicates the procedure of the EMD(-1)-MLP model, which means the first IMF component is ignored. The third kind of model is named EMD-MLP, which generally consists of the following four steps:(1)Decompose the original time series into IMF components and one residual component via EMD(2)Determine how many IMF components are used, which depends on the length of the forecasting period(3)For each chosen IMF and residual component, the MLP model is used as a forecasting tool to model these components and to make the corresponding prediction(4)Add all prediction results to one value, which can be seen as the final prediction result for the original time series.

As an example, Figure 3 represents the above procedure of the EMD(-1)-MLP model. Time series data are decomposed via EMD and generates n IMF components, , and one residue . In addition to , we forecast each decomposition and then sum them as the prediction results for the original time series.

3. Empirical Experiments

3.1. Data

Two kinds of currency exchange rates of USD to RMB are considered in this study. If the RMB is traded onshore (in mainland China), it is referred to as CNY, and if traded offshore (mainly in Hong Kong), it is named CNH. Both time series data are downloaded from Bloomberg. For CNY, daily data from January 2, 2006, to December 21, 2015, with a total of 2584 observations are examined in this study. Since the CNH officially commenced on July 19, 2010, its sample period is from January 3, 2011, to December 21, 2015, with a total of 1304 observations. For training the neural network models, two-thirds of the observations are randomly assigned to the training dataset and the remainder is used as the testing dataset. In addition, it should be noted that the time series in price levels is used in the following analysis, rather than in return levels.

According to descriptions shown in Section 3.2, the EMD technique is used to decompose both CNY and CNH price series into several independent IMFs and one residual component. The decomposed results of CNY and CNH are represented in Figures 4 and 5. Comparing these two figures, the original data series and decompositions seem dissimilar, which is due to the different lengths of the sample periods. In fact, their IMFs have some similar characteristics. First, focusing on the sample period 2011–2015, their residual components have the same trend. Starting at about 6.5, they drop slightly to 6.2 and then rise slightly to 6.5. Second, the swing period of high-frequency IMFs is shorter and also the mean of absolute values of high-frequency IMFs is smaller when compared with low frequency. For example, in the series of CNY, the absolute values of , , and are 0.00298, 0.00596, and 0.02002, respectively. In our estimation model, it is not necessary to include all the high-frequency IMFs, as considering high-frequency IMFs may reduce the forecasting accuracy when the forecasting period is long. Therefore, in the model EMD-MLP, the high-frequency IMF is removed to get the denoised time series. In the model EMD-MLP, not only are all the decompositions chosen to forecast the exchange rate, but consideration is also given to using part of the decompositions to conduct forecasting, discarding the high-frequency IMF series.

3.2. Experimental Results

Two main criteria are considered in this empirical experiment: the normalized mean squared error (NMSE), and the directional statistic (), to evaluate the levels of prediction and directional forecasting, respectively. Typically, following [37], the NMSE is defined bywhere refers to the test dataset containing observations, is the l-day ahead prediction, is the actual value, and is the variance of . Clearly, the NMSE is one of the most important criteria for measuring the validity of the forecasting model, but from a business perspective, improving the accuracy of directional predictions can support decision-making, so as to generate greater profits. Furthermore, the Diebold-Mariano (DM) test [38] is adopted to investigate whether adding EMD could improve the forecasting performance of the MLP model. Specifically, the null hypothesis is that the EMD-MLP (or EMD-MLP) model has a lower forecast accuracy than the corresponding MLP model. For example, in the case of the EMD(-1)-MLP(5,3), the corresponding model infers the MLP(5,3) model.

Besides the NMSE evaluation, the directional statistic () is applied to measure the ability to predict movement direction [27, 39], which can be expressed aswhere if and otherwise. Pesaran and Timmermann [40] provide the directional accuracy (DAC) test to examine predictive performance. In this case, the DAC test is used to determine whether is significantly larger than 0.5. We have also adopted the excess profitability test proposed by Anatolyev and Gerko [41] in every related empirical study. Since both tests produce the same outcome, we only report the results of DAC tests in the following empirical studies.

Table 2 compares the forecasting performance, in terms of the NMSE, for the MLP, EMD-MLP, and EMD-MLP models. The NMSE is reported as the percentage for l-day ahead predictions where and 30. For each prediction period, the minimum NMSE is designated in bold font. Panel A of Table 2 displays the experimental results of the MLP models. It is clear that, with the increase in forecasting period, the NMSE becomes larger, suggesting reduced forecasting accuracy for a longer forecasting period. Also, the more complex MLP models with two hidden layers do not offer better forecasting accuracy and are even worse for 1-day ahead prediction.

The empirical results of EMD-MLP are discussed in Panel B of Table 2. EMD(-1)-MLP treats the highest frequency IMF as noise and applies the MLP model after subtracting the series from the original time series. Empirical results show that the NMSE of 1-day ahead forecasting dropped dramatically; however, improvement of 5-day ahead forecasting with EMD(-1)-MLP models is limited. EMD(-2)-MLP models, which deduct the two highest frequency IMF series, perform better than EMD(-1)-MLP models in the 5-day ahead forecasting experiment. In the same way, the EMD(-3)-MLP models perform better when the forecasting period is longer than 10 days. The results of EMD-MLP models are shown in Panel C of Table 2. Although the calculation process is more complicated, the forecasting performance of EMD-MLP is always better than MLP or EMD-MLP models, according to the criterion of NMSE. In addition, the random walk models with and without drifts (the random walk model without drifts predicts that all future values will equal the last observed value. Given a variable Y, the k-step-ahead forecast from period t of Y is . For the random walk model with drifts, the k-step-ahead forecast from period t of Y is , where is estimated by the average change of in the past 60 days) are used to examine the same data. The related results in Panel D of Table 2 indicate that, for the MLP-EMD and MLP-EMD models, their performance is obviously better than the random walk models in most situations.

Table 3 is based on the forecasting performance of the above models with , which shows similar experimental results to the NMSE criterion. The table shows that if EMD(-2)-MLP(5,3) is applied to forecasting the direction of the CNY series 30 days later, the hitting rate can be as high as 86.39%. In general, although the calculation of EMD-MLP models is more complicated, the forecasting ability appears to be the best of all the proposed models. The highest frequency IMF is therefore treated as noise and EMD(-1)-MLP is used for the forecast. In this way, the performance of the selected model would be relatively better, based on both NMSE and criteria. Besides, it should be noted that performance of predicting over longer periods is worse than shorter ones intuitively. In terms of NMSE, the experimental results are consistent with above argument, i.e., the NMSE performance of predicting 20 and 30-days ahead is poor. But the performance is totally different; our results show that predictions over longer periods often have higher (in order to verify the robustness, we also used three-fourths of the observations as a training dataset to reexamine the results of Tables 2 and 3. The related outcomes are shown in Tables 4 and 5. In addition, we considered a hyperbolic tangent function as the activation function and reported related results in Tables 6 and 7).

In addition to the above experiments, the dataset of CNY and CNH series from 2011 to 2015 is considered. We use the same method of dividing the data into the training set and testing set. According to the experimental results in Tables 2 and 3, we do not report MLP models and only adopt a number of EMD-MLP and EMD-MLP models to conduct predictions. The statistical results of NMSE and are shown in Tables 8 and 9, respectively.

Tables 8 and 9 indicate the EMD-MLP models perform better than the EMD-MLP models on both NMSE and criteria. Comparing the results of the CNY and CNH series, the NMSE of CNY is found to be less than that of CNH on average, no matter how long the forecasting period. The main reason for this is the original volatility of CNY series is smaller than that of CNH series. For the test part, forecasting performance improves with the growth of forecasting periods. Also, no significant difference is found for the results of CNY and CNH series based on criteria. For forecasting over 20 days ahead, the hitting rate exceeds 89%. Even for forecasting 1-day ahead, can achieve more than 79%.

3.3. Applying the Trading Strategy

This part introduces an application for designing a trading strategy for CNY (since the cases of CNY and CNH, from 2011 to 2015, produce the similar results, we do not report the latter in this paper). According to the results in Tables 2 and 3, in terms of NMSE and , the best model for predicting l-day ahead can be determined. The forecasting model can be trained based on past exchange rate series and produce l-day ahead predictions. According to the forecasting results, trading strategies are set as follows:where is the l-day ahead predictions at time s and τ is a critical number. We long (short) the CNY if l-day ahead predictions are greater (smaller) than the present price multiplied by . So, unless , people always conduct a transaction at time s and hold it for l days. For instance, consider the case with and . If and we use the EMD-MLP model to find , we immediately long CNY and hold it for 10 days. On the contrary, if and , we short it.

The reason of setup τ comes from the following several aspects: first, when we set , the number of transactions must be very large. Since high frequency trading comes with extremely high transaction costs, the erosion of profits should not be ignored and it usually makes the trading strategy fail in reality. Therefore, letting τ be larger than zero can reduce the number of trades. More importantly, τ could be regarded as a confidence level indicator, which produces long and short trading thresholds, i.e., and . Only when the l-day ahead prediction is larger (smaller) than the thresholds, we long (short) the CNY. Specifically, when we set , denoting that if the forecast price exceeds the present price by more than 1%, we buy it; on the contrary, if the forecast price exceeds the present price by less than 1%, we short it. To fit the practical situation, this empirical analysis also considers annual returns with transaction costs of 0.0%, 0.1%, 0.2%, and 0.3%, respectively.

Table 10 discusses the performance of the above trading strategy with , based on best models, selected according to Tables 2 and 3. It displays the best performance models with different forecasting periods. The returns shown in the last four columns have been adjusted to annual returns using different transaction costs. Firstly, it can be seen that the larger the l is, which means the forecasting period is longer, the larger the standard deviation will be. If there is no transaction cost, the return decreases with the increase of the forecasting period l. For instance, in Panel A, choosing a forecasting period of 1 day with and zero transaction cost, the annual return will reach 13.59% by choosing the best performance model EMD(-1)-MLP(3). However, extending the forecasting period to 30 days, the return will become 4.59% with the best performance model.

However, if transaction costs are considered, it is clear that the return increases as the forecasting period becomes longer, which is opposite to the case where there are no transaction costs. The annual return suffers significant falls in short forecasting periods after considering the transaction cost since the annual return of a short period is offset by the significant cost of high-frequency trading. For instance, with a 0.3% trading cost, the annual return of the best performance model for 30-day ahead forecasting is positive and is only 2.09%, but the annual return of EMD(-1)-MLP(3), which is the best performance for forecasting 1-day ahead, reaches as low as −61.41%. The annual return decreases with increasing transaction costs for every forecasting period.

Panels B and C, respectively, display trading strategy performance based on the best model when and . The 1-day forecasting case is ignored in this experiment since the predicted value of the 1-day period does not exceed τ. The standard deviation in the fourth column decreases with the growth of forecasting periods. Another finding in these two panels is that the annual return decreases as the transaction cost becomes larger in the same forecasting period, which is similar to Panel A. However, with the same transaction cost, the annual return decreases with the growth of the forecasting period, which is contrary to the situation in Panel A. This may be the result of setting τ = 0.5% or τ = 1%; the annual return will not be eroded by trading costs.

When we set τ = 0.5% for 5-day ahead forecasting, although the average annual return with different trading costs is at least 21.78%, there are only 32 trading activities in a total of 832 trading days. However, in choosing long period forecasting, such as 20 days ahead, trading activity is 220 with a 8.4% average annual return. In Panel C, the trading activities are less than in Panel B. If a 0.3% trading cost with τ = 1% is considered, the annual return of 30-day ahead forecasting will exceed 10%, and there are more than 130 trading activities. It can be seen that trading activities reduce with the growth of critical number τ. To sum up, applying EMD-MLP to forecast the CNY could produce some useful trading strategies, even considering reasonable trading costs.

4. Conclusions

In this paper, RMB exchange rate forecasting is investigated and trading strategies are developed based on the models constructed. There are three main contributions in this study. First, since the RMB’s influence has been growing in recent years, we focus on the RMB, including the onshore RMB exchange rate (CNY) and offshore RMB exchange rate (CNH). We use daily data to forecast RMB exchange rates with different horizons based on three types of models, i.e., MLP, EMD-MLP, and EMD-MLP. Our empirical study verifies the feasibility of these models. Secondly, in order to develop reliable forecasting models, we consider not only the MLP model but also the hybrid EMD-MLP and EMD-MLP models to improve forecasting performance. Among all the selected models, the EMD-MLP performs best, in terms of both NMSE and criteria. It should be noted that the EMD-MLP is different from the methodology proposed by Yu et al. [27]. We regard some IMF components as noise factors and delete them to reduce the volatility of the RMB. The empirical experiments verify that the above process could clearly improve forecasting performance. For example, Table 1 shows that the best models are EMD(-1)-MLP(3), EMD(-1)-MLP(5,3), and EMD(-2)-MLP(5,3). Finally, we choose the best forecasting models to construct the trading strategies by introducing different critical numbers and considering different transaction costs. As a result, we abandon the trading strategy of since the annual returns are mostly negative. However, given or , all annual returns are more than 5%, even including the transaction cost. In particular, by setting τ = 1% with 0.3% transaction cost, the annual return is more than 10% in different forecasting horizons. Thus, this trading strategy can help investors make decisions about the timing of RMB trading.

As the Chinese government continues to promoting RMB internationalization, RMB currency trading becomes increasingly important in personal investment, corporate financial decision-making, government’s economic policies, and international trade and commerce. This study is trying to apply the neural network into financial forecasting and trading area. Based on the empirical analysis, the forecasting accuracy and trading performance of RMB exchange rate can be enhanced. Considering the performance of this proposed method, the study should be attractive not only to policymakers and investment institutions but also to individual investors who are interested in RMB currency or RMB-related products.

Data Availability

The data used to support the findings of this study are available from the corresponding author upon request.

Conflicts of Interest

The authors declare that there are no conflicts of interest regarding the publication of this paper.

Authors’ Contributions

All authors have contributed equally to this article.

Acknowledgments

This work was supported by the National Natural Science Foundation of China (NSFC no. 71961007, 71801117, and 71561012). It was also supported in part by Humanities and Social Sciences Project in Jiangxi Province of China (JJ18207 and JD18094) and Science and Technology Project of Education Department in Jiangxi Province of China (GJJ180278, GJJ170327, and GJJ180245).