Stock Market Prediction on High-Frequency Data Using Generative Adversarial Nets

Zhou, Xingyu; Pan, Zhisong; Hu, Guyu; Tang, Siqi; Zhao, Cheng

doi:https://doi.org/10.1155/2018/4907423

Mathematical Problems in Engineering

On this page

Abstract Introduction Related Work Conclusion Conflicts of Interest Acknowledgments References Copyright Related Articles

Special Issue

Computational Intelligence in Data-Driven Modelling and Its Engineering Applications

View this Special Issue

Research Article | Open Access

Volume 2018 | Article ID 4907423 | https://doi.org/10.1155/2018/4907423

Stock Market Prediction on High-Frequency Data Using Generative Adversarial Nets

Xingyu Zhou,¹Zhisong Pan,¹Guyu Hu,¹Siqi Tang,¹and Cheng Zhao^1,2

Academic Editor: Qian Zhang

Received06 Nov 2017

Revised21 Jan 2018

Accepted13 Feb 2018

Published15 Apr 2018

Abstract

Stock price prediction is an important issue in the financial world, as it contributes to the development of effective strategies for stock exchange transactions. In this paper, we propose a generic framework employing Long Short-Term Memory (LSTM) and convolutional neural network (CNN) for adversarial training to forecast high-frequency stock market. This model takes the publicly available index provided by trading software as input to avoid complex financial theory research and difficult technical analysis, which provides the convenience for the ordinary trader of nonfinancial specialty. Our study simulates the trading mode of the actual trader and uses the method of rolling partition training set and testing set to analyze the effect of the model update cycle on the prediction performance. Extensive experiments show that our proposed approach can effectively improve stock price direction prediction accuracy and reduce forecast error.

1. Introduction

Predicting stock prices is an important objective in the financial world [1–3], since a reasonably accurate prediction has the possibility to yield high financial benefits and hedge against market risks. With the rapid growth of Internet and computing technologies, the frequency for performing operations on the stock market had increased to fractions of seconds [4, 5]. Since year of 2009 the BM&F Bovespa (the Brazilian stock exchange) has worked in high-frequency, and the number of high-frequency operations has grown from 2.5% in 2009 to 36.5% in 2013. Aldridge and Krawciw [6] estimate that in 2016 high-frequency trading on average initiated 10%–40% of trading volume in equities and 10%–15% of volume in foreign exchange and commodities. These percentages suggest that the high-frequency stock market is a global trend.

In most cases, the forecast results are assessed from two aspects: the first is forecast error (chiefly the RMSE (Root Mean Square Error) or RMSRE (Root Mean Square Relative Error)) between real price and forecast value; the second is direction prediction accuracy, which means the percentage of correct predictions of price series direction, as upward and downward movements are what really matters for decision-making. Even small improvements in predictive performance can be very profitable [7, 8].

However, predicting stock prices is not an easy work, due to the complexity and chaotic dynamics of the markets and the many nondecidable, nonstationary stochastic variables involved [9]. Many researchers from different areas have studied the historical patterns of financial time series and have proposed various methods for forecasting stock prices. In order to achieve promising performance, most of these ways require careful selection of input variables, establishing predictive model with professional financial knowledge, and adopting various statistical methods for arbitrage analysis, which makes it difficult for people outside the financial field to use these methods to predict stock prices [10–12].

Generative adversarial network (GAN) was introduced by Goodfellow et al. [13], where images patches are generated from random noise using two networks trained simultaneously. Specifically, in GAN a discriminative net learns to distinguish whether a given data instance is real or not, and a generative net learns to confuse by generating high quality data. Although this approach has been successful and applied to a wide range of fields, such as image inpainting, semantic segmentation, and video prediction [14–16], as far as we know, it has not been used for stock forecasting.

This work uses basic technical index data as an input variable, which can be acquired directly from trading software, so that people outside the financial field can predict stock price through our method easily. This study introduces forecast error loss and direction prediction loss and shows that generative adversarial training [13] may be successfully employed for combining these losses to produce satisfying predict results, and we call this prediction architecture GAN-FD (GAN for minimizing forecast error loss and direction prediction loss). For the purpose of conforming to the practice of actual transactions, this work carries out rolling segmentation on training set and testing set of the raw data, and we will illustrate it in detail in the experimental section.

Overall, our main contributions are twofold: (1) we adapted generative adversarial network for the purpose of price prediction, which constitutes to our knowledge the first application of adversarial training to stock market, and extensive experiments show that our prediction model can achieve remarkable results and (2) we carry out rolling segmentation on training set and testing set of the raw data to investigate the effect the of model parameter update cycle on the stock forecast performance, and the experimental results show that smaller model update cycle can advance prediction performance.

In the remainder of this paper, we begin with a review of the literature on which algorithms have been used for the financial market prediction. Then we formulate the problem and propose our general adversarial network framework. Furthermore, in the experiments section, we presented the experimental analysis with the proposed model, as well as a comparison between the obtained results with those given by classical prediction models. Finally, conclusions and possible extensions are discussed.

This section introduce the related work from the stock market prediction method and the generative adversarial network.

2.1. Stock Market Prediction Method

According to the research developed in this field, we can classify the techniques used to solve the stock market prediction problems to twofold.

The first category of related work is econometric models, which includes classical econometric models for forecasting. Common methods are the autoregressive method (AR), the moving average model (MA), the autoregressive moving average model (ARMA), and the autoregressive integrated moving average (ARIMA) [17–19]. Roughly speaking, these models take each new signal as a noisy linear combination of the last few signals and independent noise terms. However, most of them rely on some strong assumptions with respect to the noise terms (such as i.i.d. assumption, -distribution) and loss functions, while real financial data may not fully satisfy these assumptions. By introducing a generalized autoregressive conditional heteroscedastic (GARCH) model for conditional variances, Pellegrini et al. [20] apply ARIMA-GARCH model to the prediction of financial time series.

The second category involves soft computing based models. Soft computing is a term that covers artificial intelligence which mimics biological processes. These techniques include artificial neural networks (ANN) [21, 22], fuzzy logic (FL) [23], support vector machines (SVM) [24, 25], particle swarm optimization (PSO) [26], and many others. Many authors have tried to deal with fuzziness along with randomness in option pricing models [27, 28]. Carlsson and Fullér [29] were the first to study the fuzzy real options and Thavaneswaran et al. [30] demonstrated the superiority of the fuzzy forecasts and then derived the membership function for the European call price by fuzzifying the interest rate, volatility, and the initial value of the stock price. Recently there has been a resurgence of interest in deep learning, whose basic structure is best described as a multilayer neural network [31]. Some literatures have established various models based on deep neural networks to improve the prediction ability of high-frequency financial time series [32, 33]. The ability of deep neural networks to extract abstract features from data is also attractive, Chong et al. [12] applied a deep feature learning-based stock market prediction model, which extract information from the stock return time series without relying on prior knowledge of the predictors and tested it on high-frequency data from the Korean stock market. Chen et al. [34] proposed a double-layer neural network for high-frequency forecasting, with links specially designed to capture dependence structures among stock returns within different business sectors. There also exist a few studies that apply deep learning to identification of the relationship between past news events and stock market movements [35–37].

However, to our knowledge, most of these methods require expertise to impose specific restrictions on the input variables, such as combining related stocks together as entry data [12], inputting different index data to different layers of the deep neural network [34], and converting news text into structured representation as input [36]. In contrast, our proposed forecasting model directly uses the data provided by the trading software as input, which reduce the barrier for ordinary investors.

2.2. Generative Adversarial Network

Generative adversarial network (GAN) is a framework for estimating generative models via an adversarial process, in which we simultaneously train two models: a generative model that captures the data distribution and a discriminative model that estimates the probability that a sample came from the training data rather than . The training procedure for is to maximize the probability of making a mistake. This framework corresponds to a minimax two-player game. In the space of arbitrary functions and D, a unique solution exists, with recovering the training data distribution and equal to 0.5 everywhere [13]. While and are defined by multilayer perceptrons in [13], most researches recently constructed and on the basis of Long Short-Term Memory (LSTM) [38] or convolutional neural network (CNN) [39] for a large variety of application.

LSTM is a basic deep learning model and capable of learning long-term dependencies. A LSTM internal unit is composed of a cell, an input gate, an output gate, and a forget gate. LSTM internal units have hidden state augmented with nonlinear mechanisms to allow state to propagate without modification, be updated, or be reset, using simple learned gating functions. LSTM work tremendously well on various problems, such as natural language text compression, handwriting recognition, and electric load forecasting.

CNN is a class of deep, feed-forward artificial neural networks that has successfully been applied to analyzing visual imagery. A CNN consists of an input layer and an output layer, as well as multiple hidden layers. The hidden layers of a CNN typically consist of convolutional layers, pooling layers, fully connected layers, and normalization layers. CNN also has many applications such as image and video recognition, recommender systems, and natural language processing.

Although there are a lot of literatures forecast stock price by using LSTM model, to the best of our knowledge, this paper is the first to adopt GAN to predict stock prices. The experimental part (Section 4.2) compares the prediction performances between GAN-FC and LSTM.

3. Forecasting with High-Frequency Data

In this section, we illuminate the details of the generative adversarial network framework for stock market forecasting with high-frequency data.

3.1. Problem Statement

Under the high-frequency trading environment, high-quality one-step forecasting is usually of great concern to algorithmic traders, providing significant information to market makers for risk assessment and management. In this article, we aim to forecast the price movement of individual stocks or the market index one step ahead, based solely on their historical price information. Our problem can be mathematically formalized as follows.

Let represent a set of basic indicators and denote the closing price of one stock for a 1-minute interval at time , where is the maximum lag of time. Given the historical basic indicators information and the past closing price , our goal is to predict the closing price for the next 1-minute time interval. There are literatures that examined the effects of different [7, 12, 40], but, in this work, we just set to 242 because each trading day contains 242-minute intervals in the China stock exchanges.

3.2. Prediction Model

The deep architecture of the proposed GAN-FD model is illustrated as in Figure 1. Since the stock data is a typical time series, we choose LSTM model, which is widely applied to time series prediction, as the generative model to predict output based on the input data ; that is,

The discriminative model is based on the CNN architecture and performs convolution operations on the one-dimensional input sequence in order to estimate the probability whether a sequence comes from the dataset or being produced by a generative model .

Our main intuition on why to use an adversarial loss is that it can simulate the operating habits of financial traders. An experienced trader usually predicts stock price through the available indicator data, which is the work of the generative model , and then judges the correct probability of his own forecast with the previous stock price, as the discriminative model does.

It is noteworthy that the structure of and in GAN-FD can be adjusted according to specific application, and the experimental part in this paper just proposed simple and framework (Section 4.2) for stock prediction. It is reasonable to believe that fine-tuning the structure of and can improve the predictive performance.

3.3. Adversarial Training

The training of the pair (, ) consists of two alternated steps, described below. For the sake of clarity, we assume that we use pure SGD (minibatches of size 1), but there is no difficulty to generalize the algorithm to minibatches of size by summing the losses over the samples.

Training (let be a sample from the dataset). In order to make the discriminative model as “confused” as possible, the generative model should reduce the adversarial loss in the sense that will not discriminate the prediction correctly. Classifying into class 1 and into class 0, the adversarial loss for iswhere is the sigmoid cross-entropy loss, defined as

However, in practice, minimizing adversarial loss alone cannot guarantee satisfying predictions. Imagine that could generate samples to “confuse” , without being close to , and then will learn to discriminate these samples, leading to generate other “confusing” samples, and so on. To address this problem, the generative model ought to decrease the forecast error loss; that is, losswhere or .

Furthermore, as mentioned above, stock price direction prediction is crucial to trading, so we define direction prediction loss function :where represents sign function.

Combining all these losses previously defined with different parameters , , and , we achieve the final loss on :

Then we perform one SGD iteration on to minimize while keeping the weights of fixed.

Training (let be a different data sample). Since the role of is just to determine whether the input sequence is or , the target loss is equal to the adversarial loss on D. While keeping the weights of fixed, we perform one SGD step on to minimize the target loss:

We train the generator and discriminator iteratively. The entire process is summarized in Algorithm 1, with minibatches of size .

(1)Set the learning rates and , and parameters
, , ;
(2)Initialize weights and .
(3)while not converged do
(4)Update the generator :
(5)Get new data samples (, ), (,
),…, (, )
(6)
(7)Update the discriminator :
(8)Get new data samples (, ), (,
),…, (, )
(9)
(10) end while

4. Experiments

4.1. Dataset

Next, we evaluate the performance of the proposed method based on the China stock market, ranging from January 1, 2016, to December 31, 2016. There are totally 244 trading days and each day contains 242-minute intervals, corresponding to 59048 time points. These stocks selected for the experiment should conform to three criteria: first, they should be the constituent stock of 300 (the CSI 300 is a capitalization-weighted stock market index designed to replicate the performance of 300 stocks traded in the Shanghai and Shenzhen stock exchanges); second, they were not suspended during the period we just mentioned, in case accidental events bring about significant impact on their price and affect forecast results; third, their closing prices in the start time, that is, January 1, 2016, are above 30 to ensure the volatility for high-frequency exchange. This leaves 42 stocks in the sample, which are listed in Table 1. The number of increasing directions and decreasing directions for each stock’s closing price per minute is also shown in Table 1, and their numbers are relatively close. The historical data was obtained from the Wind Financial Terminal, produced by Wind Information Inc. (the Wind Financial Terminal can be downloaded from http://www.wind.com.cn).

Many fund managers and investors in the stock market generally accept and use certain criteria for technical indicators as the signal of future market trends [12, 41]. This work selects 13 technical indicators as feature subsets by the review of domain experts and prior researches; that is, the input data at each moment (e.g., ) consists of 13 basic indicators that can be obtained directly from almost all trading software. These basic indicators are listed in Table 2, and their parameters are using the default value of the Wind Financial Terminal. As mentioned above, is defined as the closing price at each moment.

Most of the related articles use the traditional data partitioning method; that is, the entire dataset is directly split into training set and testing set [12, 22, 40, 42]. However, the trading style of the stock market changes frequently; for example, investors sometimes prefer stocks with high volatility and sometimes tend to invest in technology stocks. Therefore, we should update the model parameters regularly to adapt to the change of market style. In order to make experiments closer to real transactions, we carry out rolling segmentation on training set and testing set of the experimental data. As Figure 2 shows, in the beginning, we select the first days as training set, and the next days play the role of testing set. After the first round of experiments, we roll forward the time window for days, that is, choosing the day to the day as training set and the day to the day as testing set. Repeat until all the data has been experimented. In other words, this can be regarded as the model update cycle, and is the size of the corresponding training data.

4.2. Network Architecture

Given that the LSTM generator takes on the role of prediction and requires more accurate calculations of values than the CNN discriminator, we set the learning rate to 0.0004 and to 0.02. The LSTM cell in contains 121 internal (hidden) units and the parameters are initialized following the normal distribution . The architecture of discriminative model is presented in Table 3. We train GAN-FD with weighted by .

4.3. Benchmark Methods

To evaluate the performance of our proposed method, we include three baseline methods for comparison. The first model is ARIMA -GARCH, a fitted ARIMA model that forecasts future values of stock time series and the GARCH model forecasts future volatilities [20]. The second one is artificial neural networks (ANN). The parameter optimization method and model architectural is setting as in [21], except that the input layer node is changed to 13 and the network outputs the predicted value instead of two patterns (0 or 1). The third one is support vector machines (SVM). An RBF kernel is used and the parameter is setting as in [25].

We also inspect our GAN-FD model from several ways. The GAN-F model is using a GAN architectural for minimizing forecast error loss, with and . The GAN-D model is using a GAN architectural for minimizing direction prediction loss, with and . The LSTM-FD model is a LSTM model aiming at minimizing forecast error loss and direction prediction loss, with 121 internal units in LSTM. Obviously, the main difference between LSTM-FD and GAN-FD is the presence of adversarial training.

4.4. Evaluation Metrics

For each stock at each time , a prediction is made for the next time point based on a specific method. Assume the total number of time points being tested is ; we used the following criteria to evaluate the performance of different models.

(1) Root Mean Squared Relative Error (RMSRE)RMSRE is employed as an indicator for the predictive power or prediction agreement. A low RMSRE indicates that the prediction agrees with the real data (the reason why this paper uses RMSRE instead of RMSE is that RMSRE facilitates a uniform comparison of the results of 42 stocks).

(2) Direction Prediction Accuracy (DPA)whereDPA measures the percentage of accuracy relating to the series trend. A high DPA promises more winning trades.

4.5. Results

In order to investigate the effect of the model update cycle on the predictive performance, let and . In China stock exchange market, days represent one week, two weeks, one month, and one quarter.

Tables 4 and 5 show the average values of RMSRE and DPA with different (, ). The numbers clearly indicate that GAN-FD and its related methods perform better than three baseline methods in terms of RMSRE and DPA. This targeted method GAN-F brings some improvement in RMSRE, but it does not outperform three baseline methods in DPA. Contrary to GAN-F, GAN-D achieves better results in DPA but failed in RMSRE. LSTM-FD improves the results, since it combines forecast error loss with direction prediction loss for training. Finally the combination of the forecast error loss, direction prediction loss, and adversarial training, that is, GAN-FD, achieves the best RMSRE and DPA in the majority of scenarios.

Let us take a look at the effects of different (M, N) on the experiment. GAN-FD obtains the maximum average DPA (0.6956) and the minimum average RMSRE (0.0079) when (M, N) is (20, 5). It is interesting to note that all these methods work better when is 5 than when is 10 or 20, with smaller RMSRE and higher DPA. This implies that very short-term trends are best for predicting the next minute’s price. Therefore, a shorter model update cycle (e.g., is 5) is preferred. On the other hand, for the same , different will bring about some changes to the prediction results. From the experimental results, we suggest that should take the value greater than . This makes intuitive sense. If the training sample is inadequate, it would fail to train the model, especially in the volatile stock markets. We should also notice that when the training set is small while the testing set is large (i.e., (M, N) is (10, 20)), most of these methods perform the worst, and the DPA of these methods are no better than random guessing (i.e., 50%).

Table 6 shows the number of times for each method to achieve the minimum RMSRE over the 42 stocks. It is noticeable that the results of these three baseline methods are all zero. GAN-FD with its related methods is obviously better than these three baseline methods in RMSRE. Meanwhile, GAN-FD obtains the minimum RMSRE 246 times, accounting for 65.08% in these 378 scenarios (42 stocks and 9 groups (M, N)). The best performance appeared when (M, N) is (20, 5), with 40 stocks’ minimum RMSRE coming from GAN-FD.

Table 7 shows the number of times for each method to achieve the maximum DPA over the 42 stocks. Compared with the other six methods, GAN-FD achieves the maximum DPA 269 times, accounting for 71.16% in all scenarios. When (M, N) is (10, 5), the maximum DPA of 41 stocks in all 42 stocks comes from GAN-FD. Even when (M, N) is (20, 20), that is, the worst performance of GAN-FD cases, GAN-FD still obtains maximum DPA in 14 stocks. From the above analyses, the performance of the GAN-FD is significantly better than the other six ways.

The results of each representation are reported in Figures 3–11. We just focus on GAN-FD. As shown in Figures 3–5, the DPA of GAN-FD ranges around 64.59%–72.24% when is 5, and it slumps to 52.01%–62.71% when is 20, which is presented in Figures 9–11. When is 5, the RMSRE of GAN-FD over the 42 stocks varies between 0.48% and 1.49%, which is lower than other six methods in most cases, while the volatility is smaller. However, the RMSRE of GAN-FD increases dramatically and fluctuates violently when is 20, and it varies between 1.21% and 4.96%. This further shows that we should reduce the model update cycle and revise the model parameters regularly to adapt to the change of market style.

5. Conclusion

In this paper, we propose an easy-to-use stock forecasting model called GAN-FD, to assist more and more nonfinancial professional ordinary investors making decisions. GAN-FD adopts 13 simple technical indexes as input data to avoid complicated input data preprocessing. Based on the deep learning network, this model achieves prediction ability superior to other benchmark methods by means of adversarial training, minimizing direction prediction loss, and forecast error loss. Moreover, the effects of the model update cycles on the predictive capability are analyzed, and the experimental results show that the smaller model update cycle can obtain better prediction performance. In the future, we will attempt to integrate predictive models under multiscale conditions.

Conflicts of Interest

The authors declare that there are no conflicts of interest regarding the publication of this paper.

Acknowledgments

This work is supported by the National Key Research Development Program of China (2017YFB0802800) and the National Natural Science Foundation of China (no. 61473149).

References

R. Al-Hmouz, W. Pedrycz, and A. Balamash, “Description and prediction of time series: a general framework of Granular Computing,” Expert Systems with Applications, vol. 42, no. 10, pp. 4830–4839, 2015.
View at: Publisher Site | Google Scholar
S. Barak and M. Modarres, “Developing an approach to evaluate stocks by forecasting effective features with data mining methods,” Expert Systems with Applications, vol. 42, no. 3, pp. 1325–1339, 2015.
View at: Publisher Site | Google Scholar
A. Booth, E. Gerding, and F. McGroarty, “Automated trading with performance weighted random forests and seasonality,” Expert Systems with Applications, vol. 41, no. 8, pp. 3651–3661, 2014.
View at: Publisher Site | Google Scholar
A. Bagheri, H. Mohammadi Peyhani, and M. Akbari, “Financial forecasting using ANFIS networks with Quantum-behaved Particle Swarm Optimization,” Expert Systems with Applications, vol. 41, no. 14, pp. 6235–6250, 2014.
View at: Publisher Site | Google Scholar
Y. Son, D.-J. Noh, and J. Lee, “Forecasting trends of high-frequency KOSPI200 index data using learning classifiers,” Expert Systems with Applications, vol. 39, no. 14, pp. 11607–11615, 2012.
View at: Publisher Site | Google Scholar
I. Aldridge and S. Krawciw, “Real-Time Risk: What Investors Should Know About FinTech,” in High-Frequency Trading, and Flash Crashes, John Wiley & Sons, Inc., Hoboken, NJ, USA, 2017.
View at: Publisher Site | Google Scholar
F. A. De Oliveira, C. N. Nobre, and L. E. Zárate, “Applying Artificial Neural Networks to prediction of stock price and improvement of the directional prediction index - Case study of PETR4, Petrobras, Brazil,” Expert Systems with Applications, vol. 40, no. 18, pp. 7596–7606, 2013.
View at: Publisher Site | Google Scholar
J. H. Niño-Peña and G. J. Hernández-Pérez, “Price direction prediction on high frequency data using deep belief networks,” Communications in Computer and Information Science, vol. 657, pp. 74–83, 2016.
View at: Publisher Site | Google Scholar
A. Marszałek and T. Burczyn'ski, “Modeling and forecasting financial time series with ordered fuzzy candlesticks,” Information Sciences, vol. 273, pp. 144–155, 2014.
View at: Publisher Site | Google Scholar | MathSciNet
X. Li, X. Huang, X. Deng, and S. Zhu, “Enhancing quantitative intra-day stock return prediction by integrating both market news and stock prices information,” Neurocomputing, vol. 142, pp. 228–238, 2014.
View at: Publisher Site | Google Scholar
X. Wang, S. Bao, and J. Chen, “High-frequency stock linkage and multi-dimensional stationary processes,” Physica A: Statistical Mechanics and its Applications, vol. 468, pp. 70–83, 2017.
View at: Publisher Site | Google Scholar | MathSciNet
E. Chong, C. Han, and F. C. Park, “Deep learning networks for stock market analysis and prediction: Methodology, data representations, and case studies,” Expert Systems with Applications, vol. 83, pp. 187–205, 2017.
View at: Publisher Site | Google Scholar
I. J. Goodfellow, J. Pouget-Abadie, M. Mirza et al., “Generative adversarial nets,” in Proceedings of the 28th Annual Conference on Neural Information Processing Systems 2014, NIPS 2014, pp. 2672–2680, can, December 2014.
View at: Google Scholar
S. Iizuka, E. Simo-Serra, and H. Ishikawa, “Globally and locally consistent image completion,” ACM Transactions on Graphics, vol. 36, no. 4, article no. 107, 2017.
View at: Publisher Site | Google Scholar
P. Luc, C. Couprie, S. Chintala, and J. Verbeek, Semantic segmentation using adversarial networks, arXiv preprint, arXiv, 1611.08408, 2016, arXiv:1611.08408.
M. Mathieu, C. Couprie, and Y. LeCun, Deep multi-scale video prediction beyond mean square error, arXiv preprint, arXiv, 1511.05440, 2015, arXiv:1511.05440.
J. D. Hamilton, Time Series Analysis, vol. 2, Princeton University Press, 1994.
View at: MathSciNet
R. H. Shumway and D. S. Stoffer, Time series analysis and its applications, Springer Texts in Statistics, Springer, New York, NY, USA, 3rd edition, 2011.
View at: Publisher Site | MathSciNet
P. J. Brockwell and R. A Davis, TimE Series: Theory and Methods, Springer Science & Business Media, 2013.
S. Pellegrini, E. Ruiz, and A. Espasa, “Prediction intervals in conditionally heteroscedastic time series with stochastic components,” International Journal of Forecasting, vol. 27, no. 2, pp. 308–319, 2011.
View at: Publisher Site | Google Scholar
Y. Kara, M. Acar Boyacioglu, and Ö. K. Baykan, “Predicting direction of stock price index movement using artificial neural networks and support vector machines: The sample of the Istanbul Stock Exchange,” Expert Systems with Applications, vol. 38, no. 5, pp. 5311–5319, 2011.
View at: Publisher Site | Google Scholar
M. Ghiassi, J. Skinner, and D. Zimbra, “Twitter brand sentiment analysis: a hybrid system using n-gram analysis and dynamic artificial neural network,” Expert Systems with Applications, vol. 40, no. 16, pp. 6266–6282, 2013.
View at: Publisher Site | Google Scholar
M. R. Hassan, “A combination of hidden Markov model and fuzzy model for stock market forecasting,” Neurocomputing, vol. 72, no. 16-18, pp. 3439–3446, 2009.
View at: Publisher Site | Google Scholar
W. Huang, Y. Nakamori, and S.-Y. Wang, “Forecasting stock market movement direction with support vector machine,” Computers & Operations Research, vol. 32, no. 10, pp. 2513–2522, 2005.
View at: Publisher Site | Google Scholar
A. F., S. Elsir, and H. Faris, “A Comparison between Regression, Artificial Neural Networks and Support Vector Machines for Predicting Stock Market Index,” International Journal of Advanced Research in Artificial Intelligence, vol. 4, no. 7, 2015.
View at: Publisher Site | Google Scholar
R. Majhi, G. Panda, G. Sahoo, A. Panda, and A. Choubey, “Prediction of S&P 500 and DJIA stock indices using particle swarm optimization technique,” in Proceedings of the Proceeding of the IEEE Congress on Evolutionary Computation (CEC '08), pp. 1276–1282, Hong Kong, China, June 2008.
View at: Publisher Site | Google Scholar
S. S. Appadoo, Pricing Financial Derivatives with Fuzzy Algebraic Models: A Theoretical and Computational Approach [Ph.D. thesis], University of Manitoba, Winnipeg, 2006.
A. Thavaneswaran, K. Thiagarajah, and S. S. Appadoo, “Fuzzy coefficient volatility ({FCV}) models with applications,” Mathematical and Computer Modelling, vol. 45, no. 7-8, pp. 777–786, 2007.
View at: Publisher Site | Google Scholar | MathSciNet
C. Carlsson and R. Fullér, “On possibilistic mean value and variance of fuzzy numbers,” Fuzzy Sets and Systems, vol. 122, no. 2, pp. 315–326, 2001.
View at: Publisher Site | Google Scholar
A. Thavaneswaran, S. S. Appadoo, and A. Paseka, “Weighted possibilistic moments of fuzzy numbers with applications to {GARCH} modeling and option pricing,” Mathematical and Computer Modelling, vol. 49, no. 1-2, pp. 352–368, 2009.
View at: Publisher Site | Google Scholar | MathSciNet
N. Srivastava, G. Hinton, A. Krizhevsky, I. Sutskever, and R. Salakhutdinov, “Dropout: a simple way to prevent neural networks from overfitting,” Journal of Machine Learning Research, vol. 15, no. 1, pp. 1929–1958, 2014.
View at: Google Scholar | MathSciNet
A. M. Rather, A. Agarwal, and V. N. Sastry, “Recurrent neural network and a hybrid model for prediction of stock returns,” Expert Systems with Applications, vol. 42, no. 6, pp. 3234–3241, 2015.
View at: Publisher Site | Google Scholar
R. D. A. Araújo, A. L. I. Oliveira, and S. Meira, “A hybrid model for high-frequency stock market forecasting,” Expert Systems with Applications, vol. 42, no. 8, pp. 4081–4096, 2015.
View at: Publisher Site | Google Scholar
H. Chen, K. Xiao, J. Sun, and S. Wu, “A double-layer neural network framework for high-frequency forecasting,” ACM Transactions on Management Information Systems (TMIS), vol. 7, no. 4, article no. 11, 2017.
View at: Publisher Site | Google Scholar
A. Yoshihara, K. Fujikawa, K. Seki, and K. Uehara, “Predicting Stock Market Trends by Recurrent Deep Neural Networks,” in PRICAI 2014: Trends in Artificial Intelligence, vol. 8862 of Lecture Notes in Computer Science, pp. 759–769, Springer International Publishing, Cham, 2014.
View at: Publisher Site | Google Scholar
X. Ding, Y. Zhang, T. Liu, and J. Duan, “Deep learning for event-driven stock prediction,” in Proceedings of the 24th International Joint Conference on Artificial Intelligence, IJCAI 2015, pp. 2327–2333, arg, July 2015.
View at: Google Scholar
Z. Zhou, J. Zhao, and K. Xu, “Can online emotions predict the stock market in China?” in Proceedings of the International Conference on Web Information Systems Engineering, pp. 328–342, 2016.
View at: Google Scholar
S. Hochreiter and J. Schmidhuber, “Long short-term memory,” Neural Computation, vol. 9, no. 8, pp. 1735–1780, 1997.
View at: Publisher Site | Google Scholar
Y. LeCun, L. Bottou, Y. Bengio, and P. Haffner, “Gradient-based learning applied to document recognition,” Proceedings of the IEEE, vol. 86, no. 11, pp. 2278–2323, 1998.
View at: Publisher Site | Google Scholar
A. Arévalo, J. Niño, G. Hernández, and J. Sandoval, “High-frequency trading strategy based on deep neural networks,” in Proceedings of the International conference on intelligent computing, pp. 424–436, 2016.
View at: Publisher Site | Google Scholar
K.-J. Kim, “Financial time series forecasting using support vector machines,” Neurocomputing, vol. 55, no. 1-2, pp. 307–319, 2003.
View at: Publisher Site | Google Scholar
S. Madge, Predicting Stock Price Direction using Support Vector Machines, Independent Work Report Spring, 2015.

Copyright

Copyright © 2018 Xingyu Zhou et al. This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

PDF Download Citation

Download other formats

Order printed copies

Views

41235

Downloads

13888

Citations

Mathematical Problems in Engineering

Computational Intelligence in Data-Driven Modelling and Its Engineering Applications

Stock Market Prediction on High-Frequency Data Using Generative Adversarial Nets

Abstract

1. Introduction

2. Related Work

2.1. Stock Market Prediction Method

2.2. Generative Adversarial Network

3. Forecasting with High-Frequency Data

3.1. Problem Statement

3.2. Prediction Model

3.3. Adversarial Training

4. Experiments

4.1. Dataset

4.2. Network Architecture

4.3. Benchmark Methods

4.4. Evaluation Metrics

4.5. Results

5. Conclusion

Conflicts of Interest

Acknowledgments

References

Copyright