An Optimized Grey Dynamic Model for Forecasting the Output of High-Tech Industry in China
The grey dynamic model by convolution integral with the first-order derivative of the 1-AGO data and series related, abbreviated as GDMC, performs well in modelling and forecasting of a grey system. To improve the modelling accuracy of GDMC, interpolation coefficients (taken as unknown parameters) are introduced into the background values of the variables. The parameters optimization is formulated as a combinatorial optimization problem and is solved collectively using the particle swarm optimization algorithm. The optimized result has been verified by a case study of the economic output of high-tech industry in China. Comparisons of the obtained modelling results from the optimized GDMC model with the traditional one demonstrate that the optimal algorithm is a good alternative for parameters optimization of the GDMC model. The modelling results can assist the government in developing future policies regarding high-tech industry management.
Grey system theory [1, 2], a rising interdiscipline in uncertainty, takes the uncertain systems of “small samples and incomplete information” with “partial known information and partial unknown information” as the research object. This theory extracts valuable information mainly by generating and developing the “partial” known information. Then, the accurate demonstration of the system running behavior and the evolvement rule is done and effective monitoring is achieved. Grey system theory can be used like this to deal with the practical problems in a situation where the sample data size is not large enough to demonstrate the statistical law or the deterministic law of the system behavior.
At present, the most widely used grey model GM [1, 2]—a first-order one-variable grey differential equation—is proposed based on the aforementioned principle. Its modelling principle does not depend on distribution information from the raw data but on the application of a first-order accumulative generation operation (1-AGO), to make the sequence display grey exponent law behavior as a whole. Based on this, a first-order grey differential equation is constructed and solved. The forecast values are then derived from the first-order inverse accumulative generating operation (1-IAGO). Due to the fact that the construction of GM does not require a large sample and it is easy to build and calculate, GM and its improved variant models have been widely used [3–6].
The GM model , with relative variables acting as an associated series besides the predicted series, is a basic grey multivariable model. The forecasts of a time series may be considerably improved by using information coming from some associated series , . This is particularly true if changes in tend to be anticipated by changes in , . Nevertheless, the solution of the whitening differential equation for GM is rough —it can easily result in large errors in actual forecasting applications. Thus, for a long time, this model has been little used. In fact, the only existing research applications have proceeded on the basis of improved models [8–10]. The grey model with convolution integral, GMC, has been proposed by Tien . This is a new model which seeks to improve the traditional GM model. Theoretically, the modelling values produced by GMC are the exact solution of the traditional GM model and the grey control parameter (similar to that of the GM model) introduced into the model. When the number of variables in the model , the GMC model reduces to GM. Using GMC greatly improves the forecasting accuracy of the multivariable grey model and has been successfully applied to various different fields [11–13]. The multivariable grey model GDMC  is based on the model GMC. The higher frequency components of the 1-AGO data of the associated series were significant for the grey prediction system and the first-order derivatives of 1-AGO data of the associated series were added into the GDMC model.
In this study, interpolation coefficients are introduced into the background values of the variables in GDMC model. Then, by aiming to minimize the modelling error, the optimal parameters are solved using the particle swarm optimization algorithm. Thereby, the adaptability and forecasting accuracy of GDMC to real data are enhanced. The remainder of this paper is organized as follows. Section 2 explains the modelling methods, including the traditional GDMC and the optimization method. In Section 3, the traditional GDMC and the optimized GDMC are applied to forecast the output of high-tech industry in China. Finally, conclusions are presented in Section 4.
2.1. The Representation of the Grey Dynamic Model with Convolution Integral
Suppose that pairs of observations are available at equispaced time intervals consisting of inputs and an output from some dynamic system. The existing GDMC modelling process  is carried out as follows. Consider the original predicted series and the original associated series
Then the 1-AGO data for are given by the following equations, respectively:
The grey forecasting model based on the predicted 1-AGO series and the associated 1-AGO series is given by the following differential equation: where and are the developmental coefficient and the grey control parameter, respectively, and are the associated coefficients corresponding to the associated series , , respectively, is the number of entries for model building, is a period of delay, and are model parameters to be estimated.
Equation (7) is called the grey dynamic model by convolution integral with the first-order derivative of the 1-AGO data and series related, abbreviated as GDMC—the “1” represents the first-order derivative of the 1-AGO series of and the “” represents the total of relative series introduced into the grey differential equation.
2.2. The Evaluation of Parameters , and
The grey derivative for the first-order grey differential equation with 1-AGO is conventionally represented as
The background value of the grey derivative is taken as the mean of and . Those of the associated series are also taken as the mean of and for , respectively, for the determination of the model parameters in GDMC.
The least-squares solution for the model parameters in the GMC in (7) for t from 1 to r is given by where
2.3. The Evaluation of
In summary, in the right-hand side of (7), the discrete function can be obtained as
The 1-AGO modelling values of the predicted series can be derived using the initial condition as
The second term of the right-hand side in (13) can be evaluated approximately by the two-point Gauss numerical integration  with the linear assumption on between any two neighboring times; we have where the coefficients and are both equal to 1, the nodes and are equal to and , respectively, and is the unit step function. Applying 1-IAGO to (15) yields the following modelled values together with the forecasts: where is the number of entries to be forecasted or indirectly measured.
Assuming the system parameters in (7) to be constants in the postsampling period, then, by using the postsampling data combined with the data given for the corresponding associated series as a new input series, the corresponding forecasts or values of indirect measurement for the predicted series can be derived.
It is obvious that, when the number of associated series is zero, that is, when , (7) reduces to the grey single variable forecasting model GM.
2.4. Evaluation of the Modelling and Forecasting Accuracy
To evaluate the accuracy of the grey models Tien , who proposed the GDMC model, applied the root mean squared percentage error (RMSPE) to the priori sample period (RMSPEPR) and postsample period (RMSPEPO), respectively. Generally, the RMSPEPR and RMSPEPO are defined, respectively, as
2.5. Optimization of the GDMC Model Based on PSO Algorithm
In this study, to enhance the modelling and forecasting accuracy of the GDMC model, interpolation coefficients () are introduced into the background values of each of the variables in GDMC. Then the optimal , , are calculated with the objective of minimizing RMSPEPR.
Based on the above method, the background value of the grey derivative is taken as the weighted mean of and ; namely, . Those of the associated series are also taken as the weighted means of and ; that is, for in the determination of the model parameters in GDMC. When , the optimized GDMC model is reduced to a traditional GDMC model.
The particle swarm optimization (PSO) algorithm  is a population-based heuristic algorithm that simulates the social behavior as birds flocking to a promising position to achieve precise objectives in a multidimensional space. Each particle is a potential solution to the optimization problem. A particle represents a point in an -dimensional space, and its status is characterized through its position and velocity. The position for the particle can be represented as . The velocity of this particle can be represented by another -dimensional vector . The best previous position of the th particle can be represented as . The best position in the entire swarm is denoted as . To search for the optimal solution, each particle changes its velocity and position according to the following two formulas: where means the number of iterations. is called the inertia weight that controls the impact of previous velocity of particle on its current one. In a general way, we let , . and are positive constant parameters called acceleration factors which control the maximum step size. Typical values for and are . is a random number between zero and one.
To avoid the phenomenon of “shock” when particles near global optimal solution, we can employ a linear gradient strategy for the inertia weight: where is the total number of iterations.
This study uses the PSO algorithm to solve the optimal background value coefficients for GDMC model. The specific steps are as follows.
Step 1. Generate particles randomly in the -dimensional space. The position and velocity for the particle can be represented as
Step 2. Set the initial position of the particles as and generate an initial velocity vector randomly in the interval .
Step 3. According to (10), substitute into the matrix and establish the fitness function
Step 4. Calculate the fitness value for each particle.
Step 5. Compare each particle’s fitness to the global best position to adapt to the value; if it is the optimum, it can be taken as the best position; if not, turn to Step 6.
Step 6. Update particles velocity and position according to the evolution equation according to (19).
Step 7. If stopping criteria are met, show the output and its fitness, which corresponds to the optimal parameters and the RMSPEPR, or else go back to Step 3.
3. Forecasting the Output of High-Tech Industry in China
Forecasting the output of high-tech industry is essential for the development of projects and policy making. As Chinese high-tech industry is neonatal and so the data relating to existing economic indices are limited, it is difficult to apply existing statistical methods of analysis and forecasting to them. As a result, little research has been conducted on quantitative forecasting of Chinese high-tech industry. In this section, the advantage of the optimized GDMC model (abbreviated as OGDMC) over the traditional one is demonstrated by a real case study of high-tech industry in China.
3.1. Variables and Data
In economics theory, human resources and capital investment are the crucial factors of the economic system output . The economic output of high-tech industry is taken as the predicted series and is denoted as . And the annual average employment and investment are adopted as the relative variables of the output of high-tech industry and are denoted as and , respectively. The original data relating to the output, average employment, and investment of the high-tech industry in China for 2003–2010 are shown in Table 1. These data are all collected from the China Statistics Yearbook on High Technology Industry (2004–2011).
3.2. Modelling GDMC and OGDMC
In the following, the industrial output value is used as the forecasting variable. At the same time, with the average employment and the investment as relative variables, multivariable forecasting models GDMC and OGDMC models are established. The data of , , and from 2003 to 2010 is employed as the original modelling series to construct GDMC and OGDMC models.
Applying the GDMC model given by (7)–(16), the values of the parameters , , and in (7) and the estimates of the model parameters , , and in (10) can be obtained (Table 2). The resulting GDMC model from (7) has the form
Applying the OGDMC model given by (7)–(16) and the optimized method, the values of the parameters , , and in (7), the estimates of the model parameters , , and in (10), and the optimized parameters , , and can be obtained (listed in Table 4). The resulting OGDMC model from (7) has the form
3.3. Evaluation of the Grey Forecasting Models
In this study, the most commonly used four measurements in grey theory, including mean relative error (MRE), absolute degree of grey incidence (ADGI), ratio of standard deviation (RSD), and probability of small error (PSE) [2, 17], are used to evaluate the accuracy of the models involved. The levels of accuracy and their critical values are given in Table 6 . When all the measured values in a forecasting model meet the requirements of the critical values listed in the table, the model is applicable to the prediction. If they fall within the ranges of levels 1 and 2, this suggests that the model has high forecasting accuracy.
3.4. Empirical Results and Discussion
The modelling results for China’s high-tech industry using the two grey models described above are shown in Table 7 together with the four accuracy measurements. The results show that the GDMC model and the OGDMC model fall within levels 2 and 1, respectively. The OGDMC model reduces the MRE of the GDMC model from 0.0217 to 0.0045, increases the ADGI of the GDMC model from 0.94 to 1.00, and also reduces the RSD of the GDMC model from 0.12 to 0.06. This indicates that the OGDMC model can improve the modelling accuracy of the traditional model GDMC significantly.
Figure 1 demonstrates the degree of closeness between the modelling results of the models and the real values of the output of high-tech industry in China. Clearly, the model OGDMC is closer to the actual data than the traditional one. Moreover, as can be seen from a histogram of the modelling residual errors (Figure 2), the residual error from the OGDMC model is notably smaller than that of the GDMC. The reason for this lies in the fact that, through optimization of the background value, the modelling ability of the GDMC model can be further improved in the OGDMC model. The model coefficients , , , and can reveal the role and importance of employment and investment and their impact on the output of high-tech industry.
This study presents a PSO-algorithm-based optimization method for the grey dynamic model by convolution integral with the first-order derivative of the 1-AGO data and series related. According to empirical modelling results of the high-tech industry in China, the modelling accuracy of the traditional GDMC model can be effectively increased using a background value optimization method as proposed in this study. This is important in practice as the government of high-tech industry needs to make decisions based on traditional parameter solutions which may be constrained by an incorrect local optimal solution.
Due to the lack of more additional data published by China’s Statistic Department at present, the out-of-sample and the time-delayed forecasting results of this study have not fully been validated. Therefore, in future work, the out-of-sample and the time-delayed forecasting using both the GDMC and the OGDMC needs to be taken into account after more data is released by China’s Statistic Department. In addition, the grey forecasting model established in this study merely considers two basic input factors, that is, capital and labor, while ignoring the technical level. The main reason lies in the fact that sample data are rather limited in the short development of China’s high-tech industries. Moreover, it is challengeable to estimate the technology level of high-tech industries using small sample data. To address this problem, further researches are also needed.
Conflict of Interests
The authors declare that there is no conflict of interests regarding the publication of this paper.
The authors are grateful to the editors and the anonymous reviewers for their insightful comments and suggestions. The authors also thank the National Natural Science Foundation of China (Grant no. 71101132), the Philosophy and Social Science Foundation of Zhejiang Province, China (Grant no. 13ZJQN029YB), and the Academic Climbing Project for Young and Middle-Aged Leading Academic in the Universities of Zhejiang Province, China (Grant no. PD2013275), for financially supporting this study.
J. L. Deng, The Elements on Grey Theory, Huazhong University of Science and Technology Press, Wuhan, China, 2nd edition, 2002.
S. F. Liu and Y. Lin, Grey Information Theory and Practical Applications, Springer, London, UK, 1st edition, 2006.
W.-Y. Wu and S.-P. Chen, “A prediction method using the grey model GMC(1,n) combined with the grey relational analysis a case study on internet access population forecast,” Applied Mathematics and Computation, vol. 169, no. 1, pp. 198–217, 2005.View at: Publisher Site | Google Scholar | Zentralblatt MATH | MathSciNet
S. Gao and J. Y. Yang, Swarm Intelligence Algorithm and Its Application, China Water Power Press, Beijing, China, 2006.
Z. N. Li, Econometrics, Higher Education Press, Beijing, China, 2nd edition, 2010.