Research Article  Open Access
Body Fat Percentage Prediction Using Intelligent Hybrid Approaches
Abstract
Excess of body fat often leads to obesity. Obesity is typically associated with serious medical diseases, such as cancer, heart disease, and diabetes. Accordingly, knowing the body fat is an extremely important issue since it affects everyone’s health. Although there are several ways to measure the body fat percentage (BFP), the accurate methods are often associated with hassle and/or high costs. Traditional singlestage approaches may use certain body measurements or explanatory variables to predict the BFP. Diverging from existing approaches, this study proposes new intelligent hybrid approaches to obtain fewer explanatory variables, and the proposed forecasting models are able to effectively predict the BFP. The proposed hybrid models consist of multiple regression (MR), artificial neural network (ANN), multivariate adaptive regression splines (MARS), and support vector regression (SVR) techniques. The first stage of the modeling includes the use of MR and MARS to obtain fewer but more important sets of explanatory variables. In the second stage, the remaining important variables are served as inputs for the other forecasting methods. A real dataset was used to demonstrate the development of the proposed hybrid models. The prediction results revealed that the proposed hybrid schemes outperformed the typical, singlestage forecasting models.
1. Introduction
In recent years, cancer, heart disease, and diabetes have been reported to be the leading causes of death for most countries in the world [1, 2]. One of the most common risk factors for those diseases is the obesity, and excess of body fat often leads to obesity. One common cause of heart disease is the process called atherosclerosis. This happens when fat accumulates in the blood vessels, and it usually results in the thicker walls of the vessels. The thicker walls lead to a reduced flow of blood to the heart, and the heart becomes damaged resulting in a heart attack. It has been reported that excess body fat can increase the risk of six different types of cancers, including bowel, oesophagus, pancreas, kidney, endometrium, and breast cancers, respectively [3]. In addition, type II diabetes is also found in those who are carrying too much body fat. Therefore, how to avoid obesity has become a very important issue.
Although excess of body fat causes obesity, extremely low BFP is also undesirable, as there are minimum requirements for brain function. Accordingly, knowing the BFP can provide a great deal of information regarding the current state of health.
Maintaining a good BFP is a must for human health. However, accurate and convenient ways to measure body fat are not straightforward [4, 5]. For example, the hydrostatic weighing was reported to be a reliable method for the measurement of body fat content, but it is not convenient [4]. A new technology, Dual Energy Xray Absorptiometry (DEXA), is very accurate and precise to measure the body fat. However, DEXA may suffer from standardization issues; that is, results may vary with the specific equipment manufacturer, data collection methods, and/or software analysis. Consequently, it is desirable to have some convenient methods to predict the BFP.
While some typical studies have employed data mining techniques to classify the existence of certain diseases [2, 6, 7], the present study focuses on the development of intelligent forecasting techniques to effectively predict the BFP. The body fat datasets used in this study were real data obtained from Johnson [5]. The datasets contain the BFP which were determined by underwater weighing and 13 body circumference measurements for 252 people. Although one can employ the aforementioned variables to predict BFP through the use of multiple regression (MR) techniques or some machine learning approaches, the true relationship between these measurements and BFP may not be easy to determine. Several studies have used multiple regression techniques to build a forecasting model to estimate the BFP [5, 8–10]. However, the MR models are criticized for its strong assumptions such as variation homogeneity [2]. In addition, some data mining techniques, such as artificial neural network (ANN), multivariate adaptive regression splines (MARS), and support vector regression (SVR), have become alternatives in modeling forecasting problems due to their capability to capture complex nonlinear relationships among variables [11–13]. Those data mining techniques have been reported to have better forecasting capability than the regression technique [14–18]. Nevertheless, those techniques may have some limitations. For example, ANN has been criticized for its long training process in designing the optimal network’s topology. Also, ANN is unable to identify the relative importance of potential input variables [19–22]. Additionally, using a single technology to address all of the prediction problems may not always be possible [23].
To overcome the aforementioned difficulties and maintain the prediction accuracies of existing approaches for BFP, this study is aimed at proposing single and hybrid forecasting models to predict BFP. The singlestage forecasting modeling includes MR, ANN, MARS, and SVR approaches. The hybrid models integrate two modeling components. The first component of the model uses its own feature to capture the important but fewer explanatory variables. The second component of the hybrid schemes generates the predictions based on those explanatory variables. Because MR and MARS have great capability to select the important explanatory variables, in this study, the combinations of MR and ANN (MRANN), MR and MARS (MRMARS), MR and SVR (MRSVR), MARS and MR (MARSMR), MARS and ANN (MARSANN), and MARS and SVR (MARSSVR) are employed as the hybrid forecasting models.
In terms of prediction capability, this study compares the typical singlestage forecasting models and the proposed hybrid model for BFP application. The mean absolute percentage error (MAPE), the root mean square error (RMSE), and the mean absolute difference (MAD) are used as the forecasting accuracy measures. The superior prediction capability of the proposed hybrid approach is addressed. The remainder of this study is organized as follows. The following section introduces the methodologies of MR, ANN, MARS, and SVR, and a literature review is also provided. The designed MR, ANN, MARS, and SVR models are presented in Section 3, and a real BFP dataset is used to verify the typical and proposed forecasting models. The performances for all of the forecasting models are demonstrated and discussed. The final section addresses the research findings and concludes this study.
2. Research Methodologies
This study considers MR, ANN, MARS, SVR, and their hybrid modeling schemes as possible forecasting models for BFP. These methodologies are addressed as follows.
2.1. Multiple Regression Modeling
Multiple regression analysis can be deemed as one of the most used statistical methods in modeling realworld applications. The MR involves setting up the relationships between one response variable and several explanatory variables. The performance of MR is acceptable when the assumptions have been met. However, the assumptions of the MR model may confine its application. The general MR model is represented as follows: where are model parameters and is the error term. The accounts for the variability in that cannot be explained by the linear effect of the explanatory variables. There are four assumptions about the in MR model, and they are as follows.(1)The is a normally distributed random variable.(2)The is a random variable with a mean value of zero; that is, .(3)The variance of is denoted by and is the same for all values of the explanatory variables .(4)The values of are independent.
Also, because collinearity among explanatory variables will lead to imprecise estimates and serious stability problems, the collinearity diagnosis procedure should be performed first before screening important explanatory variables. In this study, a wellknown criterion, the variance inflation factor (VIF), is applied to examine collinearity. The VIF is described as follows: where is the coefficient of determination of a regression that evaluates all other explanatory variables. The tolerance is defined as the reciprocal of the VIF. It has been suggested that when the value of VIF is greater than 10, the sample set may have enough variation to suggest serious multicollinearity. This study used the technique where one or some explanatory variables could be dropped from the model in order to lessen the collinearity and thus reduce the standard errors of the estimated regression coefficients of the explanatory variables remaining in the model. In addition to the simplicity and effectiveness, this technique has another advantage of reducing the number of explanatory variables. This characteristic is very suitable for hybrid modeling since it usually captures less explanatory variables for the initial stage of modeling.
Typically, when considerable explanatory variables are involved in the MR design, a great amount of computation is required for examining a large volume of computer outputs, much of which are associated with poor MR models. As a consequence, three variable selection procedures are employed in this study. Those three selections include forward selection, backward elimination, and stepwise regression procedures. Given a BFP dataset with 13 explanatory variables, this study uses the aforementioned selection procedures to select the explanatory variables that lead to the best model.
2.2. Artificial Neural Network Modeling
Due to ANN’s associated memory characteristic and its generalization capability, ANN has been increasingly utilized for modeling nonstationary processes [24–28].
ANN is usually classified into two categories: feedforward and feedback networks [28]. The nodes in the ANN can be divided into three layers: the input, the output, and one or more hidden layers. The output of each neuron in the input layer is the same as the input to that neuron. For each neuron in the hidden layer and neuron in the output layer, the net inputs are given by where is a neuron in the previous layer, is the output of node , and is the connection weight from neuron to neuron . The neuron outputs are given by where is the input signal from the external source to the node in the input layer and is the bias. The transformation function shown in (4) to (6) is called a sigmoid function.
The generalized delta rule is the conventional technique used to derive the connection weights of the feedforward network [28]. Initially, a set of random numbers is assigned to the connection weights. Then, to determine the pattern with a target output vector , the sum of the minimized squared error is given by where is the number of output nodes.
2.3. Multivariate Adaptive Regression Splines Modeling
MARS has been generally applied in many fields [29–33]. The general MARS function can be represented as follows [33]: where and are parameters, is the number of basis functions (BF), is the number of knots, takes on values of either 1 or −1 and indicates the right or left sense of the associated step function, is the independent variable, and is the knot location.
A twostage process is usually used to choose the optimal MARS model. Initially, a large number of basis functions were used to fit the data. Secondly, the basis functions with the least contributions were deleted using generalized crossvalidation (GCV) criterion. A measure of variable importance can be obtained by observing the decrease in the calculated GCV values when a variable is removed from the model. The GCV can be expressed as follows:
2.4. Support Vector Regression Modeling
While support vector machine (SVM) is a powerful technique in machine learning areas, SVR can be deemed as a special form of SVMs. Due to its prediction capability, SVR has been used for predictions in many fields [34–36]. Based on the computation of a linear regression function in a highdimensional feature space, the inputs for SVR are mapped via a nonlinear function. The modeling of SVR can be described as follows. Suppose where is the weight vector, represents the model inputs, is a bias, and stands for a kernel function which uses a nonlinear function to transform the nonlinear input to be linear mode in a highdimensional feature space.
Usually, the regression modeling obtains the coefficients through minimizing the square error, which can be considered as empirical risk based on loss function. The insensitivity loss function was introduced [37], and it can be described as follows: where is the target outputs; defines the region of insensitivity. When the predicted value falls into the band area, the loss is zero. However, when the predicted value falls outside the band area, the loss is defined as the difference between the predicted value and the margin.
When empirical risk and structure risk are both considered, the SVR can be set up to minimize the following quadratic programming problem: where is the number of training data, represents the empirical risk, stands for the structure risk preventing overlearning and lack of applied universality, and is a modifying coefficient representing the tradeoff between empirical risk and structure risk. With an appropriate modifying coefficient , band area width , and kernel function, the optimum value of each parameter can be solved by Lagrange procedure.
The SVRbased regression function can be described as follows [37]: where and are Lagrangian multipliers and satisfy the equality . Additionally, since the radial basis function (RBF) is the most widely used kernel function [36], this study uses it for our experimental study. The RBF can be defined as follows: where denotes the width of the RBF.
3. BFP Data and Modeling Results
3.1. The BFP Dataset
In order to compare the forecasting performance of the singlestage and the proposed hybrid models, a real dataset of BFP is analyzed [5]. The dataset consists of 252 records. Each person consists of 15 variables and they are summarized in Table 1 (i.e., readers can refer to the website http://wiki.stat.ucla.edu/socr/index.php/SOCR_Data_BMI_Regression for more details and descriptions about the dataset).

In this dataset, the response variable BFP is denoted by . Since the BFP is difficult to measure, Siri’s equation, as in (15), is used to compute the value of :
In the initial “cleaning” phase for this real dataset, this study has found the body fat measurements of the 96th, 172nd, and 182nd cases are 0.4, 0.7, and −3.6, respectively. Generally speaking, those three measurements deviate too much from normal conditions, and this study has decided to delete those three cases. This study also deleted the 42nd case where the height is only 74.93 cm. Consequently, the sample size becomes 248 cases. With the 248 cases used in this study, the first 174 cases (around 70% of the total cases) were selected as the model training sample while the remaining 74 (around 30% of the total cases) will be retained as the testing sample.
3.2. Typical SingleStage Modeling Approaches
This study considers the BFP (i.e., ) as the response variable and the thirteen body circumference measurements (i.e., ) as the explanatory variables. The first variable, density determined from underwater weighing, in Table 1 is just for computing the values of BFP. To exclude variables with high collinearity, the Pearson correlation coefficients between variables are used. Table 2 shows the corresponding results. When the correlation coefficient between variables and is greater than 0.7, we exclude the variable that has a lower relationship with (i.e., exclude the variable with a smaller correlation coefficient ). After discarding variables with high collinearity, eight explanatory variables , , , , , , , and remain in the MR models.

In addition, this study employed three selection techniques to develop alternative MR models for BFP. These three techniques include forward selection, backward elimination, and the stepwise regression analysis. The forward selection is similar to the stepwise selection. The first explanatory variable is selected for inclusion of the regression equation is the one with the largest positive or negative correlation with the response variable, . This explanatory variable is entered into the regression equation only if it satisfies the tolerance criterion for entry. If the first variable is entered, the explanatory variable not in the regression equation that has the largest partial correlation is considered next. The forward selection procedure would stop when there are no explanatory variables that meet the entry criterion. The back elimination procedure initially considers all explanatory variables to be included in the regression equation and then sequentially removed. The explanatory variable with the smallest partial correlation with the dependent variable is first for the removal. If that variable meets the tolerance criterion for elimination, it is removed. After the first variable is removed, the variable remaining in the regression equation with the smallest partial correlation is considered next. The procedure would stop when there are no variables in the regression equation that satisfy the removal criteria. At each step for the stepwise selection procedure, the explanatory variable which is not in the regression equation that has the smallest probability of is entered, if the probability is sufficiently small. Explanatory variables that have already existed in the regression equation are removed if their probability of is sufficiently large. The stepwise selection procedure would stop when no more variables are eligible for inclusion or removal.
In this study, all those three selection procedures resulted in the same MR model. Table 3 lists the results of the parameter estimates. As shown in Table 3, all the values of VIFs of the remaining variables are smaller than 10. Accordingly, there is no high collinearity among these explanatory variables. This model is described in the following:

It has been reported that more than 75% of neural networks applications would employ the BPN structure, and, thus, this study uses the BPN in building the ANN forecasting model [8, 38]. Also, since onehiddenlayer network has been reported to be sufficient to model the complex system, this study considers one hidden layer for the ANN modeling structures [39–41]. This study uses 13 input nodes (or explanatory variables) and one output node. The hidden nodes range from to , where is the number of input variables. Thus, the hidden nodes are chosen as 11, 12, 13, 14, and 15, respectively. The ANN model has only one output node, the prediction of BFP. According to the findings of [42], the learning rates were set to be 0.01, 0.005, and 0.001, respectively.
In addition, since MAPE is one of the most important performance measurements for the forecasting capability, this study uses the smallest MAPE as the criterion for selecting the ANN topology. After performing the ANN modeling, this study found that the {13111} topology with a learning rate of 0.01 provides the best results and a minimum testing MAPE. Here,  stands for the number of neurons in the input layer, number of neurons in the hidden layer, and number of neurons in the output layer, respectively. Table 4 presents the corresponding MAPE values for various settings of the ANN topologies. Accordingly, the ANN topology of {13111} with a learning rate of 0.01 is chosen for the model of ANN alone.

This study also performed the MARS modeling to the BFP dataset; the selection results are displayed in Table 5. During the selection process, four important explanatory variables were chosen. The corresponding relative importance indicators are shown in the last column of Table 5. As a consequence, those seven important variables would be served as the input variables for intelligent hybrid modeling process. For SVR modeling, same as ANN modeling, we have 13 input variables. The two parameters, and gamma, were estimated to be 2^{3} and 2^{−7}, respectively.

3.3. Proposed Hybrid Modeling Approaches
In this study, the initial stage of the proposed hybrid modeling is to obtain the fewer but more important input variables for the second stage of forecasting models. Because this study utilizes MR and MARS modeling selections, the explanatory variables which were selected from MR and MARS models were used to serve as the input variables for other prediction models. Accordingly, this study employs six combinations of the candidate hybrid models to predict the BFP. Those hybrid models include MRANN, MRMARS, MRSVR, MARSMR, MARSANN and MARSSVR, respectively.
For the hybrid MRANN model, this study sets up 6 input nodes in the input layer, and the number of hidden nodes was set to 4, 5, 6, 7, and 8. The learning rates were identical to those used in the singlestage ANN model. The {661} topology with a learning rate of 0.01 provided the best results for the hybrid MRANN model. For the MARSANN hybrid model, this study used 4 input nodes in the input layer. The number of hidden nodes was set to 2, 3, 4, 5, and 6. Accordingly, the {431} topology with a learning rate of 0.01 provided the best results. For the other hybrid models, this study used 6 and 4 important explanatory variables, which are selected by using MR and MARS models, respectively, to predict BFP.
3.4. Prediction Results and Performance Comparison
In addition to presenting the singlestage prediction modeling, this study develops various hybrid models for prediction of BFP. This study considers the forecasting accuracy measures of MAPE, MSE, and MAD to address the forecasting performance for typical singlestage and the proposed hybrid models. The prediction measurements are defined as follows: where stands for the residual at time . A low MAPE, MSE, or MAD is associated with better forecasting accuracy.
The prediction results for single and the proposed hybrid modeling are listed in Table 6. In Table 6, by considering singlestage modeling approaches, we note that SVR model has the best performance in terms of MAPE, RMSE, or MAD indices. For the hybrid models, the MRMARS has the best prediction performance. In comparison to the singlestage and the proposed hybrid models in Table 6, one is able to observe that some of our proposed hybrid models provide more accurate results than the singlestage models. For example, in terms of MAPE, MSE, or MAD, the proposed hybrid MRMARS model possesses the lowest values among all the models.

Accordingly, the MAPE percentage improvements of the proposed MRMARS model over the single MR and MARS models are 4.2% and 9.5%, respectively. In addition, the MAPE percentage improvements of the proposed MRSVR model over the single MR and SVR models are 4.1% and 3.8%, respectively. Although the hybrid ANN models do not make any significant improvements, the fewer body circumference measurements were used for our proposed approach. This is another significant advantage of using our hybrid models.
4. Conclusion
Maintaining appropriate body fat is very crucial for human’s health. However, the measurement of the BFP is not straightforward. Accordingly, this study proposes the hybrid models to effectively predict BFP. Although the 13 body circumference measurements are involved in the real dataset, the proposed models are able to provide better predictions with fewer body circumference measurements. Actually, it is much more convenient to predict BFP with fewer body circumference measurements for most of the people.
The rationale of the proposed hybrid modeling was initially to obtain fewer important explanatory variables by performing MR and MARS modeling techniques. Those important variables were served as inputs for other designed prediction models. According to the modeling results, the proposed hybrid approaches were most appropriate for predicting the BFP. Additionally, the proposed hybrid MRMARS model was the best alternative because it contained fewer number of explanatory variables and provided the best prediction MAPE precision.
In order to obtain more accurate predictions of BFP, in addition to the existing 13 body circumference measurements, one may collect some other measurements so that the prediction precision can be increased. Moreover, the proposed hybrid modeling is not the only prediction technique that can be employed. One may combine other data mining techniques, such as rough set or genetic algorithms, to refine the structure of the hybrid models. The possibility to apply the same procedure to combine other methods as are the evolving systems deserves further research.
Conflict of Interests
The author declares that there is no conflict of interests regarding the publication of this paper.
Acknowledgments
This work was partially supported by the National Science Council of the Republic of China (Grant no. NSC 1022221E030019). The author also gratefully acknowledges the helpful comments and suggestions of the reviewers, which have improved the presentation.
References
 M. Heron, National Vital Statistics Reports, vol. 59, 2011.
 Y. E. Shao, C. D. Hou, and C. C. Chiu, “Hybrid intelligent modeling schemes for heart disease classification,” Applied Soft Computing, vol. 14, part A, pp. 47–52, 2014. View at: Google Scholar
 http://www.wcrfuk.org/cancer_prevention/recommendations/weight_and_cancer.php .
 R. J. Maughan, “An evaluation of a bioelectrical impedance analyser for the estimation of body fat content,” British Journal of Sports Medicine, vol. 27, no. 1, pp. 63–66, 1993. View at: Google Scholar
 R. W. Johnson, “Fitting percentage of body fat to simple body measurements,” Journal of Statistics Education, vol. 4, no. 1, 1996. View at: Google Scholar
 J. de Jesús Rubio, F. OrtizRodriguez, C. R. MariacaGaspar, and J. C. Tovar, “A method for online pattern recognition of abnormal eye movements,” Neural Computing and Applications, vol. 22, no. 34, pp. 597–605, 2013. View at: Publisher Site  Google Scholar
 D. M. Vazquez, J. J. Rubio, and J. Pacheco, “A characterization framework for epileptic signals,” IET Image Processing, vol. 6, pp. 1227–1235, 2013. View at: Google Scholar
 O. L. Svendsen, J. Haarbo, B. L. Heitmann, A. Gotfredsen, and C. Christiansen, “Measurement of body fat in elderly subjects by dualenergy xray absorptiometry, bioelectrical impedance, and anthropometry,” The American Journal of Clinical Nutrition, vol. 53, no. 5, pp. 1117–1123, 1991. View at: Google Scholar
 D. S. Freedman, W. H. Dietz, S. R. Srinivasan, and G. S. Berenson, “Risk factors and adult body mass index among overweight children: the bogalusa heart study,” Pediatrics, vol. 123, no. 3, pp. 750–757, 2009. View at: Publisher Site  Google Scholar
 T. Sakai, S. Demura, and K. Fujii, “Validity of fat percentage evaluation for fatthin judgement derived from standard weight in preschool children,” Sport Sciences for Health, vol. 5, no. 3, pp. 105–112, 2009. View at: Publisher Site  Google Scholar
 Y. E. Shao and C. D. Hou, “Fault identification in industrial processes using an integrated approach of neural network and analysis of variance,” Mathematical Problems in Engineering, vol. 2013, Article ID 516760, 7 pages, 2013. View at: Publisher Site  Google Scholar
 C. J. Lu and Y. E. Shao, “Forecasting computer products sales by integrating ensemble empirical mode decomposition and extreme learning machine,” Mathematical Problems in Engineering, vol. 2012, Article ID 831201, 15 pages, 2012. View at: Publisher Site  Google Scholar
 R. Abirami and M. S. Vijaya, “Stock price prediction using support vector regression,” Communications in Computer and Information Science, vol. 269, pp. 588–597, 2012. View at: Google Scholar
 X. Ren and X. Lv, “Identification of extended hammerstein systems using dynamic selfoptimizing neural networks,” IEEE Transactions on Neural Networks, vol. 22, no. 8, pp. 1169–1179, 2011. View at: Publisher Site  Google Scholar
 J. H. PerezCruz, J. J. Rubio, E. RuizVelazquez, and G. SolisPerales, “Tracking control based on recurrent neural networks for nonlinear systems with multiple inputs and unknown deadzone,” Abstract and Applied Analysis, vol. 2012, Article ID 471281, 18 pages, 2012. View at: Publisher Site  Google Scholar
 J. J. Rubio, “Modified optimal control with a backpropagation network for robotic arms,” IET Control Theory and Applications, vol. 6, pp. 2216–2225, 2012. View at: Google Scholar
 J. P. Donate, X. Li, G. G. Sánchez, and A. S. de Miguel, “Time series forecasting by evolving artificial neural networks with genetic algorithms, differential evolution and estimation of distribution algorithm,” Neural Computing and Applications, vol. 22, no. 1, pp. 11–20, 2013. View at: Publisher Site  Google Scholar
 Y. E. Shao, “Prediction of currency volume issued in Taiwan using a hybrid artificial neural network and multiple regression approach,” Mathematical Problems in Engineering, vol. 2013, Article ID 676742, 9 pages, 2013. View at: Publisher Site  Google Scholar
 C.J. Lu, Y. E. Shao, and P.H. Li, “Mixture control chart patterns recognition using independent component analysis and support vector machine,” Neurocomputing, vol. 74, no. 11, pp. 1908–1914, 2011. View at: Publisher Site  Google Scholar
 S.M. Chou, T.S. Lee, Y. E. Shao, and I.F. Chen, “Mining the breast cancer pattern using artificial neural networks and multivariate adaptive regression splines,” Expert Systems with Applications, vol. 27, no. 1, pp. 133–142, 2004. View at: Publisher Site  Google Scholar
 Y. E. Shao, C. J. Lu, and Y. C. Wang, “A hybrid ICASVM approach for determining the fault quality variables in a multivariate process,” Mathematical Problems in Engineering, vol. 2012, Article ID 284910, 12 pages, 2012. View at: Publisher Site  Google Scholar
 C. J. Lu, Y. E. Shao, and C. C. Li, “Recognition of concurrent control chart patterns by integrating ICA and SVM,” Applied Mathematics & Information Sciences, vol. 8, no. 2, pp. 6811–6689, 2014. View at: Google Scholar
 Y. E. Shao and C.D. Hou, “Change point determination for a multivariate process using a twostage hybrid scheme,” Applied Soft Computing, vol. 13, no. 3, pp. 1520–1527, 2013. View at: Publisher Site  Google Scholar
 C.C. Chiu, Y. E. Shao, T.S. Lee, and K.M. Lee, “Identification of process disturbance using SPC/EPC and neural networks,” Journal of Intelligent Manufacturing, vol. 14, no. 34, pp. 379–388, 2003. View at: Publisher Site  Google Scholar
 Y. E. Shao, C.J. Lu, and C.C. Chiu, “A fault detection system for an autocorrelated process using SPC/EPC/ANN AND SPC/EPC/SVM schemes,” International Journal of Innovative Computing, Information and Control, vol. 7, no. 9, pp. 5417–5428, 2011. View at: Google Scholar
 A. Azadeh, M. Saberi, A. Kazem, V. Ebrahimipour, A. Nourmohammadzadeh, and Z. Saberirw, “A flexible algorithm for fault diagnosis in a centrifugal pump with corrupted data and noise based on ANN and support vector machine with hyperparameters optimization,” Applied Soft Computing, vol. 13, no. 3, pp. 1478–1485, 2013. View at: Google Scholar
 R. Hedjar, “Adaptive neural network model predictive control,” International Journal of Innovative Computing, Information and Control, vol. 9, pp. 1245–1257, 2013. View at: Google Scholar
 B. Cheng and D. M. Titterington, “Neural networks: a review from a statistical perspective,” Statistical Science, vol. 9, pp. 2–30, 1994. View at: Google Scholar
 F. Vidoli, “Evaluating the water sector in Italy through a two stage method using the conditional robust nonparametric frontier and multivariate adaptive regression splines,” European Journal of Operational Research, vol. 212, no. 3, pp. 583–595, 2011. View at: Publisher Site  Google Scholar
 V. L. Pilla, J. M. Rosenberger, V. Chen, N. Engsuwan, and S. Siddappa, “A multivariate adaptive regression splines cutting plane approach for solving a twostage stochastic programming fleet assignment model,” European Journal of Operational Research, vol. 216, no. 1, pp. 162–171, 2012. View at: Publisher Site  Google Scholar
 J. de Andrés, P. Lorca, F. J. de Cos Juez, and F. SánchezLasheras, “Bankruptcy forecasting: a hybrid approach using fuzzy cmeans clustering and multivariate adaptive regression splines (MARS),” Expert Systems with Applications, vol. 38, no. 3, pp. 1866–1875, 2011. View at: Publisher Site  Google Scholar
 M. JalaliHeravi, M. AsadollahiBaboli, and A. ManiVarnosfaderani, “Shuffling multivariate adaptive regression splines and adaptive neurofuzzy inference system as tools for QSAR study of SARS inhibitors,” Journal of Pharmaceutical and Biomedical Analysis, vol. 50, no. 5, pp. 853–860, 2009. View at: Publisher Site  Google Scholar
 H. M. Azamathulla and F.C. Wu, “Support vector machine approach for longitudinal dispersion coefficients in natural streams,” Applied Soft Computing, vol. 11, no. 2, pp. 2902–2905, 2011. View at: Publisher Site  Google Scholar
 M. CastroNeto, Y.S. Jeong, M.K. Jeong, and L. D. Han, “OnlineSVR for shortterm traffic flow prediction under typical and atypical traffic conditions,” Expert Systems with Applications, vol. 36, no. 3, pp. 6164–6173, 2009. View at: Publisher Site  Google Scholar
 S. Pang, L. Song, and N. Kasabov, “Correlationaided support vector regression for forex time series prediction,” Neural Computing and Applications, vol. 20, no. 8, pp. 1193–1203, 2011. View at: Publisher Site  Google Scholar
 W. Dai, Y. E. Shao, and C. J. Lu, “Incorporating feature selection method into support vector regression for stock index forecasting,” Neural Computing and Applications, vol. 23, no. 6, pp. 1551–1561, 2013. View at: Google Scholar
 V. N. Vapnik, The Nature of Statistical Learning Theory, Springer, New York, NY, USA, 2000.
 A. Vellido, P. J. G. Lisboa, and J. Vaughan, “Neural networks in business: a survey of applications (1992–1998),” Expert Systems with Applications, vol. 17, no. 1, pp. 51–70, 1999. View at: Google Scholar
 G. Cybenko, “Approximation by superpositions of a sigmoidal function,” Mathematics of Control, Signals, and Systems, vol. 2, no. 4, pp. 303–314, 1989. View at: Publisher Site  Google Scholar
 K. Hornik, M. Stinchcombe, and H. White, “Multilayer feedforward networks are universal approximators,” Neural Networks, vol. 2, no. 5, pp. 359–366, 1989. View at: Google Scholar
 Y. E. Shao and C. D. Hou, “Hybrid artificial neural networks modeling for faults identification of a stochastic multivariate process,” Abstract and Applied Analysis, vol. 2013, Article ID 386757, 10 pages, 2013. View at: Publisher Site  Google Scholar
 D. E. Rumelhart, D. E. Hinton, and R. J. Williams, Learning Internal Representations by Error Propagation in Parallel Distributed Processing, MIT Press, Cambridge, Mass, USA, 1986.
Copyright
Copyright © 2014 Yuehjen E. Shao. This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.