Classical and Hybrid Optimization Approaches and Their Applications in Engineering and EconomicsView this Special Issue
Research Article | Open Access
Xiangsheng Zhang, Feng Pan, "Parameters Optimization and Application to Glutamate Fermentation Model Using SVM", Mathematical Problems in Engineering, vol. 2015, Article ID 320130, 7 pages, 2015. https://doi.org/10.1155/2015/320130
Parameters Optimization and Application to Glutamate Fermentation Model Using SVM
Aimed at the parameters optimization in support vector machine (SVM) for glutamate fermentation modelling, a new method is developed. It optimizes the SVM parameters via an improved particle swarm optimization (IPSO) algorithm which has better global searching ability. The algorithm includes detecting and handling the local convergence and exhibits strong ability to avoid being trapped in local minima. The material step of the method was shown. Simulation experiments demonstrate the effectiveness of the proposed algorithm.
Glutamate fermentation is a complex microbial growth process. Glutamate bacterium draws raw material of nutrition and produces complex biochemical reactions in vivo specific enzyme . The reaction process is highly nonlinear, time-variant, and uncertain. It is very difficult to establish dynamic model in the application of fermentation control [2, 3].
Modeling the glutamate fermentation process under normal or abnormal conditions is major challenge as an optimization strategy is applied from the initial stage to the final stage. It is difficult to measure online substrate, biomass, and product concentrations; hence pH, dissolved oxygen (DO) concentration, and CO2 production are usually utilized in the bioprocess analysis. The process variables provide indications of the bioprocess condition. However, the information density of these data is usually low, and the multidimensional nature of the data usually makes it difficult to understand.
Support vector machines (SVMs), developed by Boster, Guyon, and Vapnik (1992), are a kind of machine learning method based on statistical learning theory [4–7]. It has become a hot research in the field of machine learning. SVM overcomes commendably such defects as dimensionality curse and overfitting that are apt to appear in some other conventional algorithms, such as neural networks [8, 9].
But there exists a problem in the practical application of SVM. This problem is how to select some SVM parameters so that the performance of SVM can be maximized. These SVM parameters mainly include the penalty constant , the relaxation factor , and the parameters in kernel function (e.g., the width in the RBF kernel function), and they affect the SVM performance more or less. Most existing approaches use leave-one-out (LOO) , gradient descent (GD) , genetic algorithm (GA) , particle swarm optimization (PSO) , and related parameter estimators. There does not exist a single universally effective method to select these SVM parameters. Generally, cross verification trial is used, but it involves ad hoc user interference factors or requires that the kernel function should be continuously differentiable, and the resulting SVM classifiers are prone to falling into local minima.
In order to overcome the shortcomings of the existing parameter selection approaches mentioned above, an attempt is made to jointly optimize the feature selection and the SVM parameters with some evolutionary algorithms such as improved particle swarm optimization (IPSO) algorithm, in hopes of improving the SVM performance in glutamate fermentation. This paper describes an IPSO algorithm which has better global search ability and then focuses on optimizing the parameters which can be used in the prediction of SVM modeling and state of glutamate fermentation.
The paper is organized as follows. In Section 2, the SVM with mixed kernel function is briefly reviewed. Section 3 describes the standard PSO algorithm. Then in Section 4 we present the new IPSO technique. The simulations results are provided in Section 4. Finally, concluding remarks are drawn in Section 5.
2. Mixed Kernel Function
The aim of this section is to introduce the notations and to review the concepts that are relevant to the development of the proposed model parameter selection method.
The development of SVM starts from the simplest case of two classes that is linearly separable. Its basic mathematical model can be given using the following training sample set:where is the data in features space and and are coefficients. They can be estimated by minimizing the regularized risk functionwhere is the so-called loss function measuring the approximate errors between the expected output and the calculated output and is a regularization constant determining the trade-off between the training error and the generalization performance. The second term, , is used as a measurement of function flatness. Introducing slack variables transforms (2) into the following constrained function:Using the duality principle, (3) could be changed toAlthough the nonlinear function is usually unknown, all computations related to can be reduced to the form , which can be replaced with a so-called kernel function that satisfies Mercer’s condition. Then, (1) becomes In (5), Lagrange multipliers and satisfy the equalities:Those vectors with are called support vectors, which contribute to the final solution.
The kernel functions are used to project the sample data into a high-dimensional feature space and then to find the optimal separation plane in it. Kernel functions used by SVM can be divided into global and local kernels. Smits and Jordaan studied representative mapping properties of global kernel function (polynomial kernel function) and local kernel function (RBF kernel) and proposed a mixed kernel function. It was shown that the mixed kernel function-based SVM has strong learning and generalization ability.
This paper selects two polynomial kernel functions and RBF kernel function to produce a hybrid kernel function given bywhere is the polynomial kernel function; is the RBF kernel function; adjusts the sizes of two kernel functions.
The parameters in the mixed kernel function SVM such as and the width coefficient in the kernel function exert a considerable influence on the performance of SVM. A large or small value of may lead to degraded generalization ability of SVM. The value of indicates the error expectation in the classification process of the sample data, and it affects the number of support vectors generated by the classifier, thereby affecting the generalization error of the classifier. If the value of is too big, the separating error is high, the number of support vectors is small, and vice versa . The parameters in the kernel function reflect the characteristics of the training data, and they also affect the generalization of SVM. Therefore, only after the choice of all these parameters is correctly made can the SVM achieve its best possible performance.
3. Improved Particle Swarm Optimization (IPSO)
The standard PSO algorithm has fast convergence speed, but the speed of particle gets more and more slowly later [15, 16]. A particle updates its velocity and location according to the following formula:where and represent the present and next velocity of particle and and are the present and next position of particle .
In order to reduce the possibility in the evolutionary process of particles leaving the search space, the particle velocity is usually limited to a certain range ; is the best previous position of particle ; is the best position among all particles in the population ; and are random real numbers in the range of ; and are acceleration constants; is the internal weight coefficient (i.e., the impact of the previous velocity of particle on its current one).
Because it is easy for the standard PSO to fall into local minima, which is called the phenomenon of being premature [17–19], improved particle swarm optimization algorithm is proposed which is the theory of detecting premature convergence. This algorithm combines the method of analyzing and treating premature convergence to avoid the premature phenomenon throughout the whole algorithm. The whole algorithm process is shown in Figure 1.
3.1. Premature Detection
The literature  pointed out that when the particle swarm has premature convergence, the particles in the swarm will appear as “aggregation” phenomenon, and the position of the particles determines the adaptability of particles, so the state of particle swarm can be tracked by the overall changes in the fitness of all particles.
The number of particles in the particle swarm is assumed to be is the fitness of particle, is the mean fitness, and is the particle swarm colony fitness variance defined aswhere is a normalized scaling factor, and it can restrict the size of . The value of the is determined using the following formula:Equation (9) shows that the variance of the colony fitness is the reflection of all particles in the swarm “aggregation” degree. The smaller the , the greater “aggregates” level particle swarm is. If the algorithm does not meet the termination condition, the “aggregation” will enable the group to lose diversity in early state. A premature detection can be made when ( is a given constant).
3.2. Premature Treatment
For premature state of the particle, the method combined by chaos algorithm and particle swarm algorithm can set up the position and velocity of particles and make it jump out of local minima. Use the classical logistic equation to achieve chaos sequence viawhere is the random number uniformly sampled from ; is control parameter, when it takes the value of 4, the Logistic equation is in complete chaos . Then, it will be introduced into the optimization space by the following formula:where is the range of variables. A global optimal solution has no practical significance to the algorithm when particle swarm runs into local optimum. We can put forward the renewal equation of particle premature condition after combining (11) and (12):where is the range of particle velocity. Thus, the particle can jump out of local optimum and back into the particle swarm optimization iteration.
3.3. Optimization Algorithm
In order to improve the precision and ability of generalization of the mixed kernel function, the following variance function can be used for the fitness function of IPSO algorithm which can respond to SVM regression performance directly. Considerwhere is the predictive value, is the measured value, and is the number of samples. The flow diagram of optimization algorithm is shown in Figure 2. The detailed process of modeling is as follows.
Step 1. Initialize particle swarm parameters , swarm size, and the maximum number of iterations and determine the weight factor and particle swarm flight speed range . Set a loss function parameter of SVM and judgment criterion of global convergence and premature convergence.
Step 2. The individual value of each particle is set to the current position, and then fitness is calculated for each particle, the fitness value of individual extreme corresponding to the best particle as the global extreme initial .
Step 4. Use (2) (the standard PSO algorithm) to execute the iterative calculation and update the position and velocity of particles.
Step 5. Compare and , and if the , update .
Step 6. Compare the updated and , and if then update .
Step 8. Evaluate (3) and (4) to calculate the variance of the population’s fitness and then judge whether the establishment of , and if established turn to Step 9 for premature treatment; otherwise turn to Step 4.
Step 10. Output the particle swarm optimal value, and the algorithm terminates.
4. Modeling and Simulation
4.1. Function Fitting Simulation
In order to test the effectiveness of the algorithm, one-dimensional function is selected to simulatewhere is the Gaussian noise with zero mean and variance of 0.1. In order to minimize the fitness function and optimize the mixed kernel function of SVM parameters using IPSO algorithm, we take 50 sets of data to constitute a hybrid kernel function training sample of SVM in the input variable domain. Among them, mixed kernel function SVM uses an insensitive loss function which is ; and are initialized, respectively, in , , and ; the population size is 20, and the premature judgment constant is 1. The maximum number of iterations is 50 times, and which is the fitness value as global convergence. The simulation results are given in Figure 3 and Table 1.
The simulation results show that the mixed kernel function of SVM model has a high accuracy constructed by IPSO high parameter optimization. It shows the feasibility and effectiveness of the improved particle swarm optimization algorithm.
4.2. Modeling and Application Based on IPSO
In this paper, a forecast model of glutamate fermentation concentration can be built by a mixed kernel function of SVM based on IPSO; the input which is the current time in fermentation process (H), dissolved oxygen (DO), concentration of glutamate (g/L), cell concentration (g/L), and residual sugar concentration (g/L) can estimate next moment of glutamate and residual sugar concentration.
In the experiment, in the 6 batches of production data, each number represents a complete fermentation process, in which the 4 batches of data are for training, and the other 2, used as the test sample to test its generalization ability. Modeling mixed kernel function SVM using an insensitive loss function , the general range of initializing particle swarm parameters is given: , , . Based on the training data for learning and the optimization and adjustment for mixed kernel function of SVM parameter by IPSO algorithm, it can get , , . Using the parameter set of glutamate fermentation model trained to predict test data, the results are as shown in Figures 4 and 5.
Figure 4 shows the changing behavior in glutamate production from 6 glutamate fermentation experiments. The production of glutamate increased in a nonlinear way over the fermentation period from 2 h to 38 h; glutamate increased very slowly in the early period during 2–5 h. It increased quickly from 20 to 60 g/L in 9–20 h and then increased slowly after 20 h. After 34 h, the production of glutamate kept stable at 70–75 g/L. Figure 5 shows the changing behavior of residual sugar concentration in glutamate fermentation experiments. The early period (usually the former 5 hours) is bacteria growth stage and the sugar concentration is higher. About 9 hours later fermentation turned into acid producing period, bacteria growth slowing down and a large number of glutamic acids accumulating. Most of sugar in the culture medium has been burned off, requiring a continuous consumption of nutrients.
In order to maintain stable environment for the growth of bacteria and prevent severe changes in sugar concentration which can disturb bacteria physiological metabolism, we can take a fed-batch flow rate strategy and keep the glucose concentration on a constant level, referring to the estimate results.
The average prediction error for glutamate concentration is 2.61%. The average prediction error residual sugar concentration is 3.82%. It can then be seen that this method has good modeling effect.
5. Conclusion and Future Work
In this paper, a new IPSO-based optimization algorithm was used to select the optimal parameters for the mixed kernel function of SVM. The simulation results show that the mixed kernel function-based SVM with optimally chosen parameters is accurate and reliable, and the optimum mixed kernel function of SVM model based on structural parameters has better learning precision.
|:||Support vector machine|
|:||Improved particle swarm optimization|
|:||Particle swarm optimization|
|:||Radial basis function|
|:||Set of training data|
|:||Regularized risk function|
|:||Hybrid kernel function|
|:||Polynomial kernel function|
|:||RBF kernel function|
|:||Present velocity of particle|
|:||Next velocity of particle|
|:||Present position of particle|
|:||Next position of particle|
|:||Best previous position of particle|
|:||Best position of particle among all particles|
|:||Internal weight coefficient|
|:||Number of particles in particle swarm|
|:||The fitness of particle|
|:||The mean fitness|
|:||Particle swarm colony fitness variance|
|:||Normalized scaling factor|
|:||Number of samples|
Conflict of Interests
The authors declare that there is no conflict of interests regarding the publication of this paper.
This work was supported by the National Natural Science Foundation of China (Grant no. 61273131) and the Fundamental Research Funds for the Central Universities (JUDCF13040).
- J. Xiao, Z. P. Shi, P. Gao, H. Feng, Z. Duan, and Z. Mao, “On-line optimization of glutamate production based on balanced metabolic control by RQ,” Bioprocess and Biosystems Engineering, vol. 29, no. 2, pp. 109–117, 2006.
- X. Feng, T. Yu, and J. Wang, “Nonlinear GPC with in-place trained RLS-SVM model for DOC control in a fed-batch bioreactor,” Chinese Journal of Chemical Engineering, vol. 20, no. 5, pp. 988–994, 2012.
- X. J. Gao, P. Wang, Y. S. Qi, Y. Zhang, H. Q. Zhang, and A. J. Yan, “An optimal control strategy combining SVM with RGA for improving fermentation titer,” Chinese Journal of Chemical Engineering, vol. 18, no. 1, pp. 95–101, 2010.
- V. N. Vapnik, The Nature of Statistical Learning Theory, Springer, Berlin, Germany, 1995.
- Y. F. Li, Z. F. Wang, and J. Q. Yuan, “On-line fault detection using SVM-based dynamic MPLS for batch processes,” Chinese Journal of Chemical Engineering, vol. 14, no. 6, pp. 754–758, 2006.
- Z. W. Liu, H. R. Cao, X. F. Chen, Z. J. He, and Z. J. Shen, “Multi-fault classification based on wavelet SVM with PSO algorithm to analyze vibration signals from rolling element bearings,” Neurocomputing, vol. 99, pp. 399–410, 2013.
- N. Cristianmini and J. Shawe-Taylor, An Introduction to Support Vector Machines, Cambridge University Press, Cambridge, UK, 2000.
- G. F. Smits and E. M. Jordaan, “Improved SVM regression using mixtures of kernels,” in Proceedings of the International Joint Conference on Neural Networks (IJCNN '02), pp. 2785–2790, May 2002.
- S.-X. Du and T.-J. Wu, “Support vector machines for regression,” Journal of System Simulation, vol. 15, no. 11, pp. 1580–1585, 1633, 2003.
- J. H. Lee and C. J. Lin, Automatic Model Selection for Support Vector Machines, Taiwan University, Taipei, Taiwan, 2000.
- O. Chapelle, V. Vapnik, O. Bousquet, and S. Mukherjee, “Choosing multiple parameters for support vector machines,” Machine Learning, vol. 46, no. 1–3, pp. 131–159, 2002.
- C. L. Zheng and L. C. Jiao, “Automatic parameters Selection for SVM based on GA,” in Proceedings of the 5th World Congress on Intelligent Control and Automation (WCICA '04), pp. 1869–1872, Piscataway, NJ, USA, June 2004.
- A. P. Mariano, C. B. B. Costa, D. D. F. de Angelis et al., “Optimisation of a fermentation process for butanol production by particle swarm optimisation (PSO),” Journal of Chemical Technology and Biotechnology, vol. 85, no. 7, pp. 934–949, 2010.
- W. Wang, X. M. Guo, S. Y. Wang et al., “On the method of Kernel function selection in support vector machine,” Journal of Liaoning Normal University ( Natural Science Edition), vol. 31, no. 1, pp. 1–4, 2008.
- Q. Wu, “A self-adaptive embedded chaotic particle swarm optimization for parameters selection of Wv-SVM,” Expert Systems with Applications, vol. 38, no. 1, pp. 184–192, 2011.
- X. Li, S.-D. Yang, and J.-X. Qi, “New support vector machine optimized by improved particle swarm optimization and its application,” Journal of Central South University of Technology, vol. 13, no. 5, pp. 568–572, 2006.
- Y. J. Kennedy and R. C. Eberhart, “Particle swarm optimization,” in Proceedings of the IEEE International Conference on Neural Networks, pp. 1942–1948, Perth, Austrilia, December 1995.
- R. Suzuki, F. Kawai, C. Nakazawa, T. Matsui, and E. Aiyoshi, “Parameter optimization of model predictive control by PSO,” Electrical Engineering in Japan, vol. 178, no. 1, pp. 40–49, 2012.
- Y.-B. Li, N. Zhang, and C.-B. Li, “Support vector machine forecasting method improved by chaotic particle swarm optimization and its application,” Journal of Central South University of Technology, vol. 16, no. 3, pp. 478–481, 2009.
- H. Y. Liu, Y. Lin, and J. S. Zhang, “A hybrid particle swarm optimization based on chaos strategy to handle local convergence,” Computer Engineering and Applications, no. 13, pp. 77–79, 2006.
Copyright © 2015 Xiangsheng Zhang and Feng Pan. This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.