A Nonlinear System State Estimation Method Based on Adaptive Fusion of Multiple Kernel Functions

Xu, Daxing; Hu, Aiyu; Han, Xuelong; Zhang, Lu

doi:https://doi.org/10.1155/2021/5124841

Complexity

On this page

Abstract Introduction Conclusions Data Availability Conflicts of Interest References Copyright Related Articles

Research Article | Open Access

Volume 2021 | Article ID 5124841 | https://doi.org/10.1155/2021/5124841

A Nonlinear System State Estimation Method Based on Adaptive Fusion of Multiple Kernel Functions

Daxing Xu,¹Aiyu Hu,²Xuelong Han,¹and Lu Zhang¹

Academic Editor: Carlos Aguilar-Ibanez

Received12 Apr 2021

Accepted18 Jun 2021

Published24 Jun 2021

Abstract

With the development of the industry, the physical model of controlled object tends to be complicated and unknown. It is particularly important to estimate the state variables of a nonlinear system when the model is unknown. This paper proposes a state estimation method based on adaptive fusion of multiple kernel functions to improve the accuracy of system state estimation. First, a dynamic neural network is used to build the system state model, where the kernel function node is constructed by a weighted linear combination of multiple local kernel functions and global kernel functions. Then, the state of the system and the weight of the kernel functions are put together to form an augmented state vector, which can be estimated in real time by using high-degree cubature Kalman filter. The high-degree cubature Kalman filter performs adaptive fusion of the kernel function weights according to specific samples, which makes the neural network function approximate the real system model, and the state estimate follows the real value. Finally, the simulation results verify the feasibility and effectiveness of the proposed algorithm.

1. Introduction

With the development of industrial technology, the physical model of control object is also becoming more and more complicated [1, 2]. The linear system model can no longer accurately describe the system model, leading to new challenges for linear control theory. Although the classical control theory for linear systems is relatively mature, we are often faced with different types of nonlinear systems in practice; the characteristics include model nonlinearity, time variance, uncertain terms, and so on [3–5]. The state estimation of nonlinear systems has attracted more attention in recent decades. For nonlinear systems with unknown parameters, adaptive control with parameter approximation has been fully studied, and a reasonable adaptive law is designed to achieve stable control effects. For nonlinear systems whose models are completely unknown, the function approximation theory of neural networks makes it an effective approximation tool.

The kernel function is the core of the neural network [6–11]. The commonly used kernel functions include Gaussian kernel function, Fourier kernel function, linear kernel function, polynomial kernel function, and sigmoid kernel function. For the selection of the kernel function type, the most commonly used method is the cross-validation method, which uses different kernel functions to train the samples and selects the kernel function with the smallest overall error as the optimal kernel function. Machine training based on a single kernel function in a single feature space has great defects when dealing with uneven sample distribution. For example, an actual sample feature is a fusion of two basic features, where the first feature obeys a polynomial distribution and the second feature obeys the normal distribution. A single kernel function can only describe the characteristics of a certain aspect of the data, and it cannot properly represent the characteristics of different distributions [12]. Literatures [13–15] use a hybrid kernel with strong local and global information processing capabilities to deal with classification problems. This method can make up for the shortcomings of a single kernel function in processing sample local and global information, but it lacks an effective method to optimize the weighting coefficients of the two basic kernel functions. Kernel functions are divided into local kernel functions and global kernel functions according to their local and global capabilities. The local kernel function has a strong learning ability, while the global kernel function has a strong extrapolation ability [16, 17]. There are a variety of existing kernel functions, each with its own characteristics, and different kernel functions have different nonlinear processing capabilities. On the basis of maintaining the basic characteristics of the original kernel function, multiple local kernel functions and global kernel functions are linearly combined into a new kernel function, that is, the multiple kernel functions absorb the advantages of the local kernel function and the global kernel function. It can accurately reflect the characteristics of the actual sample [18, 19]. However, how to choose the weights of multiple kernel functions is a challenging problem.

In 1960, Kalman introduced state variables into the filtering theory and proposed the famous Kalman filter. But it is suitable for the linear time-invariant system model. For nonlinear systems, researchers have successively proposed extended Kalman filter, unscented Kalman filter, cubature Kalman filter, and so on [20–22]. For the problem of state estimation of nonlinear systems with unknown models, many scholars have combined the unscented Kalman filter algorithm with neural networks to solve practical problems. Literatures [21, 23, 24] use a neural network to establish a one-dimensional nonlinear time series model. The input and output of the model are the value of the time series at the current time and the next time, respectively. The unscented Kalman filter algorithm is used to simultaneously update the network weights and time series, and the method is compared with the standard neural network learning algorithm and a separate unscented Kalman filter estimation algorithm. It proves that the estimation effect of this method is better than other methods. However, when the dimensionality of the system is high, the unscented Kalman filter faces the problem of dimensionality catastrophe, which causes its estimation performance to be greatly reduced. In order to further improve the filtering accuracy, literature [25] proposed a high-degree cubature Kalman with arbitrary order volume rules. The filtering algorithm uses radial integration rules to optimize sigma points and weights, which greatly enhances the ability to handle high-dimensional nonlinear states, and the estimation accuracy and stability are also significantly improved.

The selection of multiple function fusion coefficients is actually a process of constant weight adjustment. This paper regards the fusion coefficients as part of the augmented system state. Since the kernel function is often nonlinear, the training of neural networks can be regarded as the state estimation of the nonlinear system. The problem is that the estimation of the fusion coefficient of the multiple function and the system state can be regarded as the optimal estimation of the state vector in filtering. Therefore, the main contributions of this paper include the following:(1)Building the unknown system state model using the weighted linear combination of multiple local functions and global functions. Then, establishing a nonlinear system model with augmented state.(2)By using the high-degree cubature Kalman filter to estimate the fusion coefficient and the system state in real time, a nonlinear system state estimation method based on adaptive fusion of multiple functions is proposed, and the optimal fusion coefficients are selected to improve the accuracy of state estimation.

2. Problem Formulation

Denote and as the local kernel functions and the global kernel functions, respectively. and represent the corresponding weight coefficients for the above kernel function, respectively. Then, the multiple kernel functions can be expressed as follows:

Further, the structure of the neural network state space model based on the multiple kernel function is shown in Figure 1.

represent the input sample nodes, and stand for the weight coefficients among all layers. denote the output sample nodes. The neural network model structure has three node layers, namely, input layer, hidden layer, and output layer. It is connected by weight coefficients, the input and output layers are at both ends, and the number of nodes in the middle hidden layer is selected according to actual requirements.

Since the system model is unknown, this paper uses a neural network based on multiple kernel functions to approximate it. Specifically, the nonlinear systems can be described as follows:where is an -dimensional state vector, is an -dimensional observation vector, functions and are known nonlinear functions, and and are independent zero-mean Gaussian white noise.

For general nonlinear systems, under the Gaussian hypothesis, the basic theory of Bayesian estimation can be combined with any order cubature rule to derive a high-order cubature Kalman filter. Similar to the unscented Kalman filter structure, high-order cubature Kalman filter is also divided into two steps: state prediction (time update) and measurement update. The high-degree cubature Kalman filter uses the phase difference cubature rule to solve the problem of dimensional explosion in high-dimensional systems. High-degree cubature rules satisfywhere , and is the j-th column of the unit vector matrix of n-dimensional space . is a general nonlinear function that has different forms in different filtering steps. and are the sets of points as shown below:

The weight coefficients and arewhere is the surface area of the unit sphere and . According to the moment matching method, when n = 2, the weight is

In this paper, we study how to combine high-degree cubature filter and neural networks to model unknown nonlinear systems and how to estimate the state of the system. Consequently, the addressed problem in this paper can be summarized as follows:(1)For the multiple kernel function in (1), how to construct a unified model of the combination of multiple kernel function weights and state variables to satisfy the requirements of the filter.(2)How to design a high-degree cubature filter to adaptively estimate the system state and kernel function weights.

3. Main Results

3.1. Establishment of Nonlinear System Model

When the model of the system is unknown, the neural network is used to approximate the system model, and then the optimal network node weight coefficients need to be solved. Meanwhile, the state is also unknown, and the state and weight coefficients are related. Therefore, we combine the original state and weight coefficient of the kernel functions , as a new state . Then, the original system equation and the augmented equation of the weight coefficient equation are considered as a new system model:where is the mathematical model established by the neural network for the nonlinear system:where is the sigmoid kernel function of the neural network, which has been proved to have good global classification performance in the application of neural network because it is a smooth function that is convenient to find derivatives [26]. is the weight coefficient of the neural network. The process noise and observation noise of the new system are independent zero-mean Gaussian white noise, and the corresponding covariance matrices are and .

Remark 1. Since the neural network based on multiple kernels is used to approximate the nonlinear function, it is necessary to solve the weight coefficients of the local kernel function and the global kernel function. By assuming that the weight coefficients are disturbed by Gaussian white noise, the coefficients and the state can be combined into an augmented state vector, so that a nonlinear system model based on the augmented state can be established.

3.2. Adaptive Fusion Filtering

In the past two decades, the extended Kalman filter has been widely used in the training of a neural network and as an optimizer of fuzzy membership functions for fuzzy classifiers. Tuning of multiple parameters of SVM can be viewed as an identification problem of a nonlinear dynamic system. Due to the truncation error introduced by the extended Kalman filter when linearizing the nonlinear system, the state estimation accuracy is low. The high-degree cubature Kalman filter has higher estimation accuracy than the extended Kalman filter because it uses radial integration rules to optimize sigma points and weights. Therefore, the high-degree cubature Kalman filter is exploited to estimate the the augmented state.

The parameter estimation model is established in the previous section, and the adaptive selection method of fusion coefficients is given below. The estimation process of the entire augmented state is shown in Figure 2. First, select some local kernel functions with learning ability and some global kernel functions with generalization ability from the commonly used kernel functions to form a multiple kernel function. Then, the weighted fusion coefficient and the original state are combined to form an augmented state vector, then the high-degree cubature Kalman filter is used for time update, and then the real output value of the data set is used for the high-degree cubature Kalman filter measurement update.

The specific algorithm is given as follows: Update the state:(1)At time k, assume that the error covariance at time k-1 is known, and the factorization is where the vector is the Cholesky factorization of .(2)Compute the cubature points: where , and the vector is where , represents an n-dimensional unit vector, and its i-th element is 1. are(3)Calculate the cubature points after propagation of state equation :(4)Compute one-step state prediction: where the weights are(5)Calculate the one-step prediction error covariance matrix: Update the measurement:(1)Factorization:(2)Calculate the state cubature point after update:(3)Compute the cubature point after the measurement equation has propagated:(4)Calculate one-step measurement and prediction at time k:(5)Calculate the innovation covariance matrix:(6)Compute the one-step prediction cross covariance matrix:(7)Calculate the gain matrix:(8)Update state as follows:(9)Error covariance matrix can be obtained by

Remark 2. For the known nonlinear system described in formulas (10) and (11), given the initial state of the state, the high-degree cubature Kalman filter can be performed according to the above two steps of time update and measurement update to obtain an augmented state vector value.

4. Simulation Example

The neural network approximation of the system model using the nonlinear filtering algorithm based on the Kalman filter framework has many practical applications, for example, the tracking problem of a moving target at a constant speed in a two-dimensional plane [22], the state estimation problem of the concentration and temperature of the reactant in the non-isothermal chemical stirring tower reactor [27], etc. The example considered in this paper is a commonly used discrete model of a nonlinear system as follows [28]:where and are independent zero-mean Gaussian white noises, with the variances and , respectively. The initial state is , and its estimate is set as . The initial state error covariance matrix is . The neural network model has two input nodes, two output nodes, and two hidden nodes. The simulation environment is Intel i5 CPU with 4G memory, and the simulation software uses Matlab R2013a.

In this simulation, the linear combination of Gaussian kernel function, Fourier kernel function, and linear kernel function is selected as the multiple kernel function, and the weight coefficients are, respectively, denoted as , , and . For convenience of comparison, denote MAEE as mean absolute estimation error, and we simply mark the algorithms as follows: EAFMKF: estimation algorithm based on adaptive fusion of multiple kernel functions. ESKF: estimation algorithm based on single sigmoid kernel function.

The simulation results are shown in Figures 3–7 and Table 1.

From the estimation curves of Figures 3 and 4, both EAFMKF and ESKF can perform a good tracking estimation on the two states, indicating that both algorithms are effective. From the estimation error curves of Figures 5 and 6, the error curve of ESKF is generally above EAFMKF, which means that the error of ESKF is obviously greater than that of EAFMKF. From the statistics of Table 1, it can be seen that the time consumption of EAFMKF is slightly higher than that of ESKF, but the accuracy of the state estimation of EAFMKF is much higher than that of ESKF. Specifically, estimation error of ESKF is more than twice that of the EAFMKF. This is mainly because the multiple function can accurately describe the characteristics of the sample by adaptively adjusting the weight coefficients, thereby making the established state model more accurate. As shown in Figure 7, the weights , , and of the neural network are an adaptive adjustment process in the whole estimation process, and they quickly stabilize to the corresponding values of 0.52, 0.21, and 0.27. These demonstrate the effectiveness of cubature Kalman filtering and neural network estimation algorithms.

5. Conclusions

This paper proposed a state estimation algorithm based on adaptive fusion of multiple kernel function for nonlinear systems with unknown state model. The system state model is built by using multiple kernel function, which is constructed by some local kernel functions and global kernel functions. Under this case, the characteristics of the actual sample can be fully characterized. Then, we put the weights of the multiple kernel function and the original state together as a augmented state. Futher, the high-degree cubature Kalman filter algorithm is used to estimate the augmented state in real time. Thus, we can obtain the optimal weight coefficients by the adaptive fusion of multiple kernel function, and the accuracy of the original states is significantly improved. Finally, a simulation example verifies the effectiveness of the proposed algorithm. In some practical applications, the state dimensionality is often very high. The next research content is how to choose a suitable approximate neural network to establish the state transition equation when the state dimension is high. Under this case, good state estimation results can be obtained while reducing the dimensionality of the estimation problem.

Data Availability

The data used to support the findings of this study are available from the corresponding author upon request.

Conflicts of Interest

The authors declare that there are no conflicts of interest.

References

H. B. Zhu and J. B. Ding, “A dynamic variance-based triggering scheme for distributed cooperative state estimation over wireless sensor networks,” Complexity, vol. 2021, Article ID 7851080, 12 pages, 2021.
View at: Publisher Site | Google Scholar
H. Wang, S. S. Xie, W. X. Wang, L. Wang, and J. B. Peng, “Investigation of unmeasured parameters estimation for distributed control systems,” Complexity, vol. 2020, Article ID 7518039, 15 pages, 2020.
View at: Publisher Site | Google Scholar
L. B. Prasad, H. O. Gupta, and B. Tyagi, “Intelligent control of nonlinear inverted pendulum dynamical system with disturbance input using fuzzy logic systems,” Recent Advances in Electrical and Electronic Engineering, pp. 136–141, 2011.
View at: Publisher Site | Google Scholar
W. Zhao, L. Tang, and Y. J. Liu, “Disturbance observer-based adaptive neural network control of marine vessel systems with time-varying output constraints,” Complexity, vol. 2020, Article ID 6641758, 12 pages, 2020.
View at: Publisher Site | Google Scholar
G. Khanna, S. K. Chaturvedi, and S. Soh, “Two-terminal reliability analysis for time-evolving and predictable delay-tolerant networks,” Recent Advances in Electrical and Electronic Engineering, vol. 13, no. 3, pp. 396–404, 2020.
View at: Publisher Site | Google Scholar
M. A. Islas, J. de Jesus Rubio, S. Muniz et al., “A fuzzy logic model for hourly electrical power demand modeling,” Electronics, vol. 10, no. 4, p. 448, 2021.
View at: Publisher Site | Google Scholar
J. de Jesus Rubio, “SOFMLS: online self-organizing fuzzy modified least-squares network,” IEEE Transactions on Fuzzy Systems, vol. 17, no. 6, pp. 1296–1309, 2009.
View at: Publisher Site | Google Scholar
H. S. Chiang, M. Y. Chen, and Y. J. Huang, “Wavelet-based EEG processing for epilepsy detection using fuzzy entropy and associative petri net,” IEEE Access, vol. 7, pp. 103255–103262, 2019.
View at: Publisher Site | Google Scholar
J. J. de Rubio, “Stability analysis of the modified levenberg-marquardt algorithm for the artificial neural network training,” IEEE Transactions on Neural Networks and Learning Systems, pp. 1–15, 2020.
View at: Publisher Site | Google Scholar
J. A. Meda-Campaña, “On the estimation and control of nonlinear systems with parametric uncertainties and noisy outputs,” IEEE Access, vol. 6, pp. 31968–31973, 2018.
View at: Publisher Site | Google Scholar
F. Furlán, E. Rubio, H. Sossa et al., “CNN based detectors on planetary environments: a performance evaluation,” Frontiers in Neurorobotics, vol. 14, p. 85, 2020.
View at: Publisher Site | Google Scholar
C.W. Hsu, C. C. Chang, and C.J. Lin, “A practical guide to support vector classification,” Department of Computer Science, National Taiwan University, Taiwan, China, 2020, Doctoral Thesis.
View at: Google Scholar
E. B. Huerta, B. Duval, and J.K. Hao, “A hybrid GA/SVM approach for gene selection and classification of microarray data,” Workshops on Applications of Evolutionary Computation, , pp. 34–44, 2006.
View at: Google Scholar
J. Zhou, T. Bai, and C. Suo, “The SVM optimized by culturegenetic algorithm and its application in forecasting share price,” in Proceedings of the 2015 IEEE Conference on Granular Computing, pp. 838–843, Hangzhou, China, August 2008.
View at: Publisher Site | Google Scholar
J. Zhou, T. Bai, A. Zhang, and J. Tian, “The integrated methodology of wavelet transform and GA based-SVM for forecasting share price,” in Proceedings of the 2016 IEEE Conference on Information and Automation, pp. 729–733, Changsha, China, June 2008.
View at: Publisher Site | Google Scholar
F. Kuang, S. Zhang, Z. Jin et al., “A novel SVM by combining kernel principal component analysis and improved chaotic particle swarm optimization for intrusion detection,” Soft Computing, pp. 1–13, 2015.
View at: Publisher Site | Google Scholar
Z. Xu, Z. Y. Dong, and W. Q. Liu, “Short-term electricity price forecasting using wavelet and SVM techniques,” in Proceedings of the Third International DCDIS Conference on Engineering Applications and Computational Algorithms, pp. 15–18, Guelph, Canada, May 2003.
View at: Google Scholar
S. K. Aggarwal, L. M. Saini, and A. Kumar, “Electricity price forecasting in deregulated markets: a review and evaluation,” International Journal Electric Power Energy System, vol. 31, no. 1, pp. 13–22, 2019.
View at: Google Scholar
T. Mu and A. K. Nandi, “Automatic tuning of L2-SVM parameters employing the extended Kalman filter,” Expert Systems, vol. 26, no. 2, pp. 160–175, 2009.
View at: Google Scholar
H. E. Yao, C. Zhou, and L. Y. Zheng, “Detection method against false data injection attack based on extended Kalman filter,” Electric Power, vol. 50, no. 10, pp. 35–40, 2019.
View at: Google Scholar
X. Wu and Y. Wang, “Extended and unscented Kalman filtering based feedforward neural networks for time series prediction,” Applied Mathematical Modelling, vol. 36, no. 3, pp. 1123–1131, 2016.
View at: Google Scholar
Z. T. Hu, G. Y. Yuan, and Y. M. Hu, “Training method of neural network based on cubature Kalman filter,” Control and Decision, vol. 31, no. 2, pp. 355–360, 2016.
View at: Google Scholar
H. L. Li, J. Wang, Y.Q. Chen et al., “On neural network-aided training algorithm based on the unscented Kalman filter,” in In Proceedings of the 29th Chinese Control Conference, pp. 1447–1450, Beijing, China, July 2010.
View at: Google Scholar
Z. Ronghui and J. Wan, “Neural network-aided adaptive unscented Kalman filter for nonlinear state estimaiion,” IEEE Signal Processing Letter, vol. 13, no. 7, pp. 445–448, 2018.
View at: Google Scholar
B. Jia, M. Xin, and Y. Cheng, “High-degree cubature Kalman filter,” Automatica, vol. 49, no. 2, pp. 510–518, 2013.
View at: Publisher Site | Google Scholar
Y. Ito, “Representation of functions by superpositions of a step or sigmoid function and their applications to neural network theory,” Neural Networks, vol. 4, no. 3, pp. 385–394, 1991.
View at: Publisher Site | Google Scholar
K. Salahshoor and A. S. Kamalabady, “On-line multivariable identification by adaptive RBF neural networks based on UKF learning algorithm,” in In Proceedings of the IEEE 2008 Chinese Control and Decision Conference, pp. 4754–4759, Yantai, China, July 2008.
View at: Publisher Site | Google Scholar
J. L. Zhou, D. H. Zhou, H. Wang et al., “Distribution function tracking filter design using hybrid characteristic functions,” Automatica, vol. 46, no. 1, pp. 101–109, 2010.
View at: Publisher Site | Google Scholar

Copyright

Copyright © 2021 Daxing Xu et al. This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

PDF Download Citation

Download other formats

Order printed copies

Views

426

Downloads

793

Citations