A New Adaptive Square-Root Unscented Kalman Filter for Nonlinear Systems with Additive Noise

Zhou, Yong; Zhang, Chao; Zhang, Yufeng; Zhang, Juzhong

doi:https://doi.org/10.1155/2015/381478

International Journal of Aerospace Engineering

On this page

Abstract Introduction Conclusions Acknowledgments References Copyright Related Articles

Research Article | Open Access

Volume 2015 | Article ID 381478 | https://doi.org/10.1155/2015/381478

A New Adaptive Square-Root Unscented Kalman Filter for Nonlinear Systems with Additive Noise

Yong Zhou,¹Chao Zhang,¹Yufeng Zhang,²and Juzhong Zhang³

Academic Editor: Paul Williams

Received27 Feb 2015

Accepted18 May 2015

Published06 Jul 2015

Abstract

The Kalman filter (KF), extended KF, and unscented KF all lack a self-adaptive capacity to deal with system noise. This paper describes a new adaptive filtering approach for nonlinear systems with additive noise. Based on the square-root unscented KF (SRUKF), traditional Maybeck’s estimator is modified and extended to nonlinear systems. The square root of the process noise covariance matrix Q or that of the measurement noise covariance matrix R is estimated straightforwardly. Because positive semidefiniteness of Q or R is guaranteed, several shortcomings of traditional Maybeck’s algorithm are overcome. Thus, the stability and accuracy of the filter are greatly improved. In addition, based on three different nonlinear systems, a new adaptive filtering technique is described in detail. Specifically, simulation results are presented, where the new filter was applied to a highly nonlinear model (i.e., the univariate nonstationary growth model (UNGM)). The UNGM is compared with the standard SRUKF to demonstrate its superior filtering performance. The adaptive SRUKF (ASRUKF) algorithm can complete direct recursion and calculate the square roots of the variance matrixes of the system state and noise, which ensures the symmetry and nonnegative definiteness of the matrixes and greatly improves the accuracy, stability, and self-adaptability of the filter.

1. Introduction

In 1960, Kalman described a recursive minimum mean-square estimation (RMMSE) solution of the linear discrete-time dynamical system filter problem [1]. Since then, the Kalman filter (KF) has been the subject of extensive research in many applications, particularly in autonomous or assisted navigation. However, many real-world systems are nonlinear. Sunahara and Bucy et al. extended the use of Kalman filtering through a linearization procedure, which is referred to as the extended Kalman filter (EKF) [2, 3].

The EKF applies the standard linear Kalman filter methodology by linearizing the true nonlinear system. This approach is suboptimal and can easily lead to divergence. Intuitively, a fixed number of parameters should be easier to approximate with a Gaussian distribution than with an arbitrary nonlinear function/transformation. Julier et al. proposed the unscented Kalman filter (UKF) as a derivative-free alternative to the EKF [4, 5]. The UKF is more accurate, more stable, and far easier to implement than the EKF [6–8], as proven in aircraft state estimation [9], unmanned aerial vehicle (UAV) attitude estimation [10], rotorcraft UAV actuator failure estimation [11], and satellite attitude estimation [12], among other fields. The UKF was extended to the square-root unscented Kalman filter (SRUKF) by van der Merwe and Wan [13].

All of the aforementioned filtering approaches can achieve good performance under certain assumptions. However, the assumptions are typically not entirely satisfied. Thus, the performance of the filters may be seriously downgraded from the theoretical performance, which can potentially lead to divergence. To fulfill the requirement of achieving an optimal Kalman filter and to prevent divergence, the so-called adaptive Kalman filter (AKF) approach has been used to dynamically adjust the parameters of the supposedly optimum filter based on estimating the unknown parameters for online estimation of motion and by estimating the signal and noise statistics from the available data [14, 15]. Maybeck’s estimator, which was originally proposed by Mehra [16, 17] and later summarized by Maybeck [18], is a maximum-likelihood estimator. This simple, real-time AKF-type algorithm has become extremely popular [19–21].

However, there are two well-known disadvantages associated with the traditional Maybeck estimator. First, it is not very stable and often leads to a nonpositive semidefiniteness for the estimated or . Second, the Maybeck estimator cannot estimate and simultaneously [18–21]. In addition, Maybeck’s algorithm is primarily designed for linear systems (such as KF) or linearized nonlinear systems (such as EKF). Thus, there are few applications for Maybeck’s algorithm in the nonlinear field.

To overcome these shortcomings, we developed the concept of a new adaptive square-root unscented Kalman filter (ASRUKF) in 2013 [22]. However, in that work, we only presented the algorithm briefly and did not take the next step of validating its effectiveness. Therefore, in this study, we detailed the ASRUKF for nonlinear systems with additive noise by extending the SRUKF with the traditional Maybeck estimator to nonlinear systems and then tested its effectiveness. To achieve these objectives, a new ASRUKF technique is described in detail based on three different nonlinear systems.

2. The AKF Based on the Maybeck’s Estimator

2.1. Kalman Filter

The dynamical system considered here is a linear, discrete, stochastic sequence described bywhere is the state vector, is the state transition matrix taking state from time to time , is a known exogenous input, is the gain of , is the observation vector, and is the observation mapping matrix. The process noise and the measurement noise are assumed to be additive, independent, white, and Gaussian with the following properties:The Kalman filter is represented by the following equations:

As shown in (3), if the initial values and are given, according to at time , the state estimation can be recursively calculated. The Kalman filter algorithm is shown in Figure 1.

The algorithm works in a two-step process, with a prediction step and an update step. As prior estimates, the current state variables are used in the update equations. The prior estimates are calibrated via the update equations to obtain the a posteriori estimates. After these two steps, the process repeats. The last a posteriori estimates are calculated as the next a priori estimates. This recursive process is one of the most attractive features of the Kalman filter and makes it easier to implement than other filters.

2.2. Extended Kalman Filter

The EKF system is a nonlinear, discrete, stochastic sequence described bywhere and are nonlinear functions of the system. The other parameters are defined as in (1). The EKF can be expressed aswhereThe EKF algorithm uses the filter and a Taylor series expansion to linearize the nonlinear functions and . The higher order terms are omitted, and the linearization model is established. The EKF is a suboptimal filter, which is suitable only when the filtering error and the prediction error are small.

2.3. Unscented Kalman Filter

The UKF is a type of classic Kalman filter. According to the unscented transformation (UT), the UKF chooses sample points that exhibit the same statistical characteristics as the system state. These sample points are nonlinearly transformed through the nonlinear state equations and measurement equations. The mean and covariance can then be calculated using a weighted calculation. Finally, combined with the KF, the estimated system states can be recursively calculated.

The UT uses the sample point set (the so-called sigma points) as the input distribution. This algorithm addresses the entire system as a “black box” and does not depend on the specific nonlinear function or the calculation of the Jacobian matrix.

The random vector (with dimension ) is nonlinearly transformed according towhere is the mean and is the variance. To obtain the statistical characteristics (mean and covariance) of , a matrix is created. This matrix contains sigma vectors represented bywhere is the th column of the square root of the matrix . All weights that correspond to sigma points are defined according towhere is the weight of the mean, is the weight of the covariance, and is a scaling parameter. The constant controls the distribution range of the sigma points around the mean . Generally, is a small, positive number (e.g., ). The auxiliary scaling parameter is represented by , which is typically set at to maintain a positive, semidefinite variance matrix. If , the prior distribution of the high-order matrix contains the random vector .

To obtain the mean and covariance of the output vector , the sigma points (7) are obtained using the following nonlinear transformation:

The mean and covariance of output vector can then be obtained as follows:

The main advantage of UKF is that it does not use linearization to calculate the state predictions, covariance matrices. Instead, it uses a deterministic sampling approach to capture the mean and covariance estimates with a minimal set of sigma points based on a square-root decomposition of the prior covariance [23].

According to the stochastic nonlinear discrete systems shown in (4), the UKF algorithm is the following:(I)Initializing, :(II) Iterating, :(a)Time update. First, the sigma points are calculated, and the matrix is constructed: The sigma point set is nonlinearly transformed by the state equations. The state and variance are then predicted: Then, the sigma points are calculated again: The sigma point set is nonlinearly transformed by the measurement equations. The predicted output of system is calculated:(b)Measurement update. Calculate the filter gain: Estimate the state value after correction: Calculate the a posteriori state variance:

2.4. AKF

In the AKF, the uncertain parameters that need to be adapted may be part of the system model through or or the statistical information through [24]. The AKF based on Maybeck’s estimator is as follows.

Initialize with

For , Maybeck’s estimator iswhere is the window size. Maybeck’s estimator sets the number of updates being averaged together or the window of past measurements that are averaged [20].

2.5. The New ASRUKF

The KF, EKF, and UKF assume that the statistical property of system noise is known. However, in the real world, the process noise covariance matrix or observation noise variance matrix is often unknown. In addition, these parameters can vary with time. Hence, to adjust and , an adaptive Kalman filter must be constructed, where it is important to improve the filtering accuracy and stability. Combined with the square-root filter algorithm, the traditional Sage-Husa adaptive filtering algorithm can be improved. The new ASRUKF can estimate the square root of or in a straightforward manner such that positive semidefiniteness is guaranteed, which greatly improves the stability and accuracy of the filter.

Based on (1), Sage and Husa proposed an adaptive filtering algorithm based on the maximum a posteriori estimation [25]. Based on this algorithm, the forgetting factor (sometimes called the “fading factor” [26, 27]) was introduced by Deng and Guo [28]. Subsequently, the algorithm has the ability to estimate the unknown time-varying noise. The specific calculation process is as follows:(I)Initializing, :(II)Iterating, :(a)Time update:(b)Measurement update. Estimate the statistical properties of measurement noise: Estimate the state value. After correction, calculate the a posteriori state variance: Estimate the statistical properties of the process noise according to where , is the forgetting factor, and .

Based on the SRUKF, traditional Maybeck’s estimator can be modified and extended to nonlinear systems. Thus, a new adaptive filtering approach for nonlinear systems with additive noise is developed. According to the three different structures of the nonlinear problem, the three new ASRUKF models are the following.

Both the process and the measurement equations are nonlinear (PNMN) according towhere and are nonlinear functions. The new ASRUKF for PNMN is the following.

Initialize with

For , calculate the sigma points:

The time update equations are the following:

Estimate the square root of the measurement noise matrix:(augment sigma points),The measurement update equations are the following:Estimate the square root of the process noise matrix according towhere and is the forgetting factor, typically . The weights ( and ) of the mean and covariance are given bywhere is a scaling parameter. The constant determines the spread of the sigma points around the mean, which is typically set to a small, positive value (e.g., ). The constant is a secondary scaling parameter. is used to incorporate the prior knowledge of the distribution (for Gaussian distributions, is optimal) [4–6]. In addition, .

The new ASRUKF makes use of three powerful linear algebra techniques: decomposition , Cholesky factor updating , and efficient least squares , which are briefly reviewed in [6].

Equations (31) and (38) are different from the standard SRUKF. These equations enhance the adaptive ability of the filter.

The process equation is linear, and the measurement equation is nonlinear (PLMN), which is represented byCompared with PNMN, the new ASRUKF for PLMN can be described by replacing (30) with (41) as follows:

The process equation is nonlinear, and the measurement equation is linear (PNML), which is represented by

Compared with PNMN, the new ASRUKF for PNML can be described using (43) instead of (31)–(34) as follows:Application notes are as follows:(a)If is known, replace with and only estimate . In contrast, if is known, replace with and only estimate .(b)If both and are unknown, based on Franz’s scheme [20], estimate their square roots alternately.

3. Simulation and Verification

To illustrate the advantages of the new ASRUKF, an example is presented, in which the univariate nonstationary growth model (UNGM) is estimated. This model was previously used as a benchmark [29]. In this case, the new ASRUKF resulted in better performance than that of the standard SRUKF. All codes were developed in MATLAB.

The dynamic state-space model for UNGM can be written aswhere and . In this example, the parameters were set as follows: , , , , , , and :(1); estimate ; initialize with ;(2); estimate ; initialize with .The simulation results (Figures 2 to 5) show that if either or is known, the new ASRUKF can rapidly estimate the other one, which improves the filter performance.

As shown in Figure 2, the SRUKF estimated error and the new ASRUKF estimated error are almost the same during the primary stage. After the 15th step in the simulation, the state estimation for the ASRUKF can follow the true value accurately. The estimation error is not greater than 1. At the same stage, the state estimation for the SRUKF cannot follow the true value, where the estimation error can be larger than 7. Thus, the ASRUKF has a greater adaptive ability compared with that of the SRUKF.

As shown in Figure 3, at the 15th step in the simulation, the ASRUKF can identify , which is the standard deviation of the process noise. After the 30th step, of the ASRUKF is almost the same as the theoretical value. In contrast, the SRUKF cannot estimate the statistical characteristics of the unknown noise. Therefore, compared with the SRUKF, the ASRUKF has a better filtering ability.

Setting the initial value to , the SRUKF and ASRUKF are used to estimate state . Then, 100 iterations were performed in MATLAB. The simulation results are shown in Figures 4 and 5.

In Figure 4, the UNGM is a highly nonlinear model, and, thus, the filtering performance of the ASRUKF is better than that of the SRUKF. After the 10th step in the simulation, the state estimation error of the ASRUKF is less than 1, whereas the error of the SRUKF is greater than 2.5.

In Figure 5, when the statistical characteristics of the system measurement noise are unknown, the SRUKF uses the initial assumption, which is one-tenth of the theoretical true value, to recursively estimate the system state. In contrast, at approximately the 15th step in the simulation, the ASRUKF can identify , which is the standard deviation of the measurement noise. Therefore, the accuracy of the ASRUKF is greater than that of the SRUKF.

4. Conclusions

In this paper, based on the standard SRUKF, the traditional Maybeck estimator is modified, creating a new adaptive filtering approach (i.e., the adaptive square-root Kalman filter). With the new filter, the positive semidefiniteness of Q or R is guaranteed. Moreover, several of the shortcomings of the traditional Maybeck algorithm are overcome such that the stability and accuracy of the filter are greatly improved. In addition, based on three different nonlinear systems, three new adaptive filtering models were described in detail. Compared with conventional filtering algorithms, the algorithm proposed in this study effectively improves the self-adaptability of filters.

This paper only discussed the adaptive filtering approaches for nonlinear system with additive noise. Further studies will address nonadditive noise.

The developed filter in this study can estimate the square root of or in a straightforward manner such that positive semidefiniteness is guaranteed, which greatly improves the stability and accuracy of the filter. From this study, a valid mathematical tool is provided for the purpose of multisensory data fusion with uncertain process noise, where this tool could greatly help navigation techniques for autonomous mobile robots and intelligent vehicles.

Conflict of Interests

The authors declare no conflict of interests.

Acknowledgments

The authors would like to express their great appreciation to the National Natural Science Foundation of China (nos. 51207129, and 51307137), the Natural Science Basic Research Plan in Shaanxi Province of China (no. 2015JM6302) and the Fundamental Research Funds for the Central Universities (no. 310201401JCQ01018). Additionally, the authors would like to thank Associate Professor Q. X. Zhou and colleagues for their hard work and kind help in performing various experiments with this system.

References

R. E. Kalman, “A new approach to linear filtering and prediction problems,” Journal of Basic Engineering, vol. 82, no. 1, pp. 35–45, 1960.
View at: Publisher Site | Google Scholar
Y. Sunahara, “An approximate method of state estimation for nonlinear dynamical systems,” in Proceedings of the Joint Automatic Control Conference, University of Colorado, Boulder, Colo, USA, 1969.
View at: Google Scholar
R. S. Bucy and K. D. Senne, “Digital synthesis of non-linear filters,” Automatica, vol. 7, no. 3, pp. 287–298, 1971.
View at: Publisher Site | Google Scholar | Zentralblatt MATH
S. J. Julier, J. K. Uhlmann, and H. F. Durrant-Whyte, “New approach for filtering nonlinear systems,” in Proceedings of the American Control Conference, pp. 1628–1632, Evanston, Ill, USA, June 1995.
View at: Google Scholar
S. J. Julier, J. K. Uhlmann, and H. F. Durrant-Whyte, “A new method for the nonlinear transformation of means and covariances in filters and estimators,” IEEE Transactions on Automatic Control, vol. 45, no. 3, pp. 477–482, 2000.
View at: Publisher Site | Google Scholar | MathSciNet
S. J. Julier and H. Durrant-Whyte, “Navigation and parameter estimation of high speed road vehicles,” in Proceedings of the Robotics and Automation Conference, Nogoya, Japan, 1995.
View at: Google Scholar
R. van der Merwe and E. A. Wan, “Efficient derivative-free Kalman filters for online learning,” in Proceedings of European Symposium on Artificial Neural Networks (ESANN '01), Bruges, Belgium, 2001.
View at: Google Scholar
P. H. Li and T. W. Zhang, “Unscented Kalman filter for visual curve tracking,” in Proceedings of the Workshop Statistical Methods Video Processing, pp. 13–18, Copenhagen, Denmark, June 2002.
View at: Google Scholar
P. N. Dwivedi, P. G. Bhale, A. Bhattacharya, and R. Padhi, “State estimation using UKF and predictive guidance for engaging barrel roll aircrafts,” in Proceedings of the 1st American Control Conference (ACC' 13), pp. 6175–6180, June 2013.
View at: Google Scholar
M. Rhudy, Y. Gu, J. Gross, and M. R. Napolitano, “Evaluation of matrix square root operations for UKF within a UAV GPS/INS sensor fusion application,” International Journal of Navigation and Observation, vol. 2011, Article ID 416828, 11 pages, 2011.
View at: Publisher Site | Google Scholar
J. Qi, J. Ha, and Z. Wu, “Rotorcraft UAV actuator failure estimation with KF-based adaptive UKF algorithm,” in Proceedings of the American Control Conference (ACC '08), Seattle, Wash, USA, June 2008.
View at: Google Scholar
H. E. Soken and S. Sakai, “Adaptive tuning of the unscented Kalman filter for satellite attitude estimation,” Journal of Aerospace Engineering, vol. 28, no. 3, 2015.
View at: Publisher Site | Google Scholar
R. van der Merwe and E. A. Wan, “The square-root unscented Kalman filter for state and parameter estimation,” in Proceedings of the International Conference on Acoustics, Speech, and Signal Processing, Salt Lake City, Utah, USA, 2001.
View at: Google Scholar
W. Ding, J. Wang, C. Rizos, and D. Kinlyside, “Improving adaptive kalman estimation in GPS/INS integration,” The Journal of Navigation, vol. 60, no. 3, pp. 517–529, 2007.
View at: Publisher Site | Google Scholar
D.-J. Jwo and S.-H. Wang, “Adaptive fuzzy strong tracking extended Kalman filtering for GPS navigation,” IEEE Sensors Journal, vol. 7, no. 5, pp. 778–789, 2007.
View at: Publisher Site | Google Scholar
R. K. Mehra, “On the identification of variances and adaptive Kalman filtering,” IEEE Transactions on Automatic Control, vol. 15, pp. 175–184, 1970.
View at: Google Scholar | MathSciNet
R. K. Mehra, “Approaches to adaptive filtering,” IEEE Transactions on Automatic Control, vol. AC-17, no. 5, pp. 693–698, 1972.
View at: Google Scholar | MathSciNet
P. S. Maybeck, Stochastic Models, Estimation and Control (Volumes 2), Academic Press, New York, NY, USA, 1982.
D. J. Lee and K. T. Alfriend, “Adaptive sigma point filtering for state and parameter estimation,” in Proceedings of the AIAA/AAS Astrodynamics Specialist Conference and Exhibit, Providence, RI, USA, 2001.
View at: Google Scholar
F. D. Busse, Precise formation-state estimation in low earth orbit using carrier differential GPS [Ph.D. dissertation], Stanford University, 2003.
Y. Shi, C. Han, and Y. Liang, “Adaptive UKF for target tracking with unknown process noise statistics,” in Proceedings of the 12th International Conference on Information Fusion (FUSION' 09), pp. 1815–1820, Seattle, Wash, USA, July 2009.
View at: Google Scholar
Y. Zhou, Y. F. Zhang, and J. Z. Zhang, “A new adaptive square-root unscented kalman filter for nonlinear systems,” Applied Mechanics and Materials, vol. 300-301, pp. 623–626, 2013.
View at: Publisher Site | Google Scholar
A. U. Mageswari, J. J. Ignatious, and R. Vinodha, “A comparitive study of Kalman filter, extended kalman filter and unscented Kalman filter for harmonic analysis of the non-stationary signals,” International Journal of Scientific & Engineering Research, vol. 3, no. 7, pp. 1–9, 2012.
View at: Google Scholar
A. H. Mohamed and K. P. Schwarz, “Adaptive Kalman filtering for INS/GPS,” Journal of Geodesy, vol. 73, no. 4, pp. 193–203, 1999.
View at: Publisher Site | Google Scholar
A. P. Sage and G. W. Husa, “Adaptive filtering with unknown prior statistics,” in Proceedings of the Joint Automatic Control Conference, pp. 760–769, Boulder, Colo, USA, 1969.
View at: Google Scholar
Q. Xia, M. Rao, Y. Ying, S. Shen, and Y. Sun, “A new state estimation algorithm-adaptive fading Kalman filter,” in Proceedings of the 31st IEEE Conference on Decision and Control, pp. 1216–1221, IEEE, Tucson, Ariz, USA, December 1992.
View at: Publisher Site | Google Scholar
Q. Xia, M. Rao, Y. Ying, and X. Shen, “Adaptive fading Kalman filter with an application,” Automatica, vol. 30, no. 8, pp. 1333–1338, 1994.
View at: Publisher Site | Google Scholar | MathSciNet
Z. Deng and Y. Guo, “Dynamic prediction of the oil and water outputs in oil field,” Acta Automatica Sinica, vol. 9, pp. 121–126, 1983.
View at: Google Scholar
J. H. Kotecha and P. M. Djuric, “Gaussian particle filtering,” IEEE Transactions on Signal Processing, vol. 51, no. 10, pp. 2592–2601, 2003.
View at: Publisher Site | Google Scholar | MathSciNet

Copyright

Copyright © 2015 Yong Zhou et al. This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

PDF Download Citation

Download other formats

Order printed copies

Views

4574

Downloads

2293

Citations