Improved Particle Filter Using Clustering Similarity of the State Trajectory with Application to Nonlinear Estimation: Theory, Modeling, and Applications

Jiao, Ziquan; Feng, Zhiqiang; Lv, Na; Liu, Wenjing; Qin, Haijian

doi:https://doi.org/10.1155/2021/9916339

Journal of Sensors

On this page

Abstract Introduction Conclusions Data Availability Conflicts of Interest Acknowledgments References Copyright Related Articles

Special Issue

Information Fusion and Its Applications for Smart Sensing

View this Special Issue

Research Article | Open Access

Volume 2021 | Article ID 9916339 | https://doi.org/10.1155/2021/9916339

Improved Particle Filter Using Clustering Similarity of the State Trajectory with Application to Nonlinear Estimation: Theory, Modeling, and Applications

Ziquan Jiao,^1,2Zhiqiang Feng ,²Na Lv,³Wenjing Liu,⁴and Haijian Qin²

Academic Editor: Ying-Ren Chien

Received08 Mar 2021

Revised13 Apr 2021

Accepted10 May 2021

Published26 May 2021

Abstract

A clustering similarity particle filter based on state trajectory consistency is presented for the mathematical modeling, performance estimation, and smart sensing of nonlinear systems. Starting from an information fusion model based on the consistency principle of the spatial state trajectory, the predicted observation information of the current particle filter (original trajectory) and future multistage Gaussian particle filter (modified trajectory) are selected as the state trajectories of the sampling particles. Clustering similarity methods are used to measure the state trajectories of the sampling particles and the actual system (reference trajectory). The importance weight of a first-order Markov model is updated with the measurement results. By integrating the targeted compensation scheme of the latest measurement information into the sequential importance sampling process, the adverse effects of the particle degradation phenomenon are effectively reduced. The convergence theorems of the improved particle filter are proposed and proved. The improved filter is applied to practical cases of nonlinear process estimation, economic statistical prediction, and battery health assessment, and the simulation results show that the improved particle filter is superior to traditional filters in estimation accuracy, efficiency, and robustness.

1. Introduction

Nonlinear phenomena are common in natural engineering technology. As a popular research topic with important theoretical and practical value in solving nonlinear problems, state estimation has been applied to problems such as target tracking and navigation, fault diagnosis and detection, process feedback and control, biochemical reaction and extraction, and economic prediction and control. Nonlinear state estimation applies to a wide range of fields, especially the industrial field. Applications include longitudinal vehicle speed estimation [1], fault detection of piezoceramic actuators [2], battery health assessment [3], state detection of impending rollover [4], and state estimation of dynamic systems with hysteresis [5]. Many solutions have been proposed, such as the Luenberger observer [6], robust observer [7], Gaussian process regression, Kalman filter [8], proportional integral observer [9], unknown input observer [10], high-gain observer [11], and nonsmooth observer [12]. For example, on the premise of satisfying the Gaussian noise distribution, the Kalman filter (KF) calculates the conditional probability density of random variables using a recursive formula and iteratively updates the linear minimum variance estimate. From this come extended KF, unscented KF, invariant extended KF [13], and adaptive extended KF [14]. The above methods have many advantages and wide applications for nonlinear system state estimation, but there is still much room for optimization and improvement in terms of nonlinear complexity and environmental noise uncertainty of different practical applications.

With the rapid development of computer technology, the particle filter (PF) algorithm based on Bayesian and Monte Carlo theories has shown many advantages and considerable potential to solve estimation problems involving nonlinear and non-Gaussian systems. The prototype algorithm, sequential importance sampling (SIS), was formed in the middle of the last century and is mainly applied to physics and automatic control applications. Due to inherent sample degradation and computer hardware limitations, the study of the PF algorithm slowed until 1993, when Gordon et al. [15, 16] introduced a resampling strategy to SIS and developed sequential importance resampling (SIR), which improved the method and laid the theoretical foundation for the PF. With the development of stochastic probability theory and Monte Carlo methods, the auxiliary particle filter (APF) [17] and Gaussian (sum) particle filter (GPF) [18, 19] were proposed. Introduced by Guarniero et al. [20], APF is based on the idea that the latest observation will approach the optimal proposed distribution if an auxiliary variable is imported to represent the prior probability of the current state. When the system noise is strong, the filtering accuracy is difficult to guarantee due to the lack of information. The GPF algorithm of Sun et al. [21] uses a Gaussian distribution to estimate the posterior probability density function (PDF) of a system state under the basic SIS framework, and the mean and variance are recursively obtained. The filtering effect depends heavily on the problem’s degree of nonlinearity and is limited to the dimension of the system variables [22]. With the high complexity of current natural engineering structures, the degree of nonlinearity of systems is growing. Although these algorithms somewhat improve the performance of PFs, issues remain, such as low accuracy of filter estimation and poor stability due to particle degradation and depletion, which do not meet the needs of modern engineering. Moreover, as a probabilistic method, the nonlinear estimation of PF leads to uncertainty in the result [23].

For this reason, a clustering similarity particle filter (CSPF) based on the consistency principle of the spatial state trajectory is presented. The clustering similarity method is used to measure the state distance between the actual system and the sampling particles, including the observation information of the current state filtering and future multistage state prediction, to guide the generation and improvement of new distributions and update the weight calculation of the importance sampling process. This makes up for using the prior PDF instead of the importance function in the standard PF algorithm, which can prevent the occurrence of particle degradation and significantly improve the accuracy and robustness of estimation. The resampling strategy in the traditional PF algorithm is abandoned to eliminate particle depletion, which improves the quantization accuracy of uncertainty and efficiency of the algorithm. The above methods adopt the idea of simply modifying the proposed distribution. The designed method uses clustering theory to measure the similarity of observation information corresponding to multistage (from to ) state trajectories so as to guide the generation and updating of the latest proposal distribution, which significantly improves the computational complexity of the designed method. To ensure the efficiency of the improved method for different nonlinear state estimation applications, the following two aspects can be improved. (a) The order of the state trajectory should be selected reasonably. Theoretically, the higher the order of trajectory selection, the more accurate the corresponding observation information can express the actual state, and the more accurate the estimation result, but the computational efficiency is greatly decreased. (b) The number of sampling particles should be reduced appropriately. With the increase of the number of particles, the sampling probability density function will gradually approach the probability distribution of the actual state. While improving the accuracy of state estimation, the computational effort will increase. For these reasons, it is necessary to coordinate the contradiction between estimation accuracy and computational efficiency. The nonlinear state estimation results are largely affected by the signal information, which involves the quality and scale of the research object dataset, appropriate parameter identification, and state tracking training methods. In different research and application objects, increasing the quality and scale of experimental datasets containing more physical model information can improve the state estimation performance of data mining; an appropriate parameter training method can ensure that the model can obtain as much useful information as possible from the dataset and help to establish an appropriate state space model. Based on the above measures, compared to traditional estimation methods under the premise of consistent preconditions, the designed method can greatly improve the accuracy of state results, and it also improves computational efficiency.

The remainder of this paper is structured as follows. Section 2 discusses nonlinear system theory. In Section 3, an improved CSPF algorithm is proposed based on the analysis of the defects of the PF. Section 4 provides a theoretical explanation of the improved algorithm and proves the relevant theorems. Section 5 compares the simulation results of the proposed algorithm and the traditional improved PF algorithm. We discuss our conclusions in Section 6.

2. Theory Statement

We summarize the basic definitions and properties of the state space and optimal Bayesian recursion theory of nonlinear systems.

2.1. State Space Model

We set as a random probability space and define two actual vector stochastic processes: and , where sample space is the set of all possible outcomes, event space is a set of outcomes in the sample space, probability function assigns a probability to each event in the event space, is the state process, and is the observation information. Let and be the dimensions of the state and observation information , respectively, corresponding to the state space, and define as the set on -dimensional Euclidean space . Most nonlinear systems can take the form of a dynamic state space (DSS) [24]: where is the discrete-time (stage) index, is the set of system states at time , and is the observation information corresponding to state . and are known state transition and observation functions, respectively, corresponding to the state transition kernel PDF and observation likelihood PDF in the statistical description. The system shift noise and measurement noise are independent and identically distributed (i.i.d.) sequences that obey any PDF form.

The state space follows the first-order Markov process; i.e., the state of the current moment is only related to the state of the previous moment. Assuming an initial distribution , the probability density functions of the state transition kernel PDF and observation likelihood PDF are Lebesgue measures: where and are the probability functions under the influence of shift noise and measurement noise , respectively.

2.2. Optimal Bayesian Recursion Theory

In Bayesian theory [15], the state of a nonlinear system at time is updated based on the observation information to obtain a minimum mean squared error (MSE) estimate. The optimal state estimation of the system is the conditional expected value of the posterior PDF . Based on the premise that the state variable and observation function follow a first-order Markov process, the posterior PDF is obtained by two steps of recursive iterations: prediction and updating. and are defined as the space path information of the state process and observation likelihood from time to , respectively.

2.2.1. Prediction

Combined with the transition kernel PDF calculated by the state space equation, the prior PDF is predicted using the Chapman–Kolmogorov equation and the posterior PDF at time : where the state PDF specifies the conditional probability of given , and specifies the conditional probability of given .

2.2.2. Updating

The prior PDF is updated using the observation likelihood PDF at time , and the posterior PDF of state is obtained as where the observation PDF specifies the conditional probability of given , and specifies the conditional probability of given .

The system state PDF measure is defined as

The joint posterior PDF measure can be obtained by Bayes’ theorem (Equations (3) and (4)):

The marginal posterior PDF measure is obtained by

Similarly, and are, respectively, defined as the sampling particle path information of the state process and observation likelihood from time to .

Definition 1. Suppose that is a probability measure and represents an arbitrary function, and are arbitrary function variables, is the PDF of the transfer kernel satisfying the Markov process, and the following calculation method is defined: According to the above symbols, for any function , Bayesian theory (prediction and updating processes) can be redefined, using Equation (8), as From Equation (9), it is concluded that Except for a small number of dynamic models, it is difficult to obtain an analytic solution in Bayesian theory (Equations (6)–(9), (11), and (12)) and the exact solution of the posterior probability for general nonlinear and non-Gaussian systems.

3. Particle Filter

To solve the complex problem in the above optimal Bayesian filtering algorithm, Monte Carlo sampling is used instead of an integral operation [25]. The idea is to use a discrete distribution with a series of random samples and their corresponding weights to approximate the posterior PDF measure and calculate the expected value of the samples to estimate the actual system state . The importance PDF is generally used to represent the discrete distribution to obtain the sampling particle set to calculate the posterior empirical measure distribution : where is the Dirac delta function. With the sampling number , the empirical measure is infinitely close to the actual posterior PDF measure .

3.1. Sequential Importance Resampling (SIR) Filter

Since the posterior PDF distribution is unknown, it is necessary to construct the importance PDF to satisfy the requirements of the Monte Carlo sampling method and make up for the shortcoming that sampling cannot be carried out in the target distribution, and is typically selected during the SIR process.

Assuming that the posterior PDF measure at time is known and the particle set is at time , the prediction measure of the prediction stage can be obtained as

When the number of sample particles is large enough, the prediction measure is infinitely close to the actual state . The Monte Carlo approximate posterior measure is obtained by substituting the prediction measure into Equation (9):

The above formula is equivalent to where is the weight of the importance PDF after normalization of all sampling particles , and the posterior measure is the weighted sum of the Dirac delta function. The above process is called SIS filtering.

After several updating iterations, the weights of some particles in the SIS process may be small enough to ignore, which cannot be avoided due to the shortcomings of the algorithm. To overcome this, resampling is usually used to solve the degradation problem of the standard PF algorithm. By duplicating particles with higher weights and discarding those with smaller weights, the particle set is gathered in the high-probability posterior region to obtain the approximate value of the unweighted empirical distribution measure :

It can be inferred that the essence of resampling is realized using sampling iterations in the empirical distribution measure , and the new particle set obtained by this method approximates the actual posterior measure . Common resampling methods are random, system, polynomial, and residual resampling. The process of the standard PF algorithm is shown as Algorithm 1.

Step 1. Particle initialization
At time , for
Sample
At time , for
Step 2. Sequential importance sampling
Predict , sample
Evaluate the normalized importance weights
, and
Step 3. Resampling strategy
Calculate effective samples
Compare resampling threshold
if
, and
else
Step 4. State estimation

3.2. Clustering Similarity Particle Filter (CSPF)

The standard PF algorithm is simple in structure and easy to execute. Under the optimal estimation, the approximate estimated value of the algorithm converges to the actual state value. However, there are some issues in practical engineering applications.

3.2.1. Particle Degradation Phenomenon

The standard PF introduces the importance PDF distribution in the SIS process, which causes the variance of the particle weight to accumulate with each iteration. The importance weights corresponding to most particles tend to zero, resulting in a particle degradation phenomenon [26]. The above effects lead to a significant waste of computing resources, with the result that the approximate estimation cannot accurately describe the posterior distribution of the actual state. This degradation phenomenon cannot be avoided due to defects of the algorithm.

3.2.2. Particle Depletion Problem

A resampling strategy is an effective and important method to improve particle degradation. By resampling the discrete approximate posterior PDF distribution obtained by the importance sampling process, samples with larger weights are duplicated many times under the guidance of the particle motion and the distribution of the state at the previous moment so that the number of effective particles increases and degradation is suppressed. However, resampling is likely to cause the abandonment or loss of some low-weight particles, which causes the resampled particles to prematurely move away from the actual state posterior region. This results in sample dilution [27] and eventually in the increase of state estimation variance, which greatly diminishes filtering performance.

In view of the above problems, our improved PF algorithm relies on the consistency principle of the spatial state trajectory [28]; i.e., the closer the state trajectory of a particle is to the actual state trajectory, the more likely the particle state represents the actual state. By using clustering similarity theory to measure the degree of trajectory consistency, the higher the degree of consistency similarity, the closer it is to the actual state, and the particle weights of the SIS process are updated to improve particle degradation. The improved algorithm abandons the resampling strategy, which can fundamentally eliminate the particle depletion problem.

The particle set from time to time is selected as the state trajectory at time , where and are predefined constants. The original trajectory set follows the filtering process [15], and the modified trajectory set complies with the prediction algorithm [18]. Because the actual state is unknown, the observation likelihood information is used to represent the consistency parameter of the state trajectory. Depending on the particle state trajectory, the corresponding observed likelihood trajectory set is determined as where the measurement noise is ; i.e., the observation equation is a known function determined by the specific research objects without noise interference. The observed likelihood trajectory corresponding to the actual state (reference trajectory) is . In this work, a clustering method using distance-based similarity is selected to analyze the trajectory consistency, and the distance similarity measurement [29] of the observed likelihood trajectories of the actual state and sampling particles is calculated as where is the distance similarity function, , and is the measurement type parameter. To increase the reliability of the algorithm, the distance similarity function is transformed to an exponential similarity function: where is the gradient factor. The importance weights and corresponding to times and can be calculated as where is the PDF of the observed likelihood noise. Using the above algorithm, the original trajectory set of the process and the corresponding importance weights at time represent the posterior PDF of the system state, modified trajectory set of the GPF-predicted distribution, and corresponding importance weights at time , which can approximately represent the predicted PDF . Therefore, the state estimate value can be obtained by the filtering operation, and the state estimate can be calculated by the prediction step. The implementation of the improved PF algorithm is as follows.

(1) Estimation. This step is consistent with the estimation process used to extract the particle distribution set .

(2) Updating. The weights and are determined and normalized to and , respectively, to estimate and predict system states and :

The above steps constitute an iterative process of the improved algorithm. Unlike the standard PF, this method uses an estimate-update-filter (prediction) process without resampling. The improved PF algorithm (CSPF) is shown as Algorithm 2.

Step 1. Particle initialization
At time , for
Sample
At time , for
Step 2. Importance sampling
Predict , sample
Step 3. Similarity measurement
Particle state trajectories
Draw

Calculate
Exponential transformation
Step 4. Recursive importance weights
Revised proposal distribution

Update weights
Normalize
Step 5. State estimation

4. Convergence Proof

The proposed algorithm is based on bootstrap filtering theory, and Bayesian state estimation can be realized by a weighted bootstrap method [15, 30]. It is assumed that the sampling particle set is derived from a continuous PDF, . The posterior PDF and are proportional, and is a known function. If the sample number , then the discrete distribution of particles composed of and its corresponding weights can be regarded as approaching the actual posterior PDF . Referring to Equation (16), the posterior PDF of system state is proportional to the product of the observation likelihood function and prior PDF , which can be equivalent to , and the weight in Equation (22) can be regarded as the observation likelihood function equivalent to , which follows bootstrap filtering theory and is reasonable and effective.

4.1. Convergence of the Improved Algorithm

Suppose that the probability density measure space on set is the probability measure set on the largest-dimensional Euclidean space with convergence topology and set is the measure space. and are two continuous function sequences: . In the stochastic filtering setup, space will be all probability measure spaces on -dimensional Euclidean space .

Definition 2. and , respectively, represent the mapping relationships of measure and of measure . We define as the mapping relation (prediction) satisfied on measure set : This holds for any measure . Therefore, substituting the continuous function in the prediction Equation (11), we obtain The prediction measure expression can be obtained as

Definition 3. Referring to Equation (12), we define as the mapping relation (updating) satisfied on measure set : The Bayesian filtering process can be expressed as where the operator “” represents the composite mapping function.

Definition 4. Setting and as the conversion functions of measure and of measure , respectively, the Bayesian filtering process can be expressed as In an abstract environment, the PF algorithm uses the Monte Carlo method to solve a problem for which it is difficult to obtain the exact analytical integral solution in Bayesian theory. The principle is to generate a series of samples from the target distribution to approximately estimate the partial characteristics of the actual state, and the estimation result is only the expectation of a “good performance” function, which can be approximated as the average value: When , the estimated value converges to the expected value . It can be assumed that where is the most basic digital feature to measure the centralized position or average level of a random variable , and is a numeric characteristic of the dispersion of the random variable .
Based on the law of large numbers and the central limit theorem [31], it can be concluded that where is the probability function.
Therefore, for the analytical solution of the integral operation, the disturbance caused by the Monte Carlo sampling method is inevitable, mainly because the estimated value is based on a random and limited sample set. However, under the guarantee of the law of large numbers and the central limit theorem, when the number of particles tends to infinity, the disturbance is minimal and satisfies the following Gaussian distribution : When , the state estimate converges to the real expected value , and the estimation variance decreases with the increase of the number of sample particles.
From the above analysis, it can be concluded that the particle filter is based on Bayesian filtering and can be combined with the Monte Carlo sampling method to generate a sampling disturbance function [32]. The perturbation Equations (33) and (34) can be expressed as The process formulas (29) and (30) of the particle filter algorithm can be expressed as where is the initial value . Our improved algorithm uses clustering analysis to measure the similarity of multistage measurement information [33] as the proposed distribution to replace the prior PDF in the SIS process: The importance weight calculation is updated and modified as follows: Substituting this in Equation (28), the updating formula of the improved algorithm becomes where represents the mapping relationship of the improved algorithm measure and Monte Carlo measure . Referring to Equations (37) and (30), the improved PF can be expressed as

Theorem 5. It is assumed that the state transition kernel function satisfies the first-order Markov process, and the observation likelihood function is continuous, bounded, and strictly positive in . Under the condition of the Monte Carlo sampling disturbance , the improved PF algorithm measure converges to the theoretical value (actual state value) of Bayesian optimal filtering:

Proof. In the PF algorithm, the Monte Carlo sampling disturbance is random and uncertain. is set as a random disturbance, the sample number , and the independent variable . For all measures , where is an i.i.d. random variable with measure . According to the algorithm and the simplification of Equations (29) and (41), we can obtain At time , the measure of the Bayesian prediction stage is , and the sampling disturbance measure of the PF prediction stage is . Using the i.i.d. random variables , , and , we can obtain where is the solution function of set expectation, represents the supremum norm in the domain , and . The summed expectation of the number of sampled particles from 1 to is Hence, This implies that for the prediction stage, the measure at a certain time can be expressed as Referring to Equation (40), for and any function , the updating stage measure can be obtained as This result is compared with Equations (28) and (50), and it is concluded that the improved algorithm has the same measures as the Bayesian filter in the updating stage, i.e., Combining Equations (44), (45), (49), and (51), we can obtain Therefore, the improved PF algorithm (CSPF) based on the clustering similarity of the state trajectories still converges to the actual state under the interference of Monte Carlo sampling disturbances. Theorem 5 is proved.☐

4.2. Convergence of the Mean Squared Error of Results

Combined with the conditions and conclusions of Section 4.1, we analyze the convergence of the results by calculating the boundary of the MSE of the improved algorithm [34]. We demonstrate that the convergence of the reasoning process is related to the number of sample particles at each stage of the algorithm. Suppose that in the neighborhood of , represents a measure sequence of random probability and satisfies . For any function , it can be obtained from Theorem 5 that

Theorem 6. It is assumed that the state transition kernel function satisfies the first-order Markov process, and the observation likelihood function is continuous, bounded, and strictly positive in . For any function , there must be a real constant satisfying where .

Proof. According to the improved PF algorithm, the proof is divided into prediction and updating parts.☐

Lemma 7. Refer to the prediction stage in Algorithm 2 (steps 1 and 2) and assume that the conditions set by Theorem 6 are met. When , there must be a real constant , and the prediction measure satisfies We use induction to complete the proof. When time , for any function , there must be a real constant satisfying From step 1 in Algorithm 2, when , i.i.d. particles are sampled from the prior PDF measure , . Using the Marcinkiewicz–Zygmund inequality [35], we obtain Thus, when , Equation (55) is proved, and converges to .
At time , for any function , there must be a real constant satisfying At time , step 2 in Algorithm 2 can be derived: By substituting the prediction stage formula (11) in the above formula, we obtain Setting as the generated by particle set and combining this with the Monte Carlo method, we obtain Substituting in the above formula, Referring to Equation (58), there must be a real constant: Using the Minkowski inequality, we obtain where . Lemma 7 is proved.

Lemma 8. Refer to the updating stage in Algorithm 2 (steps 3–5) and assume that the conditions set by Theorem 6 and Lemma 7 are met. When , for any function , there must be a real constant , and the prediction measure satisfies where , setting as the generated by particle set .
Combined with the updating stage, we can obtain the following using Equation (12): Substituting Equation (39) in this result yields Similarly, using the Minkowski inequality and Equation (64), we obtain where .Lemma 8 is proved, and the MSE of the improved PF algorithm is convergent.

5. Practical Applications

The factors that affect the performance of the state estimation method based on the PF are mainly determined by the construction of the spatial state model and the quality of the algorithm. The accurate architecture of the spatial state model is mainly realized by the reasonable selection of prior knowledge of the physical mechanism, as reflected in empirical physical equations, scientific parameter identification and training, multifeature search, and optimization of noise distribution. The method’s advantages and disadvantages are limited by the particle degradation caused by an unreasonable proposed distribution and particle depletion caused by the resampling strategy. Therefore, to improve and obtain the best performance of state estimation methods for a fair comparison depends on the following. (a) The architecture of the spatial state model is consistent. The same empirical physical equation and noise distribution are used in the application cases. The state tracking ability of each method is fully used, and the model parameter identification is trained. (b) The advantages of different methods are fully exploited. Commonly used and effective strategies are random, polynomial, system, and residual resampling. The applicability and accuracy of resampling strategies vary for different complex nonlinear systems. Considering the advantages of the above state estimation methods, according to the application case studied in this paper, through the analysis of simulation test results, it is concluded that the system resampling strategy performs best.

Therefore, based on the performance improvements of the method, the improved method can be flexibly applied to practical application fields to solve engineering problems on the premise of application scenario guarantees, such as the accurate architecture of the state space model for the research object, scientific identification and training of physical parameters, multifeature search, and optimization of noise distribution. We compared the clustering similarity particle filter (CSPF) to several traditional filtering algorithms on two examples. All methods were implemented in MATLAB, and the root mean squared error () was used to evaluate accuracy.

Example 1. The advantages of the improved algorithm can be seen in both the Gaussian and non-Gaussian nonlinear systems. A typical univariate nonstationary growth model (UNGM) [36] with highly nonlinear and bimodal characteristics of the state distribution is selected, which is widely used in social and economic fields such as the evaluation of urban development and short-term predictions of insurance stocks and bank interest. The following model was selected to verify the effectiveness of the CSPF algorithm: where the measurement noise , and . The initial state was , , , , , and . The number of particles was . The simulation steps were , where . The correlation coefficients were and , and the gradient factor was . SIR, APF, GPF, and the proposed algorithm were implemented in MATLAB, and 100 trials were conducted. The computer processor speed was 2.7 GHz, and the memory capacity was 8 GB. The , was used to evaluate the performance of the algorithms, where and are, respectively, the real and state estimation values of the system.

The basic framework of the particle filter has state estimation and update parts. State estimation realizes state prediction of the current time based on the state prediction value of the previous time and the system state shift equation (including shift noise). Because the state variables are inevitably disturbed by various noises in the working environment, it is necessary to modify the estimated values based on actual observations to be as close to the real state values as possible so as to realize the update process of state estimation according to the observation values and measurement noise. Furthermore, the purpose of constructing the probability density function of the unknown system state using prior knowledge and actual observation data is realized. It can be seen that the shift and measurement noise will directly and greatly affect the nonlinear estimation method. This paper takes this case as an example, setting different kinds of shift noise to analyze the performance level of the nonlinear estimation methods. The two types of shift noise were Gaussian noise and non-Gaussian noise , where , , and . The non-Gaussian noise was a heavy-tailed distribution, and the acceptance-rejection sampling procedure used in Monte Carlo simulations [25] was adopted to achieve noise sampling with a confidence level of 97.5%. The simulation results are shown in Figures 1–3 and Tables 1 and 2.

The consistency of the state trajectory of the improved PF algorithm (clustering similarity particle filter (CSPF)) was measured by the Euclidean spatial distance (ECSPF), , and Chebyshev spatial distance (CCSPF), . Figures 1 and 2 compare various PF algorithms for a single operation of the nonlinear system in Gaussian and non-Gaussian noise environments (the improved algorithm is represented by ECSPF), which can directly evaluate the effect of state estimation. Figure 3 compares the of the simulation results after 15 simulation runs. Algorithm labels ending in the letter G are denoted by solid curves and correspond to Gaussian system noise , and those ending in the letters NG are denoted by dotted curves and correspond to non-Gaussian system noise . Tables 1 and 2 show the mean and variance of state estimation results after 100 simulation runs to compare the accuracy of the improved algorithm and other algorithms.

Figures 1 and 2 show that the state estimation accuracy of CSPF was significantly better than that of the other algorithms in the two noise environments, and the fluctuation trends of the CSPF estimated curve at any time continued moving and approaching the fluctuation trends of the actual state curve. By contrast, the other algorithms were not consistent at each time point. As shown in Figure 3 and Tables 1 and 2, the means and variances of the two improved algorithms were lower than those of SIR, APF, and GPF. For example, the prediction accuracy (variance) of CSPF was improved by 65%–69% (67%–89%) in the Gaussian noise environment and by 52%–57% (14%–54%) in the non-Gaussian noise environment. Therefore, the improved algorithm had greater accuracy and more stability, and the performance of ECSPF was particularly significant.

Under the same running time, the accuracy of the state estimation of the different algorithms was determined by adjusting the number of sampling particles and the order of the state trajectory. From this, the efficiency of the improved algorithm was verified by comparing the corresponding operational cost (see Table 3 for details). In Gaussian and non-Gaussian noise environments, the improved algorithm with Euclidean and Chebyshev distance similarity measures was compared with SIR, APF, and GPF. For the same operational cost, i.e., for the same computing and simulation times, the improved algorithm not only used the fewest sampling particles but also had significantly better accuracy than the other three algorithms. The was reduced by factors of about 3 and 2 in Gaussian and non-Gaussian noise environments, respectively. This verified that the improved algorithm had higher operational efficiency and substantial advantages in computation time.

Example 2. We assessed the health status of a lithium-ion battery. Through the state tracking and capacity training of historical samples, the physical model parameters of the empirical degradation and distribution information of the noise were identified and optimized [3]. Different nonlinear methods were used to establish state tracking and remaining useful life (RUL) prediction models for the battery. The performances of various algorithms were evaluated based on the state tracking effect and prediction accuracy.

In order to solve the problem regarding battery health assessment, it is particularly important to ensure the accurate structure of the decay physical model (state space model) and scientific parameter identification; this is done under the premise of the reasonable selection of the state tracking training set (noise distribution) and initial parameters. In this paper, parameter identification is carried out based on the attenuation information of state tracking historical samples to construct the RUL prediction model [37]. Since the accuracy of state tracking and RUL prediction largely depends on the physical model of the battery capacity degradation, the development of the model requires physical knowledge of the system [38]. This is usually represented by information collected by sensors, including battery parameters (e.g., charge-discharge voltage and current, power, electrochemical impedance spectroscopy, frequency, and temperature), to build an equivalent circuit model [39] to characterize the degradation trend of battery capacity, as shown in Figure 4, which includes the open-circuit voltage (), electrolyte resistance , polarization current , double-layer voltage , capacitance , charge transfer resistance , Warburg impedance , load current , and terminal voltage .

Relying on the attenuation mechanism of the electrochemical characteristics, the relationship between capacity degradation and internal impedance of the battery was determined using statistical regression. The simulated attenuation characteristic of the impedance increased with the number of charge-discharge cycles to obtain a double-exponential empirical degradation model [40, 41], where is the battery capacity and is the number of charge-discharge cycles. The unknown model parameters and are related to the battery impedance, and are related to the rate of capacity degradation, and is the exponential function.

A tracking and training model was constructed using historical samples of the battery capacity to estimate the model parameters of the empirical degradation in real time and optimize the multicharacteristic noise [42]. The identified physical model effectively converged to the gradual trend of actual battery degradation, which provided reasonable identification parameters and effective initial values to establish an RUL prediction model. This enabled the updating of the state space model composed of the state transition equation (physical model of the battery system), which can represent the recursion law of the system, and the observation equation (data characteristic measurement relationship), which can transform the implicit information of the system to visible output. The state estimation of the current time is obtained by the state transfer process and the prediction results of the previous time. Using the error between the actual measurement (noise interference) and the estimated observation information at the current moment, a weighted correction term is generated to realize the updating process. This allows the state estimation and prediction system model to be obtained with high reliability, and the process of battery state tracking and RUL predictive evaluation can be realized. A flowchart is shown in Figure 5.

Using the state tracking process to identify the model parameters and battery RUL prediction, the SIR, APF, and CSPF algorithms were compared (since the application of the algorithm is limited by the dimension of the system variables, the GPF is not suitable for this example). State tracking parameters were identified before the charge-discharge cycle ; i.e., we estimated the optimal model parameters that can minimize the error between the predicted value of the algorithm and the actual value of the experiment. After the charge-discharge cycle , the battery capacity was predicted to determine whether the failure threshold was exceeded. The original data of the capacity degradation of the lithium-ion battery were from the open-source experimental data of the Center for Advanced Life Cycle Engineering (CALCE) of the University of Maryland. An Arbin BT2000 battery system was the experimental platform, and the test data were stored in Excel format. A data sample with a normalized capacity of 0.90 Ah, M8, was selected as the test set for comparison of the state assessment methods. The battery was charged at the standard constant current of 0.5 C until it reached 4.2 V and was switched to constant voltage charging until the current decayed to 0.05 A, and the discharge was considered complete when the voltage dropped to 2.7 V during the charge-discharge cycle [43], as shown in Table 4. While collecting experimental data, due to the error from the accuracy of measuring equipment and human misoperation, a small amount of abnormal data was extracted from the battery dataset, as it would have affected the quality of the history dataset. The size of the dataset would also affect the operational cost, and simplifying the data (e.g., taking one point from each point) would reduce this cost. Therefore, the original data were preprocessed, filtered, and simplified.

Generally, 80% of the rated capacity was taken as the end-of-life threshold , and the actual failure threshold was 133 cycles according to open-source data, where the state tracking set was and the number of particles was . The simulation results are shown in Figure 6 and Tables 5 and 6. Figure 6 can be used to compare the effect of state tracking and the accuracy of capacity prediction of different algorithms. Tables 5 and 6 compare the state tracking and prediction results obtained by multiple simulation calculations. The evaluation indicators of the state tracking effect are the sum of squared error (), , , and coefficient of determination (). The closer , , and were to 0, the closer was to 1, and the better was the effect of state tracking. The evaluation indicators of the prediction accuracy were the mean, variance, relative error, and of the prediction failure threshold.

(a) SIR

(b) APF

(c) ECSPF

(d) CCSPF

Figure 6 and Table 5 show that compared with the SIR and APF algorithms, the state tracking indicators and effect of the proposed CSPF algorithm were superior, and the Euclidean distance measure yielded the best results. The excellent state tracking effect of the CSPF method also guarantees that the attenuation mechanism of state tracking in the early period is the same as in later-stage predictions over the whole life cycle of the battery. As shown in Figure 6 and Table 6, after 100 simulations using SIR, APF, ECSPF, and CCSPF, the average failure thresholds of RUL prediction were 113, 115, 123, and 122 cycles, respectively, and the relative error was within 15.0%. Taking ECSPF as an example, the variance, relative error, and of the RUL prediction failure threshold were significantly lower than those of SIR and APF, and the prediction trend was relatively closer to the actual capacity degradation curve. The relative error of ECSPF was better than those of SIR and APF by about 50% and 45%, respectively, and the threshold variances were lower by factors of about 26 and 80, respectively. was about 55% and 60% greater, respectively. The results show that the prediction accuracy of CSPF was relatively higher than those of the other algorithms, the discrete degree of the state particle set of prediction results was the minimum, the degree of uncertainty expression was the lowest, and the algorithm was more robust.

6. Conclusions

Facing the difficulties of the standard PF algorithm in nonlinear system state estimation, such as low precision, instability, and low computational efficiency, we proposed an improved PF algorithm (CSPF) based on the consistency principle of the spatial state trajectory. Relying on the model construction of spatial trajectories between sampled particles and actual states, current and future multistage measurement information was predicted by SIS and GPF to form trajectories combining the original and modified trajectories. The similarity of the combined trajectories was measured by clustering analysis to guide the generation of new distributions and update the particle importance weights to mitigate the particle degradation phenomenon. Because resampling is not adopted in the improved algorithm, the problem of particle depletion is fundamentally eliminated. The convergence theorem of the improved algorithm of the CSPF and MSE of the results was proved. The effectiveness of CSPF was verified by comparison with current methods in cases of socioeconomic prediction and battery health assessment. The application experiments showed that, compared to SIR, APF, and GPF, CSPF had higher accuracy and better robustness for the state estimation of nonlinear systems under the influence of Gaussian or non-Gaussian noise. Primary conclusions are summarized as follows: (1)In Gaussian and non-Gaussian noise environments, the prediction accuracy of CSPF was improved significantly, more than 50% in typical cases(2)The latest measurement information was used to update the new proposal distribution, and the computational cost of CSPF was reduced by adjusting the number of sampling particles and the order of the state trajectory. In a typical case, the of CSPF was reduced by a factor of more than 2 in the same computing time(3)The first-order Markov model was modified by the clustering similarity of state trajectories so that the indicators of state tracking effects were relatively higher, which provided an accuracy guarantee for the prediction model. In the battery application, the prediction accuracy of CSPF was the best, and the relative error of the RUL failure threshold was maintained within 8%

Data Availability

The data used to support the findings of this study are available from the corresponding author upon request.

Conflicts of Interest

The authors declare that there are no conflicts of interest regarding the publication of this paper.

Acknowledgments

The authors sincerely thank the Guangxi Engineering Technology Research Center of Ship Digital Design and Advanced Manufacturing and the Intelligentized Robotic Welding Technology Laboratory of Shanghai Jiao Tong University for their intelligence support and valuable comments in performing this research. We appreciate the generous financial support of the National Natural Science Foundation of China (Nos. 51969001 and 61763006), the Guangxi Natural Science Foundation of China (Nos. 2018GXNSFAA138080 and 2021GXNSFBA075023), the Guangxi Science and Technology Plan Project of China (No. GuikeAD18281007), and the Innovation Project of Guangxi Graduate Education (YCBZ2019050).

References

X. Ding, Z. Wang, L. Zhang, and C. Wang, “Longitudinal vehicle speed estimation for four-wheel-independently-actuated electric vehicles based on multi-sensor fusion,” IEEE Transactions on Vehicular Technology, vol. 69, no. 11, pp. 12797–12806, 2020.
View at: Publisher Site | Google Scholar
Z. Zhou, Y. Tan, and R. Dong, “Fault detection of piezoceramic actuator using non-smooth observer,” International Journal of Applied Electromagnetics and Mechanics, vol. 47, no. 4, pp. 975–991, 2015.
View at: Publisher Site | Google Scholar
Q. Wang, Z. Wang, L. Zhang, P. Liu, and Z. Zhang, “A novel consistency evaluation method for series-connected battery systems based on real-world operation data,” IEEE Transactions on Transportation Electrification, vol. 7, no. 2, pp. 437–451, 2021.
View at: Publisher Site | Google Scholar
C. Wang, Z. Wang, L. Zhang, D. Cao, and D. G. Dorrell, “A vehicle rollover evaluation system based on enabling state and parameter estimation,” IEEE Transactions on Industrial Informatics, vol. 17, no. 6, pp. 4003–4013, 2021.
View at: Publisher Site | Google Scholar
Z. Zhou, Y. Tan, and X. Liu, “State estimation of dynamic systems with sandwich structure and hysteresis,” Mechanical Systems and Signal Processing, vol. 126, pp. 82–97, 2019.
View at: Publisher Site | Google Scholar
D. Luenberger, “An introduction to observers,” IEEE Transactions on Automatic Control, vol. 16, no. 6, pp. 596–602, 1971.
View at: Publisher Site | Google Scholar
Z. Zhou, Y. Tan, R. Dong, and L. Zhang, “Fault detection for sandwich systems with hysteresis based on robust observer,” International Journal of Applied Electromagnetics and Mechanics, vol. 49, no. 4, pp. 577–595, 2015.
View at: Publisher Site | Google Scholar
R. E. Kalman, “A new approach to linear filtering and prediction problems,” Journal of Basic Engineering, vol. 82, no. 1, pp. 35–45, 1960.
View at: Publisher Site | Google Scholar
Z. Zhou and X. Liu, “State and fault estimation of sandwich systems with hysteresis,” International Journal of Robust and Nonlinear Control, vol. 28, no. 13, pp. 3974–3986, 2018.
View at: Publisher Site | Google Scholar
Y. Guan and M. Saif, “A novel approach to the design of unknown input observers,” IEEE Transactions on Automatic Control, vol. 36, no. 5, pp. 632–635, 1991.
View at: Publisher Site | Google Scholar
H. K. Khalil and L. Praly, “High-gain observers in nonlinear feedback control,” International Journal of Robust and Nonlinear Control, vol. 24, no. 6, pp. 991-992, 2014.
View at: Publisher Site | Google Scholar
Z. Zhou, Y. Tan, Y. Xie, and R. Dong, “State estimation of a compound non-smooth sandwich system with backlash and dead zone,” Mechanical Systems and Signal Processing, vol. 83, pp. 439–449, 2017.
View at: Publisher Site | Google Scholar
K. S. Phogat and D. E. Chang, “Discrete‐time invariant extended Kalman filter on matrix Lie groups,” International Journal of Robust and Nonlinear Control, vol. 30, no. 12, pp. 4449–4462, 2020.
View at: Publisher Site | Google Scholar
Q. Meng, Y. Sun, and Z. Cao, “Adaptive extended Kalman filter (AEKF)-based mobile robot localization using sonar,” Robotica, vol. 18, no. 5, pp. 459–473, 2000.
View at: Publisher Site | Google Scholar
N. J. Gordon, D. J. Salmond, and A. F. M. Smith, “Novel approach to nonlinear/non-Gaussian Bayesian state estimation,” IEEE Proceedings F Radar and Signal Processing, vol. 140, no. 2, pp. 107–113, 1993.
View at: Publisher Site | Google Scholar
M. S. Arulampalam, S. Maskell, N. Gordon, and T. Clapp, “A tutorial on particle filters for online nonlinear/non-Gaussian Bayesian tracking,” IEEE Transactions on Signal Processing, vol. 50, no. 2, pp. 174–188, 2002.
View at: Publisher Site | Google Scholar
M. K. Pitt and N. Shephard, “Filtering via simulation: auxiliary particle filters,” Journal of the American Statistical Association, vol. 94, no. 446, pp. 590–599, 1999.
View at: Publisher Site | Google Scholar
J. H. Kotecha and P. M. Djuric, “Gaussian particle filtering,” IEEE Transactions on Signal Processing, vol. 51, no. 10, pp. 2592–2601, 2003.
View at: Publisher Site | Google Scholar
J. H. Kotecha and P. M. Djuric, “Gaussian sum particle filtering,” IEEE Transactions on Signal Processing, vol. 51, no. 10, pp. 2602–2612, 2003.
View at: Publisher Site | Google Scholar
P. Guarniero, A. M. Johansen, and A. Lee, “The iterated auxiliary particle filter,” Journal of the American Statistical Association, vol. 112, no. 520, pp. 1636–1647, 2017.
View at: Publisher Site | Google Scholar
Y. Sun, X. Ran, Y. Li, G. Zhang, and Y. Zhang, “Thruster fault diagnosis method based on Gaussian particle filter for autonomous underwater vehicles,” International Journal of Naval Architecture and Ocean Engineering, vol. 8, no. 3, pp. 243–251, 2016.
View at: Publisher Site | Google Scholar
M. L. Psiaki, J. R. Schoenberg, and I. T. Miller, “Gaussian sum reapproximation for use in a nonlinear filter,” Journal of Guidance, Control, and Dynamics, vol. 38, no. 2, pp. 292–303, 2015.
View at: Publisher Site | Google Scholar
X. Tang, K. Liu, X. Wang, B. Liu, F. Gao, and W. D. Widanage, “Real-time aging trajectory prediction using a base model-oriented gradient-correction particle filter for lithium-ion batteries,” Journal of Power Sources, vol. 440, pp. 227118.1–227118.11, 2019.
View at: Google Scholar
X. Fu and Y. Jia, “An improvement on resampling algorithm of particle filters,” IEEE Transactions on Signal Processing, vol. 58, no. 10, pp. 5414–5420, 2010.
View at: Publisher Site | Google Scholar
A. T. Cemgil, “A tutorial introduction to Monte Carlo methods, Markov Chain Monte Carlo and particle filtering,” Academic Press Library in Signal Processing, vol. 1, pp. 1065–1114, 2014.
View at: Publisher Site | Google Scholar
M. Ahwiadi and W. Wang, “An adaptive particle filter technique for system state estimation and prognosis,” IEEE Transactions on Instrumentation and Measurement, vol. 69, no. 9, pp. 6756–6765, 2020.
View at: Publisher Site | Google Scholar
R. Havangi, “Robust evolutionary particle filter,” ISA Transactions, vol. 57, pp. 179–188, 2015.
View at: Publisher Site | Google Scholar
A. Salarpour and H. Khotanlou, “Direction‐based similarity measure to trajectory clustering,” IET Signal Processing, vol. 13, no. 1, pp. 70–76, 2019.
View at: Publisher Site | Google Scholar
H. He and Y. Tan, “Unsupervised classification of multivariate time series using VPCA and fuzzy clustering with spatial weighted matrix distance,” IEEE Transactions on Cybernetics, vol. 50, no. 3, pp. 1096–1105, 2020.
View at: Publisher Site | Google Scholar
J. Candy, “Bootstrap particle filtering,” IEEE Signal Processing Magazine, vol. 24, no. 4, pp. 73–85, 2007.
View at: Publisher Site | Google Scholar
L. Zhang, Y. Zhu, P. Shi, and Y. Zhao, “Resilient asynchronous filtering for Markov jump neural networks with unideal measurements and multiplicative noises,” IEEE Transactions on Cybernetics, vol. 45, no. 12, pp. 2840–2852, 2015.
View at: Publisher Site | Google Scholar
Z. Zhou, Y. Tan, and P. Shi, “Fault detection of a sandwich system with dead-zone based on robust observer,” Systems & Control Letters, vol. 96, pp. 132–140, 2016.
View at: Publisher Site | Google Scholar
H. He, Y. Tan, and J. Xing, “Unsupervised classification of 12-lead ECG signals using wavelet tensor decomposition and two-dimensional Gaussian spectral clustering,” Knowledge-Based Systems, vol. 163, pp. 392–403, 2019.
View at: Publisher Site | Google Scholar
Z. Zhou, Y. Tan, Y. Xie, and R. Dong, “Soft measurement of states of sandwich system with dead zone and its application,” Measurement, vol. 87, no. 1, pp. 219–234, 2016.
View at: Publisher Site | Google Scholar
H. Zhou, Z. Deng, Y. Xia, and M. Fu, “A new sampling method in particle filter based on Pearson correlation coefficient,” Neurocomputing, vol. 216, pp. 208–215, 2016.
View at: Publisher Site | Google Scholar
M. Rhif, A. B. Abbes, I. Farah, B. Martínez, and Y. Sang, “Wavelet transform application for/in non-stationary time-series analysis: a review,” Applied Sciences, vol. 9, no. 7, 2019.
View at: Publisher Site | Google Scholar
K. Liu, Y. Shang, Q. Ouyang, and W. D. Widanage, “A data-driven approach with uncertainty quantification for predicting future capacities and remaining useful life of lithium-ion battery,” IEEE Transactions on Industrial Electronics, vol. 68, no. 4, pp. 3170–3180, 2021.
View at: Publisher Site | Google Scholar
A. Wen, J. Meng, J. Peng, L. Cai, and Q. Xiao, “Online parameter identification of the lithium-ion battery with refined instrumental variable estimation,” Complexity, vol. 2020, Article ID 8854618, 12 pages, 2020.
View at: Publisher Site | Google Scholar
B. Saha, K. Goebel, and J. Christophersen, “Comparison of prognostic algorithms for estimating remaining useful life of batteries,” Transactions of the Institute of Measurement and Control, vol. 31, no. 3-4, pp. 293–308, 2009.
View at: Publisher Site | Google Scholar
R. B. Wright, C. G. Motloch, J. R. Belt et al., “Calendar- and cycle-life studies of advanced technology development program generation 1 lithium-ion batteries,” Journal of Power Sources, vol. 110, no. 2, pp. 445–470, 2002.
View at: Publisher Site | Google Scholar
W. He, N. Williard, M. Osterman, and M. Pecht, “Prognostics of lithium-ion batteries based on Dempster-Shafer theory and the Bayesian Monte Carlo method,” Journal of Power Sources, vol. 196, no. 23, pp. 10314–10321, 2011.
View at: Publisher Site | Google Scholar
Z. Wei, J. Zhao, D. Ji, and K. J. Tseng, “A multi-timescale estimator for battery state of charge and capacity dual estimation based on an online identified model,” Applied Energy, vol. 204, pp. 1264–1274, 2017.
View at: Publisher Site | Google Scholar
Z. Jiao, X. Fan, X. Zhang, Y. Luo, and Y. Liu, “State tracking and remaining useful life predictive method of Li-ion battery based on improved particle filter algorithm,” Transaction of China Electrotechnical Society, vol. 35, no. 18, pp. 3979–3993, 2020.
View at: Google Scholar

Copyright

Copyright © 2021 Ziquan Jiao et al. This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

PDF Download Citation

Download other formats

Order printed copies

Views

466

Downloads

612

Citations