Abstract

In the car control systems, it is hard to measure some key vehicle states directly and accurately when running on the road and the cost of the measurement is high as well. To address these problems, a vehicle state estimation method based on the kernel principal component analysis and the improved Elman neural network is proposed. Combining with nonlinear vehicle model of three degrees of freedom (3 DOF), longitudinal, lateral, and yaw motion, this paper applies the method to the soft sensor of the vehicle states. The simulation results of the double lane change tested by Matlab/SIMULINK cosimulation prove the KPCA-IENN algorithm (kernel principal component algorithm and improved Elman neural network) to be quick and precise when tracking the vehicle states within the nonlinear area. This algorithm method can meet the software performance requirements of the vehicle states estimation in precision, tracking speed, noise suppression, and other aspects.

1. Introduction

Electronic control systems are increasingly being applied to modern vehicles to improve its safety, stability, and comfort. Electronic stability program (ESP), four wheel steering (4WS), and active steering control (ASC) improve the handling stability of a vehicle by controlling the body lateral and yawing motion. Active roll control (ARC) and active body control (ABC) improve the driving comfort by controlling the vertical movement and roll stability. The premise condition to realize the particular functions of these electronic control systems is to obtain accurate state parameters of vehicle when car is running on the road. Those parameters include vertical speed, lateral acceleration, yaw velocity, sideslip angle, and so on. Due to the measuring technology and its cost, it is quite a challenge to measure all the important state variables directly. At present, the soft sensor technology based on state estimation has been successfully applied to the estimation of vehicle dynamics and kinematics parameter information. Venhovens and Naab [1] and Gustafsson et al. [2] estimated the lateral acceleration, yaw velocity, tire force, and road tire friction coefficient by using Kalman filter (KF). Wenzel et al. [3, 4] estimate vehicle’s state variables and parametric variables by adopting dual extended Kalman filter parallel estimating method. Some other scholars use Luenberger observer and sliding mode observer to estimate sideslip angle and yawing force. Among all these estimation algorithms, the state estimation method based on KF is the most widely applied and has achieved relatively better results in field tests. However, the design difficulties of the KF algorithm are (1) the requirement of accurate knowledge for the model of the system observed; (2) the requirement of the statistical properties of the known system noise and the measurement noise.

Principal component analysis (PCA) is a kind of linear algorithm. It only considers the second-order statistics and can only extract the linear relation of the data. When nonlinear relation exists among a large number of variables of a nonlinear system, the principal component analysis (PCA) will not meet the requirements any more. While the kernel principal component analysis (KPCA) is a nonlinear extension of the principal component analysis (PCA) [57]. Through nonlinear mapping, it transfers the original data from data space into feature space and then determines the principal components in high dimensional feature space through principal component analysis (PCA). Thus it has a strong capability of nonlinear approximation.

Literature [8] did the clustering by altering the objective function in the FCM-AWA algorithm using kernel distance to improve the classification accuracy under noise circumstance. Literatures [9, 10] introduced Mexican hat wavelet function into the kernel principal component analysis (KPCA) and presented a feature reduction method based on wavelet kernel-PCA (WK-PCA). This method can identify and diagnose the rotor fault modes rapidly and efficiently. Literature [11] proposed the Bayes Support Vector Machine estimation method with high forecasting accuracy and robustness.

Elman neural network is based on BP network. It improves the network’s processing capability of dynamic information by adding a context node. This thesis proposes a new vehicle state estimation method based on the modified Elman neural network. This method uses error analysis to adjust the gain factors in real time, thereby achieving real time updating of the network structure.

The paper comes up with a soft sensor modeling method based on the kernel principal component analysis and the modified Elman neural network by combining Mahalanobis Distance, the kernel principal component analysis (KPCA), and the modified Elman neural network. When used in the soft sensor of the key states such as the vehicle’s yaw rate, this method is proved to have higher forecasting accuracy in simulation tests and satisfied the software performance requirements of the vehicle states estimator.

2. Vehicle Model and Principles of KPCA-IENN

2.1. Nonlinear Dynamic Model of Vehicle

The 2-DOF model used to analyze the vehicle’s yaw motion and lateral motion is a linear model, which is on the assumption that the vehicle is driving in uniform velocity. But in reality, longitudinal velocity is subject to change at any time and this changing longitudinal velocity has a significant impact on the yaw and lateral motion and constitutes a nonlinear relation among the state variables. This paper established the vehicle state estimation soft sensor model based on the longitudinal, lateral, and yaw 3-DOF nonlinear vehicle model. Figure 1 is the 3-DOF vehicle dynamic model.

As is shown in Figure 1, is an absolute coordinate system fixed in the horizontal plane, and the is the vehicle coordinate fixed on the gravity center, where the axis overlaps with the vehicle’s longitudinal axis and is positive toward the direction of the vehicle movement; axis is through the vehicle gravity center and is positive to the left. All the angles and torques within the plane of are positive counter-clockwise and all the components of vectors are positive with the axis direction. The state equations and measurement equations of vehicle nonlinear dynamic model of 3 DOF are shown as

In the equation, is the yaw velocity; is the sideslip angle; is the longitudinal velocity; is the front wheel angle; is the lateral acceleration; is the longitudinal acceleration; is the vehicle mass; is the moment of inertia about the -axis; is front wheelbase; is the rear wheelbase; is the front wheel effective cornering stiffness; is the rear wheel cornering stiffness.

With the steering wheel angle as control input and the steering wheel to the front wheel angle transmission ratio , we get

2.2. Soft Sensor Technology

Soft sensor technology is also named soft instrument technology [7]. The basic principle is to get the estimation value of the primary variables through the establishment of the mathematical model by using those easily detectable assistant variables that are closely related to those primary variables of difficult detection. The commonly used methods for soft sensor technology include the mechanism analysis modeling, the regression analysis, state estimation, pattern recognition, neural networks, fuzzy mathematics, process tomography, the correlation analysis, and so forth. The state estimation based soft sensor technique adopted in this thesis deducted the estimation of the primary variables using detectable information and the algorithm (the kernel principal component analysis and the improved Elman neural network). The technique realized the measurement for the state parameters that are otherwise hard to measure by conventional method. Figure 2 is the vehicle state estimation system based on soft sensor technique. In Figure 2, , , and represent measurable process noise, measurement noise, and control variable, respectively. and are the real value and estimated value of the object outputs, respectively. is the estimated value of the object state with the discrete moments .

2.3. The Reduction of the Feature Parameters Based on the Kernel Principal Component Analysis

The basic idea of the kernel principal component analysis is to implicitly map the data in input space to feature space through kernel function and then to process the kernel principal component analysis in the feature space. This is a powerful approach of extracting nonlinear features, realizing the reduction of the feature parameters, reducing of data redundancy, and getting the primary feature vectors with low classification error rate and high sensitivity with the driving conditions.

2.3.1. The Selection of Feature Parameters

There are many feature parameters that can be used to represent the vehicle status, such as average running time, the average acceleration time, the average deceleration time, the average time of driving at uniform velocity, the average idling time, the average running distance, the maximal driving speed, the average speed, the standard deviation of speed, the maximum vehicle acceleration, the maximum vehicle deceleration, the average acceleration during the acceleration period, the average deceleration during the deceleration period, the standard deviation of the acceleration, and the changing rate of the throttle opening. Firstly, this thesis reduces the feature parameters representing vehicle status via correlation analysis. In other words, it uses sequences and to represent the feature parameters pair-wise respectively, in the equation, . Choose any two time series when and then compare the level of similarity of the two sequences according to

In the equation, “” is the conjugation of .

Sort the above results of operation from highest irrelevance to the lowest and then make correlation analysis on the first 15 parameters and the categories of working condition. Then rate the correlation from high to low, and finally we get the 10 feature parameters as the classification standard for working conditions; they are the maximal driving speed, average speed, the minimum deceleration, the average acceleration, the uniform velocity time ratio, idling time ratio, the average acceleration during the acceleration period, the average deceleration during the deceleration period, the average degree of the throttle opening, and the time when the throttle opening degree is zero.

2.3.2. The Reduction of the Feature Parameters Based on the Kernel Principal Component Analysis

Through nonlinear mapping , the kernel principal component analysis transfers the original data () from data space into the in a new high dimensional feature space and then gets the nonlinear principal components which enables the original data to have better divisibility by using principal component analysis (PCA) on these after mapping data.

The covariance matrix for the after mapping data is as follows:

If we set the eigenvalue for the covariance matrix to be and the relative feature vector to be , then

For all the , do dot-product of each after mapping sample with both sides of (5), respectively,

When , there is coefficient (); use linear combination of to represent feature vector

Substitute (4) and (7) into the above inner product formula, and then get

To avoid complex direct calculations, in the feature space, we define the elements of the symmetric matrix as kernel function

In the equation, .

As is symmetric matrix, so (5) can be simplified as

Solve (10); we will get the required eigenvalues and the feature vectors (). Reduce the system inputted feature parameters via the method of keeping the first feature vectors. The eigenvalues of the diagonalized matrix are denoted as , if the number of the smallest nonzero eigenvalues for is set to be , and then with (9) and (10), we could get

In the equation, .

The projection of the after mapping data on the feature vectors () is the nonlinear principal components of the original data obtained through nonlinear mapping

In the equation, is the th coefficient of the th eigenvalue of the matrix for feature vector .

This thesis chooses as Gaussian based radial basis function

In the equation, is the standard deviation.

If the mapped data are non-zero mean, mean value method needs to be used on kernel function

In the equation, are the constant matrices with as its element; and are mapped data after being treated with mean value method.

2.4. Improved Elman Neural Network
2.4.1. Modified Elman Neural Network

Elman regression neural network is a kind of typical dynamic neural network. Based on the feed forward neural network, it adds one more node, the context node. Through the delay and storage of the output of the hidden node by the context node, the output of the hidden node can self-link to the input of the hidden node, thereby enabling the network to be sensitive to the historical data, increasing the capacity of the network’s processing capability of dynamic information and realizing dynamic modeling.

The characteristics of the existing modified Elman neural network is an added self-feedback connection with a fixed gain as , so that the output of the context node at the time equals the output of the hidden node at time plus times of the output of the context node at the time [12]. Figure 3 shows the network structure. The difference between the original and the improved Elman network is the different values for . But both are fixed values. This thesis proposes a determination method for variable gain based on real time error analysis, which gives the network a better real time performance.

Comparing with the standard Elman network, the modified Elman network not only has a more flexible approximation to high order system essentially but can also achieve the same effect as gained by the standard Elman network when it adopts dynamic BP algorithm training.

The expressions between each layer of the modified Elman neural network are

In the equation, connection weight is the connection weight matrix of the context node and the hidden node; is the connection weight matrix of the input node and the hidden node; is the connection weight matrix of the hidden node and the output node; and represent the output of context node and the hidden node, respectively; represents the output of the output node, with as the self–join feedback gain factor. and are often hyperbolic tangent functions or sigmoid functions.

2.4.2. The Value Determination of the Modified Elman Neural Network

By analyzing the equation (15), we know that the bigger the value is, the greater the influence the output of the context node at time has on the output of the node at time . Suppose the calculation result is at the time , and the true value is and at the time and the time , respectively; then discuss the following 3 circumstances:(1)when : if , make ; if , make ; if , make , if , make ,(2)when : if , make ; if , make ; if , make , if , make ,(3)when : if , ; if , , if , make .

2.4.3. The Modified Elman Neural Network Learning Algorithm

The modified Elman neural network overall error objective function can be expressed as

Calculating the partial derivatives of to weight values , , and , we can get the gradient descent algorithm of the modified Elman neural network:

In the equation, , , and are the learning steps of , , and , respectively.

2.5. Modeling Based on Kernel Principal Component Analysis and Modified Elman Neural Network

Combining the Mahalonobis distance, kernel principal component analysis (KPCA), and improved Elman neural network, after Mahalonobis distance getting rid of the fault data among the sampling data, we can select the kernel principal components using KPCA and establish the neural network.

Specific steps are shown in Figure 4 and as follows: (1) generate the sample data set and preprocess the sample data (the data transform and error handling); (2) calculate the Mahalonobis distance between each sample data point and the nearest point, and the distance set of all the nearest points is expressed with , where . Then calculate the median value of and compare each value of with ; if any , then the sample contains gross error data; otherwise the sample is normal. Sort the gross error data out from the new sample data set. (3) Select principle components by KPCA. There are total principle components, when the accumulative contribution rate accounted for over 90%, and choose the first principle components. (4) Use the principle components as the input of the neural network, initialize parameters to be (), and choose a batch of data to train the neural network. (5) Use the last batch of data as prediction set. Keep the best parameters when the predicting error satisfies the requirements. Substitute these parameters (driving state) into the measurement and control applications. Use them as the initial weights of the network model (driving state) and get into the on-line learning.

3. Results and Discussion

To validate the KPC-IENN algorithm to be accurate and reliable, the double lane change test is adopted. Using SUV’s real vehicle parameters, virtual simulation of the sharp double lane change test is based on the ISO3888-1-1999 standard with sampling period of 0.025 s. During the test, the vehicle passes quickly through the benchmarks set on the road, and the passage is shown in Figure 5. Turn the steering wheel, with the observation of lateral acceleration to estimate the yaw velocity, the sideslip angle, and so on.

To stimulate the external disturbances that affect the vehicle, suppose the vehicle was affected by an instantaneous crosswinds gust shown in Figure 6 when passing through the double lane.

The samples were analyzed by KPCA. The accumulative contribution rate has already reached 90.09% as seen in Table 1. is the complexity to compute the kernel principal components.

Figures 7 and 8 are the estimated results of the vehicle longitudinal velocity, the yaw velocity, the sideslip angle, and other key state parameters based on the KPC-IENN algorithm, and and then compare them with the actual output of the stimulation.

Judging from Figures 7 and 8, the vehicle nonlinear state based on the KPC-IENN algorithm is in line with the reality. The estimated values of this algorithm are very close to the stimulation values and the errors are under control. When = 4~10 s, the vehicle is entering into the lane driving state. The vehicle longitudinal velocity, the yaw velocity, the sideslip angle, and other state parameters change drastically in this time interval. Judging from Figures 6, 7, and 8, the KPC-IENN algorithm used multiple suboptimal fading factor to force the estimated residual to have orthogonality and increased the tracking efficiency for the states with abrupt changes, thereby tracking the changing state values effectively.

4. Conclusions

This thesis proposed a KPC-IENN algorithm which is applicable to the states with abrupt changes and established a soft sensor method for the key states needed in the controlling of the lateral, longitudinal, and yaw stability of the vehicle body. In the sharp double lane change test, the estimated values gained by the KPC-IENN algorithm were proved to be highly accurate and satisfied the software performance requirements of the vehicle state estimators. The stimulation results of the Matlab/SIMULINK cosimulation showed that the KPC-IENN algorithm can estimate the speed, sideslip angle, and yaw velocity with accuracy and strong real time performance as well as an effective tracking speed and an outstanding property of restraining noises. It is suitable for online estimation of vehicle states. This thesis validated the accuracy and effectiveness of the KPC-IENN soft sensor algorithm using real vehicle parameters in stimulated environment. The next step is to use the established soft sensor model and the estimation algorithm in field test through hardware in loop simulation technique, which can further verify its practicability and reliability.

Competing Interests

The authors declare no conflict of interests.

Authors’ Contributions

All authors contributed in the theory and analysis developed in the manuscript and in finalizing the manuscript. All authors have read and approved the final manuscript.

Acknowledgments

The present work is supported by Shandong Province Crucial R&D Plan Project, China (no. 2015GGX105008), and Shandong Provincial Science and Technology Development Plan Project, China (no. 2014GGX105001).