Abstract

Lithium-ion batteries have been widely used as energy storage systems and in electric vehicles due to their desirable balance of both energy and power densities as well as continual falling price. Accurate estimation of the state-of-charge (SOC) of a battery pack is important in managing the health and safety of battery packs. This paper proposes a compact radial basis function (RBF) neural model to estimate the state-of-charge (SOC) of lithium battery packs. Firstly, a suitable input set strongly correlated with the package SOC is identified from directly measured voltage, current, and temperature signals by a fast recursive algorithm (FRA). Secondly, a RBF neural model for battery pack SOC estimation is constructed using the FRA strategy to prune redundant hidden layer neurons. Then, the particle swarm optimization (PSO) algorithm is used to optimize the kernel parameters. Finally, a conventional RBF neural network model, an improved RBF neural model using the two stage method, and a least squares support vector machine (LSSVM) model are also used to estimate the battery SOC as a comparative study. Simulation results show that generalization error of SOC estimation using the novel RBF neural network model is less than half of that using other methods. Furthermore, the model training time is much less than the LSSVM method and the improved RBF neural model using the two-stage method.

1. Introduction

Lithium-ion batteries have been widely used as energy storage devices and in electric vehicles due to their desirable balance of both energy and power densities. Compared with single lithium battery cells, a lithium battery pack with hundreds even thousands of battery cells connected in parallel and series is able to provide the required power in various applications [13]. The battery management system (BMS) plays an important role in maintaining safe and efficient operation of the battery. The State-of-Charge (SOC) of li-ion battery pack is a key parameter affecting the battery life, safety and efficient operation [4, 5]. Based on the accurate estimation of SOC, effective management strategies can be developed to avoid overcharging/overdischarging, prolong the cycle life of batteries, and prevent the occurrence of security incidents [6]. Furthermore, with the correctly estimated SOC information, drivers can also arrange the driving time properly.

Due to the complex nonlinear characteristics of li-ion batteries, SOC cannot be measured directly in real-time applications, and it needs to be inferred using other measurable variables [7]. Since a battery pack may consists of hundreds and even thousands of battery cells, the computation effort for modelling is increased accordingly. Besides, the inconsistency of cells in a battery pack varies along with the life of the battery. Thus, it is a challenge to accurately estimate the SOC of the battery pack. Recently, a number of methods have been proposed to improve the SOC estimation and they can be grouped to three general approaches for the estimation of battery pack SOC. The first approach integrates the cell model into the structure of the battery pack [8, 9]. However, the inconsistency between different cells in a battery pack is ignored.

In the second category, the single cell SOC estimation approach is directly extended to battery packs, including open circuit voltage method [10], ampere-hour integral method [11], Kalman filter [12, 13], and the equivalent electric circuit model [8]. These methods treat the battery pack as a “big battery” [14], which makes the SOC estimation simpler and more quick. However, the simple model is based on the precise mechanism of single cells. Due to the inconsistency between different battery cells, estimation error inevitably exists.

The third category includes various statistical methods. Plett first proposed the Bar-Delta Filter method in 2009 [15] which uses a Sigma Point Kalman Filter (SPKF) to estimate the average SOC of the battery pack and Delta Filters to estimate the variance between the cell’s characteristics and the average characteristics. However, the accuracy of the battery SOC estimation is a key, which is still a challenge. Dai et al. [16] and Sun and Xiong [14] proposed a dual time-scale Kalman filter, based on the equivalent electrical circuit model (EECM) where the differences in the internal resistance battery cells are considered. The mean SOC model and the differences of battery SOC proposed by Zheng et al. [17, 18] use the extended Kalman filter (EKF) based on the cell mean model (CMM) and cell difference model (CDM) to estimate both the mean SOC value of battery cells and their differences, respectively. This method still requires internal information about the battery pack. Deng et al. [19] proposed a data-driven method, and an efficient feature selection method is used to estimate the SOC of a battery pack using an autoregressive Gaussian process regression (GPR) model [20, 21]. A challenge for the GPR modelling is its computation time .

In summary, albeit the aforementioned progresses in the battery pack SOC estimation, to develop a simple yet accurate model is still an important issue in real-life battery applications. Data-driven methods [22] have gained a lot of interest in recent years to solve highly nonlinear classification and regression problems. The advantages of data-driven methods are the flexibility and model-free [23] characteristics which make them easy to create new models. As a class of data-driven methods [24], the machine learning approaches, such as support vector regression [25], Kalman filter [12, 13, 17], and backpropagation (BP) neural networks [26], have been successfully used in SOC estimation and prediction. However, the selection of dataset and input features for building these models is still ad hoc via trial and error.

To overcome some shortcomings in the aforementioned methods for the battery pack SOC estimation, this paper presents an improved RBF method using a fast recursive algorithm (FRA) to estimate the SOC of a battery pack. The FRA method [27] can be used for both neural inputs selection [28] and hidden layer node selection [2931] in the configuration of RBF networks. Comparing to [32], the average cell temperature, the time mean pack voltage, the time mean pack temperature, and the time mean loop current all over 10 seconds intervals can be also added to the initial candidate pool of input variables, other input candidates can also be included such as the maximum cell voltage, the minimum cell voltage, the average cell voltage, and loop current. The statistical variables are adopted to reduce the complexity of the model and the cell information is used to overcome the inconsistency among single cells. Then, a compact subset of these candidate variables are selected as the model input by the FRA method. On this basis, an improved RBF model built by the FRA method is used to predict the SOC of the battery pack. The proposed RBF model is automatically constructed by the selection of the hidden layer nodes using the FRA method. Furthermore, the parameters of RBF kernel are optimized by particle swarm optimization algorithm (PSO).

The rest of this paper is organized as follows. Section 1 introduces the input selection based on the FRA method. In Section 2, the application of improved RBF neural network for SOC estimation of battery pack is introduced in detail. Furthermore, the experimental and simulation results are compared in Section 3. Finally, Section 4 concludes the paper.

2. Input Selection Using FRA

Based on the theory of series expansion, polynomial NARMAX models can achieve the same modelling performance as various neural networks if certain conditions are satisfied [28]. The input selection of RBF neural network is thus simplified to determining the structure of the polynomial NARMAX model. The structure of the polynomial NARMAX model can be efficiently detected by selecting important polynomial terms using the FRA method with low computational complexity [27].

2.1. FRA Method

Consider the following multiple-input single-output system represented by a linear-in-the-parameter model:where , , and are output variable, input variable vector, and model error at time instant , respectively. Herein, and denote the number of input variables and model terms (mapping functions), respectively. is the nonlinear mapping function. are the linear coefficients for the mapping functions.

For given training samples, the system model is expressed in the following matrix form:where is formed by mapping functions. . is the output vector. denotes the linear parameters.

Refer to [27], and the minimal cost function using the least square method is given aswhere

Thus, the minimal cost function is reformulated as follows:

Use the definitions below:

The variance of the minimal cost function induced by an additional mapping function is given as follows:

Using the propositions detailed in [27], equation (7) is rewritten as follows:where .

Obviously, the variance only concerns the additional mapping function . Then, define the recursive matrix and recursive vector , the elements of which are defined as follows:

Therefore, the net contribution induced by the is expressed as

And the linear coefficients are estimated by

2.2. Input Selection for Battery Pack SOC Estimation

For the battery pack SOC estimation, the measured variables including the maximum and minimum voltages of the battery cells, the overall voltage of the battery pack, the circuit current, and the average temperature of the battery cells are all possible candidates as the model inputs and the targeted SOC is considered as model output, respectively. Based on the moving average method, the mean of a time sequence values in the shifting time window can be used to eliminate the random and other noises. Then, the average of the pack voltage, the pack temperature, and loop current for every 10 seconds interval is calculated as the time mean pack voltage, the time mean pack temperature, and the time mean loop current, respectively. Both the measured variables and the statistical variables are considered as augmented variables for estimating the battery pack SOC. The SOC of a battery pack is a time sequence, so both the model dependent variables and the model output measured in the past are critical to the estimation of next SOC value. However, not all of the historical data are needed for SOC estimation, so the maximum order of time lags for these input variables should be determined in advance.

To select the RBF neural network inputs, the problem is converted into the polynomial model construction. Thus, the input selection problem is formulated as equation (1).

Herein, the mapping functions are selected using the following polynomial terms:where , , and and .

Then, the neural network model inputs can be identified by selecting the most significant polynomial terms using the FRA method. The following input selection method is detailed in Algorithm 1.

Require the maximum voltage vector , the minimum voltage , the average voltage , the average temperature of the battery cells, the circuit current , the maximal order of time lags for inputs , the maximal order of time lags for output , the maximal number of selected terms , and the minimal training error .
Ensure the SOC vector of the battery pack .
(1)Initialization: form the regression matrix for polynomial term selection.
(2)for to do
(3) calculate the recursive matrix , and is recursively calculated by
(4) calculate the net contribution of the terms using equation (10).
(5) select the significant term.
(6)end for
(7)Input selection: find the order of the time lags from the selected model terms.

3. Improved RBF Model for the SOC Estimation

This paper aims to develop an accurate yet simple model for battery pack SOC estimation. Deng et al. proposed a two stage algorithm based on the leave-one-out method [30] to increase the performance of RBF neural networks. The selection procedure is automatically terminated by predicted-residual-sums-of-squares (PRESS) error so that the constructed RBF neural model is parsimonious and accurate. In this paper, the FRA method is used instead of the two stage algorithm for RBF neural network construction, which reduces the modelling complexity. In order to ensure the accuracy of the model, particle swarm optimization (PSO) algorithm is used to optimize the kernel parameters.

3.1. General RBF Neural Network

A RBF neural model can be formulated as a linear-in-the-parameters model like equation (1) as follows:where the additional parameters is the radial basis activation function for the hidden nodes which is often chosen as a Gaussian function. is the centers, and denotes the RBF widths.

Similar to equation (2), the RBF neural model is formulated in the matrix form as follows:where is the output matrix of the hidden nodes. .

3.2. Improved RBF Neural Model

The performance of the RBF neural model is related to the number of the hidden layer nodes and the kernel parameters. Therefore, the construction of RBF network can be regarded as an optimization problem which depends on the number of hidden layer nodes, kernel parameters, and connection weights. In order to improve the accuracy and real-time performance of Li-ion battery pack SOC estimation, the FRA method is used to establish an accurate and compact RBF neural model.

Using the improved RBF neural model based on the FRA method, the hidden layer nodes are selected according to the net contribution of the hidden layer node output. At the same time, the nonlinear kernel parameters are optimized by the particle swarm optimization method. Particle swarm optimization (PSO) [33] is a nonlinear parameter optimization algorithm based on swarm intelligence, and it has been widely used for nonlinear parameter optimization. The method is simple and easy to implement, it is applied to the parameter optimization of RBF kernel function. According to [30], leave-one-out (LOO) crossvalidation and associated predicted-residual-sums-of-squares (PRESS) error are used as an index to select hidden layer nodes and automatically break the selection procedure. The hidden layer nodes are selected with the maximal reduced PRESS error. Thus, the net contribution is changed to the following equation:where and is the number of the samples and the max number of the hidden layer nodes, respectively. and is the model error and the defined matrix in equation (6) at time instant , respectively.

Based on this net contribution, the improved RBF neural networks optimized by the PSO method is shown in Algorithm 2.

Require: selected input variable matrix in equation (2), the variable upper/lower bounds and the velocity upper/lower bounds , the size of the population , the maximum number of iterations , the crossover factors , and the acceleration of the particle velocity .
Ensure: the SOC vector of the battery pack .
(1)Initialization:, .
(2)Whiledo
(3)Initialization: Set the initial centers and widths of the RBF basis function, where , thus the initial nonlinear parameters are . Set the initial velocity .
(4)for to do
(5) construct the candidate RBF basis vectors.
(6) calculate the matrix using and the recursive matrix using Algorithm 1, respectively.
(7) calculate the vector using .
(8) Find the candidate regressor that gives the minimal PRESS error, and record the minimal PRESS error (index , minimal PRESS error ) and the best position of each particle .
(9) compare to the last best position using , obtain the global optimal position .
(10) update velocity and position using
where and denote the velocity and particle at iteration for selection, and is the random numbers.
(11)end for
(12)add the candidate feature with the minimal PRESS error to the regression matrix , .
(13)end while
(14)Identification: calculate the linear coefficients using equation (11).
3.3. Battery Pack SOC Estimation

As mentioned earlier, the battery pack SOC is estimated using the improved RBF neutral network. The schematic diagram of the proposed method for the battery pack SOC estimation is illustrated in Figure 1.

From Figure 1, there are three parts in the proposed method. In the first part, the inputs are determined from the measurements including the voltage of the battery cell , the voltage of the battery pack , the terminal current , the temperature of the battery cell , the SOC of the battery cell , and the SOC of the battery pack . Before the model inputs are determined by the FRA method, the candidate inputs are expanded by finding the maximum, the minimum, and the mean of , , and . Then, the delayed sequence obtained by using delay operator is adopted to produce the polynomial terms. Thus, the inputs are selected from the terms in the resultant nonlinear autoregressive moving average with exogenous inputs (NARMAX) model. In the second part, the improved RBF model is trained using the FRA method combined with the PSO method. Finally, the SOC of the battery pack is predicted using the built RBF model in which the kernel parameters (, ), the number of the hidden layer nodes, and the weights to the outputs are optimized by PSO [33].

4. Simulation Results

We first consider a package with 216 battery cells of 18650 types connected in series. 8 battery packs in the same configuration were tested. In these tests, the circuit current, the terminal voltage of the battery pack, the terminal voltages of each cell individual, and the temperature between two battery cells are measured every 1 s. The SOC of the battery pack and the battery individual cell are all estimated every 1s by the battery management system.

The data collected from a battery pack are often too large to be used to establish the estimation model. The ageing of battery capacity can be ignored in a short period, the training samples are selected every 30 s to build the improved RBF model. Then, the model inputs are chosen by FRA for the battery pack SOC estimation.

Using the FRA method, the maximum voltage of the battery cells, the minimum voltage of the battery cells, the average voltage of the battery pack , the voltage of the battery pack , the mean voltage of past 10 measurements, the mean current of past 10 measurements, the mean temperature of past 10 measurements, the circuit current and the average temperature of the battery cells, and the estimated SOC are adopted as the inputs and output, respectively. Finally, , , , , , , , , , and are selected. To verify the selected inputs, the improved RBF model for the SOC estimation is built using the selected inputs compared to the inputs selected by experience (trial and error). The performance using different inputs are shown in Table 1.

In Table 1, the RMSE (root mean square error) and the max absolute error are shown. Clearly, the model using the selected inputs performs much better, with the RMSE of the absolute error is almost always within . The simulations are illustrated in Figure 2 and 3. It is shown that the SOC estimation is more accurate using the selected inputs of which the generalization error is less than that using experience inputs.

Then, the proposed model is compared with the conventional RBF method, the general least square support vector method (LSSVM), and the improved RBF neural model and optimized by the two-stage method (TSS_RBF) [30]. The performance of the three methods is shown in Table 2.

According to Table 2, the proposed RBF method took more time than the conventional RBF method in training the model, but the validation RMSE of the proposed RBF model is just half of that using the general RBF model. While the LSSVM model takes almost 50 times longer to train than the proposed RBF model and the improved RBF neural model by the two stage method takes almost 50 times longer to train than the proposed RBF model. Meanwhile, the validation RMSE of the proposed RBF model is lower than the other methods. The simulation results are shown in Figures 4 and 5. It is clear that the proposed RBF model has excellent generalization capability to obtain more accurate SOC than the other methods.

5. Conclusions

In order to estimate the SOC of battery pack accurately, it is necessary to adopt the data-driven method to handle the inconsistencies among the cells in a battery pack. This paper first uses the FRA method to select the input variables to improve the precision of the model because the inputs features are important to ensure the accuracy of the RBF neural networks. The experiment results show that better SOC estimation results can be achieved when a compact set of model inputs is selected. Then, the FRA method is further used to improve construction RBF neural network for battery pack SOC estimation. The hidden nodes of RBF neutral networks are again selected using the FRA method, and the particle swarm optimization algorithm is used to optimize the kernel parameters. The results show that the improved RBF model can achieve high estimation accuracy at acceptable time costs.

Data Availability

The processed data used to support the findings of this study are included within the article. The data source is provided by the partner of Shenzhen Institute of Advanced Technology, Chinese Academy of Sciences, and can be obtained from the corresponding author upon request.

Conflicts of Interest

The authors declare that they have no conflicts of interest.

Acknowledgments

This paper was partially funded by the NSFC under Grant 61673256, Key Project of Science and Technology Commission of Shanghai Municipality under Grant 19500712300, and 111 Project under Grant D18003.