Abstract

A method based on fusion of multiple features is proposed to assess and accurately describe the performance degradation of lithium-ion batteries in this paper. First, the discharge voltage signal of lithium-ion batteries under real-time monitoring is analyzed from the perspective of time domain and complexity to obtain the values of multiple features. Then, the multi-feature parameters undergo a spectral regression process to reduce the number of dimensions and to eliminate redundancy, and on the basis of this regression, a Gaussian mixture model is established to model the health state of batteries. Thus, the degree of lithium-ion battery performance degradation can be quantitatively assessed using the Bayesian inference-based distance metric. A case calculation experiment is carried out to verify the effectiveness of the method proposed in this paper. The experimental results demonstrate that, compared with other assessment methods, the performance degradation assessment method proposed in this paper can be used to monitor the degradation process of lithium-ion batteries more effectively and to improve the accuracy of condition monitoring of batteries, thereby providing powerful support for making maintenance decisions.

1. Introduction

As a kind of ideal power supply energy, which can be reused, lithium-ion battery has been widely used in many fields. However, the performance of lithium battery will decline over time, which will not only lead to system failure, but also lead to serious safety accidents [1]. Working condition monitoring, performance degradation assessment, and remaining useful life (RUL) prediction of lithium-ion battery are of great significance for improving the safety and reliability of the battery system and preventing battery accidents. The RUL prediction of lithium-ion battery can be divided into two steps: battery health status recognition and performance prediction. In terms of prediction model, Zou [2] used parameter degradation model and traceless particle filter algorithm to predict RUL of battery, discovering that the method for predicting the RUL of the battery was more accurate when considering the degradation of the performance parameters of the battery capacity state. Ma [3] proposed an improved trackless Kalman filter based on Markov chain-Monte Carlo to predict RUL, and an empirical model of four-parameter power function reflecting battery capacity was established. Linear least square method was used to estimate capacity parameters, and finally life prediction was realized with high prediction accuracy. Yang [4] used two logarithmic empirical capacity models to capture two-phase nonlinear sag degradation problem of lithium-ion batteries, and an improved particle filter algorithm was used to update the model’s state parameters to predict the RUL. The prediction model has achieved good results, but mostly used the battery capacity or single parameter as a feature to predict. However, the battery capacity is difficult to directly measure online, and a single parameter cannot describe the performance degradation of battery. Therefore, it is advantageous to accurately reflect the state of battery performance and improve the prediction accuracy of residual life by extracting characteristic parameters from multiple perspectives and fusing them into a health index, and this paper focuses on performance degradation assessment of lithium batteries.

To avoid problems such as insufficient maintenance or over-maintenance of lithium-ion batteries, it is necessary to carry out condition-based maintenance for batteries. The quantitative assessment of the degree of performance degradation is a prerequisite for condition-based maintenance of lithium-ion batteries. It is difficult to measure the residual capacity of batteries in real time [5], many researchers try to assess the performance of the battery using the approach of analyzing variations in the discharge voltage. Sbarufatti proposed a method that combines particle filters and radial basis function neural networks to predict the discharge duration of lithium-ion batteries [6]. Xu proposed an effective model based on state space for modeling the performance degradation of batteries. The model adopts an analysis method based on the sequentially observed discharging profiles and uses the expectation-maximization (EM) algorithm in conjunction with the extended Kalman filter (EKF) algorithm to estimate and update the model parameters and state [7]. Xu also analyzed the distribution of discharge voltage and combined the discharge process and the degradation process to develop a hierarchical model for comprehensive prediction of discharge duration in different cycles and the remaining effective cycles [8]. Widodo used the sample entropy of the discharge voltage to predict the performance of the battery [9].

Although some of the methods proposed in the above studies can be used to make effective assessment, they still fall short of the expectation of accurately and comprehensively assessing the degradation process of the batteries. This is because most of these methods only process a single feature of the discharge voltage, which is not very sensitive to variations in the battery state. To solve this challenge, this paper proposes a method based on the fusion of multiple features for assessing the performance degradation of lithium-ion batteries. In fact, the multi-feature fusion approach has been widely used in assessing the performance of bearings, but relatively fewer attempts have been made to adopt this approach for assessing the performance of batteries.

In researching the performance degradation assessment, effectively extracting multi-dimensional features and establishing an effective degradation assessment model are two critical issues. A lot of work has been done with regard to the first critical issue. Principal component analysis (PCA) is one of the most widely used data compression algorithms. Dong used PCA to reduce the dimensions of the parameters in bearing degradation process, and they obtained good results [10]. Yu proposed a method based on local preserving projection (LPP) to reduce the dimensions of the multi-dimensional features in the time domain, frequency domain, and time-frequency domain. The method is applied to the performance degradation assessment of bearings and verified the feasibility of the method through experiments [11]. Cai proposed a spectral regression (SR) method, which is a new type of model, for supervised, unsupervised, and semi-supervised regression. Using SR to reduce the dimensions makes it possible to solve the projection function using the regression framework, thereby circumventing the thorny issue of decomposing the features of the dense matrix [12]. With regard to the second critical issue, Yu proposed a method based on multiple two-layer sparse autoencoder neural networks to fuse the features and used experiment to verify the superiority of this method [13]. Yu applied the Gaussian mixture model (GMM) model to the health assessment of bearings and achieved quantitative analysis of performance degradation of the bearings [11]. Batista proposed an algorithm based on improved support vector machine (SVM), reducing the error rate in diagnosis [14]. Ai proposed a fusion method based on n-dimensional characteristic parameters distance [15]. Hassani proposed a fusion model utilizes zSlices-based representation of general type-2 fuzzy logic systems to combine different SVMs [16]. Zhang proposed an exergy-based fusion method to fuse multiple transient states with the information of multiple sensors obtained in state monitoring [17]. Shao proposed a feature fusion method based on the deep belief network (DBN), taking advantage of the excellent learning ability of DBN [18]. Zhou proposed an NCA-based feature extraction method to reduce dimensionality and used a coupled hidden Markov model based on multichannel data acquisition for fault diagnosis [19]. In addition, Wang [20] also summarized methods to evaluate the health indicators based on Markov model, self-organizing mapping, wavelet change, and fuzzy c-means transformation. All of the above methods have yielded good assessment results.

On the basis of the above analysis, this paper proposes a performance degradation assessment method based on the fusion of multiple features. First, the discharge voltage signals of lithium-ion batteries under real-time monitoring are analyzed to obtain features in the dimensions of time domain and complexity. Then, the multiple features undergo an SR processing to reduce the number of dimensions and eliminate redundancy; based on these processes, GMM is established to model the health status of batteries. Thus, the degree of performance degradation of batteries can be quantitatively assessed using the Bayesian inference-based distance (BID) metric. The experimental results show that the assessment method proposed in this paper can be used to monitor the degradation process of lithium-ion batteries more effectively and to improve the accuracy of condition monitoring of batteries. To the best of our knowledge, single characteristic parameters are used in most studies of lithium battery status assessment. However, there is redundancy and difference among parameters, so the characterization of lithium battery state is not accurate enough. Therefore, in this paper, the characteristic parameters are extracted from the perspectives of statistical measurement and complexity; after that the extracted features are fused by the GMM-BID model proposed in this paper. Compared with a single parameter, the health index can characterize the performance degradation degree of lithium battery more accurately.

2. Feature Extraction

The capacity of battery is a parameter that can accurately evaluate the performance, but it is difficult to obtain directly, which can only be obtained in the laboratory or offline state. Therefore, it is necessary to select some parameters that can be monitored online to estimate the degradation state of lithium-ion batteries. During the charging process, the charging mode of the battery is generally fixed; that is, the charging voltage and current are also fixed. The discharge current depends on the system payload, which can affect the health of the battery, but cannot reflect the degradation of the health state of the battery. At present, most studies have found that the discharge voltage of the battery will tend to degrade with an increase in the number of discharge cycles [611, 21]. Therefore, this paper focuses on analyzing the easy-to-measure discharge voltage of lithium-ion batteries to make a more accurate assessment of the battery performance degradation.

2.1. Time Domain Features

The voltage values of all of the discharge cycles of a lithium-ion batteries can be regarded as a time series of voltage. It is necessary to analyze this time series to extract the features that reflect the statistical pattern of the time series. The most common method for analyzing a time series is time domain analysis. Charakopoulos A.K. [22] estimated mainly nonlinear measures such as mutual information combined with descriptive statistics measures, as well as some linear and nonlinear dynamic detectors such as Hurst exponent, detrended fluctuation analysis, and Hjorth parameters.

With the increase of charging and discharging times of lithium-ion battery, its internal resistance and energy loss increase, which shows that the discharge duration will be shorter and shorter after the battery is fully charged. Therefore, the article adopted the method of statistical measurement such as the mean and RMS values to describe the changing trend of battery discharge voltage. Also, regarding the trend of the battery voltage at the same time interval increasing with the number of charge and discharge cycles, fixed-interval voltage difference can be constructed to characterize the degree of battery performance degradation.

2.1.1. Statistical Measurement

There are some conventional time domain features based on statistical measurement (see Table 1).

2.1.2. Fixed-Interval Voltage Difference

When a series of discharge voltage values are obtained by sampling the discharge voltage at fixed intervals, the voltage discrepancy between two neighboring samples is called the fixed-interval voltage difference. As the battery is used successive times (that is, the number of discharge cycles of a lithium-ion battery increases), the battery ages continuously, which is reflected by the increasingly shorter discharge duration. During each discharge cycle, the discharge voltage of a battery decreases from a higher voltage value to a lower value.

Each fixed time interval is denoted as , and the fixed-interval voltage difference in the -th discharge cycle is where is the voltage value at the end of the time interval and is the voltage value at the beginning of the time interval . The sequence of fixed-interval voltage values extracted from the discharge cycles can then be expressed aswhere is the number of discharge cycles.

2.2. Complexity

Entropy is a way to describe the complexity of a time series or the degree of internal chaos. Originally a concept in physics, the concept of entropy gradually expanded with the development of information theory and has become an indicator of system instability. Entropy is now widely used in system state monitoring and signal analysis. At present, the various forms of entropy (such as information entropy, sample entropy, and approximate entropy) are used as the characteristic parameters for measuring complexity.

Derived from the approximate entropy, the sample entropy provides a better entropy representation in signal analysis. Compared with other metrics such as related dimensions, sample entropy is better suited for classifying complex systems, including characterization of the certainty and randomness of the time series of finite data points [9]. Sample entropy has been applied in bearing fault diagnosis [2325]. Therefore, in this paper, we chose sample entropy as a useful complement to the characteristic parameters. The algorithm works in the following way:

Assume that there is an -dimensional time series obtained via sampling at fixed time intervals.

Define the associated parameters of the sample entropy where the real number is the length of the comparative vector and the real number is the metric of similarity.

Reconstruct the -dimensional vector where .

For , count the number of vectors that satisfy the following condition: where is defined aswhere is an element of the vector is the distance between the vectors and is determined from the maximum difference of the corresponding elements, and the value range of is ().

Calculate the average of for all values, and denote the average value as , which can be expressed as

Let , and repeat steps and to obtainwhere , .

The sample entropy is defined as

In practical applications, will never be . Therefore, when is a finite value, the sample entropy is estimated to be

2.3. Spectral Regression

Given a data set , the mapping element of can be found in the low dimension space for a given point , and we can find a transformation matrix that satisfies .

A graph was constructed with samples using the idea of graph embedding, a process of mapping the data of a graph (usually a high dimensional dense matrix) into a low dimensional dense vector. Each vertex of the graph corresponds to a sample (the -th point is ). If all of the weights of the connections between sample and sample are represented by a weight matrix , then is the weight of the connection between the -th point and -th point. The process of finding the linear projection function using the SR method is divided into two steps: Solve , where D is the diagonal matrix . The feature vector corresponding to the first feature value is defined as . Obtain the projection vector by solving the following regularized least-square:where is the regularized parameter. The original data can then be dimension-reduced to an -dimensional vector:

3. Performance Degradation Assessment Model Based on Fusion of Multiple Features

Data fusion can be defined as a combination of multiple sources to obtain improved information; in this context, improved information means less expensive, higher quality, or more relevant information. The available data fusion techniques can be classified into three nonexclusive categories: (i) data association, (ii) state estimation, and (iii) decision fusion [26]. The data fusion method described in this paper is a state estimation fusion method.

3.1. GMM

GMM is a clustering algorithm that uses a Gaussian distribution as the parameter model and is trained using the EM algorithm. As a simple extension of the Gaussian model, GMM uses a combination of multiple Gaussian distributions to characterize the distribution of data, which is expressed aswhere is the number of model mixing, is a weight coefficient of the model (), is the Gaussian distribution function (the -th average value is and the covariance matrix is ) denoted as . The model can be rewritten in the following form:

After the GMM model was established using the data sets for the normal operation state, a quantitative indicator based on BID [27] was used to perform quantitative assessment of the degradation degree of lithium-ion batteries.

Suppose that there are Gaussian classes. The -th component can be denoted as , and its occurrence probability can be denoted as . The probability of the test point then belongs to the component (denoted as ) and can be calculated using the following formula:

where is the a priori probability that can be calculated using the model data. can be calculated using the following formula:

where is the average value of the -th Gaussian component, is the covariance matrix of the -th Gaussian component. Next, the distance from to each component is defined as

The BID value can be obtained by performing a weighted summation over all of the component distance values of a test point:

3.2. Performance Degradation Assessment Model Based on Fusion of Multiple Features

On the basis of the above analysis, this paper proposes a method based on the fusion of multiple features for assessing the performance of lithium-ion batteries. The flowchart of the method is shown in Figure 1.

Figure 1 shows that the performance degradation assessment process is divided into an offline phase and online phase. The specific steps of each phase are as follows:

(1) Off-Line Phase. (A) Extract initial features from the training data and use the SR method to reduce the dimensions of the original features.

(B) Select the health state training data to establish a GMM model and obtain the parameters of the model.

(C) Calculate the BID value of the test data using the GMM parameters to describe the performance degradation process of lithium-ion batteries.

(2) Online Phase. (A) Extract initial features from the training data and use the SR method to reduce the dimensions of the original features.

(B) Calculate the BID value between the test data and the health state GMM.

4. Lithium-Ion Battery Performance Degradation Assessment Experiment and Results Analysis

4.1. Data Source

A case calculation experiment was carried out to verify the reliability and effectiveness of the proposed method in assessing the performance degradation of lithium-ion batteries. In the experiment, the experimental data sets were provided by NASA [28]. A set of four Li-ion batteries (# 5, 6, 7, and 18) were run through 3 different operational profiles (charge, discharge, and impedance) at room temperature. Charging was carried out in a constant current (CC) mode at 1.5A until the battery voltage reached 4.2V and then continued in a constant voltage (CV) mode until the charge current dropped to 20mA. Discharge was carried out at a constant current (CC) level of 2A until the battery voltage fell to 2.7V, 2.5V, 2.2V, and 2.5V for batteries 5, 6, 7, and 18, respectively. Impedance measurement was carried out through an electrochemical impedance spectroscopy (EIS) frequency sweep from 0.1Hz to 5kHz. Repeated charge and discharge cycles result in accelerated aging of the batteries while impedance measurements provide insight into the internal battery parameters that change as aging progresses. The experiments were stopped when the batteries reached end-of-life (EOL) criteria, which was a 30% fade in rated capacity (from 2Ahr to 1.4Ahr).

In order to save space, the batteries No. 5, No.6, and No. 18 of the four batteries are randomly selected for analysis. Figure 2 depicts the relationship between the discharge voltage and the discharge time of the three batteries. As seen in the figure, the different discharge cycles share a common pattern. In the early stage of discharge, the discharge voltage declines gradually, and the discharge time has a gentle slope. The slope of the decline increases gradually over time. In the last stage of discharge, the voltage drops rapidly. As the number of discharge cycles increases, the slope of the decline of the discharge voltage becomes increasingly steep. The variation of the discharge voltage in each discharge cycle differs from the voltage variation in any other discharge cycle; hence, it is necessary to assess and manage the battery.

4.2. Comparison of Clustering Ability Concerning Degradation Degree

In the experiment, 132 original data sets of battery number 18 are extracted. After a normalization process, 25 sets of normal state data, 25 sets of slight degradation data, and 25 sets of severe degradation data are used to verify the clustering ability and correctness of the SR dimension reduction method proposed in this paper. Furthermore, the SR algorithm is compared with the LPP and PCA algorithms. Figure 3 depicts the effects of the three algorithms in dimension reduction (only the first two dimensions are shown for each algorithm). To compare the three feature extraction methods more directly, the clustering algorithm K-means is used as the degradation degree classifier to calculate the cluster recognition rates of the three methods. The results are shown in Table 2. Figure 3 shows that, after the dimension reduction, the PCA method failed to cluster the three states (i.e., the three states overlap with each other), which means that PCA has the worst clustering effect among the three algorithms. The LPP and SR algorithms have better clustering effects, as indicated by their ability to clearly distinguish the three states. Between these two algorithms, SR is superior to LPP. Table 2 also shows that PCA has the lowest cluster recognition rate among the three algorithms. The cluster recognition rate of SR is 97.33%, which is 4% higher than that of the LPP algorithm. In summary, the SR dimension reduction algorithm has stronger clustering ability and recognition rate. Therefore, the SR method is more suitable for on-line monitoring.

4.3. Results of Performance Degradation Assessment

Seven representative features (namely root mean square (RMS), standard deviation, mean, skewness, kurtosis, fixed-interval voltage difference, and sample entropy) are used to assess the performance degradation of battery numbers 5, 6, and 18 using the existing data sets. After several trials, the number of the length of the comparative vector and the metric of similarity of sample entropy are set to , respectively. The variation curves of the six features are shown in Figure 4. The following can be observed from the diagrams: For the same battery, different features exhibit different variation trends. In terms of overall trends, RMS, mean, and kurtosis tend to decline with fluctuations, whereas the standard deviation, skewness, fixed-interval voltage difference, and sample entropy tend to rise with fluctuations. Therefore, the extracted features could be used to characterize the performance of Lithium-ion batteries; however, these features have some drawbacks. Standard deviation, skewness, and kurtosis fluctuate violently and are not sensitive to the degradation of performance, making it difficult to determine the degree of degradation. It is observed that the standard deviation, skewness, kurtosis, and sample entropy of battery numbers 5 and 6 have a “run-in” period in the early stage, during which the values fluctuate violently without a significant trend. When the battery is diagnosed during this “run-in” period, it is hard to see a trend towards degradation.

To enable more accurate online monitoring of the battery degradation, this paper proposes a quantitative indicator obtained by fusing multiple multi-dimensional features to effectively describe battery degradation.

As soon as the battery is used, it begins to degrade. In order to reduce the error of the health state model, the first 4% of all cycles were selected for training. On the basis of the above analysis, the first 4% sets of health state data of battery numbers 5, 6, and 18 are chosen to establish the GMM model. A GMM with 2 Gaussian components is used to model the normal state data, and the first two dimensions after dimension reduction by SR are used as the input data. After the GMM model is trained with the normal state data, all of the test data is input into the model in groups, and one BID value is calculated using each data group, as shown in Figures 5, 6, and 7. Figure 5 shows that the BID value of battery number 5 effectively and clearly reflects the performance degradation process of the lithium-ion battery. Similarly, the trend towards battery degradation is reflected in Figures 6 and 7. The BID curve increases with the increase of the number of battery discharge cycles.

The assessment method based on SR-GMM-BID is superior to the methods that use only a single feature. First, BID value is sensitive to the performance degradation of the battery, and the trend of lithium-ion battery degradation can be clearly seen. Second, the fused indicator does not fluctuate violently in the early stage and improves the reliability of assessment.

4.4. Comparative Analysis
4.4.1. Comparison with the Assessment Method Based on the Support Vector Data Description (SVDD) Algorithm

The SVDD algorithm is an algorithm that is well-suited for assessing the performance of bearings [29]. In the experiment, the first 4% of all data groups were taken as the normal state data to train the SVDD model; the kernel function was defined as a Gaussian kernel, and the penalty factor was set to 0.3. The value of the kernel distance between the test data and the trained hypersphere was used as the quantitative performance assessment parameter, and the assessment results based on SR-SVDD are shown in Figures 8, 9, and 10.

4.4.2. Comparison with the Assessment Method Based on Another GMM Based Indicator

A negative logarithmic likelihood probability (NLLP) value was used to evaluate the performance of the bearing [30]; that is, after establishing the GMM model in normal state, a logarithmic likelihood value is obtained for each set of test data. The larger the value is, the farther the deviation is. In general, logarithmic likelihood values are less than 0, so negative logarithmic likelihood values are used as assessment parameter, and the assessment results based on SR-GMM-NLLP are shown in Figures 11, 12, and 13.

4.4.3. Comparison with the Assessment Method Based on Self-Organizing Map(SOM)

A data fusion method based on SOM is used to evaluate the bearing state [31]. SOM was used for network training, and multi-feature fusion model was constructed. After weight vector was obtained, the minimum quantization error (MQE) from each sample to the weight vector was calculated as the indicator, and the assessment results based on SOM are shown in Figures 14, 15, and 16.

4.4.4. The Results of Comparative Analysis

Spearman rank correlation coefficient is a typical analytical method to describe the linear relationship between two data sequences [32]. In order to prove the superiority of the method proposed in this paper, the four fusion parameters and battery capacity were, respectively, analyzed by Spearman correlation. The interval of coefficient r is . When r is greater than 0, it means positive correlation; when r is less than 0, it means negative correlation. The closer the absolute value of r is to 1, the more linearly dependent it is.

The correlation analysis results are shown in Table 3. The four health indicators are negative correlated with battery capacity. The absolute value of correlation coefficient of BID is significantly higher than DI, NLLP, and MQE. It indicates that the BID index has the best correlation with the battery capacity and can be used for the input of battery remaining life prediction (RUL) to improve the prognosis accuracy.

5. Conclusions

This paper proposes a model based on fusion of multiple features for assessing performance degradation of lithium-ion batteries. The study yielded the following findings:

The method based on fusion of multiple features proposed in this paper can be used to conduct effective assessment of the performance degradation of lithium-ion batteries.

Compared with the PCA and LPP methods, the dimension reduction method based on SR can better cluster the measurement data into different degradation stages for lithium-ion batteries.

The SR-GMM-BID-based quantitative performance assessment method proposed in this paper can better characterize the degradation trend of batteries than the SVDD, NLLP, and SOM based methods.

Data Availability

The data used to support the findings of this study are included within the article.

Conflicts of Interest

The authors declare no conflicts of interest. The funders had no role in the design of the study; in the collection, analyses, or interpretation of data; in the writing of the manuscript, or in the decision to publish the results.

Authors’ Contributions

Anchen Wang and Ying Zhang contributed equally to this work. Conceptualization is done by Anchen Wang, Ying Zhang, and Hongfu Zuo; Data curation is done by Anchen Wang and Ying Zhang; Formal analysis is done by Anchen Wang and Ying Zhang; Methodology is done by Anchen Wang, Ying Zhang, and Hongfu Zuo; Software is done by Anchen Wang and Ying Zhang; Validation is done by Hongfu Zuo; Writing – original draft is done by Anchen Wang; Writing – review & editing are done by Ying Zhang and Hongfu Zuo.

Acknowledgments

This research was supported by National Natural Science Foundation of China, grant number 51505229.