Abstract

This paper introduces a game theory approach to implement power flow distribution mission for a fuel cell/battery hybrid system considering uncertain power information. To fully describe the vying interaction relationship between the fuel cell and the battery, we design the power distribution problem as a noncooperative game problem, in which the fuel cell and the battery are deemed to be two interactional players, and each one chooses proper amount of power supply to maximize its own optimization function relying on the other chosen. Different from all previous research work in the published papers, the power demand information of the adopted driving cycle is assumed to be absolutely known. In this paper, we discuss the case that when the power demand is uncertain how the players act and the Nash Equilibrium can be effectively achieved. Three original contributions are made. First, we develop the utility function for each player taking into account the uncertain behavior of the power demand due to inaccurate prediction of driving cycle. Second, an iterative algorithm with a fuzzy logical controller for correction is proposed to reduce the influence of uncertain power demand information on the decisions of the players. Finally, the effectiveness is validated by a comparison simulation test.

1. Introduction

In the last two decades, the worsening environment and energy shortage problem have brought significant challenge to the global automotive industry. Again, new energy vehicles, which are powered using alternative energy for replacing traditional gasoline and diesel oil, have been paid more close attention [13]. Of all the candidate energy storage devices, the fuel cells are supposed to be good potential for directly superseding automobile internal combustion engines to power electric vehicles with considerable driving range for users. However, the slow dynamic response of the fuel cells will directly influence vehicle speed-up and speed-down performance. In addition, the fuel cells cannot supply negative power demand, which thus give a limit to recover the braking energy of the vehicle. To correct for these flaws and improve vehicle performance, the fuel cell hybrid systems have been suggested in the literature for practical vehicle application. In a fuel cell hybrid system, a battery or a supercapacitor pack is usually considered as auxiliary device for supply the drastic components of the power demand [46].

For a fuel cell/battery hybrid system, energy management control strategy is crucial for exploiting the advantages of the hybridization by coordinating the power flow between the fuel cell and the battery. State machine energy strategies were introduced to realize the state switch of the fuel cell and the battery to satisfy the load power demand as well as maintain the energy level within predefined range in [7, 8]. This strategy is simple and effortless for online implementation. However, the strategy lacks flexibility and adaptation to variation of the load power demand due to the limit of these fixed rules. To deal with complex and uncertain variable relation of the control strategy, fuzzy controllers were developed for a fuel cell hybrid system [9, 10]. Compared with the state machine strategy, the control performance can be improved to some extent due to improved flexibility and adaptation. However, the two types of strategies mentioned above are developed based on predefined deterministic or fuzzy logical rules, which largely depend on knowledge and experience of a designer. Therefore, these strategies cannot achieve optimal control performance.

To obtain enhanced control performance, the rules of the strategies are often optimized using an offline optimization algorithm for a given driving cycle. The introduced algorithms include genetic algorithm [11], particle swarm optimization [12], simulated annealing optimization [13], DIRECT global optimization [14], and dynamic programming algorithm [15]. However, these optimization algorithms are centralized and only deal with single objective optimization problem. When a system has multiple optimization objectives, they are usually weighted and transformed into a single objective function to optimize using these algorithms. These weights were usually set to be fixed and rely on the preferences of the designers. In this way, however, the interaction between the competing optimization objectives (such as hydrogen economy and battery life cost) cannot be captured fully and the optimization result therefore lacks of objectivity.

Game theory provides one promising solution to the centered optimization dilemma by implementing independent optimization operation for each control object. Owing to its unique advantage in dealing with interaction and conflicting interests for coupled multiagent system, they have been widely applied to smart grid for demand response management [1618] and sustainable energy system planning problem [19]. Some literatures have reported its successful application for energy management in different hybrid systems. A game theory (GT) controller was developed to capture tradeoff between driver operation and vehicle performance for a JLRF2 HEV in [20]. This application anticipates that the driving style of the driver is intrinsically coupled with vehicle fuel economy and emission performance. Test conclusions indicated that the GT control strategy by less calibration cost could achieve better performance compared with the baseline control for real world driving cycles. In [21], the GT strategy is applied for a fuel cell/battery/supercapacitor hybrid system to maximize a payoff function consisting of powertrain efficiency and vehicle performance. Simulation conclusions demonstrated the GT approach could benefit the two objectives simultaneously. Another example of the GT to energy management for an engine/battery/supercapacitor hybrid system is described in [22]. The author formulated a current control problem among the three power sources using noncooperative game theory approach. It was analytically proved that the Nash equilibrium can be iteratively calculated and the preference of each player, namely, the fuel consumption and battery life, could be satisfied simultaneously.

For energy management problem of hybrid system, each power source is personalized as a participant and which chooses certain amount of power supply to maximize its own optimization function depending on the actual power demand of driving cycle, which can be predictively obtained by employing some predictive algorithms, such as neural network [23], support vector machine [24], and Markov chain models [25], as well as advanced sensor technologies, such as radar [26] or geographic information system (GPS) [27]. For most of previous research works, the games are developed based on the assumption that the prediction is completely accurate. The decision of each participant is completely believable. From the opposite perspective, this paper focuses on the problem when the predictive driving cycle information is inaccurate and how the players in the game to adjust their individual strategies and achieve their final equilibrium. The main contributions that are fundamentally different from prior research in the literature are summarized as follows. First, we propose the game scenario that the players make their decisions encountering uncertain power demand information. Second, an iterative algorithm with a fuzzy logical controller for correction is proposed to ensure that the Nash Equilibrium can be effectively reached. Finally, the effectiveness is validated by a comparison simulation test.

The remainder of the paper is organized as follows. Section 2 presents the fuel cell and the battery hybrid energy storage system modeling. The game theory energy management strategy considering uncertain power information is described in Section 3. A case study is given to evaluate the effectiveness of the strategy in Section 4, followed by conclusive remarks in Section 5.

2. Hybrid Energy Storage System Modeling

The object plant of this study is a fuel cell/battery hybrid energy storage system with parallel topology structure, as shown in Figure 1.

2.1. Modeling of Fuel Cell

Owing to energy conversion loss, the actual yielded voltage of a fuel cell can be described by the following expression: where is actual yielded voltage, is Nernst voltage, is awaken voltage, and is internal resistance voltage. The Nernst voltage can be written bywhere is fuel cell open-circuit voltage, is experience factor, is kinetic temperature, is air coefficient, is Faraday constant, is hydrogen flow stress, is oxygen flow stress, and is reaction delay effect. , and can be calculated by the following variable relations:where is molar coefficient of hydrogen reaction, is course coefficient of hydrogen reaction, is flow of hydrogen input-output, is correction factor, is molar coefficient of oxygen reaction, is course coefficient of oxygen reaction, is flow of oxygen input-output, is fixed factor, and is delay coefficient of input-output flow. The hydrogen and oxygen flow rate of fuel cell can be depicted as follows:where is flow size and is use efficiency.

The awaken voltage is written as function of current and temperature as follows:where , =1,2,3,4, is a constant coefficient and is oxygen AOEL, which is given by

The internal resistance voltage is direct proportion to current when the resistance is assumed to be a constant.Finally, the total voltage of the fuel cell stack with N cells in series can be calculated byThe specific parameters and their values used in the model are given in Table 1.

2.2. Modeling of Battery

The adopted battery equivalent circuit model in this study is illustrated in Figure 2.

According to the electrotechnics principle, the work voltage of the battery is induced by where is the work voltage of the battery, is the work current of the battery, is capacitance terminal voltage, and is the internal resistance. The work current can be obtained by the following equation:where is load power demand.

In the circuit, the voltage and current are used to describe Polarization effect of the small RC branch and can be obtained by where is related to time constant T and equal to −exp  + 1 , and , and is simulation step length.

The state of charge (SOC) of the battery is estimated by an ampere-hour integration method.

In (16), the actual SOC is influenced by its initial energy state SOC0, as well as battery coulomb efficiency φ and charging/discharging coefficients and kdis. Cbat is nominal capacity supplied by manufacturer. The correlation parameter in the battery model is given in Table 2.

2.3. Modeling of DC/DC Converter

The DC/DC converter is used to control voltage of the fuel cell and the battery for satisfying different load power demand. Owing to frequent boost and buck operations, the efficiency of energy management is largely influenced by the DC converter. The efficiency curve of the adopted DC converter in this study is illustrated in Figure 3. It can be seen that the DC efficiency is related to voltage and current of the power source. Therefore the output power of the DC converter can be written by

3. Game Theory Energy Management Strategy considering Uncertain Power Information

In this section, the optimization functions of the fuel cell and the battery are first developed. Based on the optimization functions, the optimal power values of each power source can be obtained. After that, we employ game theory formulation to describe the competing interaction between the fuel cell and the battery. In this study, the optimization objective of the fuel cell is to maximize its hydrogen economy.

The efficiency curve of a fuel cell with maximum 50 kW power capacity is shown in Figure 4. The efficiency data is obtained from ADVISOR software. It can be found that the hydrogen efficiency is directly related to fuel cell power. To assess the hydrogen consumption, the relationship between the hydrogen power and the net power needs to be first established. For this, a quadratic function is adopted and expressed by where is the hydrogen power of the fuel cell, is the net power of the fuel cell, and , , and are the fit coefficients. The fitted result is shown in Figure 5.

Then, the objective function of the fuel cell can be formulated as follows: where is the unit hydrogen price ($/g), =0.00341$/g [28], and is the lower heating value of hydrogen (J/g).

For the battery, due to vulnerable nature to use conditions, its degradation is often considered as an optimization objective in energy management. In this study, a battery capacity model is adopted to quantify the battery degradation according to the charging/discharging conditions [29]. The degradation model is empirically built using the following mathematic description:where is the index coefficient, Qloss is the actual capacity decrement of the battery, T is the Kelvin scale (K), is the awaken energy (J mol−1), and R is the air factor (J (mol−1/k)−1). z is the law coefficient of the battery power, Crate is the charging/discharging rate, and is the correct coefficient of the battery charging/discharging rate. All parameters used in the model are given in Table 3.

The actual charging/discharging amount of the battery is quantified by

Considering the battery capacity degradation range from its nominal scale Cbat to 0.8 Cbat, therefore the cost function of battery degradation can be formulated as follows:where Qbat is the normal capacity of the battery, Ubat is the normal voltage of the battery, and p is the capacity cost of the battery, p=1600$/kWh [30].

For the descried topology structure in this study, the demands of the fuel cell are strongly coupled with that of the battery and their optimization objectives are competing; e.g., the hydrogen consumption cost can be reduced at the cost of sacrificing the battery degradation cost, and vice versa. For such a coupled hybrid system, it is crucially important to capture the tradeoff between the fuel cell and the battery. To achieve this purpose, we employ noncooperative game theory to formulate the competing interaction, in which one player strategically chooses the amount of the load power it could supply so as to maximize its own optimization function according to the load power demand and the strategy the other player adopted. In such a way, the two players can interact with each other and finally reach their equilibriums.

To clarify this problem, each component of the game formulation defined in the form will be explained fully as follows.

(1) Here, “F” represents the fuel cell, and “B” represents the battery. In the game, they act as two competing players and respond to each other.

(2) The strategy of each player is the power it could supply which corresponds to the amount of the power demand from the powertrain satisfying their individual constraints, such as current and power constraints.

(3) The utility function is used to quantify the level of satisfaction that the optimization objective of each player is achieved.

The utility function of each player is formed in quadratic form, which is continuous and quasiconcave in the strategy space for ensuring the existence of the Nash equilibrium and as well as its uniqueness. For the fuel cell, we consider the following utility function:

In (24), is a constant coefficient and is the weighting coefficient (). The item is used to quantify the closeness of the actual power and its optimal value that minimizes the hydrogen consumption cost. The item is used to quantify difference between the actual power supplied by the fuel and the load power demand based on the assumption that the battery minimum degradation cost is achieved.

The utility function in the same form is also applied for the battery and written by

The differential expressions of the utility function in (24) and (25) with and are derived as follows:

When the players try to pursue their individual minimum cost, mathematically we have

By solving the above two equations, the strategies for the fuel cell and the battery are explicitly described by

From (30) and (31), the Nash Equilibrium is the power sequence (, , i, j=1,2,…,N) that minimizes the hydrogen consumption and the battery degradation simultaneously. To reach the Nash Equilibrium, an iterative algorithm is needed to find the appropriate power pair at each instant by adjusting the weighting coefficient . The specific iteration process is shown in Figure 6.

From Figure 6, the optimal powers and that, respectively, minimize hydrogen consumption and battery degradation costs are calculated using particle swarm optimization for a given driving cycle. Detailed description of the algorithm process is expounded in [31]. In this study, we assume that the given driving cycle is deviating from its real condition due to inaccurate prediction. In this case, the reliability of the optimal power and depends on prediction accuracy of driving cycle. To account for the influence of inaccurate information on the decisions of the players, the optimal powers and in the utility functions are weighted using a confidence coefficient. Then the utility function in (24) and (25) should be modified by where represents the confidence coefficient. The more accurate the prediction is, the smaller the coefficients are. In general, the prediction accuracy of the driving cycle can be evaluated using mean value (MV) and the standard deviation (SD) of prediction error for a given length prediction horizon. To analyze this problem, a fuzzy logic controller, as shown in Figure 7, is designed. The input variables of the fuzzy logic controller are the MV and the SD of prediction error and the length of prediction horizon (LH); the output variable is the coefficient .

In the fuzzy controller, the fuzzified descriptions of the input and output variables are realized by using the triangular membership function, which are shown in Figures 8 and 9. In the membership function, the MV and SD of the vehicle speed error are divided into five fuzzy subsets. Specifically, the MV includes “L (Low)”, “ML (Medium Low)”, “M (Medium)”, “MH (Medium High)”, and “H (High)”. The SD includes “S (Small)”, “MS (Medium Small)”, “M (Medium)”, “MB (Medium Big)”, and “B (Big)”. The LH is divided into “S (Short)”, “M (Medium)”, and “L (Long)”. The confidence coefficient is divided into “L (Low)”, “ML (Medium Low)”, “M (Medium)”, “MH (Medium High)”, and “H (High)”.

The MV and the SD are considered for each prediction horizon. Consequently, three subrule bases can be obtained and shown in Table 4.

4. Case Study

In this simulation study, the UDDS (Urban Dynamometer Driving Schedule) is adopted to serve as the actual driving cycle. An inaccurate prediction case is incorporated in the driving cycle, which is shown in Figure 10. The simulation vehicle main parameters are given in Table 5.

The prediction horizons are set to three different lengths, e.g., 5s, 10s, and 15s. When vehicle is running, for each prediction horizon, the mean value and standard deviation of the vehicle speed error can be online calculated according to the actual speed profile and the predictive result. The information is subsequently used by the fuzzy logic controller to calculate the confidence coefficient. For the purpose of performance comparison, the same two game theory based strategies are implemented with and without taking prediction error into consideration.

Figures 1113 show the comparison results of the fuel cell power, the battery power, and SOC. In Figure 11, the prediction horizon is set to 5s. From Figure 11(a), it can be observed that the fuel cell power with prediction error into consideration is much smaller than that without prediction error into consideration. This is because the optimal power in the power distribution game, which has obvious effect on Nash equilibrium, e.g., the amount of the power distributed to each power source, is directly out of its real value due to the inaccurate prediction information. Consequently, the hydrogen consumption cost increases. This phenomenon becomes sometimes good or bad according to the actual prediction accuracy. When the fuel cell power increases, on the contrary, the battery power will be reduced while the power demand is supplied by the fuel cell and the battery together. Accordingly, the battery powers are compared and shown in Figure 11(b). Owing to less accumulated charging and discharging power in the case that the prediction error is ignored, hence the battery degradation cost is reduced. The cost is specifically listed in the Table 6. For 5s prediction horizon, the hydrogen consumption and battery degradation cost are 0.02068$ per kilometer and 0.009692$ per kilometer; hence the total cost is 0.03037$ per kilometer, which is smaller than that in the case that the prediction error is out of consideration. This is because the hydrogen consumption cost is obviously higher than the battery degradation cost. Although the total cost can be reduced by improving the battery power, the range is limited to the battery parameters, such as maximum power and state of charge.

Figures 12 and 13 show the power comparison of the fuel cell and the battery with 10s and 15s prediction horizon, respectively. It can be further found that the fuel cell power is improved as the prediction horizon is extended. Accordingly, the battery power is reduced and the total cost is increased. This phenomenon can be explained that when the prediction horizon is extended, more prediction errors get involved in predictive information, which leads to a relatively low reliability of the optimal power.

It can be further observed from the evolution of the battery SOC that a big SOC drop occurs at the time interval from 200s to 300s, in which the peak discharge power is effectively supplied by the battery. Two advantages can be found by the proposed consideration compared with the conventional one. First, the battery is reasonably controlled to supply peak power with less adverse influence of prediction error. Second, the battery SOC is always maintained within appropriate range. As a result, the battery always has enough space for absorbing the regenerative braking energy while satisfying the acceleration demand.

To further verify the effectiveness of the proposed strategy, a fuzzy logic controller is developed for a performance comparison. The controller has two inputs that are the load power and the battery SOC, and one output is the fuel cell power, which is illustrated in Figure 14.

In the developed fuzzy logical controller, the triangular membership functions are used for depicting the fuzzy relationships of load power, battery SOC, and fuel cell power within their possible variation range Figures 1517. The load power has five membership functions including negative big (NB), negative small (NS), medium (M), positive small (PS), and positive big (PB). The negative power represents the power demand being absorbed by the hybrid system, and the positive power represents the power demand being supplied by the hybrid system. The battery SOC has three membership functions including low (L), medium (M), and high (H). In this controller, the reference battery SOC is assumed to be 0.8. If the battery SOC is low, more regenerative power should be distributed to the battery; on the contrary, if the battery SOC is high, the minimum power should be withdrawn from the fuel cell regardless of the load power. The case where battery SOC is medium, the fuel cell power should be controlled to adapt to variation of the load power. The fuel cell power has four membership functions including small (S), medium (M), big (B), and too big (TB). The fuzzy logic rule base is given in Table 7.

Two strategies are both implemented on the condition that the driving cycle prediction is inaccurate, as shown in Figure 10. The simulation results are given in Figures 1618. From Figure 16, it can be observed that the peak power and regenerative power of the load can be more effectively absorbed by the battery using the proposed strategy compared with the fuzzy logic. Benefitting from this, the fuel battery power is shaved and smoothed (clearly seen from Figure 17), which could potentially extend the life of the fuel cell. From the evolution of the battery SOC, it can be obviously found that the SOC fluctuates more frequently to participate in the power distribution over the whole driving cycle by the proposed strategy. Besides, the SOC is also regulated within a reasonable fluctuation range for ensuring the battery has enough space for absorbing the peak or regenerative power.

5. Conclusion

In this paper, a game theory energy management strategy for a fuel cell/battery hybrid energy storage system has been proposed. The competing interaction between the fuel cell and the battery has formulated a noncooperative game, in which each player pursues its own maximum utility depending on the power demand information. The uncertain behavior of the power demand due to inaccurate prediction of driving cycle has been considered when the utility function is designed for each player. Iterative algorithm has been designed to implement the strategy by introducing a fuzzy logic controller for correction. Comparison results reveal that the proposed method can effectively reduce the influence of the prediction accuracy on the Nash Equilibrium. Both the cost of the fuel economy and the battery degradation are concurrently curtailed to a minimum level.

This study tries to propose a new approach for solving energy management problem of a fuel cell/battery hybrid energy storage system. Its effectiveness has been fully demonstrated by comparison analysis. However, the proposed approach needs deep knowledge of the each control plant and therefore cannot be directly extrapolated to other hybridizations with different power sources. This constitutes one of the main disadvantages.

Data Availability

The Matlab data used to support the findings of this study are included within the article. The specific iteration process of the algorithm is also described in the paper.

Conflicts of Interest

The authors declare that they have no conflicts of interest.

Acknowledgments

This work is supported by Project of Liaoning Province Major Technology Platform, Grant JP2017002; Guidance Plan of Natural Science Foundation of Liaoning Province, Grant 20180551280; National Science Foundation of China, Grant 51675257; Project of Liaoning Province Innovative Talents, Grant LR2016054.