Abstract

This paper presents a neural network predictive control strategy to optimize power distribution for a fuel cell/ultracapacitor hybrid power system of a robot. We model the nonlinear power system by employing time variant auto-regressive moving average with exogenous (ARMAX), and using recurrent neural network to represent the complicated coefficients of the ARMAX model. Because the dynamic of the system is viewed as operating- state- dependent time varying local linear behavior in this frame, a linear constrained model predictive control algorithm is developed to optimize the power splitting between the fuel cell and ultracapacitor. The proposed algorithm significantly simplifies implementation of the controller and can handle multiple constraints, such as limiting substantial fluctuation of fuel cell current. Experiment and simulation results demonstrate that the control strategy can optimally split power between the fuel cell and ultracapacitor, limit the change rate of the fuel cell current, and so as to extend the lifetime of the fuel cell.

1. Introduction

As the rapid development of modern industrial technology, Ocean technology, and space technology, more and more mobile robots are demanded in these areas. Because of the advantages in operating time, weight, and dimensions, proton exchange membrane (PEM) fuel cells have been considered as alternative power sources for mobile robots.

A mobile robot usually has multiple freedoms, which cause the electric load drastically to fluctuate. Sudden changes in power may significantly reduce the operating life of fuel cells in a long term [1, 2]. Furthermore, fuel cells have the characteristics of unidirectional power flow and they cannot absorb the energy from regenerative braking of a robot. As a result, fuel cells are usually arranged with auxiliary power sources to form hybrid power systems and drive mobile robots. Ultracapacitors are highly suitable for the bulk of the transient power demands since the charge/discharge current of an ultracapacitor can vary in a wide range. In this paper we choose a bank of ultracapacitors as auxiliary power source.

A smart power split strategy is indispensable to enhance performance and lifetime of the hybrid power system. Jiang et al. [3] presented an adaptive control algorithm that adjusted the output current set point of the fuel cell. Ferreira et al. [4], Li et al. [5], and Kim et al. [6] developed a fuzzy controller to optimally distribute the power between the fuel cell and the battery. Rodatz et al. [7] designed an optimal control strategy to minimize the hydrogen consumption in a hybrid fuel cell system. Paladini et al. [8] proposed an optimal control strategy to power a vehicle with both fuel cell and battery to reduce fuel consumption. Lin et al. [9] studied a dynamic programming (DP) algorithm based on the fuel consumption and exhaust gas emission for a parallel electric vehicle. These strategies are effective in dealing with system efficiency but address little the lifetime of the fuel cell stack due to rapid load demand variations. Zhang et al. [10] presented a wavelet-transform algorithm to identify and allocate power demands with different frequency contents to corresponding sources to achieve an optimal power management control algorithm. This algorithm can protect fuel cell effectively but is complex and difficult to apply online. Xu et al. [11, 12] and Simmons et al. [13] proposed optimal real-time energy management strategies for a proton electrolyte membrane (PEM) fuel cell bus based on the Pontryagin’s Minimal Principle and the determined dynamic programming (DDP). Ziogou et al. [14] deployed a dynamic optimization approach based on nonlinear model of fuel cell. Li et al. [15] developed a constrained model predictive control of a solid oxide fuel cell based on genetic optimization.

Undoubtedly, the fuel cell power systems are nonlinear. Therefore, the global optimization based energy management strategies depend on nonlinear models of the fuel cell power systems and are time costly. Model predictive control (MPC) has been recognized as a powerful methodology for controlling a wide class of nonlinear dynamic system [16]. In this paper we use MPC appropriately, distribute power between the fuel cell and ultracapacitor, avoid frequent fluctuation of fuel cell current, and so enhance the transient performance and extend the operating life of the hybrid system.

There have been three main methods for nonlinear system modeling and predictive control [17]. The first one uses a piecewise linearization to describe the nonlinear behavior of a system. Each model is effective only in a small region, which results in that a mass of models is required [18]. The second one directly employs nonlinear models, but these involve a nonlinear online optimization problem with constraints, which is usually time-consuming and may even be unable to guarantee a feasible solution for real time control [19]. The third method is to use a local linearization approach representing a nonlinear plant, which is valid and simplifies the implement [2024].

This paper proposes an ARMAX (Autoregressive Moving Average with Exogenous input) modeling approach for fuel cell power systems. Time-variant coefficients of the ARMAX model are estimated by a recurrent neural network. The RNN-ARMAX model is an equal linear model of the fuel cell power system. Therefore, we design linear constrained model predictive control based on the RNN-ARMAX model for the nonlinear fuel cell power system. The design and implementation of the controller are significantly simplified and the method can protect fuel cell from substantial fluctuation of current by trading off transient current demand from the fuel cell to the ultracapacitor, according to constraints and weighting matrices of the output errors.

The remainder of this paper is organized as follows. Section 2 describes RNN-ARMAX modeling of the fuel cell power system. MPC is designed in Section 3. In Section 4, we implement and discuss simulation results. Conclusions are given in Section 5.

2. RNN-ARMAX Modeling

We aim at the optimization of electric power distribution between the fuel cell and ultracapacitor of a fuel cell robot.

2.1. System Structure and Description

The fuel cell power system studied in this paper, as shown in Figure 1, is designed for a mobile robot. The electrical output of the PEM fuel cell is connected to the load through a unidirectional DC/DC converter, and an ultracapacitor bank is also connected to the load through a bidirectional DC/DC converter to form a hybrid fuel cell system. The ultracapacitor bank should supply peak power and be recharged by the fuel cell.

The distribution of power between the fuel cell and the ultracapacitor depends on the duty ratio of the DC/DC converters. Duty ratio of a DC/DC converter is defined as the ratio of switch on time interval, , to switching period ; that is,

There is one duty ratio, , in the unidirectional DC/DC converter for controlling output power of the fuel cell. In the bidirectional DC/DC converter, one duty ratio, , is for charging the ultracapacitor, and the other, , is for discharging the ultracapacitor. Power distribution is optimized by controlling the three duty ratios.

2.2. Identification

The hybrid system is a multiple input and multiple output nonlinear system. The control input variables are three duty ratios of the power converters. Input variables are expressed as

The output variables contain output voltage of the fuel cell and the state of charge of the ultracapacitor and so forth. Output variables are chosen as where is voltage of the fuel cell, is current of the fuel cell, is current of the ultracapacitor, is state of charge of the ultracapacitor, is the bus voltage: and is the bus current, respectively. Power demanded by the load, , is viewed as a disturbance to the system. We can describe the model as the following nonlinear function: whereis the regression vector with known order and ,andare dimensions of output and input,is the system disturbance, andis an unknown nonlinear function, respectively.

If we design MPC based on direct use of the nonlinear model, it involves the online solution of a higher order nonlinear optimization problem with constraints, which is usually computationally expensive and may even be unable to guarantee a feasible solution for real time control.

Here we use RNN-ARMAX to model the system. Performing Taylor expansion on the nonlinear functionaround the regionas

We introduce the notation where and the coefficientsare nonlinear function of.

We have a regression form of the system described by (4) as follows:

Here the parameter vectoris time variant. The recurrent neural network (RNN) that consists of feed-forward and feedback connections is well known to be capable of modeling and control nonlinear system. We use RNN to estimate. The recurrent neural network modeling principle is shown in Figure 2.

The RNN is expressed as where is output of the RNN andare weights for the RNN among the output layer, the input layer, and the hidden layer. Define , , and as the node amounts of the output layer, the input layer, and the hidden layer, respectively.  and are expressed as

Then the output of the system is predicted by where and

The performance criterionof the neural network is then defined by where is sampled output of the system. Therefore, the weights are adjusted to reduce the cost functionto a minimum value by the gradient descent method. The weight vectors are updated along with whereis a positive learning rate.

Let be the quotient and remainder of , respectively. If , then set . Else set .  , , and    are then calculated as follows: where

The update rules of (15) call for a proper choice of the learning rate. For a small value ofthe convergence is guaranteed but the speed is slow; ifis too big, the algorithm becomes unstable. Here we develop a guideline in selecting the learning rate properly. A discrete Lyapunov function is given by where

Thus the change of Lyapunov function due to the training process is obtained by

The error difference due to the learning is represented by whererepresents a change in an arbitrary weight vector.

From the update rule (15),

Then we have the following general convergence theorem.

Theorem 1. is the learning rate for the weights of RNN andis the usual Euclidean norm in. Then the convergence is guaranteed ifis chosen as

Proof. From equations (20)–(22),can be calculated as
To guarantee,should satisfy the following inequality
From inequalities (25) and (26), we obtain
Namely,  satisfies
This proves the theorem.

We can establish a state space model from the matrix polynomials (7), (8), and (9) by defining a state vector given by

A state space model can then be given by where

Model (30) is a state space representation of MIMO RNN-ARX model (4). The parameters in andare estimated by the RNN, and the state   at time can be easily obtained by (29) according to the present output , the past input/output data, and output of the RNN.

3. Controller Design

A predictive controller will be designed to predict the output trajectory of the fuel cell power system and compute a series of control actions, subject to constraints, that will minimize the difference between the predicted trajectory and desired trajectory. A prominent advantage of this controller over other control schemes is its ability to deal with constraints in a systematic and straightforward manner.

To design predictive controller for the system, an objective function is defined as [18] where is predictive horizon, is the estimated output of the system at instant through models based on information available at instant. is the desired output at instant , and , are weighting matrices on output errors and control, respectively. We choose the control horizon to be equal to the prediction horizon and defineand, where,, and are penalties on errors in   and, respectively., andare penalties on and , respectively.

Substituting state equations (30) into (32), the equation is abbreviated as where is system output at instant , and are constant matrices calculated through the system model and matrices .

Consider the following:

In the hybrid system, there are several limits to deal with. Rapid variation on current will reduce lifetime of fuel cell, so it is required to constrain the fluctuation of fuel cell current; that is, where is the acceptable maximum value. Moreover, the state of charge of the ultracapacitor, the current of the ultracapacitor, and the voltage of the fuel cell should be limited to some expected range: where and are the lower limitations, , , and are the upper limits, respectively. These limitations are determined by the characteristics of the ultracapacitor and fuel cell.

A prominent advantage of MPC is its ability to deal with constraints. Deduced from equations (30), (32) and inequalities (35)–(38), the control optimization is transformed to the following constrained quadratic programming problem: where  , and are constant matrices obtained from (30) and inequalities (35)–(38). We can solve this optimal problem using the neural network method investigated in [25].

4. Experiment and Simulation

The hybrid fuel cell system, as shown in Figure 1, is designed to power a robot. The rated power is 500 W. The DC bus voltage is controlled around 24 V. The PEM fuel cells have 40 cells and an active area of 22 cm2. The ultracapacitor is 200 F and the rated voltage is 24 V. The value of capacitance can be realized by a bank of 8 ultracapacitors, each with capacitance of 1600 F and a rated voltage of 3 V, connected in series. The upper and lower limits of are 1 and 0.45, respectively. The maximum stored energy is 16 W h, although only 12.76 W h is available between the maximum and minimum of . This 12.76 W h corresponds to an average power at 500 W for 92 seconds and that is sufficient to buffer the fuel cell from acceleration transients.

4.1. Modeling Experiment and Simulation

When real input and output data of the PEM fuel cell was sampled, the operating parameters are shown in Table 1.

The collected data are equally divided into two groups. The first group is used for modeling and the second group is used for validating. The simulated and measured V-I characteristics curves of the fuel cell are shown in Figure 3. Current of the ultracapacitor changes as Figure 4, and the simulated and measured voltage curves are shown in Figure 5. It is shown that the RNN-ARMAX model closely matches the practical fuel cell power system.

4.2. Control Simulation

Control performances of constrained and unconstrained MPCs are studied and compared to validate the proposed constrained MPC. The constraints of the constrained MPC are listed in Table 2.

A typical load cycle that is used in simulation and the power profile, as shown in Figure 6, is considered as the power demand.

The simulation results for both the unconstrained and the constrained MPC are shown in Figure 7. It is shown that, there exist significant perturbations in current of fuel cell for unconstrained MPC. This phenomenon may cause oxygen starvation because the dynamic response of oxygen supply is slower, while in the case of the constrained MPC, current and voltage are much smoother.

In the case of constrained MPC, the oscillation of of the ultracapacitor is much larger than that of the unconstrained MPC. The reason is that constrained MPC draws much more energy from the ultracapacitor to supply the peak load and so limits perturbations of the current of the fuel cell.

Constraint results are shown in Figure 8. It’s exciting that the maximum rate of change of the fuel cell is 0.4 A/s, the minimum voltage of the fuel cell is 27.5 V, the charge and discharge current of the ultracapacitor are no more than 30 A, and the of the ultracapacitor is between 0.45 and 1. It is shown that these variables change in the desired and constrained ranges. These phenomena demonstrate that the constraints on the fuel cell power system are valid.

The power split under the constrained MPC is shown in Figure 9. We set the minimum voltage of the fuel cell as 27.5 V and the corresponding maximum power of fuel cell as 500 W. It is noticed that the fuel cell power changes in low speed and is no more than 500 W. The high frequency power demands are squeezed from the ultracapacitor. Furthermore, ,   and other constrained variables satisfy their constraints. Consequently, the output power of the fuel cell is well controlled and it is helpful to extend the operating life of the fuel cell.

5. Conclusions

RNN-ARMAX model was established and linear constrained MPC was developed and verified for a fuel cell power system. The proposed approach, different from other approaches, models the nonlinear fuel cell power system as linear time varying system. Accordingly, linear constrained MPC can be used to globally optimize power distribution and deal with limitations. The design and implementation of the controller are significantly simplified and the method can protect fuel cell from substantial fluctuation of current by trading off transient current demand from the fuel cell to the ultracapacitor.

Conflict of Interests

The authors declare that there is no conflict of interests regarding the publication of this paper.

Acknowledgments

This work was supported by the National Natural Science Foundation of China under Contracts 61374050 and 51106116, the “863” Program of China under Contract 2012AA110601, and the Fundamental Research Funds for the Central Universities of China (WUT: 2013-II-016).