About this Journal Submit a Manuscript Table of Contents
Mathematical Problems in Engineering
Volume 2012 (2012), Article ID 581040, 22 pages
doi:10.1155/2012/581040
Research Article

Optimality Condition-Based Sensitivity Analysis of Optimal Control for Hybrid Systems and Its Application

State Key Laboratory of Industrial Control Technology, Institute of Industrial Process Control, Zhejiang University, Hangzhou 310027, China

Received 4 April 2012; Revised 21 June 2012; Accepted 11 July 2012

Academic Editor: Jun Hu

Copyright © 2012 Chunyue Song. This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

Abstract

Gradient-based algorithms are efficient to compute numerical solutions of optimal control problems for hybrid systems (OCPHS), and the key point is how to get the sensitivity analysis of the optimal control problems. In this paper, optimality condition-based sensitivity analysis of optimal control for hybrid systems with mode invariants and control constraints is addressed under a priori fixed mode transition order. The decision variables are the mode transition instant sequence and admissible continuous control functions. After equivalent transformation of the original problem, the derivatives of the objective functional with respect to control variables are established based on optimal necessary conditions. By using the obtained derivatives, a control vector parametrization method is implemented to obtain the numerical solution to the OCPHS. Examples are given to illustrate the results.

1. Introduction

In many fields of applications, such as powertrain systems of automobiles and multistage chemical processes, dynamics of the systems involve a sequence of distinct modes with fixed mode transition order, forming a hybrid system characterized by the coexistence and interaction of discrete and continuous dynamics (the mode is commonly denoted by a discrete state of the systems in hybrid systems literature). To achieve some overall optimal performance for the systems, the duration and the admissible continuous control function of each mode must be determined as a whole [13]; thus, it necessitates the use of theories and techniques for the analysis and synthesis of hybrid dynamical systems. With the growing importance of hybrid models, various classes of hybrid systems for analysis, design, and optimization have been addressed by research communities in recent years. For more discussions on various literature results, the reader is referred to [48], and the references therein.

The existed results on OCPHS can be divided into the following two categories. One is about the optimal control theory on OCPHS. The theory inherits conventional optimal control theory and can be regarded as the extension of conventional optimal control theory [3, 914]. When control can take any value, Xu and Antsaklis [3] and Hwang et al. [9] addressed the variational method for hybrid systems. Sussmann [10], Shaikh and Caines [11], and Dmitruk and Kaganovich [12] established the Maximum Principle for hybrid systems with control constraints. Branicky et al. [14] and Bensoussan and Menaldi [13] provided the dynamic programming principle for general hybrid systems.

The other results focus on how to compute optimal control for OCPHS, which can be carried out by using a wide variety of methods (see [3, 6, 11, 1520] and the references therein). Given a prespecified order of mode transitions, Xu and Antsaklis [3] obtained the optimal continuous control and optimal switching instants based on parameterization of the switching instant for switching hybrid systems with free control. Under a fixed switching sequence of modes, Attia et al. [19] considered an optimization problem for a class of impulsive hybrid systems where continuous control function is not involved. When switching hybrid systems with control constraints are considered, Shaikh and Caines [11] proposed two algorithms for obtaining the optimal control. As far as switching hybrid systems without external continuous control function are concerned, Egerstedt et al. [6] and Johnson and Murphey [18] derived the gradients and second-order derivatives of the cost functional, respectively, and used them to design an associated algorithm to get the mode transition instants. Based on the hybrid Maximum Principle, Taringoo and Caines [20] provided gradient geodesic and Newton geodesic algorithms for the optimization of autonomous hybrid systems, and convergence analysis for the algorithms was also provided. From the view of dynamic programming, Seatzu et al. [16] provided an optimal state feedback control law to switched piecewise affine autonomous systems. Generally, these algorithms pose the hierarchy [17, 21, 22], and the basic module of the hierarchical algorithms is how to get optimal continuous control and optimal mode transition instants, though the main challenge of OCPHS is how to get the optimal mode transition order. The basic module of the hierarchical algorithms is commonly gradient based due to that gradient information can provide a better searching direction and hence reduce computation burden and help the gradient-based algorithms converge quickly, which motivates us to pay attention to the sensitivity analysis of optimal control for hybrid systems.

Although the derivative of cost functional with respect to switching instants has been discussed in the aforementioned literature [3, 6, 18], the derivative of cost functional with respect to control function is not involved. When hybrid systems are considered, due to the coexistence and interaction of discrete and continuous dynamics, the derivative of cost functional w.r.t control functions is nontrivial and is not directly formulated by 𝜕 𝐻 / 𝜕 𝑢 as conventional optimal control indicates, where 𝐻 is the Hamiltonian function. The derivative will be a function of the derivatives of continuous states w.r.t control functions at the instants of subsequent modes. In this paper, the derivatives of cost functional w.r.t control functions are established analytically, which can facilitate the design of associated gradient-based algorithms.

Motivated by the work of Vassiliadis et al. [1, 2] and Jennings et al. [23], in this paper, optimal control problem of hybrid systems (OCPHS) with mode invariants which describe the conditions that continuous states have to satisfy at this mode are considered. Based on optimal necessary conditions, the derivatives of the objective functional w.r.t control variables, that is, the mode transition instant sequence and admissible continuous control functions, are derived analytically. As a result, a control vector parametrization method is implemented to obtain the numerical solution to optimal control of the hybrid systems with the obtained derivatives. The sensitivity analysis in Vassiliadis et al. [1, 2] is similar to the work, in which the sensitivity of states w.r.t control parameters is directly obtained from the state equations and the sensitivity of objective functional with respect to control parameters is not involved. In contrast, this paper derives the derivatives of cost functional w.r.t control variables based on the optimality conditions and gives the explicitly expression of the derivatives. Therefore, the main contributions of this paper are listed as follows. (a) Optimality conditions-based sensitivity analysis of optimal control for hybrid systems with mode invariants are given explicitly, and (b) following the given derivatives, a control vector parameterization method is designed to obtain the numerical solution. Compared with the existing results on the OCPHS with fixed mode transition order, the settings in this paper cover not only the control constraints, but also the continuous states constraints, which makes the results here more general.

The paper is organized as follows. In the next section, the hybrid system with mode invariants and its optimal control problem are formulated. In Section 3, the equivalent problem and associated optimal conditions are analyzed. The derivatives of the objective functional w.r.t control variables are established in Section 4, and a control vector parametrization approach is also proposed in this section. Some numerical examples are presented in Section 5, and Section 6 contains conclusions.

Terminology and Notation
denotes the set of positive integers. and + denote the set of real numbers and nonnegative real numbers, respectively. 𝐴 𝑇 denotes the transpose of a vector (or a matrix) 𝐴 . 𝐶 𝑙 ( [ 𝑎 , 𝑏 ] , 𝑛 ) denotes the family of continuous functions 𝑓 from [ 𝑎 , 𝑏 ] to 𝑛 with up to 𝑙 order derivatives. denotes the Euclidean norm.

2. Hybrid Systems and Its Optimal Control Problem

2.1. Hybrid Systems

Engineered systems, such as chemical engineering systems and powertrain systems of automobiles, always undergo multiple modes which are represented by a discrete state 𝑖 taking values from set 𝐼 { 1 , 2 , , 𝑀 } and pose hybrid characters. The evolution of discrete state 𝑖 is determined by mode transition sequence. A mode transition sequence schedules the sequence of active modes 𝑖 𝑗 , 𝑖 𝑗 𝐼 and is a sequence of pairs of ( 𝑡 𝑗 1 , 𝑖 𝑗 ) , which can be defined by { ( 𝑡 0 , 𝑖 1 ) , ( 𝑡 1 , 𝑖 2 ) , } ( 𝜃 , 𝜋 ) where 𝜃 { 𝑡 0 , 𝑡 1 , } and 𝜋 { 𝑖 1 , 𝑖 2 , } are referred to as mode transition instants and mode transition order, respectively. A pair of ( 𝑡 𝑗 1 , 𝑖 𝑗 ) indicates that at instant 𝑡 𝑗 1 , the hybrid system transits from mode 𝑖 𝑗 1 to mode 𝑖 𝑗 . During the time interval [ 𝑡 𝑗 1 , 𝑡 𝑗 ) , mode 𝑖 𝑗 is active and unchanged.

The mode transition order 𝜋 of the considered hybrid dynamical systems is known a priori. Without loss of generality, it is supposed that the mode transition order is { 𝑖 1 , 𝑖 2 , , 𝑖 𝐾 } over the finite horizon [ 𝑡 0 , 𝑡 𝑓 ] , 𝑖 𝑗 𝐼 , 𝑗 = 1 , 2 , , 𝐾 . Moreover, according to each distinct mode, the continuous states are restricted in a specified range which is referred to as mode invariants. Here, the mode invariants are formulated by a set of inequalities. Thus, for each mode 𝑖 𝑗 𝐼 and its active horizon [ 𝑡 𝑗 1 , 𝑡 𝑗 ) , the dynamics of the considered systems can be formulated by ̇ 𝑥 = 𝑓 𝑖 𝑗 𝑝 ( 𝑥 , 𝑢 ) , 𝑖 𝑗 𝑥 𝑡 ( 𝑥 ) < 0 , 𝑗 1 = 𝜓 𝑖 𝑗 𝑥 𝑡 𝑗 1 , g 𝑖 𝑗 𝑥 𝑡 𝑗 = 0 , ( 2 . 1 ) where 𝑥 𝑛 , 𝑢 𝐔 𝑖 𝑗 𝑚 is a piecewise continuous function, 𝑓 𝑖 𝑗 𝑛 × 𝐔 𝑖 𝑗 𝑛 , 𝑡 𝑗 is the mode transition instant when a particular mode transition occurs, 𝑝 𝑖 𝑗 , 𝜓 𝑖 𝑗 , and 𝑔 𝑖 𝑗 are 𝑖 𝑗 < 𝑛 , 𝑛 and 𝑟 𝑖 𝑗 𝑛 dimensional vectors for 𝑖 𝑗 𝐼 , respectively. 𝑛 , 𝑚 , 𝑖 𝑗 , 𝑟 𝑖 𝑗 . To make the hybrid systems formulated by (2.1) well defined, the following assumption is needed.

Assumption 2.1. For any 𝑖 𝑗 𝐼 , 𝑓 𝑖 𝑗 𝐶 𝑙 ( 𝑛 × 𝐔 𝑖 𝑗 ; 𝑛 ) , 𝑙 1 , 𝑙 , and such that a uniform Lipschitz condition holds, that is, there exists 𝐾 𝑓 < such that 𝑓 𝑖 𝑗 ( 𝑥 , 𝑢 ) 𝑓 𝑖 𝑗 𝑥 , 𝑢 𝐾 𝑓 𝑥 𝑥 , ( 2 . 2 ) where 𝑥 , 𝑥 𝑛 , 𝑢 𝐔 𝑖 𝑗 .

Remark 2.2. 𝑝 𝑖 𝑗 ( 𝑥 ) < 0 indicates mode invariant for mode 𝑖 𝑗 𝐼 , which describes the conditions that the continuous states have to satisfy at this mode and can be referred to as the path constraints of the continuous states in Vassiliadis et al. [1, 2].

Remark 2.3. 𝑔 𝑖 𝑗 ( 𝑥 ( 𝑡 𝑗 ) ) = 0 can be referred to as mode transition conditions which describe the conditions on the continuous states under which a particular mode transition takes place. When mode 𝑖 𝑗 is active over [ 𝑡 𝑗 1 , 𝑡 𝑗 ) , then, at 𝑡 𝑗 , 𝑥 meets an ( 𝑛 𝑟 𝑖 𝑗 ) -dimensional smooth manifold 𝑆 𝑖 𝑗 = { 𝑥 𝑔 𝑖 𝑗 ( 𝑥 ) = 0 } and mode transition from 𝑖 𝑗 to 𝑖 𝑗 + 1 occurs. The mode transition conditions implicitly define the mode 𝑖 𝑗 's active horizon [ 𝑡 𝑗 1 , 𝑡 𝑗 ) . To prevent Zeno behavior from occurrence, 𝑡 𝑗 1 < 𝑡 𝑗 is assumed. Physically, the mode transition conditions are always the boundary of closure of the mode invariant 𝑝 𝑖 𝑗 < 0 .

Remark 2.4. 𝑥 ( 𝑡 𝑗 1 ) = 𝜓 𝑖 𝑗 ( 𝑥 ( 𝑡 𝑗 1 ) ) is the outcome of the mode transition and describes the effect that the transition will have on the continuous states. It can be viewed as junction conditions in Vassiliadis et al. [1, 2]. It is assumed that 𝜓 𝑖 𝑗 𝐶 𝑙 ( 𝑛 ) , 𝑙 1 , 𝑙 .

Remark 2.5. Basically, for general hybrid systems, the evaluation of 𝑖 should be formulated by a function of impulsive control or a graph, which generates mode transition sequence, as formulated in Song and Li [24] and Cassandras and Lygeros [8]. However, the order of the mode transition 𝜋 is known a prior here thus, the evaluation of 𝑖 is determined only by the transition instants 𝑡 𝑗 , and the evaluation function of 𝑖 is omitted here.
Besides Assumption 2.1, to make the considered systems to be well defined, there are some additional assumptions on mode invariants and mode transition conditions should be imposed. Here, it is supposed that the mode invariants and mode transition conditions meet the requirements as in Taringoo and Caines [20].

2.2. Optimal Control Problem for Hybrid Systems

Let 𝐿 𝑖 𝐶 𝑙 ( 𝑛 × 𝐔 𝑖 ; ) be a running cost function, 𝜑 𝑖 𝑗 𝐶 𝑙 ( 𝑛 ; + ) be a discrete state transition cost function, and 𝜙 𝐶 𝑙 ( 𝑛 ; + ) be a terminal cost function, 𝑖 , 𝑗 𝐼 , 𝑙 1 , 𝑙 , respectively. The optimal control problem for the hybrid systems (2.1) is stated as follows.

Optimal Problem A
Consider a hybrid system formulated by (2.1), given a fixed time interval [ 𝑡 0 , 𝑡 𝑓 ] and a prespecified mode transition order 𝜋 = { 𝑖 1 , 𝑖 2 , , 𝑖 𝐾 } , find a continuous control 𝑢 𝐔 𝑖 𝑗 in each mode 𝑖 𝑗 𝐼 and mode transition instants 𝜃 = { 𝑡 1 , , 𝑡 𝐾 1 } , such that the corresponding continuous state trajectory 𝑥 departs from a given initial state 𝑥 ( 𝑡 0 ) = 𝑥 0 and meets an ( 𝑛 𝑙 𝑓 ) -dimensional smooth manifold 𝑆 𝑓 = { 𝑥 𝜗 ( 𝑥 ) = 0 , 𝜗 𝑛 𝑙 𝑓 } , 𝑙 𝑓 , at 𝑡 𝑓 and the cost functional 𝑥 𝑡 𝐽 ( 𝜃 , 𝑢 ) = 𝜙 𝑓 + 𝑡 𝑓 𝑡 0 𝐿 𝑖 ( 𝑡 ) ( 𝑥 ( 𝑡 ) , 𝑢 ( 𝑡 ) ) 𝑑 𝑡 + 𝐾 1 𝑗 = 1 𝜑 𝑖 𝑗 𝑖 𝑗 + 1 𝑥 𝑡 𝑗 ( 2 . 3 ) is minimized.

Remark 2.6. As it is well known, when 𝑡 0 and 𝑡 𝑓 are unknown points in some fixed interval 𝑇 + , this problem can be transformed to one with fixed time essentially by introducing an additional state variable.

There are fruitful strategies about how to compute OCPHS (see [15] and the references therein), and the basic idea is briefly reviewed as follows for completeness.

Obtaining the optimal control for hybrid systems is very difficult due to the interactions between the continuous states and discrete states which produce a mode transition sequence that increases the feasibility range of the decision variables. One algorithm framework for dealing with this complexity is the decomposition method as follows: m i n ( ( 𝜋 , 𝜃 ) , 𝑢 ) 𝐽 ( ( 𝜋 , 𝜃 ) , 𝑢 ) = m i n ( 𝜋 , 𝜃 ) m i n 𝑢 𝐽 ( 𝑢 ( 𝜋 , 𝜃 ) ) = m i n 𝜋 m i n 𝜃 m i n 𝑢 𝐽 ( ( 𝑢 , 𝜃 ) 𝜋 ) , ( 2 . 4 ) where 𝐽 ( 𝑏 ) means that 𝑏 is given.

According to this framework, the master problem is how to get the optimum of the inner functional, that is, minimize 𝐽 ( 𝑢 , 𝜃 ) given 𝜋 . The key point of finding the optimal solution of 𝐽 ( 𝑢 , 𝜃 ) is how to get the sensitivity of the objective with respect to control variables, which provides a better direction for searching and hence reduces computational burden and help associated algorithms converge quickly and accelerate the primary problem convergence eventually.

In next section, the derivatives of cost functional with respect to control variables are established analytically based on optimality condition, which can facilitate the design of associated gradient-based algorithms.

3. Equivalent Problem and Its Optimal Conditions

When control vector parametrization methods are implemented to obtain numerical solution to the OCPHS, updating the parameters of control profiles should be at the same time point when iterative procedure is running. However, the fact is that the mode active horizon [ 𝑡 𝑗 1 , 𝑡 𝑗 ) for mode 𝑖 𝑗 𝐼 is varying during the procedure running, so a fixed horizon should be introduced, which will guarantee the updating of parameters of control profiles is at the same time point. For this purpose, let 𝜏 [ 0 , 𝐾 ] be a time independent variable, and 𝑡 [ 𝑡 𝑗 1 , 𝑡 𝑗 ) can be formulated by 𝑡 = 𝑡 𝑗 1 + 𝑡 ( 𝜏 ( 𝑗 1 ) ) 𝑗 𝑡 𝑗 1 [ , 𝜏 𝑗 1 , 𝑗 ) , 𝑗 = 1 , , 𝐾 . ( 3 . 1 )

In addition, to deal with mode invariants constraints 𝑝 𝑖 𝑗 ( 𝑥 ) < 0 , slack algebraic variable 𝑠 𝑖 𝑗 = [ 𝑠 𝑖 𝑗 1 , , 𝑠 𝑖 𝑗 𝑖 𝑗 ] 𝑇 𝑖 𝑗 + is introduced for each mode 𝑖 𝑗 𝐼 , such that 𝑝 𝑖 𝑗 ( 𝑥 ) + d i a g [ 𝑠 𝑖 𝑗 1 , , 𝑠 𝑖 𝑗 𝑖 𝑗 ] 𝑠 𝑖 𝑗 = 0 . For 𝜏 [ 𝑗 1 , 𝑗 ) , denote 𝐱 𝑗 ( 𝜏 ) 𝑥 ( 𝑡 𝑗 1 + ( 𝜏 ( 𝑗 1 ) ) ( 𝑡 𝑗 𝑡 𝑗 1 ) ) , 𝐮 𝑗 ( 𝜏 ) 𝑢 ( 𝑡 𝑗 1 + ( 𝜏 ( 𝑗 1 ) ) ( 𝑡 𝑗 𝑡 𝑗 1 ) ) , 𝐬 𝑗 ( 𝜏 ) 𝑠 𝑖 𝑗 ( 𝑡 𝑗 1 + ( 𝜏 ( 𝑗 1 ) ) ( 𝑡 𝑗 𝑡 𝑗 1 ) ) , and let 𝐱 = [ 𝐱 1 , , 𝐱 𝐾 ] 𝑇 , 𝐮 = [ 𝐮 1 , , 𝐮 𝐾 ] 𝑇 , and 𝐬 = [ 𝐬 1 , , 𝐬 𝐾 ] 𝑇 .

According to the above definition, the Optimal Problem A can be transcribed into an equivalent Optimal Problem B as follows:

Optimal Problem B
Given a fixed interval [ 0 , 𝐾 ] , find continuous inputs 𝐮 𝐔 𝑖 1 × × 𝐔 𝑖 𝐾 , 𝐬 𝑖 1 + × × 𝑖 𝐾 + and 𝜃 , such that the corresponding continuous state trajectory 𝐱 1 departs from a given initial state 𝐱 1 ( 0 ) = 𝑥 0 and 𝐱 𝐾 meets an ( 𝑛 𝑙 𝑓 ) -dimensional smooth manifold 𝑆 𝑓 = { 𝐱 𝐾 𝜗 ( 𝐱 𝐾 ) = 0 , 𝜗 𝑛 𝑙 𝑓 } at 𝐾 , and the cost functional 𝐱 𝐽 ( 𝜃 , 𝐮 , 𝐬 ) = 𝜙 𝐾 + ( 𝐾 ) 𝐾 𝑗 = 1 𝑗 𝑗 1 𝐿 𝑖 𝑗 𝐱 𝑗 ( 𝜏 ) , 𝐮 𝑗 ( 𝜏 ) , 𝐬 𝑗 ( 𝜏 ) 𝑑 𝜏 + 𝐾 1 𝑗 = 1 𝜑 𝑖 𝑗 𝑖 𝑗 + 1 𝐱 𝑗 ( 𝑗 ) ( 3 . 2 ) is minimized, subject to 𝑑 𝐱 𝑗 ( 𝜏 ) = 𝑓 𝑑 𝜏 𝑖 𝑗 𝐱 𝑗 ( 𝜏 ) , 𝐮 𝑗 𝑡 ( 𝜏 ) 𝑗 𝑡 𝑗 1 𝑓 𝑖 𝑗 𝐱 𝑗 ( 𝜏 ) , 𝐮 𝑗 , 𝐱 ( 𝜏 ) 𝑗 ( 𝑗 1 ) = 𝜓 𝑖 𝑗 𝐱 𝑗 1 ( ( 𝑗 1 ) ) , 𝑔 𝑖 𝑗 𝐱 𝑗 ( 𝑗 ) = 0 , ( 3 . 3 ) where 𝐿 𝑖 𝑗 𝐱 𝑗 , 𝐮 𝑗 , 𝐬 𝑗 = 𝑡 𝑗 𝑡 𝑗 1 𝐿 𝑖 𝑗 , 𝐿 𝑖 𝑗 = 𝐿 𝑖 𝑗 𝐱 𝑗 , 𝐮 𝑗 + 𝑀 𝑖 𝑗 𝑙 = 1 𝑝 𝑖 𝑗 𝑙 𝐱 𝑗 + 𝑠 2 𝑖 𝑗 𝑙 2 , ( 3 . 4 ) and 𝑀 is a large positive constant.

According to Theorems 2 and 3 in Dmitruk and Kaganovich [12], when 𝑀 is big enough Optimal Problem B is equivalent to Optimal Problem A.

Remark 3.1. The penalty function term, say, 𝑀 𝑖 𝑗 𝑙 = 1 ( 𝑝 𝑖 𝑗 𝑙 ( 𝐱 𝑗 ) + 𝐬 2 𝑖 𝑗 𝑙 ) 2 , cannot always guarantee the state satisfies the mode invariant conditions. However, the method works well in practice; moreover, the mode transition order is fixed in this paper which reduces the negative effect of the penalty function method for OCPHS.

For 𝜏 [ 𝑗 1 , 𝑗 ) , 𝑗 = 1 , , 𝐾 , let 𝜆 𝑗 𝑛 , and define Hamiltonian function 𝐻 𝑗 by 𝐻 𝑗 𝜆 𝑗 , 𝐱 𝑗 , 𝐮 𝑗 , 𝐬 𝑗 = 𝐿 𝑖 𝑗 𝐱 𝑗 , 𝐮 𝑗 , 𝐬 𝑗 + 𝜆 𝑇 𝑗 𝑓 𝑖 𝑗 𝐱 𝑗 , 𝐮 𝑗 , ( 3 . 5 ) and according to Sussmann [10], Shaikh and Caines [11], and Dmitruk and Kaganovich [12], the following Theorem 3.2 holds.

Theorem 3.2. In order that 𝐮 and 𝐬 are optimal for Optimal Problem B, it is necessary that there exist vector functions 𝜆 𝑗 , 𝑗 = 1 , , 𝐾 , such that the following conditions hold:(a)for almost any 𝜏 [ 𝑗 1 , 𝑗 ) , the following state equations hold: 𝑑 𝐱 𝑗 ( 𝜏 ) = 𝑓 𝑑 𝜏 𝑖 𝑗 𝐱 𝑗 ( 𝜏 ) , 𝐮 𝑗 , ( 𝜏 ) ( 3 . 6 ) (b)for almost any 𝜏 [ 𝑗 1 , 𝑗 ) , the following costate equations hold: ̇ 𝜆 𝑗 𝜕 𝐿 = 𝑖 𝑗 𝜕 𝐱 𝑗 𝑇 𝜕 𝑓 𝑖 𝑗 𝜕 𝐱 𝑗 𝑇 𝜆 𝑗 , ( 3 . 7 ) (c)for a.e. 𝜏 [ 𝑗 1 , 𝑗 ) , 𝐻 𝑗 𝜆 𝑗 , 𝐱 𝑗 , 𝐮 𝑗 , 𝐬 𝑗 = 0 , ( 3 . 8 ) (d)minimality condition: for all 𝜏 [ 𝑗 1 , 𝑗 ) , m i n 𝐮 𝑗 𝐔 𝑖 𝑗 , 𝐬 𝑗 𝑖 𝑗 + 𝐻 𝑗 𝜆 𝑗 , 𝐱 𝑗 , 𝐮 𝑗 , 𝐬 𝑗 = 0 , ( 3 . 9 ) (e)transversality conditions for 𝜆 𝑗 , 𝜆 𝑗 + 1 ( 𝑗 ) = 𝛽 𝑗 𝜆 , 𝑗 = 1 , , 𝐾 1 , 𝑗 ( 𝑗 ) = 𝜕 𝑔 𝑖 𝑗 𝜕 𝐱 𝑗 ( 𝑗 ) 𝑇 𝛼 𝑗 𝜕 𝜓 𝑖 𝑗 + 1 𝜕 𝐱 𝑗 ( 𝑗 ) 𝑇 𝛽 𝑗 + 𝜕 𝜑 𝑖 𝑗 𝑖 𝑗 + 1 𝜕 𝐱 𝑗 ( 𝑗 ) 𝑇 𝜆 , 𝑗 = 1 , , 𝐾 1 , 𝐾 ( 𝐾 ) = 𝜕 𝜙 𝜕 𝐱 𝐾 ( 𝐾 ) 𝑇 + 𝜕 𝜗 𝜕 𝐱 𝐾 ( 𝐾 ) 𝑇 𝛼 𝐾 , ( 3 . 1 0 ) where 𝛼 𝑗 𝑖 , 𝛽 𝑗 𝑛 are Lagrangian multipliers. Based on Theorem 3.2, the sensitivity analysis is established in the next section for Optimal Problem B.

4. Sensitivity Analysis and Parametrization Method

For finding numerical solution to the OCPHS effectively, based on Theorem 3.2, the derivatives of the objective functional 𝐽 ( ) with respect to the control 𝐮 , 𝐬 , and the mode transition instant 𝑡 𝑗 , 𝑗 = 1 , , 𝐾 1 are established in this section, and by using the obtained derivatives associated parametrization method is proposed.

4.1. Sensitivity Analysis

Lemma 4.1. The derivatives of 𝐱 𝑗 ( 𝑗 ) , 𝑗 = 1 , , 𝐾 , w.r.t 𝑡 𝑘 and 𝐮 𝑘 are given, respectively, as follows for 𝑘 = 1 , , 𝐾 1 , 𝑑 𝐱 𝑗 ( 𝑗 ) 𝑑 𝑡 𝑘 = 0 , 𝑗 = 1 , , 𝑘 1 , 𝑑 𝐱 𝑘 ( 𝑘 ) 𝑑 𝑡 𝑘 = 𝑓 𝑖 𝑘 𝐱 𝑘 ( 𝑘 ) , 𝐮 𝑘 ( 𝑘 ) , 𝑑 𝐱 𝑘 + 1 ( ( 𝑘 + 1 ) ) 𝑑 𝑡 𝑘 = Ω 𝑘 + 1 , 𝑑 𝐱 𝑗 ( 𝑗 ) 𝑑 𝑡 𝑘 = 𝑗 𝑙 = 𝑘 + 2 Φ 𝑙 ( 𝑙 , 𝑙 1 ) 𝑑 𝜓 𝑖 𝑙 𝑑 𝐱 𝑙 1 ( ( 𝑙 1 ) ) Ω 𝑘 + 1 , 𝑗 = 𝑘 + 2 , , 𝐾 , ( 4 . 1 ) 𝛿 𝐱 𝑗 ( 𝑗 ) 𝛿 𝐮 𝑘 = 0 , 𝑗 = 1 , , 𝑘 1 , 𝛿 𝐱 𝑘 ( 𝑘 ) 𝛿 𝐮 𝑘 = Γ 𝑘 ( 𝜏 ) , 𝛿 𝐱 𝑗 ( 𝑗 ) 𝛿 𝐮 𝑘 = 𝑗 𝑙 = 𝑘 + 1 Φ 𝑙 ( 𝑙 , 𝑙 1 ) 𝑑 𝜓 𝑖 𝑙 𝑑 𝐱 𝑙 1 ( ( 𝑙 1 ) ) Γ 𝑘 ( 𝜏 ) , 𝑗 = 𝑘 + 1 , , 𝐾 , ( 4 . 2 ) where Ω 𝑘 + 1 = Φ 𝑘 + 1 ( 𝑘 + 1 , 𝑘 ) 𝑑 𝜓 𝑖 𝑘 + 1 𝑑 𝐱 𝑘 ( 𝑘 ) 𝑓 𝑖 𝑘 𝐱 𝑘 ( 𝑘 ) , 𝐮 𝑘 ( 𝑘 ) 𝑓 𝑖 𝑘 + 1 𝐱 𝑘 + 1 ( 𝑘 ) , 𝐮 𝑘 + 1 , Γ ( 𝑘 ) 𝑘 𝑡 ( 𝜏 ) = 𝑘 𝑡 𝑘 1 Φ 𝑘 ( 𝑘 , 𝜏 ) 𝜕 𝑓 𝑖 𝑘 𝜕 𝐮 𝑘 , Φ 𝑙 ( 𝜏 , 𝑣 ) = e x p 𝜏 𝑣 𝑡 𝑙 𝑡 𝑙 1 𝜕 𝑓 𝑖 𝑙 𝜕 𝐱 𝑙 . 𝑑 𝑎 ( 4 . 3 )

Note that 𝐱 ( 𝑡 𝑗 ) is a functional vector of 𝐮 𝑘 , and the expression 𝛿 𝐱 𝑗 / 𝛿 𝐮 𝑘 is used, where the notation 𝛿 𝐱 𝑗 / 𝛿 𝐮 𝑘 is the functional derivatives which describe the response of the functional 𝐱 𝑗 to an infinitesimal change in the function 𝐮 𝑘 at each point.

Proof. The proof of (4.1) is only going to be shown for easily reading. The proof for (4.2) can be found in Appendix.
When 𝑗 = 1 , , 𝑘 1 , 𝐱 𝑗 ( 𝑗 ) and 𝐱 𝑗 + 1 ( 𝑗 ) are independent of 𝑡 𝑘 , and obviously 𝑑 𝐱 𝑗 ( 𝑗 ) / 𝑑 𝑡 𝑘 = 0 holds. In the case of 𝑗 = 𝑘 , 𝐱 𝑘 ( 𝑘 ) is a function of 𝑡 𝑘 which gives rise to 𝑑 𝐱 𝑘 ( 𝑘 ) / 𝑑 𝑡 𝑘 = 𝑓 𝑖 𝑘 ( 𝐱 𝑘 ( 𝑘 ) , 𝐮 𝑘 ( 𝑘 ) ) .
Case i. ( 𝑗 = 𝑘 + 1 ). In this case, 𝐱 𝑘 + 1 is a function of 𝑡 𝑘 and 𝐱 𝑘 + 1 ( 𝑘 ) , and we have 𝑑 𝐱 𝑘 + 1 ( 𝜏 ) 𝑑 𝑡 𝑘 = 𝜕 𝐱 𝑘 + 1 𝜕 𝑡 𝑘 + 𝜕 𝐱 𝑘 + 1 𝜕 𝐱 𝑘 + 1 ( 𝑘 ) 𝜕 𝐱 𝑘 + 1 ( 𝑘 ) 𝜕 𝑡 𝑘 . ( 4 . 4 )
Note that in (4.4), 𝜕 𝐱 𝑘 + 1 / 𝜕 𝑡 𝑘 is produced by the perturbation of 𝑡 𝑘 , and ( 𝜕 𝐱 𝑘 + 1 / 𝜕 𝐱 𝑘 + 1 ( 𝑘 ) ) ( 𝜕 𝐱 𝑘 + 1 ( 𝑘 ) / 𝜕 𝑡 𝑘 ) is produced by the perturbation of 𝐱 𝑘 + 1 ( 𝑘 ) with respect to 𝑡 𝑘 . Obviously, for 𝜏 [ 𝑘 , 𝑘 + 1 ) , 𝜕 𝐱 𝑘 + 1 ( 𝜏 ) 𝜕 𝑡 𝑘 = 𝑓 𝑖 𝑘 + 1 𝐱 𝑘 + 1 ( 𝑘 ) , 𝐮 𝑘 + 1 . ( 𝑘 ) ( 4 . 5 )
The solution to 𝜕 𝐱 𝑘 + 1 ( 𝜏 ) / 𝜕 𝐱 𝑘 + 1 ( 𝑘 ) is given by 𝜕 𝐱 𝑘 + 1 ( 𝜏 ) 𝜕 𝐱 𝑘 + 1 𝑘 + 𝑡 = 𝐼 + 𝑘 + 1 𝑡 𝑘 𝜏 𝑘 𝜕 𝑓 𝑖 𝑘 + 1 𝜕 𝐱 𝑘 + 1 𝜕 𝐱 𝑘 + 1 ( 𝑣 ) 𝜕 𝐱 𝑘 + 1 ( 𝑘 ) 𝑑 𝑣 . ( 4 . 6 )
Equation (4.6) is a linear system about 𝜕 𝐱 𝑘 + 1 / 𝜕 𝐱 𝑘 + 1 ( 𝑘 ) . Define the state transition matrix Φ 𝑙 ( 𝜏 , 𝑣 ) by Φ 𝑙 ( 𝜏 , 𝑣 ) = e x p 𝜏 𝑣 𝑡 𝑙 𝑡 𝑙 1 𝜕 𝑓 𝑖 𝑙 ( 𝑎 ) 𝜕 𝐱 𝑙 ( , 𝑎 ) 𝑑 𝑎 ( 4 . 7 ) according to (4.6), and we have 𝜕 𝐱 𝑘 + 1 ( 𝜏 ) 𝜕 𝐱 𝑘 + 1 ( 𝑘 ) = Φ 𝑘 + 1 ( 𝜏 , 𝑘 ) . ( 4 . 8 )
Thus, 𝑑 𝐱 𝑘 + 1 ( 𝜏 ) 𝑑 𝑡 𝑘 = Φ 𝑘 + 1 ( 𝜏 , 𝑘 ) 𝜕 𝐱 𝑘 + 1 ( 𝑘 ) 𝜕 𝑡 𝑘 𝑓 𝑖 𝑘 + 1 𝐱 𝑘 + 1 ( 𝑘 ) , 𝐮 𝑘 + 1 . ( 𝑘 ) ( 4 . 9 )
At transition instants 𝑡 𝑗 , since 𝐱 𝑗 + 1 ( 𝑗 ) = 𝜓 𝑖 𝑗 + 1 ( 𝐱 𝑗 ( 𝑗 ) ) , so 𝑑 𝐱 𝑗 + 1 ( 𝑗 ) 𝑑 𝑡 𝑘 = 𝑑 𝜓 𝑖 𝑗 + 1 𝑑 𝐱 𝑗 ( 𝑗 ) 𝑑 𝐱 𝑗 ( 𝑗 ) 𝑑 𝑡 𝑘 , ( 4 . 1 0 ) which implies 𝜕 𝐱 𝑘 + 1 ( 𝑘 ) 𝜕 𝑡 𝑘 = 𝑑 𝜓 𝑖 𝑘 + 1 𝑑 𝐱 𝑘 ( 𝑘 ) 𝑑 𝐱 𝑘 ( 𝑘 ) 𝑑 𝑡 𝑘 = 𝑑 𝜓 𝑖 𝑘 + 1 𝑑 𝐱 𝑘 ( 𝑘 ) 𝑓 𝑖 𝑘 𝐱 𝑘 ( 𝑘 ) , 𝐮 𝑘 ( 𝑘 ) . ( 4 . 1 1 )
According to (4.9), and we have 𝑑 𝐱 𝑘 + 1 ( ( 𝑘 + 1 ) ) 𝑑 𝑡 𝑘 = Φ 𝑘 + 1 ( 𝑘 + 1 , 𝑘 ) 𝑑 𝜓 𝑖 𝑘 + 1 𝑑 𝐱 𝑘 ( 𝑘 ) 𝑓 𝑖 𝑘 𝐱 𝑘 ( 𝑘 ) , 𝐮 𝑘 ( 𝑘 ) 𝑓 𝑖 𝑘 + 1 𝐱 𝑘 + 1 ( 𝑘 ) , 𝐮 𝑘 + 1 ( 𝑘 ) Ω 𝑘 + 1 . ( 4 . 1 2 )
Case ii. ( 𝑗 = 𝑘 + 2 , , 𝐾 ). When 𝑗 = 𝑘 + 2 , , 𝐾 , the following holds: 𝑑 𝐱 𝑗 ( 𝜏 ) 𝑑 𝑡 𝑘 = 𝑑 𝐱 𝑗 ( 𝑗 1 ) 𝑑 𝑡 𝑘 + 𝑡 𝑗 𝑡 𝑗 1 𝜏 𝑗 1 𝜕 𝑓 𝑖 𝑗 𝜕 𝐱 𝑗 𝑑 𝐱 𝑗 ( 𝑣 ) 𝑑 𝑡 𝑘 [ 𝑑 𝑣 , 𝜏 𝑗 1 , 𝑗 ) . ( 4 . 1 3 )
Then, 𝑑 𝐱 𝑗 ( 𝜏 ) 𝑑 𝑡 𝑘 = Φ 𝑗 ( 𝜏 , 𝑗 1 ) 𝑑 𝐱 𝑗 ( 𝑗 1 ) 𝑑 𝑡 𝑘 . ( 4 . 1 4 )
Substituting the term 𝑑 𝐱 𝑗 ( 𝑗 1 ) / 𝑑 𝑡 𝑘 in (4.14) by (4.10), we obtain 𝑑 𝐱 𝑗 ( 𝑗 ) 𝑑 𝑡 𝑘 = 𝑗 𝑙 = 𝑘 + 2 Φ 𝑙 ( 𝑙 , 𝑙 1 ) 𝑑 𝜓 𝑖 𝑙 𝑑 𝐱 𝑙 1 ( ( 𝑙 1 ) ) Ω 𝑘 + 1 . ( 4 . 1 5 )

Theorem 4.2. The derivatives of the objective functional 𝐽 ( ) w.r.t 𝑡 𝑘 , 𝐮 𝑘 and 𝐬 𝑘 are given, respectively, as follows: 𝑑 𝐽 𝑑 𝑡 𝑘 = 𝐿 𝑖 𝑘 𝐱 𝑘 ( 𝑘 ) , 𝐮 𝑘 ( 𝑘 ) , 𝐬 𝑘 ( 𝑘 ) 𝐿 𝑖 𝑘 + 1 𝐱 𝑘 + 1 ( 𝑘 ) , 𝐮 𝑘 + 1 ( 𝑘 ) , 𝐬 𝑘 + 1 ( 𝑘 ) + 𝜆 𝑘 ( 𝑘 ) 𝑇 𝑓 𝑖 𝑘 𝐱 𝑘 ( 𝑘 ) , 𝐮 𝑘 ( 𝑘 ) 𝜆 𝑘 + 1 ( 𝑘 ) 𝑇 𝑓 𝑖 𝑘 + 1 𝐱 𝑘 + 1 ( 𝑘 ) , 𝐮 𝑘 + 1 ( 𝑘 ) 𝐾 1 𝑗 = 𝑘 𝛼 𝑇 𝑗 𝜕 𝑔 𝑖 𝑗 𝜕 𝐱 𝑗 ( 𝑗 ) 𝑑 𝐱 𝑗 ( 𝑗 ) 𝑑 𝑡 𝑘 𝛼 𝑇 𝐾 𝜕 𝜗 𝜕 𝐱 𝐾 ( 𝐾 ) 𝑑 𝐱 𝐾 ( 𝐾 ) 𝑑 𝑡 𝑘 𝛿 𝐽 𝛿 𝐮 𝑘 = 𝜕 𝐻 𝑘 𝜕 𝐮 𝑘 𝐾 1 𝑗 = k 𝛼 𝑇 𝑗 𝜕 𝑔 𝑖 𝑗 𝜕 𝐱 𝑗 ( 𝑗 ) 𝛿 𝐱 𝑗 ( 𝑗 ) 𝛿 𝐮 𝑘 𝛼 𝑇 𝐾 𝜕 𝜗 𝜕 𝐱 𝐾 ( 𝐾 ) 𝛿 𝐱 𝐾 ( 𝐾 ) 𝛿 𝐮 𝑘 𝛿 𝐽 𝛿 𝐬 𝑘 = 𝜕 𝐻 𝑘 𝜕 𝐬 𝑘 . ( 4 . 1 6 )

Before proving Theorem 4.2, Lemma 4.3 is firstly given as follows.

Lemma 4.3. For 𝑗 = 𝑘 + 2 , , 𝐾 , 𝑑 𝑑 𝑡 𝑘 𝑗 𝑗 1 𝐿 𝑖 𝑗 𝐱 𝑗 , 𝐮 𝑗 , 𝐬 𝑗 𝑑 𝜏 = 𝜆 𝑗 ( 𝑗 1 ) 𝑇 𝑑 𝐱 𝑗 ( 𝑗 1 ) 𝑑 𝑡 𝑘 𝜆 𝑗 ( 𝑗 ) 𝑇 𝑑 𝐱 𝑗 ( 𝑗 ) 𝑑 𝑡 𝑘 . ( 4 . 1 7 )

Proof. For any 𝑗 = 𝑘 + 2 , , 𝐾 , we have 𝑑 𝑑 𝑡 𝑘 𝑗 𝑗 1 𝐿 𝑖 𝑗 𝐱 𝑗 , 𝐮 𝑗 , 𝐬 𝑗 𝑑 𝜏 = 𝑗 𝑗 1 𝑑 𝑑 𝑡 𝑘 𝐻 𝑗 𝜆 𝑗 , 𝐱 𝑗 , 𝐮 𝑗 , 𝐬 𝑗 𝜆 𝑇 𝑗 𝑓 𝑖 𝑗 = 𝑑 𝜏 𝑗 𝑗 1 𝜕 𝐻 𝑗 𝜕 𝐱 𝑗 𝑑 𝐱 𝑗 𝑑 𝑡 𝑘 + 𝜕 𝐻 𝑗 𝜕 𝜆 𝑗 𝑑 𝜆 𝑗 𝑑 𝑡 𝑘 𝑑 𝜆 𝑗 𝑑 𝑡 𝑘 𝑇 𝑓 𝑖 𝑗 𝜆 𝑇 𝑗 𝑑 𝑑 𝑡 𝑘 𝑓 𝑖 𝑗 𝑑 𝜏 . ( 4 . 1 8 )
Since the following holds by Theorem 3.2, 𝜕 𝐻 𝑗 𝜕 𝐱 𝑗 𝑇 ̇ 𝜆 = 𝑗 , 𝜕 𝐻 𝑗 𝜕 𝜆 𝑗 𝑇 = 𝑓 𝑖 𝑗 , ( 4 . 1 9 ) then 𝑑 𝑑 𝑡 𝑘 𝑗 𝑗 1 𝐿 𝑖 𝑗 𝐱 𝑗 , 𝐮 𝑗 , 𝐬 𝑗 𝑑 𝜏 = 𝑗 𝑗 1 ̇ 𝜆 j 𝑇 𝑑 𝐱 𝑗 𝑑 𝑡 𝑘 + 𝑓 𝑖 𝑗 𝑇 𝑑 𝜆 𝑗 𝑑 𝑡 𝑘 𝑑 𝜆 𝑗 𝑑 𝑡 𝑘 𝑇 𝑓 𝑖 𝑗 𝜆 𝑇 𝑗 𝑑 𝑑 𝑡 𝑘 𝑓 𝑖 𝑗 = 𝑑 𝜏 𝑗 𝑗 1 ̇ 𝜆 𝑗 𝑇 𝑑 𝐱 𝑗 𝑑 𝑡 𝑘 𝜆 𝑇 𝑗 𝑑 𝑑 𝑡 𝑘 𝑓 𝑖 𝑗 𝑑 𝜏 = 𝑗 𝑗 1 𝑑 𝜆 𝑑 𝜏 𝑇 𝑗 𝑑 𝐱 𝑗 𝑑 𝑡 𝑘 𝑑 𝜏 = 𝜆 𝑗 ( 𝑗 1 ) 𝑇 𝑑 𝐱 𝑗 ( 𝑗 1 ) 𝑑 𝑡 𝑘 𝜆 𝑗 ( 𝑗 ) 𝑇 𝑑 𝐱 𝑗 ( 𝑗 ) 𝑑 𝑡 𝑘 . ( 4 . 2 0 )

Obviously, when 𝑗 = 𝑘 , 𝑘 + 1 , we have 𝑑 𝑑 𝑡 𝑘 𝑘 𝑘 1 𝐿 𝑖 𝑘 𝐱 𝑘 , 𝐮 𝑘 , 𝐬 𝑘 𝑑 𝑑 𝜏 = 𝑑 𝑡 𝑘 𝑡 𝑘 𝑡 𝑘 1 𝐿 𝑖 𝑘 𝑥 , 𝑢 , 𝑠 𝑖 𝑘 𝑑 𝑡 = 𝐿 𝑖 𝑘 𝐱 𝑘 ( 𝑘 ) , 𝐮 𝑘 ( 𝑘 ) , 𝐬 𝑘 ( 𝑘 ) , 𝑑 ( 4 . 2 1 ) 𝑑 𝑡 𝑘 𝑘 𝑘 + 1 𝐿 𝑖 𝑘 + 1 𝐱 𝑘 + 1 , 𝐮 𝑘 + 1 , 𝐬 𝑘 + 1 𝑑 𝜏 = 𝜆 𝑘 + 1 ( 𝑘 ) 𝑇 𝑑 𝐱 𝑘 + 1 ( 𝑘 ) 𝑑 𝑡 𝑘 𝜆 𝑘 + 1 ( ( 𝑘 + 1 ) ) 𝑇 𝑑 𝐱 𝑘 + 1 ( ( 𝑘 + 1 ) ) 𝑑 𝑡 𝑘 𝐿 𝑖 𝑘 + 1 𝐱 𝑘 + 1 ( 𝑘 ) , 𝐮 𝑘 + 1 ( 𝑘 ) , 𝐬 𝑘 + 1 ( . 𝑘 ) ( 4 . 2 2 )

Now we prove Theorem 4.2. We are only going to show 𝑑 𝐽 / 𝑑 𝑡 𝑘 for easily reading. The proofs for 𝛿 𝐽 / 𝛿 𝐮 𝑘 and 𝛿 𝐽 / 𝛿 𝐬 𝑘 can be found in Appendix.

Proof. 𝐽 ( 𝜃 , 𝐮 , 𝐬 ) can be formulated as 𝐱 𝐽 ( 𝜃 , 𝐮 , 𝐬 ) = 𝜙 𝐾 + ( 𝐾 ) 𝑘 1 𝑗 = 1 𝑗 𝑗 1 𝐿 𝑖 𝑗 𝐱 𝑗 , 𝐮 𝑗 , 𝐬 𝑗 + 𝑑 𝜏 𝐾 𝑗 = 𝑘 𝑗 𝑗 1 𝐿 𝑖 𝑗 𝐱 𝑗 , 𝐮 𝑗 , 𝐬 𝑗 𝑑 𝜏 + 𝐾 1 𝑗 = 1 𝜑 𝑖 𝑗 𝑖 𝑗 + 1 𝐱 𝑗 ( 𝑗 ) . ( 4 . 2 3 )
Since 𝐿 𝑖 𝑗 ( ) is independent of 𝑡 𝑘 for 𝑗 = 1 , , 𝑘 1 , then 𝑑 𝐽 / 𝑑 𝑡 𝑘 can be obtained by 𝑑 𝐽 𝑑 𝑡 𝑘 𝐱 ( 𝜃 , 𝐮 , 𝐬 ) = 𝜕 𝜙 𝐾 ( 𝐾 ) 𝜕 𝐱 𝐾 ( 𝐾 ) 𝑑 𝐱 𝐾 ( 𝐾 ) 𝑑 𝑡 𝑘 + 𝑑 𝑑 𝑡 𝑘 𝐾 𝑗 = 𝑘 𝑗 𝑗 1 𝐿 𝑖 𝑗 𝐱 𝑗 , 𝐮 𝑗 , 𝐬 𝑗 𝑑 𝜏 + 𝐾 1 𝑗 = 1 𝜕 𝜑 𝑖 𝑗 𝑖 𝑗 + 1 𝜕 𝐱 𝑗 ( 𝑗 ) 𝑑 𝐱 𝑗 ( 𝑗 ) 𝑑 𝑡 𝑘 . ( 4 . 2 4 )
Substituting (4.17), (4.21), and (4.22) into (4.24), we have 𝑑 𝐽 𝑑 𝑡 𝑘 𝐱 ( 𝜃 , 𝐮 , 𝐬 ) = 𝜕 𝜙 𝐾 ( 𝐾 ) 𝜕 𝐱 𝐾 ( 𝐾 ) 𝑑 𝐱 𝐾 ( 𝐾 ) 𝑑 𝑡 𝑘 + 𝐿 𝑖 𝑘 𝐱 𝑘 ( 𝑘 ) , 𝐮 𝑘 ( 𝑘 ) , 𝐬 𝑘 ( 𝑘 ) 𝐿 𝑖 𝑘 + 1 𝐱 𝑘 + 1 ( 𝑘 ) , 𝐮 𝑘 + 1 ( 𝑘 ) , 𝐬 𝑘 + 1 ( 𝑘 ) + 𝜆 𝑘 + 1 ( 𝑘 ) 𝑇 𝑑 𝐱 𝑘 + 1 ( 𝑘 ) 𝑑 𝑡 𝑘 + 𝜕 𝜑 𝑖 𝑘 𝑖 𝑘 + 1 𝜕 𝐱 𝑘 ( 𝑘 ) 𝑑 𝐱 𝑘 ( 𝑘 ) 𝑑 𝑡 𝑘 𝐾 1 𝑗 = 𝑘 + 1 𝜆 𝑗 ( 𝑗 ) 𝑇 𝑑 𝐱 𝑗 ( 𝑗 ) 𝑑 𝑡 𝑘 𝜆 𝑗 + 1 ( 𝑗 ) 𝑇 𝑑 𝐱 𝑗 + 1 ( 𝑗 ) 𝑑 𝑡 𝑘 𝜕 𝜑 𝑖 𝑗 𝑖 𝑗 + 1 𝜕 𝐱 𝑗 ( 𝑗 ) 𝑑 𝐱 𝑗 ( 𝑗 ) 𝑑 𝑡 𝑘 𝜆 𝐾 ( 𝐾 ) 𝑇 𝑑 𝐱 𝐾 ( 𝐾 ) 𝑑 𝑡 𝑘 . ( 4 . 2 5 )
Due to Theorem 3.2 and (4.10), 𝑑 𝐽 / 𝑑 𝑡 𝑘 can be formulated by 𝑑 𝐽 𝑑 𝑡 𝑘 𝐱 ( 𝜃 , 𝐮 , 𝐬 ) = 𝜕 𝜙 𝐾 ( 𝐾 ) 𝜕 𝐱 𝐾 ( 𝐾 ) 𝜆 𝐾 ( 𝐾 ) 𝑇 𝑑 𝐱 𝐾 ( 𝐾 ) 𝑑 𝑡 𝑘 + 𝐿 𝑖 𝑘 𝐱 𝑘 ( 𝑘 ) , 𝐮 𝑘 ( 𝑘 ) , 𝐬 𝑘 ( 𝑘 ) 𝐿 𝑖 𝑘 + 1 𝐱 𝑘 + 1 ( 𝑘 ) , 𝐮 𝑘 + 1 ( 𝑘 ) , 𝐬 𝑘 + 1 ( 𝑘 ) + 𝜆 𝑘 ( 𝑘 ) 𝑇 𝑑 𝐱 𝑘 ( 𝑘 ) 𝑑 𝑡 𝑘 𝜆 𝑘 + 1 ( 𝑘 ) 𝑇 𝑓 𝑖 𝑘 + 1 𝐱 𝑘 + 1 ( 𝑘 ) , 𝐮 𝑘 + 1 ( 𝑘 ) 𝛼 𝑇 𝑘 𝜕 𝑝 𝑖 𝑘 𝜕 𝐱 𝑘 ( 𝑘 ) 𝑑 𝐱 𝑘 ( 𝑘 ) 𝑑 𝑡 𝑘 𝐾 1 𝑗 = 𝑘 𝛼 𝑇 𝑗 𝜕 𝑔 𝑖 𝑗 𝜕 𝐱 𝑗 ( 𝑗 ) 𝑑 𝐱 𝑗 ( 𝑗 ) 𝑑 𝑡 𝑘 = 𝐿 𝑖 𝑘 𝐱 𝑘 ( 𝑘 ) , 𝐮 𝑘 ( 𝑘 ) , 𝐬 𝑘 ( 𝑘 ) 𝐿 𝑖 𝑘 + 1 𝐱 𝑘 + 1 ( 𝑘 ) , 𝐮 𝑘 + 1 ( 𝑘 ) , 𝐬 𝑘 + 1 ( 𝑘 ) + 𝜆 𝑘 ( 𝑘 ) 𝑇 𝑓 𝑖 𝑘 𝐱 𝑘 ( 𝑘 ) , 𝐮 𝑘 ( 𝑘 ) 𝜆 𝑘 + 1 ( 𝑘 ) 𝑇 𝑓 𝑖 𝑘 + 1 𝐱 𝑘 + 1 ( 𝑘 ) , 𝐮 𝑘 + 1 ( 𝑘 ) 𝐾 1 𝑗 = 𝑘 𝛼 𝑇 𝑗 𝜕 𝑔 𝑖 𝑗 𝜕 𝐱 𝑗 ( 𝑗 ) 𝑑 𝐱 𝑗 ( 𝑗 ) 𝑑 𝑡 𝑘 𝛼 𝑇 𝐾 𝜕 𝜗 𝜕 𝐱 𝐾 ( 𝐾 ) 𝑑 𝐱 𝐾 ( 𝐾 ) 𝑑 𝑡 𝑘 . ( 4 . 2 6 )

Note that when second-order derivatives are needed, there is no difficulty to obtain the second-order derivatives following the above procedure.

4.2. Parametrization Method

To obtain the numerical solution to optimal control for hybrid systems, continuous control profiles are parameterized on each mode active horizon in this section. Then the numerical solution to optimal controls can be computed based on the obtained sensitivity analysis results. The basic idea behind the proposed method using finite parameterizations of the controls is to transcribe the original infinite dimensional problem, that is, 𝐶 -problem, into a finite dimensional nonlinear programming problem, that is, 𝑃 -problem [25]. Here, the parametrization method that the control profiles are approximated by a family of Lagrange form polynomials is implemented.

Partition each horizon [ 𝑗 1 , 𝑗 ) into 𝑁 𝑗 elements as 𝑗 1 = 𝜏 𝑗 0 < 𝜏 𝑗 1 < < 𝜏 𝑗 𝑁 𝑗 = 𝑗 where 𝜏 𝑗 𝑙 are referred to as collocation points, 𝑙 = 0 , , 𝑁 𝑗 . Let 𝐮 𝑗 𝑙 denote the value of 𝐮 𝑗 at 𝜏 𝑗 𝑙 , 𝑙 = 0 , , 𝑁 𝑗 . Thus, the control variable 𝐮 𝑗 is represented approximately by a Lagrange interpolation profile for 𝑗 = 1 , , 𝐾 , 𝐮 𝑗 ( 𝜏 ) = 𝑁 𝑗 𝑙 = 0 ̂ 𝑙 𝑙 ( 𝜏 ) 𝑢 𝑗 𝑙 [ , 𝜏 𝑗 1 , 𝑗 ) , ( 4 . 2 7 ) where ̂ 𝑙 𝑙 ( 𝜏 ) = 𝑁 𝑗 𝑚 = 0 , 𝑚 𝑙 ( ( 𝜏 𝜏 𝑗 𝑚 ) / ( 𝜏 𝑗 𝑙 𝜏 𝑗 𝑚 ) ) . 𝐬 𝑗 is also parameterized by 𝐬 𝑗 ( 𝜏 ) = 𝑁 𝑗 𝑙 = 0 ̂ 𝑙 𝑙 ( 𝜏 ) 𝐬 𝑗 𝑙 [ , 𝜏 𝑗 1 , 𝑗 ) , ( 4 . 2 8 ) where 𝐬 𝑗 𝑙 is the value of 𝐬 𝑗 at the collocation points 𝜏 𝑗 𝑙 , 𝑙 = 0 , , 𝑁 𝑗 .

As a result, based on the obtained derivatives, the numerical solution of 𝑢 and 𝜃 to optimal control for the hybrid systems can be solved simultaneously and efficiently by adopting gradient-based algorithms as described in Xu and Antsaklis [3] and Egerstedt et al. [6]. Note that the derivatives are functions of costate 𝜆 𝑗 as formulated in Theorem 4.2. When control polynomial profiles are implemented, a multipoint boundary value problem about state and costate expressed by (3.6), (3.7), and (3.10) will be solved, which produces the derivatives.

Although the Lagrange interpolation profiles may cause the state or/and control trajectories violate their constraints, this parameterizations method has been proved useful in practice. Moreover, there are some techniques to decrease the defect [1, 2].

Remark 4.4. Control variable 𝐮 𝑗 can be approximated by several piecewise Lagrange interpolation profiles by further partitioning the element [ 𝑗 1 , 𝑗 ) . More detail of the parameterizations methods can be found in Vassiliadis et al. [1, 2], Kameswaran and Biegler [26], and the references therein. Only one Lagrange interpolation profile is used here to show the process of the proposed method.

5. Some Examples

To illustrate the effectiveness of the developed method, two examples with different situations are presented in the following. Numerical examples are conducted on an ThinkPad X61 2.10-GHz PC with 2G of RAM. The program is implemented using MatLab 7. The order of Lagrange polynomials in the examples is 3.

Example 5.1. The prototype of this example comes from Vassiliadis et al. [1]. The hybrid system consists of two batch reactors as shown in Figure 1. The first reactor denoted by mode 1 is fitted with a heating coil which can be used to manipulate the reactor temperature 𝑢 over time and is initially loaded with 0.1  m 3 of an aqueous solution of component 𝑥 1 of concentration 2000 mol/m3. This reacts to form components 𝑥 2 according to the consecutive reaction scheme 2 𝑥 1 𝑥 2 . After completion of the first reaction, an amount of dilute aqueous solution of component 𝑥 2 of concentration 600 mol/m3 is added instantaneously to the products of the first reactor, and the mixture is loaded into the second reactor denoted by mode 2 where the reaction 𝑥 2 𝑥 3 takes place under isothermal conditions at a fixed temperature. The decision variables are the temperature 𝑢 of the mode 1, and the durations of the two mode over the horizon [ 0 , 1 8 0 ] . The dynamics of the hybrid systems can be described by
Mode 1: ̇ 𝑥 1 = 0 . 0 8 8 8 𝑒 ( 2 5 0 0 / 𝑢 ) 𝑥 2 1 , ̇ 𝑥 2 = 0 . 0 4 4 4 𝑒 ( 2 5 0 0 / 𝑢 ) 𝑥 2 1 6 8 8 9 . 0 𝑒 ( 5 0 0 0 / 𝑢 ) 𝑥 2 , ̇ 𝑥 3 = 0 . ( 5 . 1 )
Mode 2: ̇ 𝑥 1 = 0 , ̇ 𝑥 2 = 0 . 0 7 𝑥 2 8 . 0 × 1 0 5 𝑥 2 2 , ̇ 𝑥 3 = 0 . 0 2 𝑥 2 , ( 5 . 2 ) with 𝑥 ( 0 ) = [ 2 0 0 0 0 0 ] 𝑇 . The system transits once at 𝑡 = 𝑡 1 ( 𝑡 0 < 𝑡 1 < 𝑡 𝑓 ) from mode 1 to 2 with 𝑥 1 ( 𝑡 1 ) = 𝑥 1 ( 𝑡 1 ) / 1 . 7 , 𝑥 2 ( 𝑡 1 ) = ( 𝑥 2 ( 𝑡 1 ) + 4 2 0 ) / 1 . 7 . The OCPHS is to find an optimal mode transition instant 𝑡 1 and an optimal input 2 9 8 𝑢 ( 𝑡 ) 3 9 8 , 𝑡 [ 𝑡 0 , 𝑡 1 ] , to maximize the cost functional m a x 𝑡 1 , 𝑢 𝑥 3 𝑡 𝑓 , ( 5 . 3 ) with 𝑥 3 ( 𝑡 𝑓 ) 1 5 0 must be satisfied.

581040.fig.001
Figure 1: Two batch reactors system.

By using the proposed method, the optimal mode transition instant is 𝑡 1 = 1 0 5 and the corresponding optimal cost is 𝐽 = 1 5 0 . 0 2 8 5 . The corresponding continuous control and state trajectories are shown in Figure 2. In Vassiliadis et al. [1], the transition instants and the optimal cost are 𝑡 1 = 1 0 6 , 𝐽 = 1 5 0 . 2 9 4 , respectively, which are solved by software package DAEOPT.

581040.fig.002
Figure 2: State trajectories and control input of Example 5.1.

Example 5.2. Example 5.2 comes from Xu and Antsaklis [3] and is also reconsidered by Hwang et al. [9]. Different from the example in the two references, the control constraint is imposed. The example can be referred to as autonomous switching hybrid systems with mode invariants. Consider the hybrid system consisting of
Mode 1: 1 1 ̇ 𝑥 = 1 . 5 0 0 1 𝑥 + 𝑢 , ( 5 . 4 )
Mode 2: 1 1 ̇ 𝑥 = 0 . 5 0 . 8 6 6 0 . 8 6 6 0 . 5 𝑥 + 𝑢 , ( 5 . 5 ) with 𝑥 0 = [ 1 1 ] 𝑇 . Assume that 𝑡 0 = 0 , 𝑡 𝑓 = 2 and the system transits once at 𝑡 = 𝑡 1 ( 𝑡 0 < 𝑡 1 < 𝑡 𝑓 ) from Mode 1 to 2 when the state trajectories intersect the linear manifold defined by 𝑚 ( 𝑥 ) = 𝑥 1 + 𝑥 2 7 = 0 . Mode 1 is active with its mode invariant 𝑥 1 + 𝑥 2 7 < 0 and Mode 2 is active with its mode invariant 𝑥 1 + 𝑥 2 7 > 0 . The OCPHS is to find an optimal mode transition instant 𝑡 1 and an optimal input 𝑢 ( 𝑡 ) [ 1 , 1 ] such that the cost functional 𝐽 𝑡 1 = 1 , 𝑢 2 𝑥 1 𝑡 𝑓 1 0 2 + 𝑥 2 𝑡 𝑓 6 2 + 𝑡 𝑓 𝑡 0 𝑢 2 ( 𝑡 ) 𝑑 𝑡 ( 5 . 6 ) is minimized.

By using the method developed here, the optimal mode transition instant is 𝑡 1 = 1 . 1 8 5 7 and the corresponding optimal cost is 𝐽 = 0 . 1 2 4 6 . The corresponding continuous control and state trajectories are shown in Figure 3. In Xu and Antsaklis [3], the transition instants and the optimal cost are 𝑡 1 = 1 . 1 6 2 4 , 𝐽 = 0 . 1 1 3 0 , respectively. The bad performance results from that the optimal control is approximated by polynomial.

581040.fig.003
Figure 3: State trajectories and control input of Example 5.2.

6. Conclusions

The optimal control problem for hybrid systems (OCPHS) with mode invariants and control constraints is addressed under a priori fixed mode transition order. By introducing new independent variables and auxiliary algebraic variables, the original OCPHS is transformed into an equivalent optimal control problem, and the optimality conditions for the OCPHS is stated. Based on the optimality conditions, the derivatives of the objective functional w.r.t control variables, that is, mode transition instant sequence and admissible continuous control functions, are established analytically. As a result, a control vector parametrization method is implemented to obtain the numerical solution by using gradient-based algorithms with the obtained derivatives. Compared with the existing results on the OCPHS with fixed mode transition order, the settings cover not only the control constraints but also the continuous states constraints, which makes the obtained results more general. Note that when no information about the mode transition sequence is known a priori, the discrete model methods formulated in Bemporad and Morari [27], Barton et al. [15], and Song et al. [28] seem appropriate. In addition, when uncertainties are considered in the systems, the reader is referred to Hu et al. [29] and the references therein.

Appendix

For any 𝜏 [ 𝑘 1 , 𝑘 ) , 𝑘 = 1 , , 𝐾 , let 𝐮 𝑘 ( 𝜏 ) 𝐔 𝑖 𝑘 be given and let 𝛿 𝐮 𝑘 ( 𝜏 ) 𝐔 𝑖 𝑘 be arbitrary but fixed. Define a perturbation of 𝐮 𝑘 as 𝐮 𝑘 ( 𝜏 ; 𝜀 ) = 𝐮 𝑘 ( 𝜏 ) + 𝜀 𝛿 𝐮 𝑘 ( 𝜏 ) , ( A . 1 ) where 𝜀 is arbitrarily small such that 𝐮 𝑘 ( 𝜏 ; 𝜀 ) 𝐔 𝑖 𝑘 . For the time being, assume that the other controls, 𝐮 𝑗 , 𝑗 = 1 , , 𝐾 , 𝑗 𝑘 , be given and fixed. For brevity, let 𝐱 𝑗 and 𝐱 𝑗 ( ; 𝜀 ) denote the state trajectories corresponding to 𝐮 𝑘 and 𝐮 𝑘 ( 𝜏 ; 𝜀 ) , respectively. Similarly, let 𝜆 𝑗 and 𝜆 𝑗 ( ; 𝜀 ) denote the costate trajectories corresponding to 𝐮 𝑘 and 𝐮 𝑘 ( 𝜀 ) , respectively, which are the solutions of the costate equations 𝐱 𝑗 ( ; 𝜀 ) = 𝐱 𝑗 ( ) + 𝜀 𝛿 𝐱 𝑗 𝜆 ( ) , 𝑗 ( ; 𝜀 ) = 𝜆 𝑗 ( ) + 𝜀 𝛿 𝜆 𝑗 ( ) . ( A . 2 )

Proof of (4.2) in Lemma 4.1. When 𝑗 = 1 , , 𝑘 1 , obviously in these cases 𝐱 𝑗 is independent of 𝐮 𝑘 , that is, 𝛿 𝐱 𝑗 ( 𝑗 ; 𝜀 ) = 0 , which leads to 𝛿 𝐱 𝑗 ( 𝑗 ) 𝛿 𝐮 𝑘 = 0 , 𝑗 = 1 , , 𝑘 1 . ( A . 3 )
Case i ( 𝑗 = 𝑘 ). Since 𝛿 ̇ 𝐱 𝑘 = 𝑡 𝑘 𝑡 𝑘 1 𝜕 𝑓 𝑖 𝑘 𝜕 𝐱 𝑘 𝛿 𝐱 𝑘 + 𝜕 𝑓 𝑖 𝑘 𝜕 𝐮 𝑘 𝛿 𝐮 𝑘 , ( A . 4 ) with 𝛿 𝐱 𝑘 ( 𝑘 1 ) = 0 , thus we have 𝛿 𝐱 𝑘 ( 𝑘 ) = 𝑘 𝑘 1 Φ 𝑘 𝑡 ( 𝑘 , 𝜏 ) 𝑘 𝑡 𝑘 1 𝜕 𝑓 𝑖 𝑘 𝜕 𝐮 𝑘 𝛿 𝐮 𝑘 𝑑 𝜏 , ( A . 5 ) where Φ 𝑘 is the state transition matrix defined in Section 3. Based on the definition of functional derivative, there exists 𝛿 𝐱 𝑘 ( 𝑘 ) 𝛿 𝐮 𝑘 = 𝑡 𝑘 𝑡 𝑘 1 Φ 𝑘 ( 𝑘 , 𝜏 ) 𝜕 𝑓 𝑖 𝑘 𝜕 𝐮 𝑘 Γ 𝑘 ( 𝜏 ) . ( A . 6 )
Case (ii) ( 𝑗 = 𝑘 + 1 , , 𝐾 ). In this case, 𝛿 ̇ 𝐱 𝑗 𝑡 ( 𝜏 ; 𝜀 ) = 𝑗 𝑡 𝑗 1 𝜕 𝑓 𝑖 𝑗 𝜕 𝐱 𝑗 𝛿 𝐱 𝑗 [ , 𝜏 𝑗 1 , 𝑗 ) , ( A . 7 ) which gives rise to 𝛿 𝐱 𝑗 ( 𝑗 ; 𝜀 ) = Φ 𝑗 ( 𝑗 , 𝑗 1 ) 𝛿 𝐱 𝑗 ( 𝑗 1 ) . ( A . 8 )
At mode transition instant 𝑡 𝑗 , 𝑗 = 1 , , 𝐾 1 , 𝐱 𝑗 + 1 ( 𝑗 ) = 𝜓 𝑖 𝑗 + 1 ( 𝐱 𝑗 ( 𝑗 ) ) holds, which results in 𝛿 𝐱 𝑗 + 1 ( 𝑗 ) = 𝑑 𝜓 𝑖 𝑗 + 1 𝑑 𝐱 𝑗 ( 𝑗 ) 𝛿 𝐱 𝑗 ( 𝑗 ) . ( A . 9 )
Substituting (A.9) into (A.8), we obtain 𝛿 𝐱 𝑗 ( 𝑗 ) = 𝑗 𝑙 = 𝑘 + 1 Φ 𝑙 ( 𝑙 , 𝑙 1 ) 𝑑 𝜓 𝑖 𝑙 𝑑 𝐱 𝑙 1 ( ( 𝑙 1 ) ) 𝛿 𝐱 𝑘 ( 𝑘 ) . ( A . 1 0 )
According to the definition of functional derivative, we have 𝛿 𝐱 𝑗 ( 𝑗 ) 𝛿 𝐮 𝑘 = 𝑗 𝑙 = 𝑘 + 1 Φ 𝑙 ( 𝑙 , 𝑙 1 ) 𝑑 𝜓 𝑖 𝑙 𝑑 𝐱 𝑙 1 ( ( 𝑙 1 ) ) Γ 𝑘 ( 𝜏 ) . ( A . 1 1 )
This completes the proof.

Before proving the 𝛿 𝐽 / 𝛿 𝐮 𝑘 in Theorem 4.2, Lemma A.1 is firstly given as follows.

Lemma A.1. For any 𝑗 = 𝑘 + 1 , , 𝐾 , 𝛿 𝑗 𝑗 1 𝐿 𝑖 𝑗 𝐱 𝑗 , 𝐮 𝑗 , 𝐬 𝑖 𝑑 𝜏 = 𝜆 𝑗 ( 𝑗 1 ) 𝑇 𝛿 𝐱 𝑗 ( 𝑗 1 ) 𝜆 𝑗 ( 𝑗 ) 𝑇 𝛿 𝐱 𝑗 ( 𝑗 ) . ( A . 1 2 )

Proof. Note that 𝛿 𝑗 𝑗 1 𝐿 𝑖 𝑗 𝐱 𝑗 , 𝐮 𝑗 , 𝐬 𝑗 𝑑 𝜏 = 𝛿 𝑗 𝑗 1 𝐻 𝑗 𝜆 𝑗 , 𝐱 𝑗 , 𝐮 𝑗 , 𝐬 𝑗 𝜆 𝑇 𝑗 𝑓 𝑖 𝑗 = 𝑑 𝜏 𝑗 𝑗 1 𝜕 𝐻 𝑗 𝜕 𝐱 𝑗 𝛿 𝐱 𝑗 + 𝜕 𝐻 𝑗 𝜕 𝜆 𝑗 𝛿 𝜆 𝑗 𝛿 𝜆 𝑗 𝑇 𝑓 𝑖 𝑗 𝜆 𝑇 𝑗 𝛿 𝑓 𝑖 𝑗 𝑑 𝜏 . ( A . 1 3 )
Since the following holds by Theorem 3.2: 𝜕 𝐻 𝑗 𝜕 𝐱 𝑗 𝑇 ̇ 𝜆 = 𝑗 , 𝜕 𝐻 𝑗 𝜕 𝜆 𝑗 𝑇 = 𝑓 𝑖 𝑗 , ( A . 1 4 ) therefore, 𝛿 𝑗 𝑗 1 𝐿 𝑖 𝑗 𝐱 𝑗 , 𝐮 𝑗 , 𝐬 𝑗 𝑑 𝜏 = 𝑗 𝑗 1 ̇ 𝜆 𝑗 𝑇 𝛿 𝐱 𝑗 + 𝜆 𝑇 𝑗 𝛿 𝑓 𝑖 𝑗 𝑑 𝜏 = 𝑗 𝑗 1 ̇ 𝜆 𝑗 𝑇 𝛿 𝐱 𝑗 + 𝜆 𝑇 𝑗 𝛿 ̇ 𝐱 𝑖 𝑗 𝑑 𝜏 = 𝑗 𝑗 1 𝑑 𝜆 𝑑 𝜏 𝑇 𝑗 𝛿 𝐱 𝑗 𝑑 𝜏 = 𝜆 𝑗 ( 𝑗 1 ) 𝑇 𝛿 𝐱 𝑗 ( 𝑗 1 ) 𝜆 𝑗 ( 𝑗 ) 𝑇 𝛿 𝐱 𝑗 ( 𝑗 ) . ( A . 1 5 )

Obviously, when 𝑗 = 𝑘 , we have 𝛿 𝑘 𝑘 1 𝐿 𝑖 𝑘 𝐱 𝑘 , 𝐮 𝑘 , 𝐬 𝑘 𝑑 𝜏 = 𝜆 𝑘 ( 𝑘 1 ) 𝑇 𝛿 𝐱 𝑘 ( 𝑘 1 ) 𝜆 𝑘 ( 𝑘 ) 𝑇 𝛿 𝐱 𝑘 ( 𝑘 ) + 𝑘 𝑘 1 𝜕 𝐻 𝑘 𝜕 𝐮 𝑘 𝛿 𝐮 𝑘 𝑑 𝜏 . ( A . 1 6 )

Proof of 𝛿 𝐽 / 𝛿 𝐮 𝑘 in Theorem 4.2. 𝐽 ( 𝜃 , 𝐮 ( 𝜀 ) , 𝐬 ) can be rewritten by 𝐱 𝐽 ( 𝜃 , 𝐮 ( 𝜀 ) , 𝐬 ) = 𝜙 𝐾 + ( 𝐾 ) 𝑘 1 𝑗 = 1 𝑗 𝑗 1 𝐿 𝑖 𝑗 𝐱 𝑗 , 𝐮 𝑗 , 𝐬 𝑗 𝑑 𝜏 + 𝑘 𝑘 1 𝐿 𝑖 𝑘 𝐱 𝑘 ( 𝜀 ) , 𝐮 𝑘 ( 𝜀 ) , 𝐬 𝑘 + 𝑑 𝜏 𝐾 𝑗 = 𝑘 + 1 𝑗 𝑗 1 𝐿 𝑖 𝑗 𝐱 𝑗 ( 𝜀 ) , 𝐮 𝑗 , 𝐬 𝑗 𝑑 𝜏 + 𝐾 1 𝑗 = 1 𝜑 𝑖 𝑗 𝑖 𝑗 + 1 𝐱 𝑗 ( 𝑗 ) . ( A . 1 7 ) Applying a 𝛿 -operation to (A.17) leads to 𝛿 𝑑 𝐽 = 𝐽 ( 𝜌 , 𝐮 ( 𝜀 ) , 𝐬 ) | | | | 𝑑 𝜀 𝜀 = 0 = 𝐱 𝜕 𝜙 𝐾 ( 𝐾 ) 𝜕 𝐱 𝐾 ( 𝐾 ) 𝛿 𝐱 𝐾 ( 𝐾 ) + 𝑘 𝑘 1 𝜕 𝐻 𝑘 𝜕 𝐮 𝑘 𝛿 𝐮 𝑘 + 𝑑 𝜏 𝐾 𝑗 = 𝑘 𝜆 𝑗 ( 𝑗 1 ) 𝑇 𝛿 𝐱 𝑗 ( 𝑗 1 ) 𝜆 𝑗 ( 𝑗 ) 𝑇 𝛿 𝐱 𝑗 ( 𝑗 ) + 𝐾 1 𝑗 = 1 𝜕 𝜑 𝑖 𝑗 𝑖 𝑗 + 1 𝜕 𝐱 𝑗 ( 𝑗 ) 𝛿 𝐱 𝑗 ( 𝑗 ) = 𝐱 𝜕 𝜙 𝐾 ( 𝐾 ) 𝜕 𝐱 𝐾 ( 𝐾 ) 𝛿 𝐱 𝐾 ( 𝐾 ) + 𝑘 𝑘 1 𝜕 𝐻 𝑘 𝜕 𝐮 𝑘 𝛿 𝐮 𝑘 𝑑 𝜏 + 𝜆 𝑘 ( 𝑘 1 ) 𝑇 𝛿 𝐱 𝑘 ( 𝑘 1 ) 𝐾 1 𝑗 = 𝑘 𝜆 𝑗 ( 𝑗 ) 𝑇 𝛿 𝐱 𝑗 ( 𝑗 ) 𝜆 𝑗 + 1 ( 𝑗 ) 𝑇 𝛿 𝐱 𝑗 + 1 ( 𝑗 ) 𝜕 𝜑 𝑖 𝑗 𝑖 𝑗 + 1 𝜕 𝐱 𝑗 ( 𝑗 ) 𝛿 𝐱 𝑗 ( 𝑗 ) 𝜆 𝐾 ( 𝐾 ) 𝑇 𝛿 𝐱 𝐾 ( 𝐾 ) . ( A . 1 8 )
Due to Theorem 3.2 and (A.9), 𝛿 𝐽 can be reformulated by 𝛿 𝐱 𝐽 = 𝜕 𝜙 𝐾 ( 𝐾 ) 𝜕 𝐱 𝐾 ( 𝐾 ) 𝜆 𝐾 ( 𝐾 ) 𝑇 𝛿 𝐱 𝐾 ( 𝐾 ) + 𝑘 𝑘 1 𝜕 𝐻 𝑘 𝜕 𝐮 𝑘 𝛿 𝐮 𝑘 𝑑 𝜏 𝐾 1 𝑗 = 𝑘 𝛼 𝑇 𝑗 𝜕 𝑔 𝑖 𝑗 𝜕 𝐱 𝑗 ( 𝑗 ) 𝛿 𝐱 𝑗 ( 𝑗 ) = 𝑘 𝑘 1 𝜕 𝐻 𝑘 𝜕 𝐮 𝑘 𝛿 𝐮 𝑘 𝑑 𝜏 𝐾 1 𝑗 = 𝑘 𝛼 𝑇 𝑗 𝜕 𝑔 𝑖 𝑗 𝜕 𝐱 𝑗 ( 𝑗 ) 𝛿 𝐱 𝑗 ( 𝑗 ) 𝛼 𝑇 𝐾 𝜕 𝜗 𝜕 𝐱 𝐾 ( 𝐾 ) 𝛿 𝐱 𝐾 ( 𝐾 ) . ( A . 1 9 )
Then according to the definition of functional derivative, we have 𝛿 𝐽 𝛿 𝐮 𝑘 = 𝜕 𝐻 𝑘 𝜕 𝐮 𝑘 𝐾 1 𝑗 = 𝑘 𝛼 𝑇 𝑗 𝜕 𝑔 𝑖 𝑗 𝜕 𝐱 𝑗 ( 𝑗 ) 𝛿 𝐱 𝑗 ( 𝑗 ) 𝛿 𝐮 𝑘 𝛼 𝑇 𝐾 𝜕 𝜗 𝜕 𝐱 𝐾 ( 𝐾 ) 𝛿 𝐱 𝐾 ( 𝐾 ) 𝛿 𝐮 𝑘 . ( A . 2 0 )
Obviously, the functional derivative of 𝐽 with respect to 𝐬 𝑘 can be directly given by 𝛿 𝐽 𝛿 𝐬 𝑘 = 𝜕 𝐻 𝑘 𝜕 𝐬 𝑘 . ( A . 2 1 )
This completes the proof.

Acknowledgments

The authors would like to express their sincere thanks to Professor Michael Fu at the University of Maryland (College Park) for suggestions on this work. This work was supported by the NSF under grant 60974023 of China, the State Key Development Program for Basic Research of China (2012CB720503), and the Fundamental Research Funds for the Central Universities.

References

  1. V. S. Vassiliadis, R. W. H. Sargent, and C. C. Pantelides, “Solution of a class of multistage dynamic optimization problems. 1. Problems without path constraints,” Industrial and Engineering Chemistry Research, vol. 33, no. 9, pp. 2111–2122, 1994. View at Publisher · View at Google Scholar · View at Scopus
  2. V. S. Vassiliadis, R. W. H. Sargent, and C. C. Pantelides, “Solution of a class of multistage dynamic optimization problems. 2. Problems with path constraints,” Industrial and Engineering Chemistry Research, vol. 33, no. 9, pp. 2123–2133, 1994. View at Publisher · View at Google Scholar · View at Scopus
  3. X. Xu and P. J. Antsaklis, “Optimal control of switched systems based on parameterization of the switching instants,” IEEE Transactions on Automatic Control, vol. 49, no. 1, pp. 2–16, 2004. View at Publisher · View at Google Scholar
  4. P. J. Antsaklis and A. Nerode, “Special issue on hybrid control systems,” IEEE Transactions on Automatic Control, vol. 43, no. 4, 1998.
  5. P. E. Caines and Y.-J. Wei, “Hierarchical hybrid control systems: a lattice-theoretic formulation,” IEEE Transactions on Automatic Control, vol. 43, no. 4, pp. 501–508, 1998. View at Publisher · View at Google Scholar · View at Zentralblatt MATH
  6. M. Egerstedt, Y. Wardi, and H. Axelsson, “Transition-time optimization for switched-mode dynamical systems,” IEEE Transactions on Automatic Control, vol. 51, no. 1, pp. 110–115, 2006. View at Publisher · View at Google Scholar
  7. D. Flieller, P. Riedinger, and J. P. Louis, “Computation and stability of limit cycles in hybrid systems,” Nonlinear Analysis, vol. 64, no. 2, pp. 352–367, 2006. View at Publisher · View at Google Scholar · View at Zentralblatt MATH
  8. C. G. Cassandras and J. Lygeros, Stochastic Hybrid Systems, chapter 1, 2, 6, 7, Taylor & Francis, Boca Raton, Fla, USA, 2007.
  9. I. Hwang, J. Li, and D. Du, “A numerical algorithm for optimal control of a class of hybrid systems: differential transformation based approach,” International Journal of Control, vol. 81, no. 2, pp. 277–293, 2008. View at Publisher · View at Google Scholar · View at Zentralblatt MATH
  10. H. J. Sussmann, “A maximum principle for hybrid optimal control problems,” in Proceedings of the 38th IEEE Conference on Decision and Control (CDC '99), pp. 425–430, Phoenix, Ariz, USA, December 1999. View at Scopus
  11. M. S. Shaikh and P. E. Caines, “On the hybrid optimal control problem: theory and algorithms,” IEEE Transactions on Automatic Control, vol. 52, no. 9, pp. 1587–1603, 2007. View at Publisher · View at Google Scholar
  12. A. V. Dmitruk and A. M. Kaganovich, “The hybrid maximum principle is a consequence of Pontryagin maximum principle,” Systems & Control Letters, vol. 57, no. 11, pp. 964–970, 2008. View at Publisher · View at Google Scholar · View at Zentralblatt MATH
  13. A. Bensoussan and J. L. Menaldi, “Stochastic hybrid control,” Journal of Mathematical Analysis and Applications, vol. 249, no. 1, pp. 261–288, 2000. View at Publisher · View at Google Scholar · View at Zentralblatt MATH
  14. M. S. Branicky, V. S. Borkar, and S. K. Mitter, “A unified framework for hybrid control: model and optimal control theory,” IEEE Transactions on Automatic Control, vol. 43, no. 1, pp. 31–45, 1998. View at Publisher · View at Google Scholar · View at Zentralblatt MATH
  15. P. I. Barton, C. K. Lee, and M. Yunt, “Optimization of hybrid systems,” Computers and Chemical Engineering, vol. 30, no. 10–12, pp. 1576–1589, 2006. View at Publisher · View at Google Scholar · View at Scopus
  16. C. Seatzu, D. Corona, A. Giua, and A. Bemporad, “Optimal control of continuous-time switched affine systems,” IEEE Transactions on Automatic Control, vol. 51, no. 5, pp. 726–741, 2006. View at Publisher · View at Google Scholar
  17. H. Axelsson, Y. Wardi, M. Egerstedt, and E. I. Verriest, “Gradient descent approach to optimal mode scheduling in hybrid dynamical systems,” Journal of Optimization Theory and Applications, vol. 136, no. 2, pp. 167–186, 2008. View at Publisher · View at Google Scholar · View at Zentralblatt MATH
  18. E. R. Johnson and T. D. Murphey, “Second order switching time optimization for time-varying nonlinear systems,” in Proceedings of the 48th IEEE Conference on Decision and Control (CDC '09), pp. 5281–5286, Shanghai, China, 2009. View at Publisher · View at Google Scholar · View at Scopus
  19. S. A. Attia, V. Azhmyakov, and J. Raisch, “On an optimization problem for a class of impulsive hybrid systems,” Discrete Event Dynamic Systems, vol. 20, no. 2, pp. 215–231, 2010. View at Publisher · View at Google Scholar · View at Zentralblatt MATH
  20. F. Taringoo and P. E. Caines, “Gradient geodesic and Newton geodesic HMP algorithms for the optimization of hybrid systems,” Annual Reviews in Control, vol. 35, pp. 187–198, 2011.
  21. X. Xu and P. J. Antsaklis, “A dynamic programming approach for optimal control of switched systems,” in Proceedings of the 39th IEEE Confernce on Decision and Control (CDC '00), pp. 1822–1827, Sydney, Australia, December 2000. View at Scopus
  22. K. Gokbayrak and C. G. Cassandras, “Hybrid controllers for hierarchically decomposed systems,” in Proceedings of the Hybrid System Control Confernce, pp. 117–129, March 2000. View at Scopus
  23. L. S. Jennings, K. L. Teo, V. Rehbock, and W. X. Zheng, “Optimal control of singular systems with a cost on changing control,” Dynamics and Control, vol. 6, no. 1, pp. 63–89, 1996. View at Publisher · View at Google Scholar · View at Zentralblatt MATH
  24. C. Song and P. Li, “Near optimal control for a class of stochastic hybrid systems,” Automatica, vol. 46, no. 9, pp. 1553–1557, 2010. View at Publisher · View at Google Scholar · View at Zentralblatt MATH
  25. R. Findeisen, L. Imsland, F. Allgöwer, and B. A. Foss, “State and output feedback nonlinear model predictive control: an overview,” European Journal of Control, vol. 9, no. 2-3, pp. 190–206, 2003. View at Publisher · View at Google Scholar · View at Scopus
  26. S. Kameswaran and L. T. Biegler, “Simultaneous dynamic optimization strategies: recent advances and challenges,” Computers and Chemical Engineering, vol. 30, no. 10–12, pp. 1560–1575, 2006. View at Publisher · View at Google Scholar · View at Scopus
  27. A. Bemporad and M. Morari, “Control of systems integrating logic, dynamics, and constraints,” Automatica, vol. 35, no. 3, pp. 407–427, 1999. View at Publisher · View at Google Scholar · View at Zentralblatt MATH
  28. C. Song, B. Wu, and P. Li, “A hybrid model-based optimal control method for nonlinear systems using simultaneous dynamic optimization strategies,” Journal of Process Control, vol. 22, no. 5, pp. 852–860, 2012.
  29. J. Hu, Z. Wang, H. Gao, and L. K. Stergioulas, “Probability-guaranteed H finite-horizon filtering for a class of nonlinear time-varying systems with sensor saturation,” System and Control Letters, vol. 61, no. 4, pp. 477–484, 2012.