Abstract

We propose a new robust optimal control strategy for flexible spacecraft attitude tracking maneuvers in the presence of external disturbances. An inverse optimal control law is designed based on a Sontag-type formula and a control Lyapunov function. An adapted extended state observer is used to compensate for the total disturbances. The proposed controller can be expressed as the sum of an inverse optimal control and an adapted extended state observer. It is shown that the developed controller can minimize a cost functional and ensure the finite-time stability of a closed-loop system without solving the associated Hamilton-Jacobi-Bellman equation directly. For an adapted extended state observer, the finite-time convergence of estimation error dynamics is proven using a strict Lyapunov function. An example of multiaxial attitude tracking maneuvers is presented and simulation results are included to show the performance of the developed controller.

1. Introduction

Optimal control for spacecraft rotational problems has attracted a great deal of interest. The problem of optimal attitude control has been studied by many researchers (see, e.g., [14]). The main objective of optimal attitude control is to design a controller that stabilizes the attitude of a spacecraft system to an equilibrium state and minimizes some performance criterion for the stabilization process. Generally for linear systems, the linear quadratic regulator (LQR) is able to ensure an optimal and asymptotically stable solution. In [5] the LQR method was extended to a nonlinear control problem but more constraints were required to meet the optimality and stability conditions. Various nonlinear optimal control methods have been proposed for solving the attitude control problem. Sharma and Tewari [6] devised a Hamilton-Jacobi formulation for tracking attitude maneuvers of spacecraft to derive a nonlinear optimal control law. In [7] optimal controllers for a programmed motion of a rigid spacecraft were designed using the optimal Lyapunov approach. In [8, 9] state-dependent Riccati equation (SDRE) techniques were successfully applied to spacecraft attitude control. In [10] a class of globally asymptotically stabilizing controllers was developed for the complete attitude motion of a nonsymmetric rigid body. An inverse optimal control approach was presented in [11] to construct the optimal controller for regulation of the rigid body. Recently, attitude controller designs for rigid spacecraft using inverse optimal control schemes have been developed [12]. The inverse optimal control method incorporates the task of solving a Hamilton-Jacobi-Bellman equation and offers a globally asymptotically stabilizing control law which is optimal with respect to a performance index. Sontag's formula [13] uses the directional information supplied by a control Lyapunov function (CLF). Freeman and Kokotovic [14] have shown that every CLF solves the Hamilton-Jacobi-Bellman (HJB) equation associated with a meaningful cost. In other words, if we have a CLF for a nonlinear system, we can compute the resulting optimal control law without solving the HJB equation.

As extensions of the above studies, optimal control and robust control have been merged to obtain robust optimal control laws. Various methods for developing robust optimal controllers for the attitude control of a rigid spacecraft have been proposed in the literature. Kang [15] used nonlinear control to design a stabilizing feedback control for the spacecraft tracking problem. Luo et al. [16] developed an inverse optimal adaptive controller for attitude tracking of spacecraft. Adaptive control and nonlinear control have also been merged to design robust optimal controllers. Park [17] proposed a robust optimal control scheme for attitude stabilization and used a minimax approach and inverse optimal approach to examine the optimality property of this control law. Due to its lower dependence on model information and its strong capability for estimating disturbance and simple structure, an extended state observer (ESO) [18] has been widely used to deal with various kinds of engineering control problems such as flight control and chemical process control. In [19] the ESO based disturbance rejection control approach has been addressed for attitude tracking of a rigid spacecraft. The vibration effect of flexible appendages was not considered in the attitude control design. An alternative way to design a robust optimal controller is to use an optimal sliding mode controller design scheme. Sliding mode control (SMC) is a very effective approach when applied to a system with disturbances which satisfy the matched uncertainty condition [20]. Pukdeboon and Zinober [21, 22] have developed robust optimal control laws based on the optimal sliding mode control technique for attitude tracking of spacecraft. However, since the attitude control system of flexible spacecraft is quite complicated, optimal sliding mode control has rarely been studied for attitude control design. SDRE-based optimal sliding mode (SM) control and optimal Lyapunov-based SM control approaches have been used in [23] to design optimal controllers for attitude stabilization of flexible spacecraft. However, the optimal controllers developed in that paper contain some drawbacks. The SDRE approach usually provides only local asymptotic stability while for the optimal Lyapunov approach it is a formidable task to choose a Lyapunov function to satisfy the partial differential equation derived from the Krasovskii theorem [24].

In this paper, a novel control methodology for flexible spacecraft attitude maneuvers is proposed in the presence of external disturbances. First, an inverse optimal controller for stabilizing systems is designed based on a Sontag-type formula [25, 26] and a finite-time control Lyapunov function (FTCLF) [27, 28]. Then, the total disturbance is estimated by an adapted ESO which is a modified version of the traditional ESO in [18, 29]. The stability of the traditional ESO [18] has been proved using the self-stable region (SSR) approach [30], but this approach takes many steps and is rather complicated. In this paper the finite-time convergence of an adapted ESO is proved by using a strict Lyapunov function. The proposed new attitude controller for flexible spacecraft enforces tracking motion, robustness, and optimality with respect to a family of cost functionals and achieves disturbance rejection.

The main contributions of this paper are as follows.(i)An inverse optimal control method for flexible spacecraft attitude tracking maneuvers is proposed for the first time in this paper.(ii)A second-order sliding mode based disturbance compensator is developed and combined with the proposed attitude controller. The necessity of a compensator is also discussed.(iii)The hybrid control method is used to develop a controller with complete robustness under the system uncertainty and external disturbances.

This paper is organized as follows. Section 2 introduces some preliminary results which are required for the following discussion. In Section 3 the dynamic equations of a flexible spacecraft and the attitude kinematics [31, 32] are described. Section 4 provides the problem statement and control objective. Section 5 proposes a finite-time inverse optimal control design with the FTCLF concepts. We design an inverse optimal controller that provides the convergence of system states to the desired attitude motion. In Section 6, an adapted ESO method is used to develop an anti-disturbance feedback controller. The finite-time convergence of estimation error dynamics is proved using a strict Lyapunov function. In Section 7, an example of spacecraft attitude maneuvers is presented to illustrate the performance of the developed control law. In Section 8, we present conclusions.

2. Mathematical Preliminaries

2.1. Finite-Time Stability

We now restate the concepts related to finite-time stability [33, 34].

Definition 1 (see [33]). Consider a time invariant system in the form of where is continuous on an open neighborhood of the origin. The equilibrium of the system is (locally) finite-time stable if (i) it is asymptotically stable, in , an open neighborhood of the origin, with and (ii) it is finite-time convergent in ; that is, for any initial condition , there is a settling time such that every solution of system (1) is defined with for and satisfies and , if . Moreover, if , the origin is globally finite-time stable.

Definition 2. Consider a controlled system with . It is finite-time stabilizable if there is a feedback law such that is a (locally) finite-time stable equilibrium of the closed-loop system.

Lemma 3 (see [34]). Consider the nonlinear system described in (1). Suppose there is a function defined on a neighborhood of the origin such that the following conditions hold: (i) is positive definite on and (ii) there are real numbers and , such that Then, the origin of system (1) is locally finite-time stable, with its settling time If and is radially unbounded, then the origin of system (1) is globally finite-time stable.

2.2. Finite-Time Control Lyapunov Function

Based on the definition of a finite-time control Lyapunov function (FTCLF) in [27, 28], we provide the following definitions.

Definition 4 (see [27, 28]). Let be a neighborhood of the origin. A positive-definite and function is called a finite-time control Lyapunov function (FTCLF) of system (3) if there exist real numbers and such that for all , where and are the Lie derivatives of along the solution of system (3).

Note that the Lie derivative of with respect to is defined as the inner product of and the gradient of ; that is, .

Obviously, if (6) holds, then for all , the condition that is a FTCLF of (3) is precisely the statement

In fact, if there exists a positive-definite and function such that (7) holds, then is a FTCLF of system (3). Note that if is a continuous positive-definite function, then and can be seen as Dini derivatives.

Definition 5 (see [27]). is said to satisfy the extended small control property with respect to system (3) if, for all , there exists such that if and , then there exists a certain with , such that , where   and  .

2.3. The Inverse Optimal Control Problem

This subsection considers the design of finite-time inverse optimal control. A feedback control law for the system (3) will be designed such that the closed-loop system is finite-time stable at the equilibrium and minimizes the cost functional where and for all and is the settling time function.

According to the theorem in [11, 27], if is a FTCLF for the system (3), then a stabilizing control law solves an inverse optimal control problem for the system (3) with respect to the cost functional (8).

Moreover, if we choose then is achieved with the control law . Next, substituting into (10), we obtain the HJB equation Then, using the concept in [27], we have that is the solution of the HJB equation (11).

Remark 6. In the inverse optimal approach, a finite-time stabilizing feedback control law is designed first and then it is shown that the feedback law is to find and such that optimizes (8). The problem is inverse because the functions and are a posteriori found by the stabilizing feedback law, rather than a priori selected by the designer.

3. Mathematical Model of Flexible Spacecraft

We now briefly explain the use of quaternions for description of the attitude error. We define the quaternion with and where is the desired reference attitude. The quaternion for the attitude error is with . Using the multiplication law for quaternions, we then obtain subject to the constraint In (13) is given by

Remark 7. A quaternion consists of the scalar and the three-dimensional vector , so it has four components. The scalar term is used for avoidance of singular points in the attitude representation [35]. The quaternion kinematics equation is required to be solved for all four components. However, to indicate the maneuver of the spacecraft, it is sufficient to use only the vector because this vector properly represents both Euler axis and Euler angle. Furthermore, the scalar can be calculated easily using the vector and the condition . For more details of quaternion and other attitude representation see [31, 35].

The kinematic equation for the attitude error can then be expressed as (see, [31, 36]) where with being the identity matrix.

The equation governing a flexible spacecraft is expressed as [32]where is the total inertia matrix of the spacecraft, is the modal displacement, and is the coupling matrix between the central rigid body and the flexible attachments. denotes the control input, represents the external disturbances, and and denote the stiffness and damping matrices, respectively, which are defined as with damping and natural frequency .

We denote by the desired angular velocity and by as the angular velocity error. Let where . The relative dynamic equation can be written as [36] where is the first time derivative of . The matrices , , and are given as Clearly, is a Hurwitz matrix.

If we let then the spacecraft systems (16), (17a), and (17b) can be expressed in the state space form as where

4. Problem Statement

In this work we consider tracking maneuvers. The control objective is to realize desired rotations of flexible spacecraft in the presence of external disturbances and minimize a cost functional. In other words, we shall find a controller subject to (24) such that for all initial conditions the desired rotations are achieved and the cost functional (8) is minimized. Note that, when , we have , due to the constraint relation .

5. Inverse Optimal Controller Design for Flexible Spacecraft

In this section, we first propose a finite-time inverse optimal controller for stabilizing the complete attitude motion of flexible spacecraft in the presence of external disturbances.

In order to design this controller for solving the finite-time inverse optimal problem we first choose a FTCLF for the system (24) as the following candidate positive-definite function: where and are nonnegative constants and is a positive-definite matrix that is a solution of the Lyapunov equation with a positive-definite matrix .

Assumption 8. The desired angular velocity vector and its first time derivative are bounded and satisfy the following conditions: where and are positive constants.

We next show that the function defined in (28) is a FTCLF for the system (24) by using the following lemma.

Lemma 9. In the absence of disturbance vector, under Assumption 8, the positive definite defined in (28) is a FTCLF for the spacecraft tracking system (24).

Proof. Since is symmetric positive definite, we can write as where , and
The conditions for to be positive definite are Also, using (28) we can obtain the following inequalities: where denotes the Euclidean norm of and and denote the minimum and maximum singular values of the matrix .
The first time derivative of can be obtained as Thus, we have Therefore, if , we have
Next, we show that for all , if , then with being a positive constant. Here, can be expressed as Substituting (37) in (38), we obtain Since , (39) becomes where With proper parameters and , one can obtain that is positive definite From (33) we have . Using , one has and it follows that . Thus, (42) becomes where .
We know that there exist proper parameters and such that is positive definite and can be achieved with these parameters. So, if , then for all we can obtain . This guarantees that the candidate is a FTCLF for system (24).

Next, the main results of our proposed anti-disturbance inverse optimal control for the spacecraft model are presented.

Theorem 10. Let Assumption 8 hold. The following dynamic feedback control law with
where is a positive constant, is an estimated value of which will be defined later, , and , finite-time stabilizes the spacecraft system (24).

Proof. We show that the control law in (44) is a stabilizing controller for attitude control system in (24). Consider the smooth positive-definite radially unbounded function in (28) as the Lyapunov function. The time derivative of the is It has been shown that if converges to , then converges to zero in finite time. This means that an appropriate can be selected such that is achieved. By Lemma 3, the finite-time stability of closed-loop system is ensured. This completes the proof.

Next, we show that if the disturbance estimate in (44) and disturbance term in (24) are ignored, then the feedback stabilizing controller in (44) solves the inverse optimal control problem.

Theorem 11. Consider the system (24) for which Assumption 8 holds. Then the following dynamic feedback control law solves the inverse optimal assignment problem for the attitude tracking system (24) by minimizing the cost functional (8).

Proof. With , using the control law one obtains . Letting we also obtain . Next, choosing it can be ensured that . This shows that is positive semidefinite in , , and . Therefore this is a meaningful cost function for the attitude control problem, penalizing on , , and , as well as the control effort . Substituting into the cost functional (8), we obtain the optimal cost for every .

Remark 12. It should be noted that most existing inverse optimal attitude control approaches have been developed for attitude motions of a rigid spacecraft [16, 17]. For a flexible spacecraft the vibration of flexible appendages induced by an orbiting attitude slewing operation may degrade the attitude pointing accuracy. In this paper, we develop an inverse optimal attitude maneuver controller for a flexible spacecraft system which is significantly different from the controllers in [16, 17], where the effects of vibration of flexible appendages were not considered.

6. Anti-Disturbance Inverse Optimal Control with Extended State Observer

Due to the great advances in nonlinear control theory, the observer-based controller is now one of the most common schemes in industrial applications. The extended state observer (ESO) mentioned in [18, 28] has high efficiency in accomplishing nonlinear dynamic estimation. We know that use of the ESO dynamics of the observer error gives convergence into a residual set of zero. The convergence proof has been shown using the SSR approach [30]. However, this method takes many steps and is quite complicated. In this paper an adapted ESO which is a modified version of the traditional ESO is presented and the finite-time stability of the adapted ESO system is investigated using the strict Lyapunov function.

We now consider the coordinate transformation of the spacecraft model into the following form: where is a positive constant. The time derivative of is Substituting (14) and (17a) into (51), we obtain the auxiliary dynamics which can be rewritten as where Here, the new disturbance variable is introduced. Although we need to estimate in (24), it is simpler to first estimate and then use the results to estimate . Thus, we now consider the estimate for in the ESO design.

We next consider the auxiliary system (53) with the adapted ESO technique. The ESO views the system model uncertainties and external disturbances as an added state to be estimated. Using this idea, a nonlinear ESO can be designed for estimating the disturbances . We add an extended state to the state equations to represent the total disturbances . The system (53) then becomes where the function is the estimated derivative of the disturbances .

Assumption 13. The th component of is bounded; that is, , .

Then the adapted ESO for the system (53) is proposed to be as follows: where is the estimation error of the ESO, and are the observer output, , and with and being the observer gains. Here, the function is defined as with .

Theorem 14. Let Assumption 13 hold. Consider the system (55) with the adaptive ESO (56). Then there exist positive observer gains , , and and such that the estimated states and finite-time converge into a residual set of actual states and , respectively.

Proof. To investigate the stability of the ESO system, one must consider an expression for the observer dynamics. We first define the observer errors and . The observer error dynamics are also expressed as
Letting and the observer error dynamics (58) can be transformed to the scalar form as We define . To prove the stability, we select the Lyapunov function where The matrix is positive definite if and are chosen to satisfy the condition Taking the time derivative of and using (60), we obtain which can be further written as After some manipulation, the derivative of can be written as follows: where Letting , one obtains Using (60) we know that . We obtain If , the error system (59) will finite-time converge to the region
It is obvious that the estimation errors are determined by the parameters , , and . Basically, these parameters can be chosen such that (62) is satisfied. This makes the error states converge to region (69) in finite time.

Remark 15. It should be noticed that conditions for the stability of the adapted ESO (56) have been obtained in terms of positive gains ,  , and and in (59) for the estimation errors. When suitable gains are chosen, will be a precise estimate of and the estimation error will converge to region (69) in finite time.

Using the results from the ESO system, the estimated disturbance is determined by . Thus, the proposed anti-disturbance inverse optimal control can be obtained as With suitable control gains defined by the inverse optimal control approach based on FTCLF concept, the optimal feedback controller (70) contains both optimality and robustness performance to attenuate external disturbances.

7. Simulation Results

An example of attitude control of flexible spacecraft is presented with numerical simulations. The performance of our proposed controller (70) is compared with the performances of the optimal Lyapunov sliding mode (SM) controller (4.3) in [23] and robust finite-time controller in [37]. The spacecraft is assumed to have the nominal inertia matrix [38] and coupling matrices respectively. The first four elastic modes that have been considered in the model used for simulating a spacecraft are , , , and with damping , , , and . The initial states of the rotation motion are given by , and . For the FTCLF (29), the parameters and are set as and . The adapted ESO parameters are selected as , , , and . The attitude control problem is considered in the presence of external disturbance . The external disturbances are described as and the desired angular velocity tracking is given by together with

Note that the optimal Lyapunov SM controller (4.3) in [23] was designed for optimal attitude stabilization. To apply this controller to the tracking problem, we define the total disturbance as and use it as the disturbance in the spacecraft model (24). Also, the corresponding parameters in the optimal Lyapunov SM controller (4.3) in [23] are selected. In this paper, to apply controller in [37] to the attitude control problem we use quaternions instead of Euler angles to describe the attitude of the flexible spacecraft. For controller in [37], the control parameters were selected as , , , , and .

Simulation studies have been performed to test all controllers. Figures 1 and 2 show the performance of controller (4.3) in [23]. The responses of quaternion and angular velocity tracking errors reach zero after 40 seconds. The components of angular velocity tracking error vector are smooth. From Figure 3 it can be seen that the optimal Lyapunov SM controller (4.3) in [23] stabilizes the closed-loop system of flexible spacecraft. As shown in Figures 4 and 5 the modal displacements () converge to the neighborhood of zero.

On the other hand Figure 6 shows that controller (70) provides good trajectories of the quaternion error and they reach zero in about 50 seconds. Similarly, from Figure 7 it can be seen that the angular velocity tracking errors reach zero after 80 seconds. The responses of angular velocity tracking errors at the first 20 seconds are smoother when compared with those obtained from controller (4.3) in [23]. As shown in Figure 8 the control torques obtained by controller (70) are quite smooth even though the external disturbances are taken into account. The responses of modal displacements shown in Figures 9 and 10 converge to a smaller region around the zero when compared to controller (4.3) in [23]. Figures 1115 show the simulation results of controller in [37]. As shown in Figures 11 and 12 quaternion and angular velocity tracking errors converge to zero in about 10 seconds. Figure 13 depicts the control torques which approximate the harmonic curves. The responses of modal displacements for controller in [37] are shown in Figures 14 and 15.

A comparison of the simulation results obtained by control law (4.3) in [23], our proposed controller (70), and controller in [37] shows the following. It can be seen that our proposed control law (70) provides smoother attitude velocity tracking error responses and control input responses than those of controller (4.3) in [23]. It can be seen that the vibration of flexible appendages of the spacecraft is obviously reduced by applying the proposed controller (70) rather than through the use of (4.3) in [23]. Next, we compare the performance of (70) and controller in [37]. Although tracking responses obtained by our proposed control law (70) converge more slowly to zero than in [37], smaller values of control torques are required in (70) than in (29). Our proposed control law is based on the inverse optimal control concept that minimizes the performance index, so it usually requires smaller values of control torques. In addition, the proposed controller gives smoother control torque signals. This shows that our controller (70) achieves better disturbance rejection performance than in [37]. In view of these simulation results, controller (70) seems to give the best overall control for practical inverse optimal attitude tracking control of a flexible spacecraft.

8. Conclusion

We have studied a finite-time anti-disturbance inverse optimal controller design of attitude tracking of a flexible spacecraft in the presence of external disturbances. The concepts of inverse optimal control and the FTCLF have been employed to develop a novel finite-time inverse optimal attitude tracking control law. An adapted ESO has been designed by modifying the structure of the traditional ESO. The finite-time convergence of an adapted ESO has been proven using the strict Lyapunov function. A finite-time anti-disturbance inverse optimal controller can be expressed as the sum of a finite-time inverse optimal control and adapted ESO. It has been shown that the developed controller solves the inverse optimal control problem and converges to the reference attitude states in finite time. An example of multiaxial attitude maneuver is presented and simulation results are given and compared with the results from controller (4.3) in [23] and controller in [37] to verify the usefulness of the developed controller.