Abstract

Considering recurrent optimization process in model predictive control (MPC), the model uncertainties and disturbances terms in the missile’s guidance and control model can degrade recursive feasibility, and there are control mutation problems in common MPC algorithm. This paper presents a disturbance rejection model predictive control algorithm for missile integrated guidance and control (IGC). Firstly, a sliding mode observer (SMDO) is designed to estimate the unknown disturbances caused by target maneuvering. Secondly, the method of optimizing control increment is adopted in MPC to avoid the phenomenon of control mutation in the model calculation. By limiting the control increment in each cycle, it ensures the continuity of the control input. Thirdly, by combining the SMDO and MPC, an IGC algorithm is presented, and the stability of the algorithm is proved by using Lyapunov stability theory. Finally, the simulation results with different impact angles verify the effectiveness of the proposed algorithm for intercepting maneuver target.

1. Introduction

In the traditional missile guidance and control system design, the guidance loop and the control loop are designed separately, which leads to the time delay problem. And, at the end of engagement, the two loops couple fiercely, which leads to instability of guidance loop and causes large miss distance. At present, the difficulties and key problems in the integrated design of guidance and control can be summarized in these two aspects: (1) The integrated system of guidance and control has a higher order and more state variables in the mathematical model. And it exists a certain coupling between these state variables. (2) Mathematical model of guidance and control system has a lot of uncertainties. The aerodynamic parameters and the aerodynamic torque parameters are complex function of Mach number, atmosphere density, angle of attack, and deflection of rudder. However, there are uncertainties in the actual atmospheric density, wind speed, and target maneuver. Therefore, how to deal with the uncertainties in the system is also the key problem that requires to be considered in the integrated design of guidance and control system.

In the past decades, integrated guidance and control has been extensively studied in flight vehicle guidance and control fields. References [13] designed the integrated control law based on optimal control method. Reference [1] designed a three-dimensional guidance law using a stable numerical state-dependent coefficient (SDC) method. And the Ricatti equation was solved via the state-dependent Ricatti equation (SDRE) method. Compared with the SDRE, Hamilton-Jacobi-Bellman (HJB) equation was solved by using the method [2], which can achieve an approximate suboptimal solution. In Ref. [3], polynomial expansion of the force and torque coefficients on their respective parameters (angle of attack, sideslip angle, and rudder deflection angle) was utilized; the equation of state can be transformed into an affine linear equation dependent on the state. However, the main problems of these optimal integrated control laws are as follows: in order to get the solvable form of optimal control, the model needs more approximation and simplification; how to solve the optimal control when there are uncertainties; how to select the optimal index weight and so on. In Refs. [48], sliding mode control (SMC) and active disturbance rejection control (ADRC) were used to cope with model uncertainties and external disturbances in the guidance and control system. References [4, 5] considered multiple layers SMC. In Ref. [6], an integral sliding mode surface which can shorten response time was designed without considering the constraints of variables. The bound of the uncertain dynamics was considered in Refs. [7, 8] against stationary targets. Although the uncertainties of flight control dynamics were considered in the design of guidance law in Refs. [48], but in fact, the guidance and control were designed separately. References [919] combined backstepping control (BC) and robust control to design IGC algorithms. In Ref. [9], adaptive neural networks were trained online to compensate for unknown uncertainties were proposed. In Ref. [10], an adaptive nonlinear IGC method was proposed, which made the line-of-sight angular rate stabilized under the uncertainties of target maneuvering and missile model. Reference [11] proposed an IGC algorithm based on adaptive fuzzy sliding mode control, which is robust to system uncertainties and external disturbances. In Ref. [12], a simple adaptive block dynamic surface control algorithm was proposed, which can reduce computation cost. Combined with the extended state observer, the three-dimensional control of dynamic surface control algorithm was proposed in Refs. [1315] to obtain more stable control input. In Ref. [16], an IGC method based on nonsingular terminal dynamic surface control was proposed to obtain better control effect without singular phenomenon. In Ref. [17], the systematic backstepping technique is adopted in IGC law design, and a new smooth second-order sliding mode differentiator is developed to estimate the derivatives of the virtual control laws. In Ref. [18], IGC law is developed by combining backstepping sliding mode approach. An adaptive fuzzy approximator is employed to approximate the system uncertain functions, and a sliding mode is used to compensate the effects of approximate error and disturbances. In Ref. [19], the sliding mode control method and nonlinear extended disturbance observer technique were combined to derive the control accuracy of the impact angle under the nonlinear disturbances. In Refs. [919], backstepping control method was utilized to realize integrated guidance and control, but there is still intermediate delay in this method. References [2024] studied model predictive control-based guidance and control of missile. Only ideal model of missile dynamics was used to design model predictive control algorithm. In Refs. [2022], the target maneuver and other disturbances terms had not been fully considered. Reference [23] proposed a neural network to optimize model predictive control, but considered the acceleration of target as a disturbance. A robust model predictive control was proposed in Ref. [24] against a stationary target. In Refs. [2024], the conditions of target and disturbance are not considered in the actual environment.

Based on the above analysis, to solve the recursive feasibility and control continuity of MPC-based IGC considering the indeterminate and disturbance terms in system, this paper proposes a new IGC algorithm by combining SMDO and MPC. Firstly, considering the uncertainties and interferences problem in recurrent optimization, a new sliding mode observer (SMDO) is introduced and compensates the uncertainties and disturbances in the system to solve the recursive problem in MPC-based IGC. Secondly, this paper adopts the method that optimizes control increment in MPC instead of control input and limits the control increment in each cycle to ensure the continuity of the control command. In brief, based on the new continuous MPC and SMDO composite algorithm, the integrated missile guidance and control algorithm is proposed, and this method is verified under various simulation cases. Simulation results demonstrate that the proposed algorithm is superior to the standard model predictive control. The main contributions of this paper are as follows: (1)A new sliding mode observer (SMDO) is designed to improve the anti-interference capability and algorithm’s stability without measuring the target’s motion information and then solves the recursive problem in optimization(2)An improved model predictive control algorithm (MPC) that optimizes the control increments is proposed instead of the standard model predictive control that optimizes the control command. The proposed MPC algorithm limits the control increment in each cycle and ensures the continuity of the control command, thus avoiding the control mutation in system(3)Combining the model predictive control and sliding mode observer, a new composite MPC algorithm is proposed. The stability of proposed algorithm is proved. And the simulation results show that the proposed MPC method improves the performance of the closed system compared with the standard MPC

The rest of this paper is orgnized as follows: in Section 2, the model of IGC is established; in Section 3, the model predictive control and SMDO-based IGC method is proposed; in Section 4, a new disturbance observer is designed; in Section 5, the stability of the method is proved. Section 6 is the results of simulation. Finally, Section 7 is the conclusion.

2. Integrated Dynamic Model

2.1. Nonlinear Kinematics Model

Suppose that the missile and the target move in the pitch plane; then, the relative motion of missile and target is shown in Figure 1, where and represent the missile and the target, respectively. denotes the relative distance between missile and target, denotes the line-of-sight angle between missile and target (LOS angle). Suppose that the missile and target have constant velocities and , and their accelerations are and , respectively. The flight path angles during flight are and .

The relative motion of the missile and the target is modeled as follows:

Differentiate Equation (1), and it yields the following:

Since the and according to the assumption that the missile and the target have constant velocities, then we have . Equation (2) can be simplified as follows:

2.2. Missile Dynamics Model

The missile dynamics in the longitude plane is modeled as follows: where is the attack angle, is the mass of missile, is gravity constant, is the moment of inertia about the axis, is the pitch angle, is the pitching rate, is the deflection angle of elevator for pitch control, is the lift force, is the control contribution to the angular acceleration, represents the angular acceleration contributions from all other sources, the denotes pitch moment derivative with respect to , and represents the pitch moment derivative with respect to .

The lift and moment terms are defined as follows: where is the dynamic pressure, which equals to , is the missile reference area, is the reference length, is the lift force derivative with respect to , and represents the pitch moment derivatives with respect to .

According to Equation (4), we have ; then, Equation (3) can be rewritten as follows:

Define as , respectively, and set . From (4)–(6), the integrated equation of guidance and control can be derived as where and are defined as follows:

The disturbance contains the unknown target acceleration and the time-varying term relative to system states, while the represents the time-varying disturbance caused by the uncertainties of aerodynamic parameters and gravity, where denotes the disturbance due to aerodynamic uncertainties.

Based on the control problem formulated above, we will design a composite MPC scheme that consists of the nominal input and the estimated disturbances, i.e., to offset the effect caused by the disturbances, where is the estimation of and is obtained by minimizing the cost function of MPC. The idea is shown in Figure 2.

Definition 1. The impact angle is expressed as , which means the angle between the velocity vector of missile and the velocity vector of target at impact time, so . When the interception is realized, it can be obtained that , and the following equation can be obtained from Equation (1).

According to Equation (9), there is always a unique expected terminal line-of-sight angle for a predetermined target with an expected impact angle. Therefore, the control problem of the impact angle can be transformed into the control of the final line-of-sight angle.

Substituting into Equation (9), then it can be given that

From (10) and (11), it can be obtained that

Finally, the control objective is to design a control law that enables the missile to impact the maneuvering target with the required impact angle .

3. IGC Design Based on Improved Model Predictive Control

3.1. Design SMDO for Unknown Disturbances

In this section, we consider the case that the disturbances are unknown. Compared with the first-order sliding mode observer, the second-order sliding mode observer (SOSM) [25] or higher-order sliding mode observer (HOSM) [26] significantly reduces the undesired chattering phenomenon, which is very valuable in practical applications. Based on the previous analysis, in this section, we propose a new SMDO based on SOSM and adaptive control method to estimate target maneuver.

An enhanced nonlinear disturbance observer is proposed to estimate the disturbance of the IGC system. The nonlinear observer is designed as follows: where denotes the estimated value of disturbance , is the internal state of the nonlinear observer, and are the gain coefficient of the nonlinear observer, and .

Set the error as follows:

Similarly, the estimated value is designed as follows: where denotes the estimated value of disturbance , is the internal state of the nonlinear observer, and and are the gain coefficient of the nonlinear observer, and set the error as follows:

Design the sliding mode surface as follows:

And, the reaching law of sliding mode surface is designed as follows:

In order to prove the convergence of the estimated error, firstly define a Lyapunov function:

Then, differentiate (19), and it can be obtained that

Due to , it can be given that

Since , the sliding mode surface is stable based on Lyapunov’s theorem. And then, according to the finite time convergence theory [27], it can be concluded that the estimated error converges to zero in finite time. Finally, the nonlinear observers (13) and (15) are also finite-time stable, which means that the estimated error converges to zero in finite time.

Remark 2. The gain coefficients of the nonlinear observer should be positive constants. And, it needs to be selected carefully to limit the estimated error.

3.2. Linearization of Systems

Since both kinematics model and dynamics model are nonlinear, they should be linearized to utilize linear model predictive control method. Compared to nonlinear model predictive control, linear model predictive control has better real-time performance and is easier to analyze and calculate. The standard MPC algorithms in [22, 23] optimize control command directly. But in this section, the method that optimizes control increment is adopted.

According to the integrated model, the state variables and control variables are as follows:

The corresponding Jacobian matrix is as follows:

The linear time-varying model is derived as follows: where where are the initial values of .

Then, the discrete linear model is obtained as follows: where denotes the current step and denotes the next step. And the output of the system is as follows:

In order to solve control mutation problem of MPC, the method that optimizes control increment in MPC instead of control command is adopted. It is assumed that where is the time. Then, a new state equation can be obtained as follows: where

Through continuous iteration, it can be given that where is the predictive time domain, and the larger the is, the better the predictive effect produces, but more calculation cost will take. Finally, the state and output of the system in the future time are expressed as follows:

Then, it can be simplified as follows: where

The state and output in the predictive time domain can be obtained from the current state quantity and the control increment of the system, which is the predictive function in the model predictive control algorithm. The schematic is presented in Figure 3.

4. MPC with Disturbance Compensation

The purpose of receding horizon optimization is to find the optimal control solution, which is an online optimization. It recalculates the current error at each time and finally achieves desired control performance by optimizing one or some cost functions. Therefore, receding horizon optimization may not obtain the global optimal solution, but it can respond to the state at each time to achieve local optimal. The idea of MPC is presented as Figure 4.

Therefore, it is necessary to design appropriate cost function to obtain the best possible control sequence. The cost function can be written as a quadratic function of state and control input. where and are the weight matrices, and they are positive definite matrices which satisfy . The first term in (35) is to tuning the tracking performance of the reference trajectory, and the second term is the constraint on control consumption.

In MPC method, the missile needs to follow the reference trajectory and satisfies the constraints of states. Therefore, the following aspects should be considered when designing the cost function: (1) maximum limits on angle of attack, (2) maximum limits on pitching rate, (3) maximum limits on elevator deflection angle and the increment, and (4) terminal state of line-of-sight angular rate.

Remark 3. Considering practical application limitations, such as strapdown seeker’s field of view, maneuverability of missile, and limits of actuator’s response, appropriate constraints and parameters should be selected carefully.

After receding horizon optimization, control increment sequence is obtained in each control cycle.

The first control increment of the sequence is applied to the system as the actual control input increment. where is the control input to the system at current time. At the next control time, the system predicts the output in the next period according to the new state information and then obtains a new set of control increments through optimization. Next, repeat until the whole control process is completed. The MPC algorithm is summarized in the following Algorithm 1.

Algorithm: MPC methodology
1: Obtain the initial variables and state matrices of the system through the initial state of the missile.
2: Discrete the original system to obtain (26).
3: Take the iteration processing to obtain (33), and the is the number of iterations.
4: The optimization objective function (35) is determined by (33), and then, the quadratic programming is performed under constraints (37)–(41).
5: Finally, a set of control increments (42) is obtained, and the is imposed on the original system to obtain the minimizing control sequence . And the estimate of the disturbance is solved based on SMDO.
6: Apply the composite control signal to the real system.
7: Update the time of again and return to the first step to restart.

Remark 4. It can be seen from the above algorithm that the states of the system are updated by the actual states of each step, and this problem needs to be solved online. This strategy obtains an optimal control about the current state. Therefore, this method has the same advantages as the online optimization method.

5. Stability Proof

Assumption. Define as a terminal cost, denotes the terminal constraint set, and denotes the time-varying control set. Suppose that satisfy the following conditions:

A1: . is closed-loop, and ( is the state, and state constraint satisfied in ).

A2: ( is the control, and control constraint satisfied in ).

A3: ( is the disturbance, is a disturbance sequence, and is a function about ).

A4: ( represents the change due to increment of , and denotes a phase cost function).

If is a Lyapunov function with robust control, based on these assumptions, one can obtain the following: The above equation holds for all appropriate sets of and all . Therefore, according to [28], the above conditions can finally ensure the stability of the system.

Notation. For any function , represents the change of state from to , and it can be defined as follows:

Proof. Since the system in this paper has complex characteristics such as nonlinear, time-varying, and constrained, the system needs to be discretized firstly and then prove the stability.
First, determine a phase cost function , and then, the discrete system is obtained according to Equation (22). where is a function about . Then, the above equation can be rewritten as follows: When the policy is used, the following specific cost function form is defined as follows: where and . According to (50), the function is defined as where is a set of constrained values for disturbances. The optimal control is now defined as follows: where is a set that satisfies the conditions of optimal control.
To simplify the stability argument, is now assumed. where setting the , , and it can be obtained that where , , and the equation satisfies all , , and.
Next, in order to determine and , Equation (22) is rewritten as follows: where , , , and its realization of the stage cost is follows: Define the stage cost function as follows: When the data is appropriate, especially when is large enough, and utilizing the optimal control strategy , then the following Lyapunov function can be selected as the cost function: Set the cost function to satisfy the following: where all the and satisfy the above equation. If the function is smooth enough, then the has the following set: And satisfying the following inequalities, where all the are satisfied. From the above assumption, the satisfy the A1-A4 conditions, so it can be obtained that where it satisfies all the . Therefore, it is finally proved that the system is stable.☐

Remark 5. It should be emphasized that the stability proof conditions of MPC are limited, which means that the state variables should change within a certain range according to different cases. Therefore, the system is stable only when the set conditions are satisfied. Due to the particularity of the MPC model and the difference of state matrix at each time, this paper only needs to prove the stability of the initial time. And the proof method at the later time is the same, which only needs to change within the boundary.

6. Numerical Simulation

6.1. Simulation Parameters

The initial conditions of missile and target are as follows: the initial positions are set as and; the velocities are and ; the initial flight path angles are set as and ; the initial pitch angle is set as ; the initial pitching rate is set as ; the initial distance and line of sight angle between missile and target are and , respectively. Coefficients of disturbance observer are selected as , , , and . The nominal aerodynamic parameters which are used to generate lift and pitch torque are set as follows:

6.2. Simulation Results

For comparison, two other control algorithms are introduced in the simulation. The first one is the standard model predictive control in literature [20]. The standard MPC usually optimizes control input, not the control increment, which means the control was obtained from the receding horizon optimization of Equation (18). The second one is the reference value, which comes from an integral sliding mode control algorithm. The method is introduced briefly as follows:

According to literature [6], an integral sliding mode surface with all state variables is designed as follows: where , and they are all constants. It can be obtained from Equations (7) and (64).

According to the design method of reaching law, it can be designed as follows:

According to Equations (65) and (66), the control law is obtained. where the coefficients are , , , and .

Three cases are presented to demonstrate the superiority of the proposed IGC algorithm as follows:

Case 1. Target interception with unknown target’s maneuvering.
In this case, assume that the target is maneuvering with acceleration . The LOS angular rate obtained by these two algorithms is given in Figure 5(b). It can be observed that under the standard MPC law, the LOS angular rate diverges at the end of the terminal guidance phase, which results in large LOS angle tracking errors. By contrast, due to introducing SMDO, it achieves high control accuracy under the same conditions. Meanwhile, it can also be seen from Figure 5(b) that the response trend of the two algorithms is closing when  s, which means that the proposed MPC algorithm preserves the performance of the standard MPC algorithm. Figures 5(c) and 5(d) also indicate that the attack angle and pitch angle of the SMDO-MPC algorithm diverge slower than standard MPC. Furthermore, Figures 5(e) and 5(f) indicate that the proposed method needs smaller control input and has satisfactory disturbance estimation performance.

Case 2. Comparison of proposed algorithm with different predictive steps.
To evaluate the influence of predictive level on algorithm’s performance, three different predictive step parameters are used in the simulation, that is, s, 35 s, and 50s. In this case, the parameters of SMDO are designed the same with Case 1. The LOS angle, the attack angle, and the pitch angle are given in Figures 6(b)6(d). From these figures, we can see that those with larger predictive steps have faster convergence speed and more accurate tracking performance. But it may lead to larger guidance and control errors. So it needs to select the appropriate predictive steps. It can be concluded that the predictive step level can significantly affect the performance of integrated guidance and control. Therefore, under certain conditions, this parameter can be designed according to the performance requirement.

Case 3. Validation of SMDO-MPC under different impact angle constraints.
Assume that the target is maneuvering with acceleration , and the impact angles and . Under the proposed IGC law, the simulation results are shown as presented in Figures 7(a) and 7(b). The results indicate that the proposed IGC law has satisfactory performance, and both the LOS angle and trajectory control are within an acceptable range. It means that the proposed IGC law is within a reasonable range of impact angles, and the impact angle constraint is achieved. Figure 7(e) indicates that there is an acceptable field-of-view angle in simulation. Therefore, the proposed IGC law can intercept maneuvering targets with impact angle constraints.

7. Conclusion

To enhance the recursive feasibility of standard MPC-based IGC, this paper proposes an improved MPC-based IGC algorithm by combining the SMDO with MPC. Firstly, an SMDO is designed to estimate disturbance caused by target maneuvering and model uncertainties. And in order to avoid the phenomenon of control mutation in the model calculation, this paper adopts the method of optimizing control increment in MPC and limits the control increment in each cycle, which ensures the continuity of the control command. Then, the new designs of SMDO and MPC are combined into a new composite controller. And the stability of the system is proved by utilizing Lyapunov stability theory. Simulation results indicate that the proposed IGC method has satisfactory strike performance, which ensures interception precision under unknown target maneuvering, and achieves the desired terminal impact angle. Compared with the standard MPC, the proposed method needs smaller rudder deflection input. In the future, the composite algorithm needs to be further analyzed in satisfying more constraints and the stability needs to be proved strictly.

Data Availability

The processed data required to reproduce these findings cannot be shared at this time as the data also forms part of an ongoing study.

Conflicts of Interest

The authors declare that they have no conflicts of interest.

Acknowledgments

This research was supported by the National Natural Science Foundation of China under Grant No. 62103049.