Abstract

For various threats in the enemy defense area, in order to achieve covert penetration and implement effective combat against enemy, the unmanned aerial vehicles formation needs to be reconfigured in the process of penetration; the mutual collision avoidance problems and communication constraint problems among the formation also need to be considered. By establishing the virtual-leader formation model, this paper puts forward distributed model predictive control and finite state machine formation manager. Combined with distributed cooperative strategy establishing the formation reconfiguration cost function, this paper proposes that adopting the revised quantum-behaved particle swarm algorithm solves the cost function, and it is compared with the result which is solved by particle swarm algorithm. Simulation result shows that this algorithm can control multiple UAVs formation autonomous reconfiguration effectively and achieve covert penetration safely.

1. Introduction

Multiple UAVs formation control has become an important research direction in the field of UAV control in recent years [18], whether in military applications or in civil applications, and it has attracted a lot of attention. Compared with executing the task by single UAV, multiple UAVs formation has significant advantages in terms of time consumption, project selection, task completion rate, and so on. However, as the battlefield environment or formation task changes, the formation or loose degree of UAVs formation needs to be changed, namely, formation reconfiguration. Therefore, it is very important to research a kind of fast, safe, effective, and applicable multiple UAVs formation reconfiguration method [914].

Multiple UAVs formation reconfiguration has been widely researched in domestic and foreign regions currently. In order to solve the problem of autonomous formation reconfiguration under the threat of known radar and missile, the literature [15, 16] proposed a method that achieves the location optimal configuration of different load UAVs by the plan, reaction, and parameters, then reaches the goal which is the inference of the enemies’ radar and missile cooperatively, and ensures the overall safety of the formation, but it did not put forward the effective formation reconfiguration control method for the UAV model. The optimal time control problem of the fixed terminal state constraints has been discussed in detail by the literature [17, 18]. The optimal time control problem of free terminal state constraints has been discussed in [1921], but all of them do not consider the distance constraint and collision avoidance constraint. The mathematical description of UAV has been described by the literature [22], and it adopts the optimal control and differential countermeasure theory to analyze whether the external aircraft can influence the collision avoidance among UAVs of the formation. The literature [23] takes artificial potential field method to control the UAV formation so that the UAV formation can avoid the collision among them, but the design of potential field is very complicated, it is only suitable for the collision avoidance of the fixed formation, and it is difficult to be applied to the process of formation reconfiguration. The literature [24] adopts DMPC to solve the collision avoidance problem among UAVs in three-dimensional space, but this method can only calculate the path of collision avoidance, and it is not suitable for the formation problem with clear goal state. The literature [25] maps the formation onto large rigid graph; when the formation passes the obstacle zone, the big rigid graph will be divided into many small rigid graphs, the formation passes the obstacle zone by subformation, and this method can only analyze this problem from the angle of system stability, but it does not put forward effective control strategy. In order to solve the multiple UAVs search problem, the literature [26] takes the method of distributed model predictive control that transforms the centralized multiple UAVs optimization decision problem on line to each UAV small scale distributed optimization problem and then adopts the algorithm which is based on particle swarm optimization and Nash optimization achieves iterative solution for each subsystem optimization problem. Based on optimal trajectory generator coupled with a modified sliding controller for tracking the trajectory and avoiding collisions, the literature [27] accomplishes a UAV formation reconfiguration control scheme with autonomous collision avoidance system for application in 3D space. When facing the uncertainties and obstacles, the literature [28] adopts the learning based model predictive control (LBMPC) to solve the formation reconfiguration problem for a group of cooperative UAVs forming a desired formation. The literature [29] proposes a distributed linear MPC approach to solve the trajectory planning problem for rotary-wing UAVs, and the simulation results show that this method is valid. Under the constraints of terminal status and of control action energy, the literature [30] puts forward a novel algorithm which is pigeon-inspired optimization to solve the problem of multiple unmanned aerial vehicles formation reconfiguration; compared with particle swarm optimization, this algorithm is better.

This paper elaborates taking distributed model predictive control (DMPC) and finite state machine (FSM) to solve the formation reconfiguration control for multiple UAVs formation which are the mathematic model of UAV, formation model, and different threat constraints and then puts forward revising quantum-behaved particle swarm algorithm (RQPSO) to solve the cost function of DMPC problem, and the particle swarm optimization algorithm solution is compared. The simulation result shows that the algorithm can accomplish formation configuration quickly and efficiently and then achieve covert penetration.

The rest of the paper is organized as follows. Section 2 introduces the UAV motion model and formation model. Main environment threats and constraints will be described in Section 3. The concept of distributed model predictive control and the formation reconfiguration control description is provided in Section 4. Then finite state machine formation management unit and the solution of reconfiguration problem which is revised in quantum particle swarm optimization algorithm are, respectively, presented in Sections 5 and 6. The simulation results and experimental explication are shown in Section 7. Finally, the conclusion and future work are reported in Section 8.

2. UAV Formation Reconfiguration Model

2.1. UAV Motion Model

Assuming that there are UAVs in the UAV formation, then the UAV formation set can be expressed as . All the UAVs can keep the flying height constant; therefore, the UAV centroid motion model after discretization can be described as [31, 32] are, respectively, the coordinate, velocity, and track azimuth of UAV in the earth coordinate system. are, respectively, speed command and track azimuth command of UAV. are, respectively, velocity time constant and track roll angle time constant. is sampling period.

Take the state variable of UAV in time as and the control variable of UAV in time as ; then the motion equation of the UAV can be simplified as (2), and the constraint condition can be expressed as (3).

Assuming that there is a known reference trajectory so that the UAV formation can follow with it, the reference trajectory satisfies the following [33]: are, respectively, coordinates and azimuth angle of reference trajectory in the earth coordinate system. are, respectively, velocity and angular velocity of reference trajectory. Both and are piecewise continuous and uniformly bounded, and they satisfy the following constraint equation:

2.2. Formation Model

The control strategy of multiple UAVs formation mainly includes the following three: leader-follower method, virtual-leader method, and behavior control method [3438]. Because the leader-follower method has the error transmission problem, and it is hard to establish the mathematic model for the behavior control method, this paper selects the virtual-leader method. Assuming there exists a reference point which UAVs can follow it, and all UAVs in the formation can achieve the trajectory of reference point in advance or through wireless communication in real-time and keep the relative distance and angle with reference point, there is no reference error among the UAVs. The trajectory reference point coordinate system is fixedly connected to the reference point (Figure 1); thus, we can obtain the desired position of all UAVs in the formation. are, respectively, the relative distance between the desired position of and trajectory reference point in time.

3. Description of Formation Reconfiguration

3.1. Threat Description

In the process of penetration multiple UAVs formation will face a variety of threats, this paper takes the early warning radar, short-range air defense radar, and no fly zone (including fixed obstacles, fixed threats, and mobile threats), assuming that all the UAVs in formation are able to detect enemy early warning radar and air defense radar, and they carry jamming equipment which is used to interfere in the early warning radar and air defense radar.

3.1.1. Threat of Early Warning Radar

As shown in Figure 2, assuming that the early warning radar’s position is , the early warning radar jamming equipment position is , the effective interference distance of early warning radar jamming equipment is , and the max detection radium of early warning radar is , when a certain UAV is interfering the early warning radar, its detection radium can be described as [39, 40]

3.1.2. Threat of Air Defense Radar

As shown in Figure 3, assuming that the air defense radar position is , the air defense radar jamming equipment position is , the position of is , the effective interference distance of air defense radar jamming equipment is , the max detection radium and angle of air defense radar are, respectively, and , then the condition which UAVs can be in the safe zone is [39, 40]

3.1.3. Nonfly Zone

Once multiple UAVs formation encounters nonfly zones or obstacles, they can only be able to keep away from them, as shown in Figure 4; in order to simplify the problem, the nonfly zones or obstacles are replaced by their minimum circumnavigations in this paper; assuming that the minimum circumnavigations of nonfly zone position and radium are, respectively, and , then the condition which UAVs can be in the safe zone is

3.2. Constraint Condition Construction of UAVs Formation

On the one hand, because UAVs formation faces a variety of threats, only to avoid these threats, the UAVs formation can achieve covert penetration, reach the designated zone, and implement effective attack on enemy targets. On the other hand, because UAVs formation needs to avoid collision among the UAVs in the formation and maintain normal communication in real-time in the process of flight, it only keeps a certain distance between one UAV and another UAV, and then the formation can fly safely. Based on the above situation, the constraints condition can be constructed as follows.

3.2.1. Threat Constraint of Early Warning Radar

Assuming that there are enemy early warning radars detected by UAVs in the formation in time, their position can be expressed as , and the detection radium after interference is , according to Figure 2, the UAV in the formation can ensure that it is not found by the early warning radar only if it is beyond the early warning radar detection distance after interference; thus, we can construct the threat constraint condition when UAV faces the early warning radar as follows:

3.2.2. Threat Constraint of Air Defense Radar

Assuming that there are enemy air defense radars detected by UAVs in the formation intime, their position can be expressed as , and the safe distance and angle cosine after interference are, respectively, and , according to Figure 3, the UAV in the formation can ensure that it is not found by air defense radar only if it is in the safe zone; thus, we can construct the threat constraint condition when UAV faces the air defense radar as follows:

3.2.3. Threat Constraint of Nonfly Zone

Assuming that there are nonfly zone detected by UAVs in the formation in time, and the minimal circumnavigation of nonfly zone can be expressed as , according to Figure 4, the UAVs in the formation can ensure that they are safe only if they do not fly across the nonfly zones; thus, we can construct the threat constraint condition when UAV faces the nonfly zone as follows:

3.2.4. Collision Free Constraint

In order to ensure that UAVs in the formation avoid collision, the distance between every two UAVs in the formation needs to be greater than the safety distance ; thus, we can construct the distance constraint condition as follows:

3.2.5. Communication Distance Constraint

Multiple UAVs formation needs to receive many commands or information during missions, like the command from the upper, change task, all UAVs’ state information, and so on, so they must keep normal communication in real-time, and owing to the communication distance of device being limited, so it is necessary to ensure that the distance between two UAVs is less than the communication security distance ; thus, we can construct the distance constraint condition as follows:

4. Distributed Model Predictive Control (DMPC) Strategy

4.1. Model Predictive Control (MPC)

Model predictive control is also called receding horizon control; its basic principles can be summed up in three aspects: prediction model, rolling optimization, and feedback correction. Since MPC is a finite time domain optimization, not a global optimization strategy, its performance of anti-interference and robustness are very strong [24]. The process of MPC algorithm is shown in Figure 5.

The typical MPC algorithm can be simply described as follows:(1)Assuming that the system state is at time, thus we can forecast the future output of the system based on the past input.(2)Select a certain performance index to solve problem online; then the system open loop optimal control input sequence in prediction of time domain is .(3)Select the first item of the system open loop optimal control input sequence as the input of time , namely, , and then act in the system.(4)Repeat the above steps in the new time domain at time.

4.2. Structure of Multiple UAVs Formation Reconfiguration Based on DMPC

The reconfiguration controller needs to optimize all the control inputs in traditional centralized formation reconfiguration control method, the calculation is very large, and it has no scalability [39]. Because these UAVs in the formation are independent, from the view of dynamic characteristics, they are decoupling, so formation reconfiguration can be achieved by the necessary information; thus, the DMPC control architecture can be adopted. Every UAV will be equipped with a MPC controller which can also achieve the information interaction under DMPC control architecture; thus, the overall formation behavior can be described by all subsystems, so the state equation of formation system can be described as [26, 41, 42]Further, the overall cost function of formation can be expressed as

are, respectively, the step prediction state and step prediction control input of th UAV. are step prediction state set of other UAVs. is weight coefficient. , and can reflect the time cost of the overall formation reconfiguration: namely, demanding that every UAV does not deviate from the final desired position too far; thus, the reconfiguration control task can be completed as soon as possible. Here the optimization problem can be decomposed into small scale local finite time domain optimization problems; thus, the local optimization control model for th subsystem can be expressed as

Assuming that all the UAVs can be able to maintain the normal communication under the communication constraint, other UAVs’ states can be obtained by communication; thus, the local optimization problem of th subsystem is relevant to its own state , the other UAVs’ states , and its own control input , the scale of optimization problem is greatly reduced, and th UAV’s control input can be obtained by solving this optimization problem. Combined with the threat constraints, the overall reconfiguration optimization model of UAV formation can be expressed as

5. Finite State Machine Formation Management Unit

5.1. Finite State Machine (FSM)

Finite state machine is mainly used to describe the conversion process and mechanism of object between different states, and its mathematical definition can be defined as follows: a finite state machine can be expressed by a quintet . is nonempty finite set of state. is the initial state set, . is the final state set, and . is the input alphabet (triggering event set). is mapping function from to , [4346].

5.2. Design of Formation Control Manager Based on FSM

As shown in Figure 6, the adjacent UAVs’ state, environmental threat information, task command, and reference trajectory coordinate will be transferred to FSM formation management unit and DMPC formation controller by communication unit. According to this information, FSM formation management unit will make the formation mode of the next step and transfer it to DMPC formation controller, and then the DMPC formation controller will output specific control signal after handling the above information.

The operation steps of the FSM formation management are as follows:(1)According to the theory of finite state machine and the mission requirement of UAVs formation, we can determine five models of UAV formation as follows: free formation flight ; forming the initial formation ; keeping the formation ; formation reconfiguration ; formation avoidance control .(2)According to the formation mode of the first step, the switching condition (trigger event) between two states is determined: UAVs formation command is ; formation satisfies the constraint condition, that is, ; there exists a fixed obstacle or nonfly zone within the predetermined range, that is, ; there exists no fixed obstacle or nonfly zone within the predetermined range, that is, ; a certain UAV joins into the formation, that is, ; a certain UAV leaves the formation, that is, ; start a cooperative task, such as cooperative attack and cooperative interference, that is, ; end a cooperative task, such as cooperative attack and cooperative interference, that is, ; command of formation dissolution is .(3)According to the first step and the second step, we can draw a state transition diagram (Figure 7).

Based on the DMPC formation controller and the FSM formation management unit, this paper presents an automatic formation reconfiguration control method. When UAVs formation arrives in the target zone and finds the early warning radar threat or air defense radar threat, the cooperative interference penetration should be mainly taken. It is usual to select one or several UAVs to interfere in the early warning radar or air defense radar for cooperative interference penetration; for simplicity, in this paper electronic interference will be carried out only by the UAV whose distance is nearest from radar interfere, and the overall formation transforms the formation based on this UAV and reference trajectory point, while it must satisfy the constraints of formation forming which is required by cooperative interference penetration. The other UAVs in the formation which has not interference task will arrive to the latest desired relative position as soon as possible. And when the UAVs formation detects the nonfly zone, cooperative penetration can only be carried out by them, and the overall formation can achieve formation transformation based on reference trajectory and nonfly zone.

Assuming that the initial state of formation is , when a certain UAV of the formation detects the early warning radar or air defense radar, the formation goes into , and it will select the most suitable UAV which will carry on real-time resolving to obtain the best position to interfere in the early warning radar or air defense radar; the other UAVs in the formation will also carry on real-time resolving to obtain the desired formation parameters; they will justify whether they are in the range of the enemy early warning radar or air defense radar or not until they are in safety [39, 46].

6. Solution of Reconfiguration Problem

6.1. Quantum Particle Swarm Optimization Algorithm (QPSO)

In the model of quantum particle swarm optimization algorithm, the particle motion state can be described by wave function as follows: is probability density function, and it satisfies the normalization condition:

The particle motion satisfies the Schrödinger equation in the quantum space: is Planck constant; is Hamiltonian operator which can be described as follows:

is the particle mass, is particle potential energy, and the model is called potential well. Quantum particle swarm optimization algorithm is mainly based on potential well: assuming that there is a group made up of particles which are represented as problem solution in dimensional object search space: namely, . At time th particle position is , it has no velocity vector, the best individual position in the group can be expressed as , and the best global position in the group can be expressed as . In order to ensure the convergence of the algorithm, each particle converges to an attractor point which can be expressed as follows: are, respectively, random number between 0 and 1.

The average best position is introduced in the quantum particle swarm optimization algorithm, which can be defined as the average of the individual best position from all the particles: namely, is the group scale.

Then the evolution equation of the quantum particle swarm optimization algorithm can be expressed as is a random number between 0 and 1, is the contraction-expansion coefficient, and the selection of parameter will affect the performance of the algorithm directly. Generally, when is reduced from 1 to 0.5, the result is relatively good, so it can be expressed as follows: is the maximum iteration number and is the current iteration number.

Compared with the basic particle swarm optimization algorithm, because the attractor point is between and , the particle will probably appear near for the particles which are gathering in . The quantum particle swarm optimization algorithm has less operator and simplifies the calculation, and to introduce the average best position , there exists a waiting effect among the particles which can greatly improve the cooperative work capability and enhance the global search capability of the algorithm [47, 48].

6.2. Solution of Formation Reconfiguration Based on RQPSO-FSM-DMPC

Although QPSO has been proved to be a global convergence algorithm, it may also fall into premature when it is used to deal with complex optimization problem such as multipeak function. In order to improve this problem, Sun et al. introduced the diversity guidance strategy into PSO and proposed the attraction and repulsion particle swarm optimization algorithm [47]. By controlling the diversity of the group, the attraction and repulsion particle swarm optimization algorithm can achieve the switch between the two modes of global search and local search. The attraction corresponded with the convergence phase, and the repulsion corresponded with the divergence phase.

The paper takes the average Euclidean distance between the particle and the centre point to express the diversity. The group made up by the current position of the particle and the group made up by the individual best position of the particle can be expressed as

Thus, the diversity metric can be expressed as follows: is the length of the longest diagonal in the search space, is the dimension of optimization, and is the scale of particle swarm:

Time is critical for the battlefield, so search speed is more important in the search early and middle phase, but when entering the search later phase, the diversity of particle reduces, and the capability of global search will subsequently be greatly degraded. In order to avoid the premature phenomenon effectively, this paper sets the lower limit of the diversity and the QPSO algorithm which is introduced by the diversity is applied to UAVs formation reconfiguration control.

We can use the diversity metrics to guide the algorithm search. After the initialization of the particle swarm, it will converge. According to (26), we can know that will decrease linearly from 1 to 0.5. In the process of convergence, once is smaller than the preset lower limit , the particle swarm will enter into the divergence state; this is a temporary increase of diversity until is larger than .

When or is less than , the mutation operation can be carried out for the particles in the best position of the group: is the random number according to the standard normal distribution and is the mutation parameter.

By conducting the mutation operation, the distance between the particle individual best position and the average best position can be increased, and then the value of will increase. At the same time, due to the change of the global best position that can make deviate from the current position, the distance between the particle current position and will also increase and then will directly lead to a certain degree of divergence of particles which will increase . Therefore, we can conduct the mutation by increasing two kinds of diversity ( or ); the former is called and the latter is called .

In addition to carrying out the mutation operation for particle swarm which can increase the diversity of group, this paper proposes another measure to increase the diversity of the group by conducting selection operation on which can avoid falling into the local optimal solution. Namely, of particles is no longer determined by the global best position and but is determined by and the individual best position of the other particles which are selected randomly. After selecting some other particle from particle swarm we can carry out the following operation:

When the objective function value which is determined by is better than , then is determined by and ; otherwise, it is determined by and . Therefore, the coordinate of can be expressed as follows:

The individual optimal position of each particle is likely to be selected as , so the particle swarm will tend to it. If some nonglobal best position of the particle becomes the target which is the convergence of the particle swarm, and it is located near the global optimal solution, then the probability of finding the global optimal solution will be increased greatly, so that the global search capability of QPSO algorithm is improved evidently [49, 50].

The revised quantum particle swarm algorithm (RQPSO) is obtained by introducing the diversity and selection strategy, and its process is described as follows.

Step 1. Set the basic parameters of the algorithm, encode the particles (Figure 8), initialize the particle swarm randomly, and establish the mapping relationship between the problems and particles.

Step 2. Perform the following steps when the iteration number is less than .

Step 3. Calculate the average best position of the particle swarm, namely, .

Step 4. Calculate the current fitness value of the particles and compare it with the fitness value of the former iteration; if the current fitness value is less than the former iteration, then the particle position is updated by the current particle position; namely, if , then .

Step 5. Calculate the current global optimal position of the group, namely, .

Step 6. Compare the fitness value of the current global optimal position with the fitness value of the former global optimal position; if the current global optimal position is better, then the global optimal position of the group is updated by it.

Step 7. Select the point for each particle, and calculate depending on (32) for each dimension of the particle.

Step 8. According to (25), calculate the new position of the particles.

Step 9. Calculate the particle swarm diversity, namely, or .

Step 10. Calculate the contraction-expansion coefficient and conduct the convergence model.

Step 11. Justify or ; if it is less than , then carry out the mutation for the global best particle depending on (30) and conduct the divergence mode.

Step 12. According to the quantum particle swarm optimization evolution equation, update the position for each particle of the group.

Step 13. Return to Step 2.

7. Simulation and Analysis

In order to verify the effectiveness of formation reconfiguration control method which is proposed by this paper, we have made procedure simulation under the MATLAB compiler environment, adopt the virtual-leader formation control strategy, and then make UAVs formation reconfiguration control simulation experiment by particle swarm optimization (PSO) algorithm and revise quantum particle swarm optimization (RQPSO) algorithm, respectively. We set the particle swarm scale and the max iteration to be 20 and 400, respectively, in PSO and RQPSO. In PSO, local coefficient and global coefficient are set to 2, and inertia weight is set to 0.5. In RQPSO, the contraction-expansion coefficient is reduced linearly from 1 to 0.5, the lower limit value of diversity is set to 0.0004, and the parameter is set to 0.000012.

Assuming the communication link between UAVs is in good condition, the communication delay need not be considered. The velocity and track azimuth of all the UAVs can be measured, and there is no interference which can interfere in the velocity and track azimuth. The parameters of early warning radar, air defense radar, and nonfly zone and collision avoidance constraints and communication constraints are shown in Table 1; the state transformation is shown in Table 5. The reference trajectory and the UAVs initial motion parameters are shown in Tables 2 and 3. The relative distance between UAVs in the formation and the reference trajectory is shown in Table 4.

The simulation results are shown in Figures 914.

From Figure 9, we can see that the overall UAVs formation does not deviate the reference trajectory too far in the process of flight, and it can automatically switch the formation control law to complete the transformation between formation keeping and formation reconfiguration in accordance with each discrete event. When the UAVs formation detects the enemy early warning radar and carries out penetrating, UAV5 is selected to interfere in it, and the other UAVs regard UAV5 and reference trajectory as reference to transform formation automatically, so that they can be in a safe zone. When the UAVs formation detects the air defense radar tracking threat, UAV3 is selected to interfere in it; at this moment, UAV3 and UAV5 select the optimal interference position cooperatively and automatically, the formation completes the optimum formation solution, and all the UAVs can reach the desired position quickly and realize safe penetration. After successful penetration, according to the latest position and the desired formation, the UAVs formation makes the formation reconfiguration; although it forms the same formation with the former, the relative position among the UAVs has changed; this phenomenon shows that the formation which is made up of five UAVs can transform the formation nimbly under the threats environment. Figures 10 and 11 show the velocity, course angle control input of UAV1 and its actual velocity, and course angle response, and from the figures, we can achieve that the actual velocity and course angle response are within the range of constraint. Figure 12 shows the desired location tracking deviation of five UAVs, and from the figure, we can get that all the tracking deviations are within the range of constraint. Figure 13 shows that the relative distance between one UAV and another meets the constraints; thus, it validates the fact that the formation configuration automatic control method which is designed by this paper is effective. Figure 14 shows that the effect of revised quantum particle swarm optimization algorithm is better than particle swarm algorithm apparently.

8. Conclusion

In this paper, a finite state machine management unit scheme was proposed in the framework of distributed model predictive control. Because the formation adopts the virtual-leader control model and the battle field created the known threat constraints, such as early warning radar, air defense radar, nonfly zone, collision free constraints, and communication distance constraints, the formation configuration is determined by the reference trajectory point and the above threat constrains. This paper regards the cost of formation reconfiguration as the optimization objective, introduces the finite state machine scheme to justify the future state of the UAVs formation, and then adopts the revised quantum particle swarm optimization algorithm to solve the problem, while comparing it with particle swarm optimization algorithm. Simulation results show that the designed method is able to keep and transform the formation along with the desired reference trajectory when avoiding the threat of battlefield, avoiding intervehicle collision, and meeting the communication distance limit. Furthermore, we will explore the results when there exists communication delay or there exists no communication in a short period of time on the designed formation reconfiguration control problem.

Competing Interests

The authors declare that they have no competing interests.