Abstract

Aimed at the disadvantages of constrained processing technology in the cooperative target allocation of multiple unmanned combat aircraft (UCAV), the energy-reserved chemical reaction algorithm (CNCRO) is proposed to solve the constrained optimization problems. On the one hand, convert multiple constraint conditions of the multi-UCAV target allocation into optimization targets to transform the constrained processing problem into a multiobjective optimization problem including allocation goals and constrained optimization goals. On the other hand, the energy-reserved chemical reaction algorithm (CNCRO) is proposed, which introduces the environmental energy-reserved variables into the ineffective collision, combination or splitting reactions between single molecule and multimolecules of CNCRO, which is used to boost energy for the low kinetic energy molecules, so as to reduce its kinetic energy in case of invalid collision of molecules and control it to gradually stabilize and converge. At the same time, the splitting reaction is inspired, to greatly change the structure of molecules, promote its search for more solution space, and improve the splitting reaction ability and its global optimization ability. Finally, the simulation experiment is completed by using MATLAB software. The advantages of CNCRO in accuracy are verified by 8 standard test functions, the influence of the weight coefficients in the cooperative target allocation function of UCAV is studied, which are revenue, loss, and voyage, and the target allocation schemes of traditional constrained processing and unconstrained multiobjective optimization methods based on different attack target number and total target number are investigated, to obtain the control law, which can be used to guide the given parameters with different emphases.

1. Introduction

The cooperative target allocation problem of multi-UCAV is an NP problem of optimization with multiple models, multiple constraints, and increasing complexity [1]. It is used to allocate the attack targets for UCAV, so as to maximize the combat efficiency and minimize the combat cost of the entire UCAV, thus facilitating the rational allocation of force resources to obtain the maximum attack effect. Its main feature is that the constraints are numerous and complex [2]; therefore, the constraint processing technology is the key problem in the process of UCAV cooperative target allocation, not only the constraints of the UAV itself but also the constraints of the cooperative problem should be considered. Currently, the commonly used processing algorithms include genetic algorithm (GA) [3], simulated annealing (SA) [4], ant colony algorithm (ACA) [5]. Among them, literature [6] proposed a genetic algorithm with dynamic crossover rate and mutation rate, which maintained the diversity of the population and avoided premature convergence. Document [7] proposes a hierarchical multiagent optimization algorithm, which combines multiagent optimization algorithm and genetic algorithm (GA) to improve resource utilization. Literature [8] designed a selection algorithm with the centralized iterative interference strategy based on the optimal dynamic response and proposed an allocation algorithm of distributed limited feedback interference resources. Literature [9] enhances the performance of the genetic algorithm by introducing greedy algorithms. The adaptive ant colony algorithm proposed in literature [10] improves the reliability of the algorithm. Literature [11] uses artificial immune systems to solve the problem. Literature [12] focuses on the interference energy ratio of the interference suppression method and changes the proposed model into a convex optimization problem. Document [13] proposes a decision-making algorithm for resource allocation based on hierarchical reinforcement learning of bootstrap experts to solve the problem of interference decisions. In addition, considering long-term benefits and current benefits, the authors of [14] propose a distributed deep compression algorithm and a distributed quick compression algorithm and adopt a distributed framework to complete task optimization. In view of the rapid response characteristics of multiple UAVs in a dynamic environment, the authors of [15] extend the consensus-based bundling algorithm to quickly allocate new tasks without completely redistributing existing tasks. Based on the task and the UAV group, the authors of [16] establish the state information model of the UAV and UAV group and solve the problem by improving the particle swarm algorithm and distributed auction algorithm. However, in the process of solving problems with these algorithms, solutions that do not meet the constraint conditions are inevitably generated, and premature convergence is easy occurring. At this time, it is generally necessary to use the heuristic information in the constraints to check the solutions, or adopt the methods such as penalty functions and constraint tournaments [17, 18]. Nevertheless, repeated attempts to check the solution in real time will affect the solving speed, and it is difficult to select the penalty factor of the penalty function method, which has problem relevance and is difficult for users to grasp. Other algorithms such as the constraint tournament are always designed for specific problems, which are not universal, and their effects are not good and will increase the time consumption. Therefore, for the application of multi-UCAV cooperative target allocation, this paper regards the constraints in target assignment as multiple targets and transforms the constrained optimization problem composed of decision variables, single objective function, inequality constraints, and equality constraints into a nonconstrained multiobjective optimization problem with decision variables and two objective functions, and the energy-reserved chemical reaction algorithm (CNCRO) is used to optimize and to realize the rational allocation of UCAV cooperative goals under complex constraints.

In this paper, we propose the energy-reserved chemical reaction algorithm (CNCRO) and applied it to the process of UCAV cooperative target allocation. The main novelty of the proposed approach is as follows: (1) the constrained optimization problem is transformed into a multiobjective optimization problem for processing, to avoid the repeated trial of the conventional constrained optimization method. (2) The energy-reserved chemical reaction algorithm (CNCRO) is proposed. The environmental energy-reserved variable is introduced into the ineffective collision, combination or splitting reaction between single and multiple molecules of CNCRO to supplement energy for low kinetic energy molecules and inspire the splitting reaction, to greatly change the molecular structure and promote its search for more solution space, improve the global optimization of CNCRO; (3) the influence of the weight coefficients (, , ) of the overall income, the overall loss, and the total flight range in the objective function of UCAV on the allocation scheme is studied through the simulation experiments, and the influence laws of the maximum number of attack targets () and the total number of targets () on the allocation scheme are obtained, which can lay a foundation for the subsequent implementation of target allocation based on different emphases.

The remainder of this paper is organized as follows. In Section 2, we introduce the target allocation model of multi-UCAV, including the problem description and constraint conditions. The processing process of the chemical reaction algorithm for solving the cooperative target assignment of multi-UCAV is introduced in Section 3, including the collision reaction, the splitting reaction, and the combination reaction. In Section 4, the performance of the proposed method is illustrated and compared with some other task allocation algorithms. Section 5 concludes the paper.

2. The Target Allocation Model of Multi-UCAV

2.1. Problem Description

Assuming that the sets of multi-UCAV are , the multiple targets scattered in different locations are , and then, the target assignment problem of multi-UCAV is described as assigning existing targets to all attackers of the system in the shortest time. Taking the minimum loss to obtain the maximum benefit and the minimum flight range as the indexes to evaluate the effectiveness of the attack aircraft, the objective function is given as follows:where is the decision variable which defined as

is the killing probability of the attacker to the target ; is the value of objectives ; is the total benefit obtained by UCAV after attacking all targets; is the overall loss of the entire attack fleet; is the total flight range of UCAV; is the probability of death when the attacker attacks the target ; is the value of the attack aircraft; is the flight range while the aircraft attack the target ; , , and are the weight coefficient of income, loss, and flight range, respectively.

2.2. The Constraint Conditions

The constraint conditions for multi-UCAV are as follows:(1), which represents that each target is assigned to only one attacker(2), which means that each attacker must be assigned one target and attack target at most(3), which means that all targets are assigned to the group of attack aircraft, is the collection of multiple targets

2.3. The Handling Method of Constraint Conditions

The above problem is a single objective function composed of decision variables, which is a constrained optimization problem with multiple constraints. The focus of this paper is to transform it into an unconstrained multiobjective optimization problem and use a chemical reaction algorithm to optimize it.

First, according to the constraint conditions of the cooperative target assignment task of UCAV described above, inequality constraint conditions and equality constraint conditions are constructed, so thatwhere and are specifically defined as

Then,

Thus, the constraint conditions of multi-UCAV target assignment are transformed into a single objective function. So, two-objective vectors are formed by and as follows:

So far, a constrained optimization problem consisting of decision variables, single objective function, inequality constraints, and equality constraints has been transformed into an unconstrained multiobjective optimization problem with decision variables and two-objective functions; therefore, the objective function is redefined as

In this paper, the constrained optimization problem is transformed into a multiobjective optimization problem for processing. The main purpose is to avoid the repeated trial for the real-time check of the solution in the conventional constrained optimization method, as well as the problems such as the difficulty in selecting penalty factors and the increasing of time consumption. However, this method is fundamentally different from the conventional multiobjective optimization problem [18]. For the conventional multiobjective optimization problem, its purpose is to find an optimal solution set with uniform distribution and good diversity, and in this paper, degenerates into a single objective optimization problem , and its optimal solution is still one point, which is simpler and more reliable to deal with.

3. The Chemical Reaction Algorithm for Solving Multi-UCAV Cooperative Target Assignment

Chemical reaction optimization (CRO) is a heuristic optimization algorithm with strong global optimization ability [19], which originates from the basic process of molecular reaction in the chemical field. Based on the continuous changes in molecular structure and energy in chemical reactions and the characteristics of energy conservation law, Lam and other scholars were inspired and proposed the algorithm in 2009. Subsequently, Lam used the Markov chain to prove the convergence of the algorithm under the total energy and molecular reaction [19]. In 2011, Lam et al. proposed a real-coded chemical reaction optimization algorithm (RCCRO) [20] to solve the optimization problem of solution space in the multidimensional real number domain, which make up for the problem that the conventional algorithm can only solve the optimization of solution space in the discrete domain. So far, the CRO algorithm has received the attention of scholars and has been widely used in the fields of the spectrum allocation of radio systems and network computing [21], and its global optimization ability has also been fully confirmed.

After the constraint optimization problem of UCAV is transformed into a multiobjective problem, how to balance the feasible solution and the infeasible solution becomes particularly important. If the optimization population after entering the feasible domain does not have enough diversity, then the search will focus on a part of the feasible domain, so that the whole population is easily trapped in local optimization. By simulating the interaction between molecules in chemical reactions, CRO achieves a low-energy steady state which has the ability of distributed, parallel, large-scale, and fast global search, so it is suitable for solving multiobjective optimization problems [21]. Based on the traditional chemical reaction algorithm, the energy-reserved chemical reaction algorithm (CNCRO) is proposed in this paper. In the reaction process, the energy-driven method (the molecular with large kinetic energy reacts violently and the molecular with small kinetic energy tends to be stable) is used to increase the diversity of molecular structure through four simple chemical reaction operations, mainly including ineffective collision, combination or splitting reactions of single and multiple molecules. In addition, in the ineffective collision and splitting reaction between a single molecule and the container wall, the environmental energy-reserved variables () are used to supplement energy for low kinetic energy molecules, so that when an ineffective collision occurs, is utilized to store energy, and the current kinetic energy of the colliding molecule is reduced and gradually tends to be stable; at the same time, the splitting reaction is inspired by using to greatly change the structure of the single molecule to increase the possibility of drastic changes in the molecular structure, improve its splitting reaction ability, make it search more solution spaces, and improve the global optimization ability of the algorithm. Finally, for the cooperative target assignment task of multi-UCAV, CNCRO is used to effectively deal with the multiobjective optimization constraint problem based on UCAV.

3.1. The Ineffective Collision Reaction between Single Molecule and the Walls of Container

During the movement of a single molecule, the process of rebound after collision with the container wall is defined as the invalid collision reaction between the single molecule and the container wall, which is referred to as the invalid collision reaction with the wall. When an ineffective collision with the wall occurs, some properties of the molecule will change, and the molecular structure will also change at the same time. However, the collision is not too violent, so the molecular structure will not change too much.

Assuming that the structure of the molecule is before the ineffective collision reaction with the wall occurs, and the new molecular structure is after the ineffective collision, the acquisition method of new structure is as follows:where is the domain operation function of the molecule in the potential energy surface.

At the same time, the molecular structure change in this process needs to meet certain energy conditions, which is defined as follows:where and are the molecular potential energy and kinetic energy corresponding to the original molecular structure, respectively; is the molecular potential energy corresponding to the new molecular structure, which are calculated by using the objective function of the optimization object in the algorithm. If the equation (9) is satisfied, the molecular structure is updated. According to the principle of energy conservation, the molecular kinetic energy of the new molecular structure is defined as follows:where is the random decimal number between and 1 ( is the lower limit of kinetic energy loss probability which is variable).

When an invalid collision reaction occurs between a single molecule and the container wall, part of the energy will be lost from the molecular reaction to the container environment, which will affect the generation of the optimal solution. To store the lost energy, the environmental energy-reserved variables of are defined in the CNCRO algorithm, which mainly considers the following reasons, and on the one hand, the energy will be continuously lost in the event of an invalid collision of a single molecule, and its kinetic energy needs to be reduced to make it gradually stable and make the system gradually converge. On the other hand, the energy saved by the variable of is used to start the splitting reaction after the single molecule collides with the container wall, which greatly changes the molecular structure to promote the splitting reaction of the single molecule, to ensure it can search for more solutions in the solution space, and improve the global optimization ability of the algorithm. Specifically, it is defined as follows:

3.2. The Splitting Reaction between Single Molecule and the Walls of Container

The violent collision between a single molecule and the wall of the container will cause great changes in the structure of molecular, and the process of splitting one molecule into two or more molecules is referred to as a splitting reaction. There are huge differences between the structure of the new molecule and that of the original molecule. The specific algorithm is as follows.

Assuming that the structure of the original molecule is , the structure of the new molecule generated by splitting is and , respectively (taking two molecules as an example). If the original molecule has sufficient energy by itself (equation (12)), or has sufficient energy supplemented by the variables of (equation (13)), it can provide sufficient potential energy for the new molecule, and then, the splitting reaction is successful. Otherwise, the algorithm maintains all the properties of the original molecule, such as the molecular structure and energy parameters.

Or

If the equation (12) or (13) is satisfied, it is defined as follows:where .

Otherwise, execute equation:where are all random pure decimals.

There are two purposes for setting four random numbers of in equation (15). First, the energy value of is generally large, so it is necessary to avoid transferring too much energy to a new molecule at one time, which is unfavorable to the convergence of the algorithm. Second, it is necessary to avoid exhausting all the energy at one time, to complete the energy-enlightening effect on other molecules. At this time, is defined as follows:

3.3. The Invalid Collision Reaction between Molecules

The process in which two or more molecules rebound back after colliding with each other is referred to as an invalid collision reaction of intermolecular [20]. Its collision intensity is small, the molecular structure changes little, and the energy changes little. This collision reaction is mainly used to complete the local optimization task, similar to the ineffective collision between a single molecule and the container wall, the changes of molecular structure are obtained by using the neighborhood function , and the molecules in the reaction process meet the energy conservation and do not exchange energy with the container environment; that is, the reaction process is independent of .

Suppose that there is an invalid collision reaction between two molecules, the original molecular structure is and , and the new molecular structure is and . Then, the reaction can proceed only if the energy transfer conditions of equation (17) are satisfied; otherwise, the original molecular structure and energy properties are maintained.

According to the law of energy conservation, the reaction process is as follows:where , is the random decimal between 0 and 1.

3.4. The Combination Reaction after Molecules Colliding with Each Other

The process in which two or more molecules combine into one molecule after a violent collision is referred to as the combination reaction [20]. It is assumed that the molecular structures before the combination reaction are , the new molecular structure generated after the combination . This reaction occurs under the condition of a violent collision, which is similar to the splitting reaction, the molecular structure changes dramatically, and the new molecular structure is very different from the original molecular structure. Similarly, the reaction can be completed only if the energy-changing relationship shown in equation (19) is satisfied; otherwise, the original properties of the two molecules remain unchanged.

Then, the reaction process is as follows:

If and have similar values, the difference of and is large; therefore, the combined molecules have large kinetic energy, to ensure their ability to jump out of the local optimal neighborhood in the subsequent reaction and expand the optimization range of the algorithm.

3.5. The Processing Process of CNCRO

In solving the cooperative target assignment of multi-UCAV, the four reaction processes of the CNCRO are implemented in three stages: initialization phase, iteration phase, and result processing phase. The whole algorithm flow is shown in Figure 1.

The initialization phase is used to complete the initialization of parameters and variables required for the algorithm, including the creation of initial molecules and the initialization of molecular structures and energy values. In the iteration phase, relevant reaction conditions need to be detected to determine the reaction type, and the iteration phase continues until the end conditions are met. The result processing phase is mainly used to read the minimum potential energy molecule and output its molecular structure, which is the global optimal solution.

The pseudocode of the specific processing process is shown in Algorithm 1.

(1)Define and initialize corresponding parameters, such as the number of molecules (), the lower limit of kinetic energy loss rate (), the condition threshold (), and the initial kinetic energy of molecules ()
(2)Create molecules based on the number of molecules ()
(3)For every molecule do
Obtain randomly the molecular structure , calculate with objective function , and initialize with initial value of kinetic energy ()
(4)end for
(5)Initialize the environmental energy-reserved variable ()
Create the molecular pointer (, , )
(6)while the end condition of iteration is not satisfied do
 Generate a random number between 0 and 1 for the temporary variable
(7)Ifthen
 Select a molecule of from the container
(8)If the conditions of classification are met then
  (, , success) = DecomposeOnWall (, )
(9)  If success == TRUE then
  Remove the molecule of from the container while adding the molecules of and ,
(10)  end if
(11)else
  IneffectiveOnWall (, )
(12)end if
(13)else
Select the molecules of and from the container
(14)If the conditions of combination are met then
 (M, success) = SynthesisInter (, )
(15)  If success == TRUE then
   Remove the molecules of and from the container and add the molecule of at the same time
(16)  end if
(17)else
 IneffectiveInter (, )
(18)end if
(19)end if
Determine whether the potential energy of the new molecule is the lowest and preserve the globally optimal molecule
(20)end while
(21)Output the structure and potential energy value of the global optimal molecular, that is the global optimal solution and the objective function value

4. The Simulation Experiment and the Comparison of Results

4.1. The Performance Test of the CNCRO Algorithm

The energy-reserved chemical reaction algorithm (CNCRO) is simulated by MATLAB 2010, and its performance is compared with the particle swarm optimization (PSO) described in document [22], the parallel genetic algorithm with the elite set (PGA) described in document [23], the simulated annealing genetic algorithm (SAGA) described in document [24], the deep intelligent ant colony optimization algorithm (ACA) described in document [25], and the chemical reaction algorithm (CRO) described in document [26]. The algorithm parameters are shown in Table 1. At the same time, eight standard test functions are used to verify the effectiveness of CNCRO. The test functions are shown in Table 2.

In order to make the experimental results more scientific and effective, eight benchmark functions are given in Table 2, which are divided into three types of functions. The first one is composed of four unimodal functions (, , , ) where and are simple unimodal function in high-dimensional space, is a multimodal function, is unimodal functions with random interference. The second type includes three complex multimode functions (, , ), and the last type is the rotation function . The global optimal solution of all test functions is zero.

In the experiment, each problem function is set to 50 dimensions, 6 algorithms are used to optimize 8 standard test functions. Each test function runs independently for 30 times under the same conditions, and the average optimization results and standard deviation are recorded. The experimental results are shown in Table 3.

It can be seen from Table 3 that the CNCRO algorithm can achieve better average value and standard deviation in most functions when solving 50-dimensional problem, especially in the functions of , , , , . For the four unimodal functions, CNCRO can find the global optimal solution of and . For and , the performance of CNCRO is slightly worse than that of PGA but better than other algorithms. For the three multimodal functions, CNCRO can find their optimal solution. Among them, both CNCRO and PGA have the best performance in finding the global optimal solution of . In addition, the optimization effect of CRO for is better, which is second only to CNCRO, and the effect of PGA in the function of is slightly worse than that of CNCRO. For the rotation function, CNCRO can find the global optimal solution of , and the performance of PGA in is slightly worse than that of CNCRO.

In general, CNCRO performs well for most functions except for a few functions. It can be seen that the CNCRO algorithm has strong effectiveness and robustness. By using the environmental energy-reserved variable of to supplement energy for low kinetic energy molecules, the molecular diversity of the CNCRO and the correlation between multiple decision variables are increased, which makes it difficult to fall into local optimization and improve the ability of the new algorithm to solve the complex problems.

4.2. The Simulation Experiment of Multi-UCAV Cooperative Target Allocation Based on CNCRO
4.2.1. The Design of Experimental Parameters

In order to further verify the effectiveness of CNCRO in the cooperative target allocation of multi-UCAV, the following simulation experiments are designed. Assuming that there are 5 attack aircraft (Ua1–Ua5) and 10 targets (Ta1–Ta10), the survival probability of the attacker against the target (1 − ), the flight range (), the killing probability (), the value of target (), and the value of attack aircraft () is shown in Tables 48, respectively.

4.2.2. The Overall Experimental Results

Matlab 2010 is used for the simulation, and the CNCRO algorithm is compared with conventional target allocation methods (CRO, ACA, SAGA, PGA, PSO). The parameters of the algorithms are shown in Table 1, and the parameters of UCAV cooperative target allocation are shown in Tables 48, and the weight coefficients of the income, loss, and flight range in the objective function are set as , and the maximum number of targets that can be attacked by each UAV is set as . Each algorithm has been simulated for 25 times. The curve of the average best solution of the fitness of the objective function is shown in Figure 2. In addition, the comprehensive analysis results after each algorithm has been run for 25 times are shown in Table 9.

It can be seen from Figure 2 that PGA converges to the best solution after 40 iterations, ACA converges after 81 iterations, CRO converges after 110 iterations, and PSO and SAGA converge after 85 and 83 iterations, respectively, while the CNCRO algorithm in this paper converges to the best solution after 28 iterations. That is to say, CNCRO method has faster convergence speed and better real-time performance.

In Table 9, the fitness value of the objective function of CNCRO is the highest. Based on the maximum overall gain, the minimum loss, and the minimum flight range, compared with other algorithms, CNCRO can obtain greater benefits than other algorithms, and its global optimization ability is stronger which can ensure that the optimal solution is obtained in a short time. Finally, obtain the scheme of target decision-making with the value of fitness function of 302 by using CNCRO, which is shown in Figure 3.

Comprehensively analyze the allocation strategy given in Figure 3. First, by comparing the allocation processes of Ua3 and Ua4 to the target of Ta7 and considering the killing probability of Ta7, Ua4 is slightly better than Ua3, but the difference is small (Ua3 is 0.8531, and Ua4 is 0.8625). Considering the survival probability, Ua3 has a large survival probability (Ua3 is 0.8641, and Ua4 is 0.8422). In addition, in terms of the flight range, Ua3 also has great advantages (20) over Ua4 (36), and the value of attack aircraft of Ua4 (100) is far greater than that of Ua3 (80). Therefore, the algorithm allocates Ta7 to Ua3, to ensure the integrity of high-value aircraft of Ua4, avoid its loss, and keep the path shortest. This allocation scheme is reasonable and effective.

Next, the allocation processes of Ua4 and Ua5 to the target of Ta2 are compared. Compared with other aircraft, Ua4 has the best position to target Ta2; that is, it has a higher survival probability (0.8933), and Ua5 is slightly worse than Ua4 (0.8618). However, Ua5 has ammunition advantages; that is, Ua5 has the largest killing probability to Ta2 (0.8941), which is obviously superior to Ua4 (0.8342), and Ta2 has a relatively high value (60). In addition, the flight ranges of Ua4 and Ua5 to the target Ta2 are 30 and 20, respectively. In order to ensure that the aircraft has high survivability, the flight range is as short as possible, and attack the high-value target with the maximum killing probability, Ta2 is allocated to Ua5, to ensure the principle of using superior firepower to attack the target and avoid the high-risk UAV attacking the high-value target at the same time. That is to say, the attack aircraft with the advantages of position and ammunition will be preferentially allocated to high-value targets to increase the attack income. Meanwhile, the targets with the greater threat should be avoided from being allocated to high-value attack aircraft, which is conducive to the self-preservation of the attack aircraft, and can reduce the loss. On the whole, the CNCRO algorithm can provide a better scheme of cooperative target allocation.

4.2.3. The Influence of Weight Coefficient on UCAV Target Allocation

In the function of UCAV target allocation, three factors will affect the final optimization result, the overall gain, the overall loss, and the total flight range. In order to study the control ability of different factors on the target allocation, the weight coefficients of the overall income, the overall loss, and the total flight range are investigated. The control law of the parameters can guide the determination of the task assignment scheme with different emphases. This part has completed the following three experiments. The parameters of the algorithm are shown in Table 1, and the target allocation parameters of UCAV are shown in Tables 48.(1)To investigate the influence of a single factor, the weight coefficients are set as (in consideration of the maximum fitness function). (in consideration of the absolute minimum fitness function). (in consideration of the absolute minimum fitness function). The experimental results are shown in Tables 1012.(2)To investigate the influence of two factors, the weight coefficients are set as , , . The experimental results are shown in Tables 1315.(3)The influence of the three factors is investigated. The weight coefficients are set as . The experimental results are shown in Table 16.

In order to more intuitively investigate the diversity of allocation schemes, the fitness and average error of each experiment are obtained, as shown in Figure 4. The abscissa in the figure represents the target of Ta1–Ta10.

It can be seen from the above charts that when the overall income, the overall loss, and the total flight range are considered separately, the target allocation schemes are quite different, and the performance is relatively divergent with too much diversity. When the two factors are considered at the same time, although the values of fitness function under different combinations are different, the scheme of target allocation has been relatively determined, the aspects considered are more comprehensive, and the algorithm tends to be optimized. When the three factors are considered at the same time, the allocation result is the optimal selection in the previous scheme. Specifically, the allocation schemes of and in Experiment 2 are the same, and the allocation scheme of in Experiment 2 is the same as that of in Experiment 3. The main difference between the two schemes is the allocation of Ta3. The former allocates it to Ua4, and the latter allocates it to Ua5. By analyzing the given parameters of Ta3, it can be seen that the killing probability of Ta3 allocated to Ua4 and Ua5 is 0.8462 and 0.8131, respectively, from the perspective of killing probability, assigning Ta3 to Ua4 is more advantageous. In addition, the flight ranges of Ua4 and Ua5 are 20 and 45, respectively. It can be seen that the flight distance of Ua4 is smaller, and the killing efficiency is higher. From the perspective of survival probability, Ua4 and Ua5 are 0.8937 and 0.8972, respectively. Since the parameters of Ua4 and Ua5 are close, the impact on the allocation result can be ignored. In a comprehensive view, the scheme allocated to Ua4 is more optimized.

So, when the optimization conditions are sufficient, although some of the distribution schemes are the same, relatively speaking, the allocation scheme of the three factors will tend to be optimal and more in line with the reality. In addition, the above allocation results are also related to the larger proportion of the total income in the objective function. In the later practical application process, the parameters of the total income, the total loss, the total flight range, and its weight coefficient can be set according to the specific situation, so as to obtain a strategy of target allocation more in line with the actual situation.

4.2.4. The Comparison between Traditional Constrained Processing Technology and Unconstrained Multiobjective Optimization Method

In order to investigate the role of traditional constraint processing technology and unconstrained multiobjective optimization method in the strategy of UCAV target allocation, the two methods are compared experimentally. In addition, since the target allocation scheme of UCAV is closely related to the constraint conditions of (the maximum number of targets that can be attacked by each UAV) and the total number of target set () adopted, the experiments in this section mainly use CNCRO algorithm to realize the target allocation based on traditional constrained processing technology and unconstrained multiobjective optimization and control the value of and , and the performance of the two methods by using and with different values is investigated. Specifically, it includes two experiments, (1) set as 2, 3, 4, 5, respectively, and the value of the total target number of is 10, that is, , ; (2) the total target set is composed of 5, 6, 7, and 8 targets randomly selected from the original targets, that is, , . In addition, the value of the weight coefficient is , and the other specific parameter settings are the same as above. The performances of CNCRO in constrained processing and unconstrained multiobjective optimization are obtained by executing 30 times in each case, and the number of times of obtaining the optimal solution in 30 times is shown in Table 17.

It can be seen from Table 17, on the one hand, from the maximum number of targets (), which is a constraint condition of the objective function, and it is closely related to the optimization process. The larger the value of , the broader the conditions for controlling the allocation process, the simpler the implementation, the greater the possibility of obtaining the optimal solution, and the shorter the execution time. However, when the maximum number of targets is reached, the processing effect and performance will approach saturation or bottleneck. For example, the number of times of obtaining optimal solutions is very close for and (26 times for constrained processing and 29 and 28 times for unconstrained multiobjective optimization). On the other hand, from the perspective of the total number of sets (), the larger value of will lead to greater difficulty in the implementation process and longer execution time. Therefore, the number of times to obtain optimization will become smaller, which is mainly related to the difficulty of the problem. However, compared with the constrained processing method, the proportion of optimization results obtained by unconstrained multiobjective method will greatly increase.

In addition, the consumption time and the average value of the objective function of the two methods are obtained, as shown in Figure 5.

It can be seen from Figure 5 that the processing time of the unconstrained multiobjective optimization method is shorter than that of the constrained processing method. The main reason is that its optimization process does not need to detect the constraint conditions, and the processing is simpler, more reliable, and with shorter time consumption. In addition, just as the above analysis, the larger the number of maximum targets (), the simpler the implementation process, and the shorter the execution time, while the larger the number of target set (), the greater the optimization difficulty, and the longer the execution time. As for the value of objective function, on the one hand, the larger the value of , the broader the allocation conditions, the more likely it is to obtain the optimal solution. It also can be seen from Figure 5 that the value of objective function is gradually increased except for isolated cases, which is related to the specific allocation scheme; in addition, when it reaches a certain degree, the processing effect is close to saturation, which is also one of the reasons. However, in general, compared to the constrained processing method, the value of objective function obtained by unconstrained multiobjective optimization method is better, and its allocation process is more reasonable. On the other hand, the larger the number of set (), the greater the optimization difficulty. Generally speaking, except for isolated cases, both methods show that the value of objective function gradually decreases with the gradual increase of , and the optimization difficulty gradually increases.

5. Conclusion

The main innovations of this paper are as follows: (1) by transforming multiple complex constraint conditions into optimization objectives, the constrained optimization problem of UCAV cooperative objective allocation is transformed into a multiobjective optimization problem including allocation objectives and constraint objectives; (2) the energy-reserved chemical reaction algorithm (CNCRO) is proposed. The environmental energy-reserved variable of is introduced into the ineffective collision, combination or splitting reaction between single and multiple molecules of CNCRO to supplement energy for low kinetic energy molecules, to reduce the molecular kinetic energy while ineffective collisions have occurred between molecules. At the same time, the splitting reaction is inspired, to greatly change the molecular structure and promote its search for more solution space, and the global optimization ability is improved; (3) the energy-reserved chemical reaction algorithm (CNCRO) is applied to the process of UCAV cooperative target allocation, the influence of the weight coefficients (, , ) of the overall income, the overall loss, and the total flight range in the objective function of UCAV on the allocation scheme is studied, and the influence laws of the maximum number of attack targets () and the total number of targets () on the allocation scheme are obtained, which can lay a foundation for the subsequent implementation of target allocation based on different emphases.

Data Availability

The data used to support the findings of this study are included within the article

Conflicts of Interest

The authors declare no conflicts of interest.