Decentralized Algorithms for Weapon-Target Assignment in Swarming Combat System

Zhao, Peng; Wang, Jianzhong; Kong, Lingren

doi:https://doi.org/10.1155/2019/8425403

Mathematical Problems in Engineering

On this page

Abstract Introduction Conclusions Data Availability Conflicts of Interest Acknowledgments Supplementary Materials References Copyright Related Articles

Research Article | Open Access

Volume 2019 | Article ID 8425403 | https://doi.org/10.1155/2019/8425403

Decentralized Algorithms for Weapon-Target Assignment in Swarming Combat System

Peng Zhao,¹Jianzhong Wang,^1,2and Lingren Kong¹

Academic Editor: Francisco J. Montáns

Received08 Apr 2019

Revised14 Jul 2019

Accepted05 Aug 2019

Published27 Aug 2019

Abstract

Swarming small unmanned aerial or ground vehicles (UAVs or UGVs) have attracted the attention of worldwide military powers as weapons, and the weapon-target assignment (WTA) problem is extremely significant for swarming combat. The problem involves assigning weapons to targets in a decentralized manner such that the total damage effect of targets is maximized while considering the nonlinear cumulative damage effect. Two improved optimization algorithms are presented in the study. One is the redesigned auction-based algorithm in which the bidding rules are properly modified such that the auction-based algorithm is applied for the first time to solve a nonlinear WTA problem. The other one is the improved task swap algorithm that eliminates the restriction in which the weights of the edges on graph must be positive. Computational results for up to 120 weapons and 110 targets indicate that the redesigned auction-based algorithm yields an average improvement of 37% over the conventional auction-based algorithm in terms of solution quality while the additional running time is negligible. The improved task swap algorithm and the other two popular task swap algorithms almost achieve the same optimal value, while the average time-savings of the proposed algorithm correspond to 53% and 74% when compared to the other two popular task swap algorithms. Furthermore, the hybrid algorithm that combines the above two improved algorithms is examined. Simulations indicate that the hybrid algorithm exhibits superiority in terms of solution quality and time consumption over separately implementing the aforementioned two improved algorithms.

1. Introduction

WTA plays an increasing role given the growing requirement for the collaborative engagement of several weapons relative to several targets in modern battlefields [1]. Various extant studies focused on solutions for WTA that are mainly applied in the traditional high-value weapon platforms such as air defense WTA [2–4] or ship target strikes [5, 6]. Compared to expensive equipment, swarming UAVs or UGVs attract the attention of military powers worldwide because of advantages including strong autonomy, low cost, high flexibility, and fast upgrades [7]. For example, the United States of America uses the advantages of a large quantity of low-cost UAVs to synergistically destroy a few key enemy targets [8, 9]. Similarly, WTA is of great significance in the swarming combat system. Given the large-scale and decentralized autonomous framework of the swarming combat system, it is necessary to develop more efficient WTA algorithms to maximize the total damage value of enemy targets in a decentralized manner.

Several centralized algorithms to solve WTA problems are proposed based on the research results of operations research and example include the heuristic algorithm [10], memetic algorithm [11], genetic algorithm [12, 13], particle swarm optimization algorithm [14], bee colony algorithm [15], and game theoretic strategy [16]. A centralized algorithm efficiently obtains an optimal solution if the number of weapons and targets is low. However, in a large system, a centralized algorithm can result in heavy workload with respect to computation/communication because the central controller is typically responsible for collecting, processing, and disseminating information for the entire system. Additionally, the heavy workload can result in low efficiency for the entire system. Furthermore, the failure of the central controller results in the failure of the entire system [17].

In order to cope with the large-scale system and enhance reliability, several distributed algorithms are presented in the research areas of multirobot task assignment [18, 19]. However, damage probabilities of weapons to targets are not additive. The expression for the WTA problem does not typically correspond to a linear sum equation, and thus, it is difficult to immediately apply the distributed algorithms in the areas of multirobot task assignment. In this paper, we introduce two new methods derived from the auction-based algorithm and task swap-based algorithm to optimize the assignment of weapons to targets, given the cumulative effect.

The auction method is explained in terms of economic competition concepts in human’s economic activities [20]. In [21–23], the conflict-free assignment solution is achieved by introducing auxiliary variable termed as task price, and making robots bid a price for tasks via an iterative auction mechanism. The algorithms presented in [24, 25] adopt the auction-based framework in a manner similar to [21–23] and extend the algorithms to problems with more constraints on the tasks, such as adding task deadline constraints or group constraints. However, the extension of the auction-based algorithm to solve the nonlinear WTA problem has rarely been studied. The WTA problem in [1] is similar to our version of the problem where the cumulative damage effect is considered. However, the method in [1] includes the prepared assignment guidance table, which needs a central auctioneer and is not suitable for solving the distributed assignment problem introduced in this paper. This paper further develops the auction-based algorithm, adds a judgment criterion for bidding, and uses the maximum consensus technique rationally. The results show that the solution quality of the redesigned auction method significantly improves while the extra running time is negligible.

This paper uses the task swap method to refine the assignment solution after the auction method. The task swap method is applied well to solve the one-to-one mapping assignment problem where each weapon is assigned with a unique target, and an optimal assignment solution is obtained when it runs to completion. Additionally, the task swap method can start from any initial assignment such that it is easy to refine the suboptimal assignment computed via other methods [26]. Significant attention has been focused on the task swap method such as exchanging tasks between pairs of robots [27] or among two or more robots [28]. Recently, some studies [29, 30] proposed a task swap-based method that combined the duality theory with decentralized task allocation approaches. However, despite its decentralized form, the method assumes that global communication exists at each stage when the algorithm is running. Based on the study in [29], a fully decentralized task swap-based method that combines duality theory with Dijkstra’s algorithm in graph theory was proposed [17], and the method is suitable for scenarios in which robots communicate only with neighbors in their vicinity. However, in [17], the search for a task swap requires satisfying the condition that the weight value of the edge on the graph is positive, which decreases the searching scope and extends the searching time. In this paper, we retrofit the method to search for a task swap loop. The weight value of the edge and loop in graph has no restrictions, which accelerates the optimization process. The simulation results show that the convergence rate of the improved task swap method exceeds that of other task swap-based methods.

Finally, we analyze a hybrid algorithm that combines the redesigned auction-based algorithm and improved task swap algorithm. That is, the redesigned auction-based algorithm is applied to obtain an assignment solution where a reasonable number of suitable weapons are assigned to targets, and the improved task swap algorithm is then applied to refine the assignment solution after the auction-based algorithm. The simulation results indicate that when compared with implementing one of the algorithms separately, the hybrid algorithm exhibits better performance in solution quality, and the convergence rate is not lower than that of the redesigned auction-based algorithm or improved task swap algorithm.

This paper is organized as follows: In Section 2, we describe the background of this paper and model the WTA problem. In Sections 3 and 4, we present a redesigned auction-based algorithm and an improved task swap-based algorithm, respectively. In Section 5, we present the simulation and analyze the results. In Section 6, we summarize the conclusion and outline future research studies.

2. Weapon-Target Assignment Model

The WTA problem in this paper involves determining a proper assignment of weapons to targets in a decentralized manner with the objective of maximizing damage effect of enemy targets. A simple scenario of the weapons used against the targets is shown in Figure 1. The scenario exists in applications of intelligent minefield, i.e., intelligent mines with limited ability of reconnaissance, communication, and movement lurk in the battlefield, and they cooperatively attack the enemy targets in the area. We assume that the weapons in this paper are small unmanned ground vehicles (UGVs) carrying a gun or explosive device, and the targets are soldiers or military vehicles.

We assume that there are weapons and targets . The solution of the WTA problem is based on the following assumptions.

Assumption 1. The capability of the weapons is limited, and thus, the destruction of a target typically requires more than one weapon. The number of the weapons exceeds that of the targets, i.e., . Each weapon is assigned to exactly one target, and each target is attacked by one or more weapons.

Assumption 2. The problem in this paper is related to static weapon-target assignment (SWTA), i.e., all the weapons attack the targets in a single stage.

Assumption 3. Each weapon knows a few basic attributes of the targets, and maintains local information about the damage probability of each target attacked by . reflects the likelihood that target is destroyed by weapon . Given a certain type of target, the damage probability corresponds to a function of relative distance or relative angle between it and a weapon.

Assumption 4. Each weapon is homogenous, and the value of each target is identical. Without loss of generality, we assume that .
The objective is to assign all weapons to the targets such that the total damage value of the targets is maximized, and this is formally stated as follows:subject towhere is a binary variable that indicates whether is assigned to . If is assigned to , then , otherwise, . The constraint (2) guarantees that a weapon is assigned to exactly one target at a time. The constraint (3) states that a target is attacked by at least one weapon. It should be noted that the damage probability in objective function (1) is not additive. Additionally, conventional decentralized algorithms, such as an auction-based algorithm or task swap algorithm, are used to solve the linear sum assignment problem. It is necessary to develop new decentralized algorithms to solve the WTA problem.

3. Auction-Based Algorithm Design

In this section, the redesigned auction-based algorithm is proposed to solve the WTA problem in a distributed manner. The auction-based algorithm operates in a manner similar to a real auction where individuals bid for items by raising their prices via competitive bidding. The total payoffs are almost maximized by individuals iteratively maximizing their own payoffs.

3.1. Conventional Auction-Based Algorithm

The conventional auction-based algorithm is introduced to solve the classic linear sum assignment problem with the one-to-one assignment constraint in this subsection. We assume that the price for target at iteration is , and this is identical to the auxiliary variables termed as task price in [21, 25]. The payoff for weapon relative to target is . We assume that does not change during the iterations because the basic attributes of the weapons and targets change slightly during the assignment. The revenue for weapon used against target at iteration is . Each weapon bids price for the target with the highest revenue. With respect to weapon , if another weapon has bid higher price than that of weapon ’s previous bid for target , then the revenue of changes and is potentially not the highest, and the new desired target with the highest revenue should be selected. Subsequently, weapon makes a new bid for target and increases the price of . Therefore, the iterative biddings gradually maximize total payoff while the one-to-one assignment constraint is satisfied.

The assignment result obtained by the auction-based algorithm is equivalent to an economic equilibrium result, i.e., each individual is satisfied with the goods obtained and no longer possesses an incentive to bid for other goods.

3.2. Redesigned Auction-Based Algorithm

The conventional auction-based algorithm provides an almost optimal solution for the linear sum assignment problem. However, the WTA problem is always formulated as the nonlinear programming model as shown in equation (1). Thus, the algorithm does not work well in the WTA problem. Furthermore, the conventional auction-based algorithm is designed for the problem wherein each target is exclusively assigned to a unique weapon. The method of adding virtual targets or weapons can be used to solve the assignment problem with unequal number of weapons and targets. In contrast to the popular method, we propose a method of adding a criterion to verify whether the damage value is improved. The solution is near optimal for the WTA problem while the constraint of the number of weapons on each target is satisfied.

3.2.1. Price Update Rules

The objective of the WTA problem corresponds to the total damage of the targets, as shown in equation (1), where . We assume that in assumption 4. If the targets are attacked with the higher damage probability by the weapons, then the total damage of targets should be maximized.

We set the damage probability to be the payoff , i.e., . In the auction-based algorithm, each weapon is an economic agent acting in its own best interest. With respect to weapon , let denote the index of the target with the highest revenue:where denotes the set of the candidate targets. The price is an auxiliary variable and is used to resolve the WTA problem to satisfy the constraint (3). When the algorithm terminates, the quality of assignment solution does not depend on , i.e., the output assignment solution is almost optimal in terms of payoff instead of the revenue [24].

Let denote the index of the target with the second highest revenue and is expressed as follows:

In the auction-based algorithm, weapon increases the price and bids for its desired target. By bidding iteratively, the auction-based algorithm gradually ensures that every weapon is assigned to its desired target with the almost highest payoff . At iteration , if required, the price is updated based on a certain rule as follows:

In equation (7), the new price of the target increases by at least . Based on the original idea in an auction-based algorithm, the parameter is constant, and this avoids an endless bidding cycle when two or more weapons repeatedly bid for the same target. Instead of avoiding the algorithm from cycling, we use the parameter to control the rate of convergence because related studies indicate that the higher the , the faster the auction-based algorithm converges. In this paper, we define .

3.2.2. Bidding Criterion

The WTA and linear sum assignment problem have different objective functions. When the sum of the payoffs is maximized, the cumulative damage probability of the targets is potentially not maximized. Thus, extending from the conventional auction-based algorithm, we integrate the additional criterion to verify whether the total damage value is improved. In terms of , when some other weapon exhibits a higher price for ’s previous desired target, then this implies that weapon is more suitable to attack the target. At this time, weapon must select a new target to bid. In the implementation, (1) we select the target with the highest revenue based on equation (5) from the candidate target set . (2) We calculate the new value of the objective function and verify whether the new value is improved when compared with the previous value, if true, i.e.,then the new desired target is selected, wherein denotes the value calculated by using equation (1). Otherwise, we repeat procedures (1) and (2) to continue to determine a new target from the remaining targets until the criterion is true or the candidate target set is empty. The implementation is shown in lines 8 to 23 in Algorithm 1.

	Input: , , ; ; , , , ;
	Output: , , , , ;
(1)	//Update the local assignment information among its neighbors based on the maximum consensus technique.
(2)	;
(3)	;
(4)	; //The value of and denotes each weapon-to-target assignment pair and damage probability of the assignment, respectively. The information is used to calculate the value of the objective function in Step 14.
(5)	;
(6)	;
(7)	//Weapon selects a new desired target and updates the price if another weapon exhibits a bid higher than ’s previous price.
(8)	If or then
(9)	Initialize the desired targets set ;
(10)	;
(11)	;
(12)	;
(13)	;
(14)	The new value of the objective function, , is obtained by using equation (1);
(15)	If or then
(16)	; //Update price
(17)	Assign new value to the variables, i.e., , , ;
(18)	Else
(19)	;
(20)	Retain the previous price and assignment information;
(21)	If then go to step 11;
(22)	End if
(23)	End if

3.2.3. Maximum Consensus Technique

Specifically, each weapon must know the current price for each target and assignment information on each weapon-to-target pair during the bidding procedure. In other words, the assignment information among all weapons must be consensus. A maximum consensus technique [23] is used to satisfy the aforesaid requirement. The idea is that at each iteration , each weapon locally maintains and updates a list of current highest price for each target by communicating with its neighbors, , and uses the highest price among its neighbors as the local price of the targets. This is expressed as follows:

The reason for obtaining each other’s assignment information is to calculate the value of the objective function such that the weapon can use the value to verify whether the solution is improved. Similarly, the same technique is applied such that the assignment information is consensus among all weapons.

Although the objective function value should be identical, different weapons have different values due to the paucity of each other’s recent assignment information in the distributed implementation. Let be the symbol of the local objective function value calculated by each weapon at current iteration . For further explanation of the symbol , the superscript denotes that weapon receives the message from weapon . With respect to weapon , is maintained and updated by communicating with its neighbors, and it uses the highest objective function value among its neighbors as the local value. This is expressed as follows:let denote the index of the target attacked by weapon at iteration . Each weapon locally maintains and updates the lists of the desired targets’ index, , and the corresponding payoff for all the weapons. The superscript denotes that the lists are maintained by weapon .

For the sake of explanation, we define a message for bidding in the following format: , where , , , and . Each weapon locally maintains and updates the value of and from messages with the highest objective function value among its neighbors, i.e.,

Although the price and assignment information may not reach all weapons immediately, accurate information is eventually propagated to all weapons in the connected network.

A single iteration of the redesigned auction-based algorithm for weapon is described in Algorithm 1. In fact, the weapons can run Algorithm 1 simultaneously or asynchronously. Algorithm 1 can be summarized as follows:(1)Weapon receives bidding from its neighbors and updates its own local information such as (see lines 1 to 6).(2)Whether the price of the previous assigned target has been increased by other weapons is verified. If true, then the weapon should select a new desired target based on the additional criterion as previously mentioned. If there exists a new desired target, then its price and the corresponding assignment information are updated, and a new bidding message is then broadcast (see lines 7 to 23).

Algorithm 1 terminates when all weapons are satisfied for their desired targets, i.e., the price does not change for iterations, where is the weapons’ network diameter that indicates the maximal hop of the network.

3.3. Performance Analysis

Algorithm 1 for all weapons terminates in a finite number of iterations with an assignment that satisfies the constraint wherein each target is attacked by at least one weapon. The time complexity of the redesigned auction-based algorithm (Algorithm 1) is , where denotes the running time of Algorithm 1 for each weapon and denotes the maximum number of iterations for all weapons to run Algorithm 1.

In practice, the results show that the redesigned auction-based algorithm generally works better than the conventional auction-based algorithm. The main reasons are as follows.

In order to solve the WTA problem via the conventional auction-based algorithm, the objective function as shown in equation (1) is simplified as equation (12) because the conventional auction-based algorithm is aimed at solving the linear sum assignment problem.where is subject to the constraints (2)–(4). For ease of exposition, we define as the survival probability of the target attacked by weapon . Equation (12) is equivalent to the below equation:

Given the nonlinear cumulative effect, we propose the redesigned auction-based algorithm to solve the nonlinear WTA problem. The objective function (1) is equivalent to minimize the survival probability of the targets as follows:

Based on the arithmetic mean-geometric mean (AM-GM) inequality, for all nonnegative real numbers , we have , equality holds if and only if . Extending from the AM-GM inequality, we assume that there exists a set of nonnegative real numbers , and , and if the variance of the numbers is higher than that of , i.e., , then .

Similarly, we observe equations (13) and (14) and assume that there exists an optimal solution for the linear summation (13); however, the solution is not optimal for the objective function (14), namely, , but , where is any feasible solution. There are potentially more suitable solutions for the nonlinear WTA problem.

In order to find more suitable solutions, the redesigned auction-based algorithm improves by adding a judgment condition to verify whether the solution for the nonlinear WTA problem improves (see lines 15 to 22 in Algorithm 1). Thus, the redesigned auction-based algorithm generally works better than the conventional auction-based algorithm. Furthermore, the simulation results also demonstrate it.

However, it is hard to provide mathematical proof that the solution for the WTA problem is optimal or almost optimal. The main reason lies in that the additional criterion evaluates the new assignments in a “greedy” manner and may impede some potential good solutions.

4. Task Swap-Based Algorithm Design

The weapons start moving and launching an attack on the targets when the assignment is completed by the auction-based algorithm. As the weapons move towards the targets, there is still time to optimize the assignment. The task swap algorithm can be started and interrupted at any time, and thus it is used to further optimize the assignment after the redesigned auction-based algorithm. The main idea of the task swap algorithm is that two or more weapons refine their assignment via exchanging tasks between themselves. Specifically, the term of “perform a task” implies that the target is attacked by the weapon. For further details, the weapons that need to adjust the tasks form a group that is termed as the swap loop. In order to adjust the assignment, each weapon on the swap loop substitutes its task with its successor’s along the closed loop. In [17], the search for a swap loop is used by the primal-dual theory and single source shortest path algorithm such as the Dijkstra algorithm. The process of searching a swap loop involves determining a valid loop on graph (detailed explanation in Section 4.3). In this paper, a new algorithm is proposed, where we remove the restriction that the weights of the edges on a searching loop must be nonnegative. The improved algorithm decreases the searching length on graph and searches for multiple swap loops concurrently, thereby improving the efficiency of the solution.

4.1. Task Swap Method

The scene of weapons relative to targets is convenient to describe the concept of the task swap method, as shown in Figure 2(a). Figure 2(b) shows a matrix of the survival probability of the targets attacked by the weapons. It should be noted that the number of weapons is equal to the number of targets in the example. Specifically, after the assignment by the auction-based algorithm, each weapon is assigned to each target such that every weapon possesses exactly one task to perform, and this case is considered as the one-to-one mapping problem.

(a)

(b)

To decrease the sum of the survival probability of the targets, the weapons exchange their tasks under certain rules, as shown in Figure 3. Let be the assignment vector, i.e., , where denotes the task assigned to weapon . The swap loop is denoted as , where denotes the length of the swap loop. The arrows in Figure 3(c) denote that the task of the previous weapon (e.g., weapon ) is passed to the next weapon (e.g., weapon ). We simplify the description of the swap loop as (12, …, k), for example, the swap loop in Figure 3 is expressed as (123).

(a)

(b)

(c)

4.2. Primal-Dual Problem

The objective of the WTA problem is to maximize the damage probability of the targets, as shown in equation (1). The sum of the damage probability and the survival probability is 1, and thus the objective turns into minimizing the survival probability of the targets, and this is expressed as follows:

The task swap algorithm is used to solve the linear sum assignment problem. As previously mentioned, we integrate an additional criterion to verify whether the solution of the WTA problem improves. For the sake of explanation, a linear sum model is used here without considering the cumulative damage effect as follows:where .

Definition 1. Equations (16) and (17) are denoted as the primal problem in the dual theory.

Definition 2. There are corresponding dual variables and , wherein the dual problem is expressed as follows:where the symbols and are used as dual variables in the rest of this paper and should not be confused with the symbol of the payoff and revenue mentioned earlier.

Definition 3. Reduced costs are defined as follows:Reduced cost is an auxiliary variable used to imply whether the weapon-target assignment is feasible. Based on equation (20), the weapon-target pair is feasible only if .

4.3. Dijkstra’s Algorithm

Let us define graph , as shown in Figure 4. Graph is built from the weapons communication network topology. The node on graph is composed of a weapon and its current assigned target , i.e., . The directed edge from to denotes a new assignment , i.e., weapon passes its task to weapon . The weight of the edge is , which is equal to the element of the nth row and th column in the reduced cost matrix (Figure 5).

(a)

(b)

(c)

(d)

(e)

(f)

Figure 4

A swap loop is found on graph . The dark node denotes the source node, and the shaded nodes denote the nodes on the loop (possibly temporarily); the shaded edges denote the spanning tree of searching for a swap loop. The solid edges denote its weight (see the value beside the edge), and the dashed edges denote . The value in the node is the shortest distance from the source. (a)–(e) show the process of searching for a swap loop on graph by using the Dijkstra algorithm. (e) The edge from to with closes the loop, and the total weights on the loop is . (f) The new assignment after exchanging tasks.

In [17], a novel algorithm is proposed to search for a task swap loop, where the problem is transformed into searching for a loop on graph . Figure 4 shows the process of searching for a task swap loop by using the Dijkstra algorithm. The Dijkstra algorithm is explained in [31] in detail. Based on Definition 3, the edge is feasible only if . It should be noted that the nonnegative weight of the edge also corresponds to a constraint for running the Dijkstra algorithm. Thus, while searching for a task swap loop, each node selects the next candidate node only from outgoing edges with nonnegative weights to join the swap chain. Without loss of generality, let denote the source node, and the other nodes are termed as leaf nodes. A valid task swap loop is formed when a candidate node on the swap chain exhibits an outgoing edge to the source node with the weight , and the total cycle weight . After tasks exchange, the objective as shown in equation (16) decreases by (see [17] for the proof in detail).

4.4. Improved Task Swap Algorithm

Based on the related work, we conclude that the assignment result is improved after tasks exchanging if the total weight of the edges along the swap loop is less than zero. Thus, the search for a swap loop turns into determining a loop with total weight on graph .

As required by the Dijkstra algorithm, the weight of the edges on graph is nonnegative. Otherwise, the leaf nodes cannot obtain the results about the shortest distance and path between the source node and themselves by running the Dijkstra algorithm. Specifically, when there is a loop with negative weight, the distance between the leaf node and source node tends to , i.e., there are no shortest distance and path between the leaf node and source node. The Bellman–Ford algorithm is used to solve the single source shortest path problem and determine whether the graph exhibits a loop with a negative weight. However, the Bellman–Ford algorithm repeatedly requires relaxation operations on each edge, and thus it exhibits a long running time and is not suitable for scenes with large-scale nodes.

This paper further improves over the Dijkstra-based searching process for finding a task swap loop, and this allows edges and loops with negative weight to appear on graph . During the process of searching for a task swap loop, if the shortest distance of a leaf node on the swap chain (such as the shaded nodes in Figure 4) is less than the previous value, then the node may be included in the loop with a negative weight. Then node stops the current search for a swap loop and is used as the new source node, and a new search commences again (see lines 15 to 22 in Algorithm 2). If there exists a loop with between the source node and leaf node, then a task swap loop that reduces equation (16) is determined. However, for the nonlinear WTA problem, the new assignment after exchanging the tasks along the task swap loop is potentially not improved. The weapons should calculate the new value of the objective function based on equation (1) and verify whether the new value improves when compared with the previous value. If true, then a valid task swap loop is formed (see lines 23 to 27). The improved task swap algorithm is described in Algorithm 2.

(1)	Initialize the dual variables , the reduced cost ;
(2)	Let be the set of candidate source nodes;
(3)	While do
(4)	is voted among neighbors as the source node; ; if , the assignment cannot be improved, then Algorithm 2 terminates;
(5)	Let be the set of the candidate nodes to be added to the swap chain, let be the set of the nodes that joined the swap chain (leaf nodes);
(6)	Let be the distance from node to the source node, let be ’s predecessor node; initialize , , and ;
(7)	While do
(8)	;
(9)	; ;
(10)	For each node on outgoing edge of do
(11)	If then
(12)	//This step is termed as the relaxation operation.
(13)	; set ;
(14)	End if
(15)	If and the distance is smaller than its previous value then
(16)	If then
(17)	Change the source node: ; ;
(18)	Go to step 5;
(19)	Else
(20)	Go to step 3;
(21)	End if
(22)	End if
(23)	If there exists a loop with between the source node and a leaf node then
(24)	If then
(25)	The valid task swap loop is found, and Algorithm 2 terminates.
(26)	End if
(27)	End if
(28)	End for
(29)	End while
(30)	End while

After exchanging tasks along the task swap loop, any weapon on the loop adjusts the dual variables as , where subscript denotes the index of the new assigned target to , and resets the reduced cost . It is observed that there is no dependency on the dual variables between the previous and next processes of searching swap loops. Furthermore, only the local weapons on the task swap loop need to adjust the dual variables, and other weapons do not need to adjust the dual variables. The features facilitate asynchronous searching for multiple task swap loops concurrently, and this speeds up the process of optimizing the WTA solution.

The following example is presented to further illustrate the flow of Algorithm 2. Figure 6(a) shows a survival probability matrix of each target attacked by each weapon. Figure 6(b) shows a reduced cost matrix obtained by the survival probability matrix (see line 1, Algorithm 2). Figure 7 shows the process of running Algorithm 2 to search for a task swap loop. In Figure 7(c), the distance from node on the swap chain (the shaded edge) to the source node is , and this is smaller than its previous value of 0.1 as shown in Figure 7(b). Then, we conclude that node is potentially contained on a loop with a negative weight. Based on lines 15–22, Algorithm 2, node becomes the new source node and continues to search for a task swap loop. It should be noted that, at this moment, a loop containing node cannot be considered as a valid swap loop that improves the assignment. This is because a negative weight of an edge on the loop may also result in a reduction of , while the total weight of the loop is positive and the loop does not improve the assignment. Figure 7(e) shows the shortest distance of the source node is , and the total weight of the loop is , i.e., a swap loop that improves the assignment is formed. As shown in the survival matrix (Figure 6(a)), it decreases from 1.7 to 1.6 for the sum of the survival probabilities of weapon-target assignment pairs after exchanging the tasks along the task swap loop. Moreover, the amount of reduction 0.1 is identical to the absolute value of the total weight of the loop .

(a)

(b)

(a)

(b)

(c)

(d)

(e)

(f)

4.5. Performance Analysis

The time complexity of the improved task swap algorithm to search for a single task swap loop is , where in this paper. The analysis is analogous to Dijkstra’s algorithm. In order to decrease the total survival probability of the targets, the weapons run Algorithm 2 repeatedly until no further improvements are possible. The number of times Algorithm 2 is run depends on the initial assignment. When compared with the task swap algorithm in [17], the proposed algorithm is more likely to detect a swap loop with , and this converges faster. This is because Algorithm 2 allows edges and loops with negative weight to appear on graph , and the expansion of the search space contributes to more easy detection of a swap loop with . The following simulation results demonstrate the superiority of the proposed algorithm.

Proposition 1. We assume a case where . Let denotes the weight of the edge on task swap loop, and this also corresponds to the value of the new assignment entries in the reduced cost matrix (see the square entries in Figure 6(b)). Define as , then denotes the difference between the new and old solutions, , where is obtained in equation (15).

Proof. Initialize , and then we obtain .
Node is the predecessor of , i.e., weapon passes its task to weapon , and the value of weapon becomes . Let and . We haveThus, Proposition 1 is concluded.

Based on Proposition 1, if the numbers of weapons and targets are identical, then the solution is improved by every time after exchanging tasks along the task swap loop. Finally, the solution of the objective (15) with an equal number of weapons and targets converges to the optimal solution. Unfortunately, for the same reason given in Section 3.3, it is hard to provide mathematical proof that Algorithm 2 is optimal or almost optimal for the WTA problem with the unequal numbers of weapons and targets.

5. Experiments

In this section, we verify the good performance of the redesigned auction-based algorithm, improved task swap algorithm, and hybrid algorithm of the aforementioned two algorithms. We implemented the algorithms for different numbers of weapons and tasks. We assume that the weapons network topology is fixed and complete. We randomly generated the damage probabilities from a uniform distribution in (0,1). All algorithms are implemented in MATLAB and run on an Intel Core i7 @ 3.60 GHz processor with 16 GB RAM.

Furthermore, the example videos demonstrate the validity of our algorithms implementation via semiphysical experiments and can be found at https://doi.org/10.6084/m9.figshare.7506248. The videos indicate that the algorithms can be performed feasibly in a distributed manner.

5.1. Simulation Results of Auction-Based Algorithm

For ease of description, we name this paper’s redesigned auction-based algorithm AB1 and the conventional auction-based algorithm AB2. The conventional auction-based algorithm implies that the auction-based algorithm is combined with the method of adding virtual targets in [22] such that the WTA problem is reduced to the linear sum problem without considering the cumulative effect. The performances of AB1 and AB2 are compared in terms of solution quality and time consumption. The solution quality is expressed via the mean of the optimized damage value (equation (1)) of 100 simulations. The time consumption is the mean of the runtime of 100 simulations.

The results are shown in Table 1. Table 1 shows that when compared with AB2, AB1 has a higher damage value, and its advantages become more significant when the problem size increases. For up to 120 weapons and 110 targets, the damage value obtained by AB1 is improved by 37% compared to the value obtained by AB2. Although the runtime of AB1 slightly exceeds that of AB2, they are both processed within 0.5 second for up to 120 weapons and 110 targets. The time requirement for WTA is satisfied because ground targets generally move slowly.

To present and analyze the performances of AB1 and AB2 in a more visual manner, the damage value and runtime in Table 1 are plotted in Figures 8 and 9, respectively. Furthermore, the figures reflect the robustness of the algorithms measured via the standard deviation of the damage value and runtime. As shown in Figure 8, the mean and standard deviation of the optimized damage value of AB1 both exceed that of AB2, and the difference between AB1 and AB2 increases when the problem size increases. As shown in Figure 9, AB1 performs slightly worse than AB2 in terms of time consumption.

Figure 10 further shows the performance of algorithms relative to time. Without loss of generality, only Case 4 is shown. We observe that AB1 achieves the optimal value calculated by AB2 earlier than AB2 does. From this perspective, the computational efficiency of AB1 is not worse than that of AB2.

In summary, the proposed redesigned auction-based algorithm significantly improves the solution quality of the WTA problem in a distributed manner while the time consumption is acceptable. Additionally, the proposed algorithm exhibits good performance to solve the large scale of the WTA problem with unequal number of weapons and targets.

5.2. Simulation Results of Task Swap-Based Algorithm

Three task swap-based algorithms are compared: the proposed algorithm in this paper termed as the Dijkstra with less constrained condition (DWLCC) algorithm; Dijkstra algorithm in the literature [17]; and breath-first-search (BFS) algorithm, which searches for the swap loop in a manner analogous to a BFS approach in the literature [29].

The solution quality and time consumption of the three algorithms are shown in Figures 11 and 12, respectively. All statistics are the average values of 100 simulations. The initial assignments for the algorithms are arbitrary: weapons are randomly assigned to targets. With respect to different numbers of weapons and targets, there are four cases as previously mentioned in Table 1.

As shown in Figure 11, the three algorithms almost achieve the same optimal value by the time that the loop with cannot be found anymore [17]. However, the Dijkstra algorithm has the constraint wherein the task swap loops must be searched from the edges with nonnegative weight, and a few potential task swap loops from the edges with negative weight cannot be found. Thus, under the same conditions, the solution quality of the Dijkstra algorithm is slightly worse than that of the proposed DWLCC algorithm and the BFS algorithm. As shown in Figure 12, DWLCC performs better, in terms of time consumption, than the other two algorithms. For up to 120 weapons and 110 targets, the time consumption of DWLCC is 53% less than that of the Dijkstra algorithm, and 74% less than that of the BFS algorithm.

In order to observe in a more visual manner, the performance of algorithms over time is plotted as shown in Figure 13. Without loss of generality, only Case 4 is shown. Figure 13 shows clearly that the proposed DWLCC algorithm has the best solution quality and the fastest convergence speed among the three algorithms. The proposed DWLCC algorithm can also be adapted well to a distributed scenario.

5.3. Simulation Results of the Hybrid Algorithm

The hybrid algorithm proposed in this paper is divided into two stages.(1)First, the AB1 algorithm is implemented. With respect to each weapon , the initialization inputs of the hybrid algorithm are set as the same as that of the AB1 algorithm. The outputs of the hybrid algorithm in this stage are the initial assignments, , which satisfy the constraint conditions (2)–(4). The AB1 algorithm terminates when the price of all targets does not change for iterations.(2)Second, after the AB1 algorithm, the DWLCC algorithm is implemented to refine the assignment obtained by AB1. The initialization input of the DWLCC algorithm is the assignment obtained by AB1, and the output of the DWLCC algorithm is the refined assignment.

Particularly, the number of weapons is greater than the number of targets, but the swap-based algorithm still addresses the problem in this scenario. This is because each weapon holds a “task” after the AB1 algorithm, although the “tasks” performed by different weapons may share the same target. So long as each weapon can hold a task at any time, the weapons can exchange their tasks to improve the assignment. The non-one-to-one mapping scenario (weapon-to-target) becomes one-to-one mapping scenario (weapon-to-task).

The results of the hybrid algorithm, AB1, and DWLCC are shown in Table 2. For different numbers of weapons and targets, there are four cases as previously mentioned in Table 1. All statistics are the average values of 100 simulations.

Table 2 shows that the hybrid algorithm has best solution quality. The hybrid algorithm and the DWLCC algorithm do not converge to the same optimal solution. This makes sense because the assignment result obtained by the first stage AB1 algorithm in the hybrid algorithm optimizes the initial input of the second stage DWLCC algorithm.

Table 2 shows that the time consumption of the hybrid algorithm slightly exceeds that of AB1, but it is evidently less than that of DWLCC. The weapons start moving and attacking the targets when the first stage (the AB1 algorithm) is completed. Thus, to obtain the same solution quality, the hybrid algorithm is completed as fast as the AB1 algorithm.

Thus, the hybrid algorithm is proposed to fully utilize the advantages of both AB1 and DWLCC. Compared to implementing AB1 and DWLCC separately, the hybrid algorithm exhibits better solution quality with no degradation with respect to the time consumption.

6. Conclusions

In order to solve the WTA problem in a decentralized manner, we propose a redesigned auction-based algorithm, an improved task swap algorithm, and a hybrid algorithm. First, given that the cumulative effect is not additive, we use the maximum consensus technique twice and add a judging criterion to solve the WTA problem. Simulations indicate that, when compared to the conventional auction-based algorithm, the proposed redesigned auction-based algorithm (AB1) represents significant improvements in solution quality (37% on average) while the additional running time is negligible. Second, to refine the task swap algorithm, we eliminate the restriction that the weights of the edges on graph must be positive, and the optimization process is accelerated. Simulation results indicate that, in the same conditions, the improved task swap algorithm (DWLCC) converges significantly faster to the optimal solution. The average time-savings over the Dijkstra algorithm and BFS algorithm correspond to 53% and 74%, respectively. Third, the hybrid algorithm is analyzed to fully utilize the advantages of AB1 and DWLCC. Simulations and analyses indicate that the hybrid algorithm is superior to implementing AB1 and DWLCC separately in terms of solution quality and time consumption.

6.1. Future Work

Unfortunately, all the weapons are exhausted when the assignment is completed. This scene is meaningful because small unmanned weapons are typically low cost. However, it is necessary to investigate further maximization of the total damage value while minimizing the weapon consumption in a distributed manner.

Data Availability

The data used to support the findings of this study are available from the corresponding author upon request.

Conflicts of Interest

The authors declare that they have no conflicts of interest.

Acknowledgments

This work was supported by Defense Industrial Technology Development Program of China (JCKY2017602C016).

Supplementary Materials

An example video demonstrates the validity of our algorithms implementation and can be found at . (https://doi.org/10.6084/m9.figshare.7506248 Supplementary Materials)

References

Z. R. Bogdanowicz, A. Tolano, K. Patel, and N. P. Coleman, “Optimization of weapon-target pairings based on kill probabilities,” IEEE Transactions on Cybernetics, vol. 43, no. 6, pp. 1835–1844, 2013.
View at: Publisher Site | Google Scholar
X. Hu, P. Luo, X. Zhang, and J. Wang, “Improved ant colony optimization for weapon-target assignment,” Mathematical Problems in Engineering, vol. 2018, Article ID 6481635, 14 pages, 2018.
View at: Publisher Site | Google Scholar
H. Xu, Q. Xing, and Z. Tian, “MOQPSO-D/S for air and missile defense WTA problem under uncertainty,” Mathematical Problems in Engineering, vol. 2017, Article ID 9897153, 13 pages, 2017.
View at: Publisher Site | Google Scholar
M. T. Davis, M. J. Robbins, and B. J. Lunday, “Approximate dynamic programming for missile defense interceptor fire control,” European Journal of Operational Research, vol. 259, no. 3, pp. 873–886, 2017.
View at: Publisher Site | Google Scholar
O. Karasakal, N. E. Özdemirel, and L. Kandiller, “Anti-ship missile defense for a naval task group,” Naval Research Logistics (NRL), vol. 58, no. 3, pp. 304–321, 2011.
View at: Publisher Site | Google Scholar
Z.-J. Lee, S.-F. Su, and C.-Y. Lee, “Efficiently solving general weapon-target assignment problem by genetic algorithms with greedy eugenics,” IEEE Transactions on Systems, Man, and Cybernetics, Part B: Cybernetics, vol. 33, no. 1, pp. 113–121, 2003.
View at: Publisher Site | Google Scholar
N. Jia, Z. Yang, and K. Yang, “Operational effectiveness evaluation of the swarming UAVs combat system based on a system dynamics model,” IEEE Access, vol. 7, pp. 25209–25224, 2019.
View at: Publisher Site | Google Scholar
G. Jia and Z. Hou, “The analysis and enlightenment about the UAV swarming project of the United States Military,” National Defense Science & Technology, vol. 38, no. 4, pp. 53–55, 2017.
View at: Google Scholar
O. Ilaya, C. Bil, and M. Evans, “Control design for unmanned aerial vehicle swarming,” Proceedings of the Institution of Mechanical Engineers, Part G: Journal of Aerospace Engineering, vol. 222, no. 4, pp. 549–567, 2008.
View at: Publisher Site | Google Scholar
B. Xin, J. Chen, Z. Peng, L. Dou, and J. Zhang, “An efficient rule-based constructive heuristic to solve dynamic weapon-target assignment problem,” IEEE Transactions on Systems, Man, and Cybernetics—Part A: Systems and Humans, vol. 41, no. 3, pp. 598–606, 2011.
View at: Publisher Site | Google Scholar
J. Wang, P. Luo, and J. Zhou, “A memetic algorithm for constrainted weapon target assignment problems,” in Proceedings of the 2017 2nd IEEE International Conference on Computational Intelligence and Applications, pp. 182–188, Beijing, China, September 2017.
View at: Publisher Site | Google Scholar
C. Wang, G. Fu, D. Zhang, H. Wang, and J. Zhao, “Genetic algorithm-based variable value control method for solving the ground target attacking weapon-target allocation problem,” Mathematical Problems in Engineering, vol. 2019, Article ID 6761073, 9 pages, 2019.
View at: Publisher Site | Google Scholar
Y. Li, Y. Kou, and Z. Li, “An improved nondominated sorting genetic algorithm III method for solving multiobjective weapon-target assignment Part I: the value of fighter combat,” International Journal of Aerospace Engineering, vol. 2018, Article ID 8302324, 23 pages, 2018.
View at: Publisher Site | Google Scholar
H. Chen, Z. Liu, Y. Sun, and Y. Li, “Particle swarm optimization based on genetic operators for sensor-weapon-target assignment,” in Proceedings of the 2012 Fifth International Symposium on Computational Intelligence and Design, pp. 170–173, Hangzhou, China, October 2012.
View at: Publisher Site | Google Scholar
T. Chang, D. Kong, N. Hao, K. Xu, and G. Yang, “Solving the dynamic weapon target assignment problem by an improved artificial bee colony algorithm with heuristic factor initialization,” Applied Soft Computing, vol. 70, pp. 845–863, 2018.
View at: Publisher Site | Google Scholar
D. J. Bachmann, R. J. Evans, and B. Moran, “Game theoretic analysis of adaptive radar jamming,” IEEE Transactions on Aerospace and Electronic Systems, vol. 47, no. 2, pp. 1081–1100, 2011.
View at: Publisher Site | Google Scholar
L. Liu, N. Michael, and D. A. Shell, “Communication constrained task allocation with optimized local task swaps,” Autonomous Robots, vol. 39, no. 3, pp. 429–444, 2015.
View at: Publisher Site | Google Scholar
S. Öztürk and A. E. Kuzucuoğlu, “A multi-robot coordination approach for autonomous runway Foreign Object Debris (FOD) clearance,” Robotics and Autonomous Systems, vol. 75, pp. 244–259, 2016.
View at: Publisher Site | Google Scholar
A. Farinelli, L. Iocchi, and D. Nardi, “Distributed on-line dynamic task assignment for multi-robot patrolling,” Autonomous Robots, vol. 41, no. 6, pp. 1321–1345, 2017.
View at: Publisher Site | Google Scholar
R. G. Smith, “The contract net protocol: high-level communication and control in a distributed problem solver,” IEEE Transactions on Computers, vol. C-29, no. 12, pp. 1104–1113, 1980.
View at: Publisher Site | Google Scholar
D. P. Bertsekas, “The auction-based algorithm: a distributed relaxation method for the assignment problem,” Annals of Operations Research, vol. 14, no. 1, pp. 105–123, 1988.
View at: Publisher Site | Google Scholar
D. P. Bertsekas, “Auction algorithms for network flow problems: a tutorial introduction,” Computational Optimization and Applications, vol. 1, no. 1, pp. 7–66, 1992.
View at: Publisher Site | Google Scholar
M. M. Zavlanos, L. Spesivtsev, and G. J. Pappas, “A distributed auction-based algorithm for the assignment problem,” in Proceedings of the 2008 47th IEEE Conference on Decision and Control, pp. 1212–1217, Cancun, Mexico, December 2008.
View at: Publisher Site | Google Scholar
L. Luo, N. Chakraborty, and K. Sycara, “Distributed algorithms for multirobot task assignment with task deadline constraints,” IEEE Transactions on Automation Science and Engineering, vol. 12, no. 3, pp. 876–888, 2015.
View at: Publisher Site | Google Scholar
L. Luo, N. Chakraborty, and K. Sycara, “Provably-good distributed algorithm for constrained multi-robot task assignment for grouped tasks,” IEEE Transactions on Robotics, vol. 31, no. 1, pp. 19–30, 2015.
View at: Publisher Site | Google Scholar
L. Liu and D. A. Shell, “An anytime assignment algorithm: from local task swapping to global optimality,” Autonomous Robots, vol. 35, no. 4, pp. 271–286, 2013.
View at: Publisher Site | Google Scholar
T. Lemaire, R. Alami, and S. Lacroix, “A distributed tasks allocation scheme in multi-UAV context,” in Proceedings of the 2004 IEEE International Conference on Robotics and Automation, pp. 3622–3627, New Orleans, LA, USA, April 2004.
View at: Publisher Site | Google Scholar
X. Zheng and S. Koenig, “K-swaps:cooperative negotiation for solving task-allocation problems,” in Proceedings of the International Joint Conference on Artificial Intelligence (IJCAI), pp. 373–378, Pasadena, CA, USA, July 2009.
View at: Google Scholar
L. Liu and D. A. Shell, “A distributable and computation flexible assignment algorithm: from local task swapping to global optimality,” in Proceedings of Robotics: Science and Systems, Cambridge, MA, USA, July 2012.
View at: Publisher Site | Google Scholar
Z. Ma, L. Liu, and G. S. Sukhatme, “An adaptive K-opt method for solving traveling salesman problem,” in Proceedings of the 2016 IEEE 55th Conference on Decision and Control, pp. 6537–6543, Las Vegas, NV, USA, December2016.
View at: Publisher Site | Google Scholar
H. C. Thomas and E. L. Charles, Introduction to Algorithms, Massachusetts Institute of Technology, Boston, MA, USA, 3rd edition, 2009.

Copyright

Copyright © 2019 Peng Zhao et al. This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

PDF Download Citation

Download other formats

Order printed copies

Views

1660

Downloads

1106

Citations