Abstract

When searching for multiple targets in an unknown complex environment, swarm robots should firstly form a number of subswarms autonomously through a task division model and then each subswarm searches for a target in parallel. Based on the probability response principle and multitarget division strategy, a closed-loop regulation strategy is proposed, which includes target type of member, target response intensity evaluation, and distance to the corresponding individuals. Besides, it is necessary to make robots avoid other robots and convex obstacles with various shapes in the unknown complex environment. By decomposing the multitarget search behavior of swarm robots, a simplified virtual-force model (SVF-Model) is developed for individual robots, and a control method is designed for swarm robots searching for multiple targets (SRSMT-SVF). The simulation results indicate that the proposed method keeps the robot with a good performance of collision avoidance, effectively reducing the collision conflicts among the robots, environment, and individuals.

1. Introduction

Swarm robotics is inspired by the self-organization behavior of social animals such as ants and bees. It aims to make the system consisting of a swarm of simple agents exhibit the desired intelligent behavior with the advantages of robustness, flexibility, and scalability [1]. Then, the system is used to complete more complex tasks by the coordinated control of agents and the interaction between the agents and the environment [24].

After years of research and development, the swarm robots system has been successfully applied to complete a large number of collaborative tasks as well as the target search tasks [57]. According to the number of targets, the search problem can be divided into single target search problem and multitarget search problem. In terms of single target search, some researchers have focused on the collaborative research among individuals. For example, Ducatelle et al. [8] enhanced the interaction between individual robots by the use of local wireless communication strategies; aiming at the characteristics and limitations of the search space, Dadgar et al. [9] proposed an adaptive particle swarm optimization algorithm for robot control. Hong-qiang et al. [10] abstracted a simplified virtual-force model for swarm robots hunting problem and designed the kinematic control input of the robot based on the model. In addition, some work is also devoted to parameter optimization and system modeling. Doctor et al. [11] extended the particle swarm algorithm to the modeling of the swarm robots system, focusing on the optimization of algorithm parameters. In case of multitarget search task, the situation will be different. The swarm robots must first be divided into several subswarms through the task division strategy and then search for their respective targets collaboratively [12, 13]. According to the target signal strength detected by the robots, a packet strategy is developed by Derr et al. [14] and then the subswarms cooperatively search for their targets by the particle swarm algorithm principle. However, this approach lacks a subswarm size adjustment mechanism. Zhang et al. [15] proposed a dynamic task division strategy with closed-loop regulation for multitarget search of swarm robots, which considered the coordination of the subswarm. However, this strategy removed some advantageous individual members when adjusting the size of the sub_swarm and did not take into account of some underlying behaviors, such as collision and roaming. Manzoor et al. proposed a multirobot coordination and navigation algorithm for intercepting the long distance targets [16]. In order to improve the communication efficiency during the plan consensus, Kim et al. developed a decentralized multi-UAV task allocation algorithm and a consensus-based binding algorithm for grouping the UAVs based on their task preferences [17]. The multitarget particle swarm optimization is an effective tool for solving multitarget optimization problem [18]. In order to consider practical constraints for the collaborative search of robot swarms, Yang et al. proposed a collaborative search method for robot swarms based on constrained particle swarm optimization [19]. Ibraheem et al. combined the improved fruit fly optimization algorithm and the modified adaptive particle swarm optimization algorithm to design a novel neural network-based fractional PID controller for mobile robots [20].

In view of the above problems, the task allocation in this paper is performed based on the probability principle and target response threshold, and a closed-loop regulation strategy is introduced, including target type, target incentive intensity, and distance to communicating individual, so as to reasonably allocate the level of robot resources. For the collision avoidance problem of robots in an unknown complex environment, the simplified virtual-force analysis model of individual robot is firstly abstracted, and then it is introduced into the subswarm alliance and the roaming individuals. The input strategy of robot motion control is designed accordingly. Furthermore, a roaming searching strategy is designed, which aims to keep different search directions with two nearest neighbors and guarantee the largest search area. Simulation experiments are performed in the environment containing convex obstacles with various shapes, and the results demonstrate that the swarm robots searching for multiple targets based on the proposed simplified virtual-force model (SRSMT-SVF) can reasonably configure the robot resource level and effectively avoid collisions among the internal members of the system and between the system and the environment.

2. Model Construction

2.1. Kinematics Model and Related Functions of Mobile Robots

Considering a swarm robot system with multiple identical autonomous mobile robots, the kinematic equation of individual robot R is as follows:where V(t) and ω(t) are the linear velocity and angular velocity of the robot, respectively. And, |V(t)| ≤ Vm, |ω(t)| ≤ ωm; Vm and ωm are the maximum linear velocity and the maximum angular velocity, respectively. The linear acceleration and angular acceleration satisfy and , respectively, where am and ωam represent the maximum line acceleration and the maximum angular acceleration, respectively.

Definition 1. φi and φj denote the angles of the directional line li and lj, respectively, and φij represents the angle from li to lj [21], which is defined asThe analytical expression of the function dvgl (•) is as follows:where

Definition 2. In the search environment, the forcing functions of various objects (e.g., targets, robots, and obstacles) are defined aswhere is the velocity vector of robots at time t+ 1, indicates the gravitational interaction of the target t acting on the robot r (assuming that the movement of the robot to the target is due to the presence of this gravity), d is the distance between two points, and r is mainly used for routing optimization, and cr and cs represent the potential field radius of the robots and obstacles, respectively.

2.2. Multitarget Search Strategy

The multitarget search problem in an unknown environment mainly involves two major issues: the construction of the self-organizing task division model and the coordination of the subswarms and individuals. In addition, it entails the planning of some underlying behaviors is entailed, such as collision avoidance, collaborative search, and roaming strategy. In this subsection, the associated mathematical model is established and elaborated [10].

In an unknown complex environment, it is assumed that the ternary set includes the swarm robots , the search targets , and the static convex obstacles . In the global coordinate system XOY, the position of object p, robot, or obstacle, is denoted as . represent the radius of individual potential field, the maximum collision avoidance distance, the maximum perceived distance, and the maximum effective corresponding distance, respectively. Generally, . Therefore, the static obstacles that the robot needs to avoid can be expressed as , and the individual robot within its potential filed radius and communication range are recorded as

The perceived target is recorded as

3. Self-Organizing Multitask Division Strategy Based on Target Response

3.1. Target Incentive

The swarm robots system is composed of a large number of individual robots based on multisensor structure in this paper. The robot can perceive changes of its surrounding environment and target signals through various sensors equipped. It is assumed that the target can continuously emit perceivable signals with certain characteristics. Due to the distance difference between target and robot, the target response intensity detected by the robot will also be different. Here, the target response function is used to describe this intensity [22, 23]:where P is the constant signal power emitted by the center of the target and Iij is the response intensity of the robot Ri to the target Tj, which is inversely proportional to the square of the distance, d represents the distance between the robot and the target, d0 is the maximum perceived distance of the robot, and m and η() represent the attenuation coefficients of the target signal in the environment and the disturbance of the environment, respectively, and 0 ≤ m ≤ 1.

3.2. Multitask Division Model Based on Response Threshold of the Target

Given a multitarget search task, the swarm robot system needs to be autonomously divided into several subswarms according to the target detection, and then all the subswarms cooperatively search for the corresponding target in parallel [2426]. The overall search efficiency of the system can be greatly improved. Therefore, the primary task is to solve the multitarget problem by constructing a reasonable and effective task division model. Using the existing task division strategy based on the target response threshold, an internal descending sorting principle is introduced in this paper, which combines the target type, target response intensity, and distance to communicating individuals of subswarms. Besides, a multitask division model with closed-loop regulation is proposed. When performing multitarget search task, each target can be treated as a subtask and each robot can participate in a subtask. It is noted that the multiple robots are allowed to participate in a subtask, while a robot can only execute one subtask. Denote the target response threshold as Imin (Imin = I(d0)). If the response value of the target Tj detected by robot Ri satisfies Iij(d) > Imin, Tj will be regard as one of its candidate intentional targets by Ri. Denote P(i, j) as the probability that the robot Ri chooses the target Tj as its own target. P(i, j) can be written aswhere intentioni (i = 1, 2, …, m) is the set of alternative intention targets for robot Ri. Let Randi be a random number uniformly distributed on the interval (0, 1). If P(i, j)> Randi, the robot Ri will choose the target Tj as its own search goal. For an individual robot whose alternative intentional target set is not empty, the detected target response needs to be evaluated and the intent target needs to be determined in the same way.

3.3. Dynamic Closed-Loop and Self-Organizing Task Division Model
3.3.1. Construction of Robot Information Card

In an unknown complex environment, the individual robot can directly detect the target signals through sensors and calculate the target response intensity by equation (10). The directly detectable targets are classified as target I. Meanwhile, the robots can share information such as the target perception through local communication, which expands the cognitive range of individuals. Therefore, the robot can obtain the cognitive information of targets by interacting with other robots within the communication range, and such targets are classified as target II. Obvious, target I has priority over target II. According to the cognitive information acquired by the robot and the state of the robot itself, the structure of individual robot information card is defined as shown in Table 1.

As shown in Table 1, the robot information card includes robot number, search target, task set, target response incentive, communicating individual, distance to the communicating individual, intentional target, and individual state. Since the robot can simultaneously perceive multiple target information and communicate with multiple robots, some elements in the individual robot information card are embodied in the form. If the robot has no the cognition of a certain part, the corresponding remark is expressed as . The individual state information refers to current search state of the robot, including roaming state (indicated by 0) and collaborative search state (indicated by 1).

3.3.2. Formation of Subswarms

Once the intentional target is determined, the robot will carry out a self-organizing task division on the individual level, aiming to guarantee that the robots with the same intentional target collaboratively perform the local search in the form of subswarm alliance, and then the system can execute the task in parallel. When the subswarm is formed, the subswarm information card is defined as shown in Table 2, which can be used to facilitate the acquisition of information within subswarms and the interaction between the subswarms.

Subswarm information card includes its number, subswarm target, leader, member information, size, recruitment number (recruit number), and optimal records. For the part with noncognition, it is denoted as . As the external speaker of the subswarm, the leader is the most dominant individual and the target of the nearest robot in the subswarm. In this paper, the upper limit of subswarm size is Nmax = 6, and M robots can be recruited to a subswarm, where M = Nmax − N and M ≤ Nmax. In addition, each subswarm holds its own optimal record table, including the optimal information of its searching target and other I targets (called alternative targets) acquired by the subswarm during the search process. The noncognition part in the table is denoted by “—.”

3.3.3. Evaluation of Subswarm Scale and Closed-Loop Regulation

Based on the target response threshold and probability principle, the task division model is established, and a closed-loop adjustment strategy is proposed: when performing task division, the level of robot resource allocation is evaluated for the subswarms that exceed the upper limited size. And, the evaluation results are taken as the negative feedback of the task division model to regulate the migration of some robots between different subswarms.

This strategy can effectively handle the serious imbalance of the resource allocation in the swarm robots system, so that the robot resources are rationally and effectively configured. As shown in Section 3.3.1, the targets are classified as type I and type II, and the priority of type I is higher than that of type II. In order to assess whether the size of the subswarms exceeds the upper limit size, it is necessary to prioritize their internal members. Firstly, the internal members of the subswarms are sorted according to the categories of their targets. The position of members with the target type I is superior to those with the type II target. If the target category is the same, the descending sorting of the members is performed according to the target incentive intensity. For robots with the same type II target, they obtain the target information by communicating with the same neighborhood; if the strength of the target incentive is the same, they can be sorted according to the decline of the distance to the communicating individuals. The details are shown in Table 3.

As shown in Table 3, the task type for R1, R5, R7, and R8 is type I and the task type for R2, R3, R4, and R6 is type II. According to the above ranking rules, the dominant position of the robots with type I targets should be higher than that of the individuals with type II targets. In case of the same task target type, the higher the task incentive priority, the higher the dominant position. Hence, R1 has the greatest advantage in the subswarm. For individuals with the same type II target and the same target incentive, the closer the distance to the communicating robot, the more advantage they have. Therefore, R2 has an advantage over R3. Since the upper limit of the subswarm size is set to 6 robots, R4 and R6 do not enter the top 6. They will be out of the subswarm and then trigger the punishment mechanism; then, they cannot participate in the task division for a period of time. After the punishment, R4 and R6 can still join the subswarm. Via the withdrawing the union and the second-join mechanism, the dynamic migration of several robots between different subswarms is realized and the robot resources can be allocated reasonably.

4. Searching Algorithm

Figure 1 shows the flowchart of multitarget search algorithm for swarm robots in an unknown complex environment.

4.1. Simplified Virtual-Force Model

In the global coordinate system XOY, it is assumed that the robot Ri can sense the local information of the two nearest neighbors p1 and p2, which may be individual robots and static convex obstacles, through the sensors. As shown in Figure 2, the coordinate position of the robot at time t is Ri (xi, yi) and the position vector dip1, dip2 are defined as

When moving to at the speed of , robot Ri will be subjected to the gravitational effect of p0 and the repulsive effects of p1 and p2, which are denoted as , , and , respectively. The directed angles from , , and to x-axis positive direction are recorded as , , and , respectively. is the resultant force of the repulsive force, which is received by the robot in a direction perpendicular to the desired direction of the motion, and represents the directed angle from to the positive direction of the x-axis. is the resultant force of Ri.

At the time step t, the resultant force of the robot Ri represents its actual demand speed, which can be defined aswherewhere is the force function calculated by equation (5), acting on the next position of the robot. and are the force functions produced by p1 and p2, respectively, which are calculated by equations (6) and (7). is the deflection angle of and , so is . If is directly equivalent to the velocity perpendicular to the direction of the movement at next step, , .

4.2. Extended Particle Swarm Optimization

The mathematical expression of the swarm robots system model based on the extended particle swarm algorithm [27, 28] is as follows:where Xif(t) and Vif(t) denote the position and velocity of robot Ri at time step t, respectively, and Vie (t + 1) is the desired speed of the robot at next time step, c1 and c2 are the learning factors (usually defined as cognitive coefficient and social coefficient), r1, r2 [0, 1] are random values, and is the inertia weight. The experience of robot Ri in terms of its own best position thus far are recorded as , and the position of the best robot in the entire system is denoted as . is introduced to represent the motion of inertia, and is a stride control factor for the embodiment of consequent control.

4.3. Controlling Input of Individual Robot

Based on the simplified virtual-force model (SVF-model) and extended particle swarm optimization (PSO) algorithm, a control input strategy of individual robots is developed in this paper. The actual demand speed and direction of robot Ri are denoted as and , respectively, and d is the distance to the nearest object. cm is the maximum collision avoidance distance, indicating that the robot has began to enhance avoiding. According to the different states of the robot Ri, the motive control input is presented as follows.

If Ri is in the roaming state, the motive control input can be written aswhere is determined by the program in Figure 2 and is the maximum running speed of the robots. Besides, a strategy is designed for the roaming individual: keep driving at the maximum speed and the direction of the robot should be different from its two neighbors, so as to maintain the maximum search area with its two neighbors.

If Ri is in a cooperative search state, the motive control input can be written aswhere is calculated by equation (15).

4.4. Steps of Search Algorithm

According to the model of multitarget search problem in Section 2 and the individual motion input equations in equations (16) and (17), Figure 3 shows the steps of multitarget search algorithm for swarm robots based on the simplified virtual-force model in an unknown complex environment.

5. Simulation

In order to verify the performance of the proposed method, the search experiment of 7 targets is performed on the swarm robots with different sizes ranging from 30 to 80 in steps of 10 in the same search environment. At the same time, the trajectory of the swarm robots with a size of 30 is recorded, and the collision avoidance performance and search performance are analyzed. The system parameters are summarized in Table 4.

5.1. Simulation Results

Here, a simulation experiment is conducted on the swarm robot system. The proposed SRSMT-SVF method and the extended particle swarm optimization algorithm (EPSO) [11] are employed to solve the randomness of the algorithm, and the two strategies with different sizes are repeated 30 times in the same search environment, respectively. Figure 4 compares the average time of the two tasks.

In order to verity the collision avoidance performance and search performance of the proposed algorithm, the simulation results of a swarm robot system with a population size of 30 are recorded. As shown in Figure 4, the black solid line represents the edges of various convex obstacles in the environment; the solid circle represents the targets to be searched; the individual robot is represented by a hollow circle, and the corresponding number is its serial number. Moreover, the motion direction of the robot is also specified, and the subswarm concluded during the search is denoted as SubT, where T represents the number of the targets. The members of subswarms are joined by a straight line, and the other unconnected individuals indicate that they are in a roaming state.

As shown in Figure 5(a), the swarm robots are initialized in the rectangular region of 20∼80 units, and all robots are in a roaming state at the beginning of the search task. In the case of T = 69, after completing the self-organizing task division on the subswarm level, individuals numbered 22, 7, 19, 29, and 3 have successfully formed Sub1 as shown in Figure 5(b). Accordingly, the search status of the subswarm members change from the roaming state to the cooperative state, while other individuals fail to form a subswarm and continue to search using the roaming strategy.

In the process of searching for target 1, other roaming robots still need to perform the self-organization task division according to the detecting situation of the targets. Moreover, the individuals targeting the same target will form subswarms and search in parallel, greatly improving the search efficiency. In the case of T = 91, the individuals numbered 18, 8, 26, 27, 16, and 28 form Sub3, which searches in parallel with Sub1, and the details are shown in Figure 5(c).

Figure 5(d) shows the case with T = 106, the members of Sub1 declare that the target has successfully reached, and their status changes to the roaming state correspondingly. Then, the subswarm is disbanded. In addition, the robot numbered 19 can successfully avoid the triangle convex obstacles in the environment.

In the case of T = 135, the Sub3 has successfully searched for target 3, and its search process is shown in Figure 5(e). It can be seen that the members can successfully avoid the polygonal shape of convex obstacles in the environment. When T = 338, individuals numbered 21, 7, 19, 22, and 30 form Sub 5 and reach the last target 5. The search process is shown in Figure 5(f). So far, all the targets in the environment have been successfully searched.

5.2. Result Analysis

Observing the entire search process of the swarm robots, it can be seen that the proposed multitask division model with closed-loop regulation strategy can greatly improve the configuration level of robots resource. Meanwhile, the size of subswarm is guaranteed to be within the upper limit. The simulation results indicate that the size of the formed subswarm does not exceed the upper limit in the search process. As shown in Figures 5(d)5(f), via the collision avoidance strategy and the simplified virtual-force model, the individual robot can successfully avoid convex obstacles with various shapes in the environment during the search process. In order to check the collision avoidance between robots, Sub3 subswam is taken as an example to analyze the distance between the two members in the entire search process, as shown in Figure 5. Sub3 includes individuals numbered 18, 8, 26, 27, 16, and 28, which are referred to as 1, 2, 3, 4, 5, and 6 for convenience. The distance from the robot m to the robot n is recorded as dmn. For example, d12 represents the distance from the robot 18 to the robot 8.

As shown in Figure 6, during the entire collaborative search process, the members of Sub3 can maintain the maximum collision avoidance distance (safety distance) above 20. When the distance between individuals is less than the maximum collision avoidance distance, individual can quickly adjust to a safe distance. The inspection of the d46 deviation analysis indicates that, when t = 23 and t = 38, if the distance between the robots is less than or equal to 20, it can be quickly adjusted to a safe distance at the next moment. The simulation results demonstrate that the individual control input based on the simplified virtual-force model can make the robot strictly keep a safe distance from other objects (individual robots or obstacles) in the environment in the process, and the system can successfully search for all targets.

6. Conclusion

This paper aims to handle the issue of multitarget search of swarm robots in an unknown complex environment, and its emphasis is put on the task allocation model and individual collision avoidance. Based on the probability principle and target response threshold, the closed-loop regulation strategy with subswarm scale evaluation is introduced into the allocation model. After a subswarm is successfully established, the members are sorted according to the target type, target response intensity, and distance to the communicating individual. The lower-level members will withdraw from the subswarm and participate in the search of other subswarms, so that some robots can migrate between different subswarms and the robot resources can be reasonably configured. For the problem of robot collision avoidance, a simplified virtual-force model is established in this paper, whereby the motion control method of the robot is designed. Since the individual only needs to consider the information of two neighbors, reasonable motion input can be provided to effectively avoid collision with various convex obstacles and other individuals. Therefore, to a certain extent, the complexity of calculation and various collisions of the system are reduced.

Data Availability

The data used to support the findings of this study are available from the corresponding author upon request.

Conflicts of Interest

The authors declare that there are no conflicts of interest regarding the publication of this paper.

Acknowledgments

This research was supported by the Special Project of Engineering Research Center (item no.: Lgy18gz006).