Abstract

With the increasing application of unmanned aerial vehicles (UAVs), UAV-based base stations (BSs) have been widely used. In some situations when there is no ground BSs, such as mountainous areas and isolated islands, or BSs being out of service, like disaster areas, UAV-based networks may be rapidly deployed. In this paper, we propose a framework for UAV deployment, power control, and channel allocation for device-to-device (D2D) users, which is used for the underlying D2D communication in UAV-based networks. Firstly, the number and location of UAVs are iteratively optimized by the particle swarm optimization- (PSO-) Kmeans algorithm. After UAV deployment, this study maximizes the energy efficiency (EE) of D2D pairs while ensuring the quality of service (QoS). To solve this optimization problem, the adaptive mutation salp swarm algorithm (AMSSA) is proposed, which adopts the population variation strategy, the dynamic leader-follower numbers, and position update, as well as -learning strategy. Finally, simulation results show that the PSO-Kmeans algorithm can achieve better communication quality of cellular users (CUEs) with fewer UAVs compared with the PSO algorithm. The AMSSA has excellent global searching ability and local mining ability, which is not only superior to other benchmark schemes but also closer to the optimal performance of D2D pairs in terms of EE.

1. Introduction

Integrating unmanned aerial vehicles (UAVs) into the fifth generation (5G) and beyond cellular networks is a promising technology. UAVs can be used as aerial communication platforms to assist ground communications, such as traffic offloading, recovery after natural disasters, emergency response, and Internet of Things (IoT) [1]. To solve the problem of wireless connectivity for devices that are without infrastructure coverage, such as mountainous areas and isolated islands, especially interruption of communications caused by disasters, many efforts have been made to study wireless communication with UAVs [28]. Compared with terrestrial communications, UAVs are generally easier to deploy, more flexible to reconfigure, and better in communication channel performance [2]. Meanwhile, [3] studies the measurement and models of air access channels between UAVs and base stations (BSs) in typical urban and rural macrocell scenarios, which is helpful to develop more reliable and more efficient communication technologies for UAVs to access the territorial cellular networks. On the other hand, the authors of [4] focus on maximizing the coverage of UAV-based BSs under the constraint of the transmitting power. In [5], a unified framework for a UAV-assisted emergency network in disasters is established. Furthermore, there is also literature combining UAVs with full-duplex mode, 6G, and other technologies. A method of uplink and downlink transmission resource allocation for a UAV-assisted full-duplex nonorthogonal multiple access system is proposed in [6], which guarantees the QoS of users and minimizes power consumption. In [7], the problem of multisource destination pair relay of UAV based on full-duplex is studied. In the aspect of user scheduling based on time division multiple access (TDMA), UAV dynamic trajectory, and UAV transmission power, the authors propose and solve a joint optimization problem to maximize the swallowing performance of the network. However, this paper does not study the mobility of ground users. In [8], using low-altitude platforms (LAPs) and high-altitude stations, the authors propose a 6G heterogeneous Internet of Things based on XAPS and a multi-UAV cluster nonorthogonal multiple access systems. By optimizing spectrum allocation, power control, horizontal coordinates, and hovering height of UAV, the uplink accessibility and speed of the whole system can be maximized. However, existing literature has rarely investigated the UAV deployment problem and the power control and channel allocation of D2D pairs in a D2D-abled UAV-based system. The purpose of this paper is to study a scenario lacking ground BSs (GBS), such as mountainous areas, isolated islands, and disaster areas. Several UAVs that served as aerial BSs can provide communication links for areas lacking infrastructures with the help of D2D communications. However, in these emergencies, reasonable UAV deployment can reduce the interference and improve the communication quality of users; energy-efficient schemes are also required along with improving the quality of communication services.

At present, according to different optimization objectives, many scholars have proposed a variety of deployment methods of UAVs, such as minimizing the number of UAVs, minimizing deployment delay, and maximizing coverage range. To reduce the deployment delay and improve efficiency of UAVs, [9] proposes a rapid UAV deployment scheme based on bandwidth resources. By using the improved Kmeans algorithm, the goal of minimizing the number of UAVs and deployment delay is achieved, and the satisfaction of users is improved. However, the heights of UAVs are just estimated in [9]. In [10], the authors not only consider the optimization of horizontal positions but also optimize the height deployment of UAVs and propose a 3D (three-dimensional) deployment method of UAVs based on particle swarm optimization. In [11], three basic deployment designs of UAVs, namely, the minimum number of UAVs, the optimal deployment location, and the optimal transmission power allocation, are studied in combination with the instantaneous signal to the signal-to-interference-plus-noise ratio (SINR) transmitting power allocation, and the particle swarm optimization- (PSO-) based scheme is proposed. Aiming at the optimization of task completion time and overall energy consumption of multiple UAVs in resource-constrained unsaturated postdisaster areas, [12] studies the problem of data dissemination of multiple UAVs. However, the above literatures do not consider the situation that the locations of users are unknown. The related algorithms based on machine learning methods solve this problem well. The authors of [13] propose a 3D deployment method and 3D dynamic motion acquisition method of UAVs based on -learning. As an agent, each UAV makes its own 3D positioning decision through trial and error learning. In [14], a geographic location information learning algorithm is proposed, which avoids accurate channel calculation, saves a lot of time, fills the blank of quickly establishing multiple UAVs to serve multiple users, solves the problem of how to find suitable base station location for multiple UAVs as soon as possible, and achieves the goal of maximizing the total downlink rate of the communication network. [15] utilizes game theory and machine learning methods to deploy the 3D position of UAV to compensate for interruption and overload of the network. This method does not need to know all the information of the system and a large amount of network information exchange. In order to learn and adapt to time-varying user activities, the authors of [16] study the deployment of dynamic UAVs. Without knowing the users’ time activity distribution parameters, UAV can update the belief about the activity probability of undiscovered users according to the visit history. Finally, the -learning algorithm is applied to develop effective deployment strategies.

Many scholars have studied UAV-aided networks. In [17], the authors propose an energy-efficient multi-UAV coverage model based on spatial adaptive strategy. In [18], the authors try to maximize the energy efficiency (EE) of UAVs by designing the trajectory path of UAVs, considering both throughput and propulsion energy consumption of UAVs. Literature [19] studies the joint optimization of the UAV track, aimed at minimizing the total energy consumption of each node under the constraint that the estimated mean square error is less than the target threshold. To solve the problem of resource allocation and trajectory design for multi-UAV-aided cellular networks with unreachable user location and channel parameters, the authors of [20] propose a multiagent reinforcement learning solution to optimize the overall throughput and fairness throughput. With the exponential growth of data traffic, the use of caching and D2D communication has been recognized as an effective approach for mitigating the backhaul bottleneck in UAV-assisted networks in [21]. The authors of [22] consider a scenario that UAV-BS replaces destroyed traditional GBSs when a natural disaster occurs; a large-scale UAV is deployed to provide wireless service to ground devices with the support of D2D-enabled links. However, one UAV can only serve a small limited range of signal coverage. Meanwhile, in [23], the optimization of power control for D2D communication underlying UAV-assisted access systems is investigated, but channel allocation is not considered. In order to improve the above deficiencies, in this paper, a D2D-abled UAV-based system with multiple UAVs is considered, and the UAV deployment and D2D energy efficiency are optimized in two steps.

To date, existing literature has investigated the UAVs’ deployment and the D2D pairs’ EE on an underlay D2D-enabled UAV-based network. The main contributions of this paper are as follows. (i)We aim to minimize UAVs’ number and deployment to achieve a given SINR requirement, by using the particle swarm optimization- (PSO-) Kmeans algorithm. Under the condition of not exceeding the coverage of UAVs, the less quantity and better deployment of UAVs make cellular user (CUE) less interference while better communication quality(ii)After determining the deployment of UAVs, we improve EE of D2D pairs while considering QoS constraints in a D2D-abled UAV-based system, which is a complicated NP-hard problem(iii)By introducing the population variation strategy and the leader-follower adaptive adjustment strategy, the adaptive mutation salp swarm algorithm (AMSSA) is proposed to solve the optimization problem, such that the variation of nonoptimal individuals can improve the diversity of the population and make the iteration easier to achieve global optimization. Meanwhile, the adaptive leader-follower numbers can have a better balance between exploration and development capabilities, and the adaptive weights updating strategy makes it easier for AMSSA to jump out of the local optimum. Furthermore, the -learning strategy enables the salps to choose the individuals to follow, thus refusing blind following(iv)Finally, we propose an efficient iterative resource allocation algorithm; the problem of optimizing D2D pairs’ EE can be solved by the proposed AMSSA

The rest of this paper is organized as follows. In Section 2, the system modeling and the interference modeling are introduced. In Section 3, we analyse the UAV deployment problem and introduce the PSO-Kmeans algorithm. The model and proposed method of power control and channel allocation are given in Sections 4 and 5, respectively. Simulation results are shown in Section 6, and conclusions are drawn in Section 7.

2. System Modeling

In this section, the system modeling and the interference modeling of CUEs and D2D pairs are illustrated in Subsections 2.12.3, respectively.

2.1. System Modeling

The system model is shown in Figure 1. In some remote areas and extreme cases, such as mountainous areas and isolated islands where there is no GBS, as well as disaster areas where GBSs are out of service, the rapid provision and recovery of communication are significant for people’s production, life, and disaster relief. Therefore, in this paper, several UAVs are deployed as aerial BSs to provide temporary communication connections for users, so as to solve the communication interruption caused by GBS absence. In a short time, we assume that UAVs are still in the air, which can provide stable communication for users within the coverage area. We consider reusing uplink resources because reusing downlink resources is more difficult and less effective than reusing uplink resources in a cellular network [24]. Therefore, this paper focuses on the resource allocation of uplink, and the related research of downlink will be studied in the future.

We consider an interference-limited D2D system for a single-cell uplink scenario in which the ground base station does not exist. The UAVs are temporary aerial BSs. The users are distributed randomly in cells. Assume that each UAV is stationary in a short time; in this case, each UAV-BS controls several CUEs and surrounding D2D pairs, and each CUE or D2D pair can only be controlled by one UAV-BS. The cellular transmissions under the control of a UAV-BS are orthogonal while CUEs in disparate UAV-BSs can reuse the same channels. Similarly, a D2D pair can reuse only one CUE channel of the same UAV, and one CUE channel can be reused by multiple D2D pairs. The UAV-BS contains the complete information of the instantaneous channel state information of links to all UEs, which are assumed to be independent block fading, and each receiver knows the fading characteristics of only its own communication link, whether it is CUE or D2D [22].

2.2. Interference Modeling of CUEs

In our system, the channels that CUEs covered by the same UAV are mutually orthogonal, so there is no interference between them. When CUEs are served by different UAVs as well as multiplex the same channel, there will be interference. Figure 2 shows the interference of CUEs to UAVs.

2.3. Interference Modeling of D2D Pairs

As described in [25], we analyse several complicated interference scenarios in the system. Figure 3 shows four different interference scenarios to D2D receivers (D2DR); those are intracell interference of CUEs towards D2DR, intercell interference of CUEs towards D2DR, intracell interference of other D2D transmitters (D2DT) towards D2DR, and intercell interference of other D2DTs towards D2DR. (1)Intracell Interference of CUEs towards D2DR. As shown in Figure 3(a), in a UAV-BS, the CUE creates interference towards D2DR, which reuses the same channel(2)Intercell Interference of CUEs towards D2DR. As shown in Figure 3(b), this kind of interference, towards D2DR, is generated by CUEs in different UAV-BSs(3)Intracell Interference of D2DTs towards D2DR. As shown in Figure 3(c), the interference is generated by other D2DTs, which reuse the same channel of D2DR in the same UAV-BS(4)Intercell Interference of D2DTs towards D2DR. As shown in Figure 3(d), this scenario shows the intercell interference where D2DTs comes from other UAV-BS interference with D2DR

3. Minimum Number and Optimal Location Deployment Strategy of UAVs

3.1. Problem Model

In our scenario, and index the th CUE and the th UAV, respectively. Otherwise, CUEs covered by the same UAV are orthogonal to each other, and there is no interference, but there is interference between CUEs, which are served by different UAVs while using the same channel.

We assume is theth CUE that is covered by th UAV . 3D locations of and are donated by and , respectively. Thus, the horizontal distance between and is defined as

The vertical distance between and is given by

We consider a UAV-assisted emergency communication network. Most countries stipulate that the UAV flight height is less than 150 meters. Therefore, the path loss channel model proposed in [26] is particularly suitable for the system in this paper. LAPs are below the stratosphere, are easy to be deployed, and are consistent with the broadband cellular concept, because low altitude combines coverage advantages with limited cellular radius [26].

As described in [11, 26], the path loss channel model of LAPs can be written as where is the elevation angle between and , which is defined as Moreover, , , , and are the environment parameters, is the transmitted radio frequency, and is the speed of light.

We focus on the UAV deployment optimization problem, and the small-scale fading is not considered.

Furthermore, the instantaneous signal-to-interference-plus-noise ratio (SINR) of can be expressed as where

means the CUE association variable (i.e., if the th CUE and the th CUE use the same channel, ; otherwise, ). means the CUE-UAV association variable (i.e., if the th CUE in the th UAV, ; otherwise, ). and represent the transmit power of and , respectively. is the aggregate power of noise.

Our UAV deployment problem is aimed at minimizing the number of UAVs and optimizing their locations. Combining (1)–(7), we can formulate the problem as follows: where in (8a) is the minimum limit of the minimum SINR. Equation (8b) ensures that any two UAVs do not hover in the same position, thus effectively avoiding UAV collisions. Equation (8c) limits one CUE to only one UAV. Equation (8d) ensures that the locations of UAVs must be in the feasible area.

This UAV deployment problem is nonconvex and is hard to find the optimal solution. Thus, the PSO-Kmeans algorithm is used to solve the suboptimal solution of the optimization problem.

3.2. PSO-Kmeans Algorithm

The particle position of the PSO algorithm is random. Therefore, this paper first uses the Kmeans algorithm to generate the initial particle positions, which makes the PSO algorithm have good initial particle positions, which is beneficial to further improving the iterative performance.

Firstly, the positions of CUEs are randomly selected as the initial positions of clusters. We define the clusters as . The horizontal distance between and the centroid of the th cluster is defined as

Find the centroid which is the closest to and attribute to the corresponding cluster , and update .

The centroid is updated by

Finally, Equations (9) and (10) are repeated several times until all the centroids are no longer changed, and the initial position of the first-generation UAV is obtained, which is input into the scheme based on the PSO algorithm. The UAV deployment scheme based on the PSO-Kmeans algorithm is shown in Algorithm 1.

read, ;
Initialize positions of UAVs by using Kmeans;
whiledo
Initialize positions of UAVs by Kmeans;
whiledo
  fordo
   
  ifthen
    
    
  end if
end do
  ifthen
   
   
  end if
end while
end while
print;
3.3. Complexity Analysis

At first, the initial deployment of UAVs is obtained by clustering with the Kmeans algorithm. Assuming that there are particles in the PSO algorithm, the Kmeans algorithm needs to be repeated for times to get groups of UAV initial deployment as the initial position of particles. Thus, the time complexity of times Kmeans algorithm is , where means the number of sample points (i.e., the quantity of CUEs), means the number of UAVs, and is the iterations. In addition, the time complexity of the PSO algorithm is . Thus, the time complexity of the PSO-Kmeans algorithm is .

4. Power Control and Channel Allocation of D2D Pairs

In this section, we model the energy efficiency optimization problem of D2D pairs when determining the deployment of UAVs.

Let index the th channel. In the th UAV-BS, let and denote receiver and transmitter of the th D2D pair , respectively.

The intracell interference towards is given by where means the set of D2D pairs in the th UAV-BS. is the channel state indicator variable (i.e., if shares the same channel with , ; otherwise, ). and denote the transmit power for (i.e., transmitters of other interfering D2D pairs in the same UAV-BS) and the CUE at the same channel in the th UAV-BS, respectively. and represent the distance between and , and CUE, respectively. and are the channel gains, regardless of the link being cellular or D2D, and denotes the path loss exponent.

The intercell interference towards is given by where means the set of D2D pairs in the th UAV-BS. represents the channel state indicator variables between and . It should be emphasized that and are in two different UAV-BSs. and denote the transmit power of and CUE that shares the same channel but in another UAV-BS. and are the channel gain, regardless of the link being cellular or D2D, and denotes the path loss exponent.

Furthermore, SINR of can be expressed as where denotes the transmit power of and is the propagation distance of D2D pair . represents the channel gain and denoting the power spectral density of the additive white Gaussian noise (AWGN) at the receivers.

Therefore, the spectral efficiency (SE) (bit/s/Hz) of is given by

The total power consumption of , which concludes transmit and circuit power, is given by where is the circuit power of D2D pair . The circuit power needs to be multiplied by 2, because both and must be taken into account.

The EE of D2D pairs (bit/J/Hz) is given by

Our objective is to maximize the EE of all D2D pairs in the system. Combining (11)–(16), we formulate the joint power control and channel allocation problem as an optimization problem as follows: where in (18a) is the minimum SE of , which satisfies the QoS requirement. Equations (18b) and (18c) mathematically model our assumption that one D2D pair can only reuse one channel. Equation (18d) ensures that the transmit powers of cannot go beyond the maximum limit.

We define as the channel allocation matrix for all D2D pairs, in the same way, as the power control matrix. This optimization problem is a complicated NP-hard problem, and no efficient polynomial-time solutions exist, as the complexity may increase. Therefore, we propose the quantity-weight adaptive salp swarm algorithm to solve the problem.

5. Adaptive Mutation Salp Swarm Algorithm

In this section, firstly, we introduce the salp swarm algorithm (SSA) and the adaptive salp swarm algorithm (ASSA). Then, we propose the adaptive mutation salp swarm algorithm (AMSSA), which is aimed at solving the power control and channel allocation problem.

5.1. Salp Swarm Algorithm and Adaptive Salp Swarm Algorithm

In [27], the authors propose the first model of the salp chain. The salp chain can be divided into two groups: leader salp and follower group. The leader salp is at the front of the chain, while the other salps are regarded as follower salps.

A two-dimensional matrix is defined as the position of all salps, and an -dimensional search space includes a food source .

The leader salp position updates can be expressed: where . is the position of the leader in the th dimension. is the food position in the th dimension. represents the upper bound of the th dimension, and represents the lower bound of the th dimension. , , and are the constants of the interval , respectively. Meanwhile, they dictate the next position in the th dimension as well as the step size.

Furthermore, the update positions of follower salps are defined as where and shows the th follower position in the th dimension.

Thus, the salp chain can be simulated by (19) and (20).

It can be seen that there are two optimization matrices for the power control and channel allocation problem. Meanwhile, the channel allocation matrix is discrete, while the power control matrix is continuous. Thus, the continuous SSA cannot be directly used to solve the optimization problem of power control and channel allocation for maximum energy efficiency in the D2D-abled UAV-based system. The authors in [28] propose the ASSA, which can solve the above problem.

The leader salp position updates can be described as where and are the leader salp position. and are defined as the current optimal channel allocation and power control matrix in the iterative process, respectively. , , and are the same as , , and in SSA. represents the maximum power constraint.

The follower salp positions are defined as where and are follower salp positions. and are initial weight and final weight, respectively. represents the fitness function, which is used to evaluate the current positions of follower salps.

However, the number of leaders and followers is fixed, which limits the searching abilities of these two algorithms. In order to overcome the above shortcomings and further improve the capability of global searching and local mining, we propose the AMSSA scheme.

5.2. Adaptive Mutation Salp Swarm Algorithm

We introduce the population variation strategy and the leader-follower adaptive adjustment strategy based on the ASSA. Firstly, the randomly generated locations of salps are mutated by the population variation strategy. Then, leader-follower adaptive adjustment strategy is used to update the position of the variation salps. Finally, the -learning strategy is considered to allow salps to gain autonomous following ability, which enables the individual of the salps to choose followers independently and avoid falling into local optimum due to blind following.

When the population of the salp group is fixed, there are more leaders and less followers in the initial iteration, which means AMSSA focuses on the global search. As the iteration continues, the number of leaders decreases, and the number of followers increases. In the later stage of iteration, more followers make the AMSSA focus on local search, so the AMSSA is hard to fall into local optimum. (1)Population Variation Strategy. In each iteration, the randomly generated power control matrix is mutated by where represents the power control matrix after mutation, means the current number of iterations, and ; and represent any two power control matrices in the current iteration, and . means the success rate of the th iteration and is defined as where means the salp number and represents the success value.If the fitness value of the th salp in the th iteration is greater than that in the th iteration, the th iteration of the th salp is considered to succeed. Thus, the success value is defined as (2)Leader-Follower Adaptive Adjustment Strategy. The calculation formula of the proposed leader-follower number is where means the weight and is calculated by where is the scale factor of leader-follower and and represent the current iteration and maximum iteration, respectively. is the perturbation deviation factor, and is an interval of .The leader salp positions are updated by where , and are leader salp positions. is defined as the current optimal power control matrix in the iterative process. , and are intervals of , respectively. represents the maximum power constraint.The follower salps’ positions are defined as where , and are follower salp positions. and is salp position that salp chooses to follow according to the -learning strategy. and are weight value, which and . is represented for the fitness function that .(3)-Learning Strategy. -learning strategy includes state, action, reward, and value. The current position of the salp (whether it is a follower salp or a leader salp) is the state , and the current positions of all salps constitute the state space . The movement of the salp following another salp is an action , and all the actions of the salp swarm constitute the action space . The maximum value is the strategy of salp action selection. Following the action, the state of each salp transitions to a new state receives a reward . (i)Agent. Salp .(ii)State. The current position of the salp .(iii)Action. The movement of the salp follows another salp . means the salp can follow the salp ; enotes the salp cannot. Meanwhile, a leader salp is only allowed to choose another leader with the highest value to follow.(iv)Reward. The reward function is formulated by the fitness function . If the action that the salp carries out can improve the current maximum fitness value of the salp swarm, the action gets a -point reward. If the action that the salp carries out can improve the fitness value compared with the last iteration, the action gets a -point reward; the agent receives a -point reward if otherwise. Thus, the reward function can be expressed as (v)Value. The value function is defined as long-term return, which can be regarded as a cumulative reward. The value function can be expressed as where means the discount factor.

The AMSSA algorithm is shown in Algorithm 2.

read
whiledo
 update randomly and generate corresponding to
 f ordo
  
  
 end do
fordo
  
  
  
  
 end do
fordo
  
ifthen
  
  
  
  end if
end do
ifthen
  
  
  
 else
  
  
  
 end if
end do
print, and

6. Simulation Results

In this section, simulation results verify the theoretical analysis and the effectiveness of the proposed approaches. In the simulations, a community of 1000 square meters without GBSs is considered. Therefore, UAV-BSs are used to provide communication for users in the cell. In addition, assumed that in a short time, the UAV-BSs are in a static state and can provide communication for users within the signal coverage range stably. D2D pairs are generated by the distance between two CUEs (i.e., if the distance between two CUEs is less than a constant value, the two CUEs are carried out D2D communication). Firstly, we use the PSO-Kmeans clustering algorithm to minimize the UAV number in our system and deploy the UAVs’ positions. Moreover, the air to ground models described in [11] are used to model the path loss with , , , and . Secondly, AMSSA is used to solve the D2D energy efficiency maximization scheme in the D2D-abled UAV-based system. Default parameters in the simulations are given in Table 1.

Figure 4 shows the change of the minimum SINR of CUEs in the system with the increasing number of UAVs. As can be seen from Figure 4, to reach the SINR threshold , UAVs need to be deployed by the PSO-Kmeans algorithm, but more UAVs need to be deployed through the PSO algorithm. Under the same distribution of CUEs and the same number of UAVs, the PSO-Kmeans algorithm can find better UAV deployment than that of the PSO algorithm, which makes CUEs get better communication quality. Compared with the random initialization, Kmeans clustering initialization can get a better initial point, which leads to the deployment of fewer UAVs to achieve higher SINR. With the increase of deployed UAVs in the system, the minimum SINR of CUEs increases. When the total number of CUEs is constant but the number of UAVs increases, the number of CUEs covered by each UAV decreases, and each cellular user can allocate more transmission power. The rate of transmission power increase is greater than that of interference increase, so the minimum SINR increases and the communication quality of CUEs improves.

Figure 5 shows the 3D position update process of seven UAVs with the SINR threshold . We can find that the UAVs’ positions converge quickly only after a few generations.

Figure 6 shows the performance of the PSO-Kmeans algorithm and PSO algorithm under the condition of a constant number of UAVs. The minimum SINR of CUEs decreases with the increase of the total number of CUEs. The more the number of CUEs in the system and the more the number of CUEs covered by each UAV, the smaller the transmission power allocated to each CUE, and the smaller the minimum SINR in the system. In other words, to achieve the required SINR threshold, the more cellular users, the more UAVs are needed. It can be seen from Figure 6 that when the number of CUEs is equal to 150 and the number of UAVs is equal to 7, the UAV deployment optimized by the PSO-Kmeans algorithm can meet the minimum threshold of SINR, but the PSO algorithm cannot meet.

Figure 7 shows the fitness convergence curves of the SSA, the ASSA, the QWASSA, and our proposed AMSSA. The minimum rate is set to . Compared with the other three algorithms, AMSSA can significantly improve the efficiency of D2D pairs. AMSSA can quickly converge to a better value in a shorter time. Therefore, AMSSA is not easy to fall into local optimum, and its search accuracy is higher. This is because the AMSSA adopts population variation strategy and adaptive follower position updating and leader-follower number updating, which has strong local mining ability and global searching ability.

Figure 8 compares the performance of D2D pairs EE of the SSA, the ASSA, the QWASSA, and our proposed AMSSA for different D2D pair distances, but the deployment of UAVs in the system keeps the same. The minimum rate is set to . With the increase of the distance between D2DR and D2DT, the EE of D2D pairs in the system increases monotonically due to the increase of the number of D2D pairs. However, compared with the other three algorithms, the AMSSA can make the system more energy-efficient for different distances.

Figure 9 shows the average EE at different distances for the same UAV deployment. Although the total D2D EE increases, which is shown in Figure 9, the average EE decreases. The minimum rate is set to . The more D2D pairs that each UAV-BS controlled, the greater intercell and intracell interference between D2D pairs, and the lower average EE. However, compared with the SSA, the ASSA, and the QWASSA, the D2D EE of the proposed AMSSA decreases slower.

Figure 10 shows the relationship between D2D EE and the numbers of UAV in the system, and the distribution of D2D pairs keeps the same. The minimum rate is set to . As the number of UAVs increases, the D2D EE decreases monotonously. It means that when the numbers of D2D pairs are fixed and the numbers of UAVs increase, the D2D pairs will produce more intercell interference, resulting in a decrease of D2D EE in the system. Although the system D2D EE of the AMSSA scheme decreases with the increase of the UAV number, its performance remains superior to the other three baseline algorithms.

7. Conclusion

In this paper, we investigate the UAV deployment, the channel allocation, and power control problem of D2D pairs in the D2D-abled UAV-based system. Firstly, we minimize UAVs’ number and deployment to satisfy a given SINR requirement by using the PSO-Kmeans algorithm. After determining the UAV deployment in the system, we aim to maximize the energy efficiency of D2D pairs while guarantying the QoS of networks. The problem of D2D pairs’ channel allocation and power control is a complicated NP-hard problem. To solve this problem, we propose the AMSSA algorithm. The population variation strategy mutates the nonoptimal individuals, which can increase the possibility that they become the optimal individuals in the next iteration and improve the ability of the algorithm to jump out of the local optimal. The adaptation of leader-follower numbers leads to faster convergence speed and more excellent local mining capability of the AMSSA. Meanwhile, the position update mode of leader salps makes iterations more difficult to fall into local optimization. The AMSSA performs the best in terms of D2D pair energy efficiency in the D2D-abled UAV-based system compared with the other benchmark schemes.

Data Availability

The data used to support the findings of this study are included within the article.

Conflicts of Interest

The authors declare that they have no conflicts of interest.

Acknowledgments

This work is supported by the China Postdoctoral Science Foundation under Grant No. 2018M640032, the Beijing Postdoctoral Science Foundation under Grant No. ZZ2019-73, the National Natural Science Foundation of China under Grant Nos. 61571021, 61701010, and U1633115, and the Seed Fund for International Scientific Research Cooperation under Grant No. 2021-A02.