Abstract

Tradition wireless sensor networks (WSNs) transmit data by single or multiple hops. However, some sensor nodes (SNs) close to a static base station forward data more frequently than others, which results in the problem of energy holes and makes networks fragile. One promising solution is to use a mobile node as a mobile sink (MS), which is especially useful in energy-constrained networks. In these applications, the tour planning of MS is a key to guarantee the network performance. In this paper, a novel strategy is proposed to reduce the latency of mobile data gathering in a WSN while the routing strategies and tour planning of MS are jointly optimized. First, the issue of network coverage is discussed before the appropriate number of clusters being calculated. A dynamic clustering scheme is then developed where a virtual cluster center is defined as the MS sojourn for data collection. Afterwards, a tour planning of MS based on prediction is proposed subject to minimizing the traveling distance to collect data. The proposed method is simulated in a MATLAB platform to show the overall performance of the developed system. Furthermore, the physical tests on a test rig are also carried out where a small WSN based on an unmanned aerial vehicle (UAV) is developed in our laboratory. The test results validate the feasibility and effectiveness of the method proposed.

1. Introduction

In recent years, the prosperous development in the Internet of Things has been validated where wireless sensor networks (WSNs) become ubiquitous. WSNs are widely used in the fields, such as intelligent transportation, agriculture, medical treatment, aerospace exploration, and other emerging applications, whereas they can contain a large number of static and mobile nodes in a self-organizing way [13]. Traditionally, a sink node in a WSN is fixed where the other sensor nodes (SNs) transmit data to the sink through either single-hop or multihop communication. Thus, the SNs close to the sink tend to carry more data transmission or forwarding tasks which leads to unbalanced energy consumption and results in poor data delivery in networks. The issues of energy holes and data collision may occur in such applications [47].

Mobile data gathering is regarded as a promising solution to tackle the problems aforementioned. In applications, mobile sinks (MSs) as data collectors gather sensing data in an efficient manner [815]. The issues associated with the tour planning of MS have been actively discussed in the existing literature that is crucially important to determine the performance of networks. The concern of energy consumption due to long-distance transmission is discussed in [16]. A maximum cache mechanism is proposed to enhance the transmission capability by adopting MS. In [17], an architecture of single-hop with single MS for mobile data gathering has been investigated. The proposed scheme has the benefit of energy-saving in line with the optimal tour trajectory of MS. In the study, a heuristic optimization algorithm is employed. A joint scheme for both charging and tour planning in a MS-based WSN is proposed in [18]. Energy consumption of each sensor is leveraged which in turn mitigates energy exhaustion. To strike the balance between energy consumption and latency of data gathering, an algorithm to find the optimal number and position of cluster head nodes for data collection is proposed in [19]. The simulation results verify the effectiveness of the scheme in favor of adopting the optimal path length of MS. In order to create a robust WSN, clustering techniques are also important for energy-saving and delay reduction where SNs are partitioned into subnetworks. Taking into account the kinematic constraints of mobile nodes of similar vehicles, in [20], a mobile data acquisition algorithm based on clustering Dubins smooth curve is proposed. Aiming at the problem of sensor node data aggregation and node energy imbalance, as well as the space problem being often ignored, a heuristic search algorithm (HLSA) is proposed in [21]. A dynamic clustering algorithm to divide the SNs into clusters is employed in [22]. By using the clustering mechanism, the energy efficiency and packet reception rate are improved. Our previous work has also shown the importance of sensor clustering in WSNs [23, 24]. A WSN with respect to mobile data gathering where the work has a special focus on clustering mechanism before mobile data gathering is committed by the MS is developed in [23]. A data gathering scheme by using multiple mobile sink nodes is proposed in [24]. The average path length is dramatically shortened in this way, thereby reducing energy expenditure.

As an extension of the recent work, this study includes the main contributions as follows:(1)We investigate the data gathering mechanism by considering energy constraints in a WSN system where the path length is minimized.(2)The network coverage is discussed and formulated to an optimization problem. As a result, suitable node numbers are defined to strike the balance of the WSN scale and network coverage.(3)A predicted trajectory is developed to assist the tour planning of a MS. In this manner, the overall data gathering is divided into several stages. This mechanism gives a chance to the MS to be recharged at the end of each stage to prevent energy exhaustion for a MS with insufficient power while heavy data load being required.(4)Besides the extensive simulation on MATLAB, the trail tests are committed to verify the effectiveness of the proposed scheme where an unmanned aerial vehicle (UAV) serving as a MS is employed in a WSN.

The rest of this paper is organized as follows. Section 2 presents the systematic configuration with extensive discussion on energy constraints associated with MS traveling, activities, and the network architecture. Section 3 develops the optimization formulation for maximising network coverage while minimizing energy expenditure and path length for each cycle. Simulation setup and trail tests are both presented in Section 4 following the outcome demonstration and analysis corresponding to them. The key conclusions are drawn in Section 5.

2. Problem Formulation

The sensor network consists of a base station, a certain number of SNs, and a MS. After the sensors acquire data from the sensing field, they wait to be polled by the MS to deliver the information in each round. The WSN is shown in Figure 1.

It is assumed that there are N SNs in the network, N = {1,......,n}, ; V represents a virtual cluster, V = {1, …, }; and trajectory between the virtual cluster head nodes is M = {V (i, j) }. Let represent the Euclidean distance between the two virtual cluster head and ley indicate whether V (i, j) is included in the tour of the MS. We have the following equation:

A MS gathers data periodically. The overall time in each cycle contains two segments: traveling and sojourn time [25]. Traveling time also includes two parts. One is in line with traversing time from one cluster to another. The second part is relevant to returning to the base station. Moreover, since the MS has the duty to poll the cluster head nodes, it stays on the cluster head nodes until completing data collection.

2.1. Traveling Time

In one cycle, the tour length can be calculated by

The total moving time is then calculated bywhere is the moving speed.

2.2. Sojourn Time

In a cycle, the MS reaches sensor node ; the amount of data is ; is staying time at each rendezvous point; G represents the transmission rate. Therefore, there is a relationship as follows:

According to equations (3) and (4), the period of a cycle is obtained by

The tour planning is formulated to an optimization problem with the following equations:

Equations (7) and (8) ensure that each cluster head node is accessed only once per round. Equation (9) guarantees tour continuity. Equation (10) guarantees that the MS starts from the base station. Equation (11) guarantees that the final destination of MS is the base station. Equation (12) is the time limit per round. Equation (13) is the limit of tour length per round, which in turn determines the energy limit.

Since the MS consumes the most energy on traversing through the sensing field, the energy cost is closely related to the tour range. This paper mainly takes energy expenditure on traveling into account. Therefore, the objective function in optimization is replaced by the target to minimize the tour length as follows:

3. Tour Planning

3.1. Coverage Issue with an Adjacency Matrix

The coverage issue is essential for a WSN. On the one hand, visiting fewer nodes will reduce the MS burden for data collection. On the other hand, a WSN needs enough nodes to achieve the expected monitoring quality. Thus, there is a balance to strike between the node number and the communication quality. In this paper, the network coverage with nodes deployment in random order is concerned. For simplicity purpose, the optimal density of the deployment is determined by the communication radius, which is the basis for calculating the number of deployment nodes.

This paper uses an adjacency matrix graph to define the connectivity of deployed nodes [2226]. We consider the network as a graph G = (V E) where V refers to the SNs and E represents the edge between the two SNs which can communicate with each other.

In some conditions, we can convert G into a square matrix with M-order () if . Therefore, the adjacency matrix G can be represented by matrix A.

Furthermore, we define , . If , , and , there are routes from the node via to . For instance, demonstrates there are two paths that can start from via an intermediate node to the node . Therefore, if , there is no path communication between the node and the node .

Therefore, we use a new matrix . If all the elements in matrix are nonzero elements, then G is a fully connected graph. Otherwise, if there is only an element in matrix S, then G is a disconnected graph.

3.2. Cluster Forming

We uniformly deploy the nodes in the region where they are grouped into clusters. In this way, the SNs can have balanced energy consumption since they can share the workload in clusters s [27]. Moreover, it ensures that each cluster head has almost the same energy expenditure in each cluster. There are many useful and advanced algorithms to realize the deployment, for example, neighbor clustering and fuzzy clustering. In order to implement real-world tests, we want a simpler and efficient algorithm as our solution. Therefore, the K-means algorithm is adopted. We use K-means based dynamic clustering algorithm to partition the nodes into the monitoring area [24].

The K-means algorithm aims to minimize the total distance between the SNs and the center of the cluster. The distance can be presented by where is cluster ; is the center of the cluster ; and is the sample number in the cluster . Since the selection of the cluster center should make extremely small [1], then  = 0. Thus, equation (14) can be rewritten by

Therefore,

The algorithm is committed in the following way:Step 1: initializing the clusters with randomly picked nodes. A cluster head is selected to be as the center. All centers are recorded by .Step 2: allocating the rest of the SNs in cluster around the centers according to equation (17) where Euclidean distance is applied.Step 3: recalculating , j = 1, 2, ..., k for each cluster.Step 4: judging if , the clustering ends. Otherwise, return to Step 2 and regroup the sample iteratively.

Afterwards, the calculation completes while the results are recorded. Now, the nodes are partitioned to k clusters where k virtual cluster centers are generated at the same time. Via k-means algorithm, the overall distance from the SNs to the cluster centers reaches the least. Since the intercluster communication uses single-hop transmission, the communication costs the least energy consumption through this arrangement. The nodes are then manually placed to the locations of these virtual cluster centers where the cluster head nodes play the roles of the center. After a head node being defined, a broadcast message is sent to the members in the cluster to confirm the location of the cluster head node. Now, the location of these virtual cluster centers will be a rendezvous point for the MS. The MS starts to gather data by the members in the cluster after the MS reaches the location.

3.3. Energy Consumption Prediction

Due to the limited energy capacity of the MS, it is difficult for a single MS to complete all the data gathering without interruption in one cycle. Therefore, the whole tour planning is divided into several segments where the division method is defined by the prediction result based on the ant colony system (ACS) optimization. The algorithm follows the steps below:Step 1: to determine the accessible order of each cluster where the access nodes are defined by . These orders are determined by the ant colony algorithm [827]. The calculation is based on equations (18)–(20).Step 2: judgement. If the length of the whole tour is within the maximum range of MS, the MS completes the data gathering at once. Otherwise, the tour is cut into multiple segments and executed in turn.Assuming j represents the next hop point from the current i point, the possibilities of visiting point j are s that can be presented by the following equations [27]:where represents pheromone variables; is the heuristic desirability; represents the relative strength of heuristic factors; is the initial parameters; and is a random number. Local update strategy is to make the selected edges have a better influence on the later ants, as shown in the following formula:where is the pheromone decay parameter within (0, 1) and is a constant. The global update strategy is to find the shortest path for traversing the rest of the points and return to the base station within the MS energy capability that belongs to the edges on the optimal path, as shown in the following equation:where is the added pheromone on the trail from point to ; represents the evaporation coefficient, ; and is the optimal path length.Step 3: the total energy carried by the MS is . At the beginning of the cycle, the MS arrives at the first sensor node for data collection. Afterwards, the MS measures its own remaining energy for the next stage. If the remaining energy is enough to be consumed in the next stage, the MS travels to the next node for data gathering. Otherwise, it returns directly to the base station for recharging. The energy consumption in the next stage is expected to be sufficient for visiting the remaining nodes, data collecting, and returning to the base station.(1)Energy consumption for visiting:It refers to the energy consumed over the traveling from node i to node j:(2)Energy consumption for collecting data from node j: is the time to arrive at node j, and is the starting time of the cycle.(3)Energy consumption for returning to the base station:The total energy consumption for the next stage is predicted at node i expressed by the following equation:Assuming the remaining energy when the MS reaches the node i is equal to q, the remaining energy of the MS after the completion of the data gathering is . So,If when MS travels to node j for data collection, MS goes on its journey to the next node. Otherwise, the MS returns to the base station for recharging.Step 4: tour replanning.After the MS completes the recharge, the tour planning to the remaining nodes in the next subcycle will be recalculated. The new trajectory and visiting order are calculated. Afterwards, the data gathering continues according to the updating results from Step 2.Step 5: repeat the above process until all the cluster fields are traversed. The remaining nodes not polled are abandoned. In order to simplify the algorithm, the maximum tour length is used to represent the energy limit of the MS.

4. Results and Analysis

4.1. Simulation Environment

To verify the effectiveness of the algorithm, the simulation is carried out on a MATLAB platform. The key simulation parameters are summarized in Table 1.

4.2. Outcomes and Analysis

First, the problem of node coverage is simulated. Assuming the node number is 90 and the communication radius is 15 m, the coverage area is shown in Figure 2. After a large number of experiments being committed, the relationship between node number, communication radius, and coverage are obtained. The results are shown in Figure 3. The relationship between node density and coverage is shown in Figure 4. By synthetically analyzing these outcomes, we define the communication radius and the node density as 15 and 0.007, respectively.

According to the coverage plan, 70 nodes are randomly placed in the sensor field as shown in Figure 5. After applying the dynamic clustering algorithm, 10 virtual cluster head nodes are generated as shown in Figure 6.

Afterwards, the ant colony optimization is used to plan the shortest path over the ten cluster head nodes, and the node path is shown in Figure 7. However, the shortest length on the tour is 348.63 according to the measurement from Figure 8. It exceeds the energy limit of the MS. The access order can be obtained as shown in Figure 9. Nevertheless, the node starts the data gathering for the first subcycle from the base station (node 0).

The MS starts the tour. However, when it reaches node 3, it is found that the remaining energy is not enough to support it to complete the next stage of data collection tasks; that is, it reaches node 4 for data collection tasks and returns to the base station. Thus, it returns to the base station for energy replenishment to continue the second subcycle. After the MS is fully charged, the tour planning for traversing the remaining nodes is recalculated as shown in Figure 10. The shortest path is shown in Figure 11. At this time, traversing the remaining nodes costs more energy than the MS capacity. There will be a third subcycle for the MS. The updated access order in the second subcycle can be found in Figure 12.

Again, the MS arrives at the head node of the first virtual cluster in the second subcycle. After node 3 being poll, the MS needs more energy to reach node 4. Therefore, it returns to the base station for preparing the third subcycle.

After being fully charged, the remaining tour is reorganized. The third subcycle is then defined as shown in Figure 13. The searching map for the optimal trajectory is shown in Figure 14. The shortest length over the field cannot be completed by the MS in one cycle. Therefore, the fourth subcycle is needed. The calculation for the third subcycle with the optimal is executed and the result is shown in Figure 15.

The plan for the traversing, the predicted path length, and the node access order of the fourth subcycle are shown in Figures 1618

In the fourth subcycle, it is slightly different from the previous subcycles. After polling node 2, the MS cannot reach node 3. It returns to the base station to be recharged. However, the energy cannot support it traveling to node 3. Therefore, node 3 is abandoned. The tour planning, trajectory, and the visiting order for the fourth subcycle are demonstrated in Figures 19 and 20.

The simulation results have shown the initial energy of the MS cannot support completing data collection. Therefore, the initial energy is then increased to 300. By repeating the process above, the tour planning in the final two periods can be obtained as shown in Figure 21. The path lengths of each subcycle are shown in Figure 22. With the updated initial energy, the MS can complete the data gathering by traversing all cluster head nodes over the sensing field. Therefore, the optimal energy capacity of the MS is determined by 300 m.

The simulation is carried out with the optimal tour planning with the shortest length where energy constraints and efficient data gathering are considered as a priority. Through the simulation, we also find the process of data gathering is closely related to the initial energy of the MS. Therefore, a comparative evaluation for the relationship between the tour planning and battery capacity of the MS is carried out. The key indicators are summarized in Table 2.

It is noted that the larger initial energy can lead to a smaller number of subcycles. Moreover, the shorter length means the shorter latency over data gathering.

In practice, the energy capacity of a MS is limited. Therefore, consideration is needed to strike the balance between the delay and the initial energy.

4.3. Trail Tests in a Small WSN

Trail tests are carried out in the laboratory to verify the reliability of data transition based on a small scale WSN while only the first subcycle is tested. Three SNs are placed as the terminal nodes to collect data. The MS is responsible for polling the SNs and completes data gathering. The device in the experiment includes a UAV carrying the MS and three DHT11 temperature-humidity sensors as the SNs. The images of these devices are shown in Figure 23. Test results are summarized in Table 3.

Furthermore, three rounds of experiments are committed. The results are summarized in Table 4. The results show that the experimental system can effectively collect temperature and humidity data, and the data transmission is reliable.

5. Conclusion

This paper presents a prediction-based tour planning for data gathering in WSNs. The scheme primarily aims to mitigate the challenge when a single MS cannot complete data gathering or traverse all cluster head nodes at once. First, the issue of the network coverage is discussed in favor of a decent arrangement with the optimal number of sensor nodes across the field. Afterwards, a dynamic clustering algorithm is proposed to generate sensor clusters where the cluster head nodes are defined as the MS sojourns. The methodology of the predictive trajectory is developed for finding the optimal tour before a MS traverses the sensing field. In the study, the single tour of data gathering is divided into several subcycles to prevent energy exhaustion. Both the MATLAB simulation and physical tests are committed to demonstrating the whole procedure for data gathering under the proposed scheme. The results have proved the effectiveness and reliability of the methodology. The study outcomes provide a useful solution for tour planning in energy-constrained WSNs.

Data Availability

The data used to support the findings of this study are included within the article. If other data or programs used to support the findings of this study are needed, they can be obtained from the corresponding author upon request.

Conflicts of Interest

The authors declare that they have no conflicts of interest.

Acknowledgments

This work was supported partly by the National Natural Science Foundation of China (61861007).