Cyber-Physical Systems in Manufacturing and Service SystemsView this Special Issue
A Topology Evolution Model Based on Revised PageRank Algorithm and Node Importance for Wireless Sensor Networks
Wireless sensor network (WSN) is a classical self-organizing communication network, and its topology evolution currently becomes one of the attractive issues in this research field. Accordingly, the problem is divided into two subproblems: one is to design a new preferential attachment method and the other is to analyze the dynamics of the network topology evolution. To solve the first subproblem, a revised PageRank algorithm, called Con-rank, is proposed to evaluate the node importance upon the existing node contraction, and then a novel preferential attachment is designed based on the node importance calculated by the proposed Con-rank algorithm. To solve the second one, we firstly analyze the network topology evolution dynamics in a theoretical way and then simulate the evolution process. Theoretical analysis proves that the network topology evolution of our model agrees with power-law distribution, and simulation results are well consistent with our conclusions obtained from the theoretical analysis and simultaneously show that our topology evolution model is superior to the classic BA model in the average path length and the clustering coefficient, and the network topology is more robust and can tolerate the random attacks.
Wireless sensor network (WSN) consists of spatially distributed autonomous sensors to monitor physical or environmental conditions, such as temperature, sound, vibration, pressure, motion, or pollutants, and to cooperatively transmit their data through the network to a sink node. Today WSNs are more and more widely used in variety of industrial and consumer applications, such as industrial process monitoring and control, machine health monitoring, environment and habitat monitoring, health care applications, home automation, and traffic control .
In military and warfare applications, WSNs are deployed in hostile monitoring environment, and the sensor node is of limited energy support. Energy exhaustion and natural damage of some sensor nodes often result in the failure of the whole networks. As a vital technique, the construction and control of the topology play an important role in conquering these problems in WSNs. The main purpose of construction and control of topology is to achieve a higher communication quality, energy utilization efficiency, and strong robustness topology.
Complex network model becomes increasingly popular in the complex communication networks topology control and performance optimization. Complex network is the abstract description of the complex systems which emphasize characteristics of topology . A complex system contains a mass of units which are considered as nodes, and the forces among units are considered as edges . To achieve better performance and more robust topology against the attacks in the network, people need to better understand the complex network theory and the possible application scenarios in these fields . With the development of WSN, small-world model  and scale-free model  are widely used to optimize the WSN’s topology.
As one classical complex network model, small-world model has the shorter average path length and the larger clustering coefficient and also a few of short cuts, so this kind of network can be easily destroyed under the targeted attacks . As another classical complex network model, scale-free model has the power-law node degree distribution characteristic and the better robustness against random attacks, because its physical topology is tightly associated with the evolution process, which depends on the mechanism of its growth and preferential attachment, and lots of researchers have contributed their works for improving the performance of the network topology [6, 7]. In this paper, our purpose is to propose a new topology control method which is based on the scale-free model, and its evolution process includes two aspects as follows.(1)Growth: the scale of the network is expanding.(2)Preferential attachment: the newly joined node is more inclined to join those nodes with higher degree. Namely, the rich become richer.
It is generally known that the preferential attachment is the most important one because it provides the principal rules about how a new node connects to the previous network topology and which node should be the potential candidate node to be connected. To achieve better performance of the network, we aim to revise the preferential attachment method and to consider more details for selecting the candidate node from the existing topology and connecting to the new node. PageRank algorithm is a method for node ranking in a network and also for evaluating the node’s importance in some application scenarios, so some revised PageRank algorithms were proposed to evaluate the node importance and select the important candidate nodes, and also some detailed rules were considered in the proposed algorithms [8–10]. However, we can see that these preferential attachment rules only concern more node’s degree.
In some challenging applications, there are lots of nodes with same degree, which is not sufficient to make a rational decision. So some of the background information, such as location and flow, was taken into account to design the PageRank-based algorithms. Dorogovtsev showed that main properties of scale-free evolving networks may be described in frames of a simple continuous approach and also obtained the scaling relations for networks with nonlinear, accelerating growth and described the temporal evolution of arising distributions . Unlike the well-studied models of growing networks, where the dominant dynamics consists of insertions like nodes and connections, and rewiring of existing links, Sarshar studied the scale-free evolving networks application in ad hoc network, where one also has to contend with rapid and random deletions of existing nodes (and the associated links); the dynamics discovered in the paper can be used to craft protocols for designing highly dynamic peer-to-peer networks and also to account for the power-law exponents observed in existing popular services . However, the above algorithms missed some inner relationships, and the transform possibility matrix holds the same value for every link, which will be revised in our model.
In this paper, we introduce node degree, centrality, and betweenness to revise the PageRank algorithm to evaluate the node importance of the WSNs, and a dynamic transmission matrix is built up; these will be helpful to form the new preferential attachment rules to select a suitable candidate node to be connected to the newest node. Thus, the robustness and survivability of WSNs are enhanced to some extent.
The rest of the paper is organized as follows. Section 2 introduces basic definitions of complex networks model and the limitations of the existing PageRank Algorithm. Con-rank node importance evaluation algorithm is proposed in Section 3; Section 4 proposes the new rule for topology evolution based on Con-rank algorithm; finally, the simulation results and conclusions will be shown in Section 5.
2. Complex Networks Model and PageRank Algorithm
In this section, we formally describe the complex networks and the PageRank algorithm in detail, which is the basis of our node importance evaluation algorithm and topology evolution.
2.1. Complex Networks Model
Complex networks can be modeled by graph , where is a nondirection connected graph with nodes and edges. is the set of nodes and represents the set of edges. is adjacent matrix of with rows and columns, and the detailed definition of in is shown as follows:
At the same time, some terms to be used to evaluate the node importance are listed as follows.(i)Degree . It is the number of edges connected to node .(ii)Shortest Distance . It is the number of hops along the shortest path between node and node .(iii)Centrality . It is the reciprocal of the total cumulative sum of the distance from all other nodes to node ; namely, .(iv)Betweenness . It is a ratio of the number of the shortest paths through the node to the total number of the shortest paths which include all the node pairs of a network.(v)Largest Effective Component. It is the effective subgraph with the largest size in a network.
2.2. PageRank Algorithm
According to PageRank algorithm, a weight of each web page can be computed through iteration, which depends on weights of the pages connecting to this page. Here is the PageRank equation:where indicates the set of pages destined for page , and surfers connect to another page with probability , and follow the links in current page with probability .
3. Con-Rank Node Importance Evaluation Algorithm for WSNs
It is known that PageRank algorithm is proposed for node ranking and importance evaluation in Internet, and its limitation is that it only depends on hyperlink relationship. We have improved the original PageRank algorithm and proposed a so-called “Con-rank algorithm,” which is based on node similarity and the characteristics of WSN. First, a transformation probability matrix is built up, namely, the probability of each node transforming data to its neighbors, in which the more similar the nodes are, the higher probability they will have. Second, the normalized centrality is used to revise the uniform distributed parameter in PageRank algorithm.
3.1. Basis of Node Contraction
Here, we use node contraction algorithm proposed in  to calculate contraction value and is further used to compute the node importance. Because the node position and the degree are taken into account at the same time, the node importance in the network can be obtained.
Definition 1. Node contraction: there are nodes, which are directly connected with node , and these nodes are all replaced by a new node . One extreme example is that it becomes one-node network when the central node of a star network is contracted.
Definition 2. The cohesion degree of is that , where and is the shortest distance from node to node , when , . Obviously , and gets the maximal value 1 when there is only one node in the network. So network cohesion degree is decided by connection ability of nodes in network, and one assumes that the average shortest distance between a couple of nodes is . In addition, the number of nodes can also influence network cohesion degree.
Definition 3. The contraction value of node is , where indicates the graph after contraction of node .
3.2. Node Contraction Algorithm
Detailed node contraction procedure is listed in Algorithm 1, in which the contraction value of node can be obtained.
In Algorithm 1, we firstly have to get the direct distance matrix from the adjacent matrix of :
Secondly, we should update the shortest distance between any node pair . Let be the set of all the neighbor nodes and itself and the contraction of equal to , . The corresponding update operations are listed as follows:(1) and ;(i); that is, is in the shortest path between and ; then ;(ii); that is, is in the suboptimal path between and ; then ;(iii); then ;(2) or ; then ;(3); then .
Finally, all the nodes in are replaced by a new node but the links between any pair of nodes remain, and we can get by deleting the relative rows and column of in . When calculating , only pairs of nodes need to be calculated because is an undirected graph.
3.3. Computing Transformation Probability Matrix
is the similarity of and , where and are the contraction value of and , respectively, and can be calculated according to (3). Furthermore, we can compute the transformation probability matrix , and and should be normalized further.
3.4. Calculating Node Importance
To calculate the node importance in a network, we have to improve (2), where is the adjustment parameter to control the degree of adjacent nodes. When , node importance is totally decided by the importance of its adjacent nodes; when , all the node importance is , where , and there is no difference between all the nodes. Obviously, it is not reasonable because node importance should be related to the characteristics of adjacent nodes and itself. Only degree and betweenness are related to it due to the simplification of the contraction equations.
Here, we use the normalized centrality to revise the parameter in PageRank algorithm and also insert the above matrixes into (5) to revise node importance :Here, is the transformation probability matrix as mentioned before, and is the influencing weight of the neighbor node importance on itself, and is the influencing weight of the centrality of the network, which is calculated as described in Section 2.1. Periodically, the current node importance is updated by .
3.5. Con-Rank Node Importance Evaluation Algorithm
More detailed Con-rank node importance evaluation algorithm procedure is shown as Algorithm 2. Lines 1–3 are the initialization of the algorithm, line 2 shows how to compute the value of contraction of , lines 4–7 are mean to update the topology without , lines 8–11 are the calculation of , lines 12–15 are used to obtain the transmission matrix, and both line 16 and line 17 represent the computation of the central degree and the node importance, respectively.
4. Con-Rank Topology Evolution for WSN
In the real world, the way for a node to connect other nodes is not global preferentialness as described in BA (Barabasi and Albert) scale-free network. In WSNs, nodes’ communication radius limits them to connect all the other nodes in networks. What is more, global preferentialness is impractical because it will cost vast energy of wireless sensor nodes which is also energy limited. So local optimum is needed. Finally, if the scheme only cares about nodes’ degree it may cause some nodes’ expiration quickly and shorten WSNs’ lifetime. Hence, this paper applies the Con-rank algorithm to form the new preferential attachment rules and to select the candidate node connecting to the newest node. The topology evolution process is shown as follows.
4.1. Topology Evolution Process
(i) Initialization. Initialize the network with a small scale topology which consists of nodes, edges, and a sink node.(ii) Growth and Preferential Attachment. Let a new node join the network with edges from the node to previous ones. And the probability of the previous node being chosen is where is the value calculated by Con-rank equation.(iii)Use the Con-rank equation to update the and cycle until the total number of nodes reaches .
4.2. Dynamic Analysis of Topology Evolution
Theory 1. Con-rank evolution topology agrees with power-law distribution.
Proof. According to mean-field theory , we assume that is continuous, and the probability of connecting to a node can be interpreted as a continuous rate of change of ,where and is the largest communication area.
According to , , where and are fixed parameter.
And ; then where and are constants.
Substitute (8) into formula (7); we get Because every edge contributes 2 degrees, then . And let be the average of ; then . So Assume that when node enters into the network at every new node has initial degree . Meanwhile, let and ; we get the answer which fits this initial condition:where and .
The probability of the degree of the node which enters at is less than at ; that is, Assume that the new node is added to the network regularly, and random variable agrees with the uniform distribution in , and then the probability is Substitute (13) into formula (12); we getThen the degree distribution of Con-rank topology evolution is And then Take logarithm of both sides of formula (16): From the above formulas, the degree distribution of node is independent of time. When , we get Hence the degree distributions of all nodes nearly agree with power-law distribution with an exponent-.
5. Simulation Results
According to Con-rank topology evolution model, we use MATLAB to simulate the topology process and get several pieces of related data to perform the analysis. We will compare several characteristics in both Con-rank model and BA model, such as degree distribution, average path length, and clustering coefficient.
5.1. Degree Distribution
Degree distribution is the basic topology characteristics in complex network. Here, we will analyze the distribution probability upon the different node degree by the numerical simulation, and we set the total number of nodes , the initial nodes , the number of inserting nodes , and the inserting link length be 1, which is with probability . The mean of the data is based on 20 repeated experiments and relationships of node degree and distribution probability are showed in Figure 1, where the abscissa denotes the node degrees and the ordinate refers to the degree distribution probability.
From Figure 1, in which nodes show empirical results and small circles represent theoretical results, we find degree distribution allows the power laws and the two results fit each other well. In addition, when random variables , , and make according to the theoretical analysis, this model becomes the classic BA model.
5.2. Clustering Coefficient
Clustering coefficient, which comes from the proportion of a man’s friends who are also friends with each other, is used to show the nodes connection condition. And it is also a key parameter to measure the density of a network.
Let and and calculate the clustering coefficient in the two models. The mean of the data is based on 20 repeated experiments and the clustering coefficient comparison are shown in Figure 2, where the abscissa denotes the numbers of nodes from 100 to 500 and the ordinate refers to the clustering coefficient.
From Figure 2, the clustering coefficient by Con-rank is larger than that by BA model in the same size; that is, it is more likely to see nodes gathering in a small district in Con-rank model.
5.3. Average Path Length
Communication delay is one of the vital factors influencing the functions of networks, and path and diameter are the main parameters, which are not used to predict the whole networks. So average path length is introduced to reflect the whole networks’ characteristics. Set these parameters as above and get the mean of the data from 20 repeated experiments and relationships are showed in Figure 3, where the abscissa denotes the numbers of nodes from 100 to 500 and the ordinate refers to the average path length.
From Figure 3, the average path length in Con-rank is shorter than that in BA model, which fits the small-world model better.
In addition, the Con-rank evolution model has shorter average path length and larger clustering coefficient and can construct a robust topology to defend both random and targeted attacks.
In this paper, we build a new node importance evaluation model, Con-rank evaluation model, which is based on PageRank and node contraction algorithms. It evaluates node in multiple angles and fits the real network better. In addition, limited energy and robustness are the main problems in WSNs; thus it is a key to enhance lifetime and keep scale-free characteristics for a scale-free network. So this paper proposes Con-rank evolution model to build a superior scale-free network and gives a dynamic analysis of topology evolution. Finally, a simulation is performed to evaluate the model characteristics and network performances. Simulation results agree with theoretical analysis and show that Con-rank evolution model has shorter average path length and bigger clustering coefficient.
Conflict of Interests
The authors declare that there is no conflict of interests regarding the publication of this paper.
This work is partially supported by the National Natural Science Foundation of China under Grant nos. 71271165 and 61373174, Guangxi Key Laboratory of Trusted Software under Grant no. kx201416, the State Key Laboratory of Complex Electromagnetic Environmental Effects on Electronics and Information System under Grant nos. CEEME2012k0207B and CEEME2014k0302A, and Grant no. 2011C14024 from Science and Technology Department of Zhejiang Province Program and Grant no. 2010R50041 from Key Innovation Team of Science and Technology Department of Zhejiang Province.
M. E. J. Newman and D. J. Watts, “Scaling and percolation in the small-world network model,” Physical Review E—Statistical Physics, Plasmas, Fluids, and Related Interdisciplinary Topics, vol. 60, no. 6, pp. 7332–7342, 1999.View at: Google Scholar
G. Feng, T. Liu, Y. Wang et al., “AggregateRank: bringing order to websites,” in Proceedings of the 29th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR '06), pp. 75–82, ACM, Washington, DC, USA, 2006.View at: Google Scholar
H. Nakakubo, S. Nakajima, K. Hatano, J. Miyazaki, and S. Uemura, “Web pages scoring based on link analysis of web page sets,” in Proceedings of the 18th International Workshop on Database and Expert Systems Applications (DEXA '07), pp. 269–273, Regensburg, Germany, September 2007.View at: Publisher Site | Google Scholar
D. Cai, X. He, J.-R. Wen, and W.-Y. Ma, “Block-level link analysis,” in Proceeding of the 27th Annual International ACM SIGIR Conference, pp. 440–447, July 2004.View at: Google Scholar
S. N. Dorogovtsev and J. F. F. Mendes, “Scaling properties of scale-free evolving networks: continuous approach,” Physical Review E, vol. 63, no. 5, Article ID 056125, 21 pages, 2001.View at: Google Scholar
Y.-J. Tan, J. Wu, and H.-Z. Deng, “Evaluation method for node importance based on node contraction in complex networks,” Systems Engineering—Theory and Practice, vol. 25, no. 11, pp. 79–84, 2006.View at: Google Scholar