Abstract

We investigate diverse random-walk strategies for searching networks, especially multiple random walks (MRW). We use random walks on weighted networks to establish various models of single random walks and take the order statistics approach to study corresponding MRW, which can be a general framework for understanding random walks on networks. Multiple preferential random walks (MPRW) and multiple simple random walks (MSRW) are two special types of MRW. As search strategies, MPRW prefers high-degree nodes while MSRW searches for low-degree nodes more efficiently. We analyze the first passage time (FPT) of wandering walkers of MRW and give the corresponding formulas of probability distributions and moments, and the mean first passage time (MFPT) is included. We show the convergence of the MFPT of the first arriving walker and find the MFPT of the last arriving walker closely related with the mean cover time. Simulations confirm analytical predictions and deepen discussions. We use a small random network to test the FPT properties from different aspects. We also explore some practical search-related issues by MRW, such as detecting unknown shortest paths and avoiding poor routings on networks. Our results are of practical significance for realizing optimal routing and performing efficient search on complex networks.

1. Introduction

A random walk on a network [1] is precisely what its name says: a walk operated in a certain random fashion. Starting a random walk at , its next node, , is selected at random from among the neighbours of ; then is a random neighbour of , and so on. Namely, the stochastic sequence of nodes selected this way is a random walk on the network, which mainly depends on the network structure.

Random walks on complex networks are a fundamental dynamic process, which can be used to model various dynamic stochastic systems in physical, biological, or social contexts such as diffusive motion [2], traffic flow [3], synchronization [4], animals movement [5], and epidemic spread [6]. It could also be a mechanism of transport and search in real-world networks [79], when no global information of the underlying networks is available. For instance, in some networks, such as wireless sensor networks [10], ad hoc networks [11], and peer-to-peer networks [12], data packets traverse the networks in a random-walk fashion. It has been demonstrated that the structural properties of networks, such as the large connectivity heterogeneity and the short average distance, play an essential role in determining the scaling features of the random-walk search for complex networks [13]. Recent studies show that random walks on complex networks can also reveal a variety of characteristics of the network [14].

First passage time (FPT) and cover time (CT) are two important characteristics of random walks on complex networks, which heavily rely on the underlying network topology [1517]. FPT, denoted by , is the number of steps the random walker takes to arrive at the destination node for the first time after starting from the source node . Correspondingly, the mean FPT (MFPT), that is, the expectation of is denoted by or . For a random walk on a network, the vertex CT is the number of steps the random walker takes to reach every node, and the edge CT is the number of steps the walker takes to traverse every edge. Let be the expectation of vertex CT (or the mean vertex CT), and let be the expectation of edge CT (or the mean edge CT) if the walk starts at node . FPT and CT of a single walker or multiple walkers in complex networks have been studied in a number of articles [1622]. These results are frequently based on the spectral properties of the adjacency matrix of the graph and of the transition matrix of the random walk [13, 21, 22].

However, thus far, the comparison of random-walk strategies for searching complex networks remains less understood, especially MRW strategies. Furthermore, a uniform analysis of various MRW strategies is still lacking. In this paper, based on random walks on weighted networks, we establish various models of single random walks and then construct corresponding MRW. By means of the order statistics, we mainly investigate the behaviors of the first and last arriving walkers when all the walkers start from the same source simultaneously and head to the same destination randomly, that is, MRW on networks. We focus on the FPT and the MFPT of MRW, which are closely related to the distance between a selected pair of nodes and the CT of corresponding single random walks. Considering the MRW as the search strategy, the FPT of the first arriving walker, to a certain extent, characterizes the search efficiency while that of the last arriving walker portrays network searchability [7]. In future, the MRW search strategy could be used to fast detect the unknown shortest path and avoid the worst routing between a pair of nodes.

2. Single Random Walks on Networks

First, we take a look at single random walks on an arbitrary finite network (or graph) . The network consists of nodes (or vertexes, sites) and links (or edges) connecting them. We assume that the network is connected; that is, there is a path between each pair of nodes; otherwise, we simply consider each component separately. To implement a random walk, each node needs to calculate the transition probability from the node to each of its adjacent ones. In this paper, we mainly perform preferential random walks (PRW) on networks by the following rule. The rule is that a walker, wandering on the network, randomly selects a neighboring node as its next dwelling point according to the degrees of neighboring nodes. So the probability of jumping to any neighboring node is , where denotes the degree, the number of connected neighbors, of a node , and denotes all the connected neighbors of node . Transforming the into , we have defined a Markov chain on the network with a transition matrix as follows:

We make comparative studies with simple random walks (SRW), of which the transition matrix is

In fact, as (1) and (2) show, if SRW and PRW being search strategies on networks, PRW prefers the high-degree node, while SRW searches for the relatively low-degree node more efficiently.

More generally, for the network , we assign a positive weight to link ; otherwise, if the link is absent, we attach weight . Writing for the function , we have obtained the weighted network . Define a random walk on the weighted network as a sequence of random variables , each taking values in the set of nodes. And the sequence is such that if , namely, at time , the walk is at node ; then with the transition probability the walk is at node at the next time ; that is to say, the walk randomly selects a neighboring node as its next dwelling point according to link weights. Such a walk is named a single random walk on the weighted network or a single weighted random walk. Clearly, the sequence is a Markov chain with the finite space , whose transition matrix satisfies [15] where .

Thereby, we can use a uniform method to study PRW and SRW on networks. If we attach weight to each link , then the random walk on the weighted network is a PRW. Whereas, if we assign weight to each link , then the one is an SRW that we always say. Both of them are irreducible and reversible Markov chains.

Random walks mentioned below include SRW and PRW unless particularly noted. Now we randomly select a pair of nodes on the network, one node as source , the other as destination . Let be the number of steps the walker takes to arrive at the destination for the first time after starting from the source , namely, FPT. is a nonnegative integer-valued random variable. The probability distribution is denoted by where is being the distance (or the shortest path length) between the source and the destination; here we assume .

In this paper, we investigate the FPT and MFPT for single random walks by analysis of the transition matrix. We define a matrix [22] associated with the transition matrix . is got from by setting th column of to zero; that is, Thus,

By the way, in our another paper unpublished, by using the stopping time technique, we derive an exact formula of the MFPT, denoted by or , for the PRW on networks. Considering the PRW on the finite network, the MFPT of node from node is where thus, the average MFPT over all node pairs    is

3. Multiple Random Walks on Networks

3.1. Formation of Multiple Random Walks

Based on the above discussion, we are now able to further consider parallel, independent single weighted random walks that start from the same node, which form the multiple weighted random walks with walkers, abbreviated as MRW. We mainly analyze the FPT problem of MPRW and MSRW, which are two special types of multiple weighted random walks and, respectively, correspond to PRW and SRW. Suppose there are walkers wandering independently from the same source simultaneously to the same destination on the network. FPTs of walkers are, respectively, denoted by They are independent random variables with the same distribution of . Let be the th order statistics [23] given by where they are arranged from ’s independent observations from the discrete distribution of . That is to say, , , and denote FPTs of the first, last, and th arriving walkers, respectively.

3.2. FPT of Multiple Random Walks

In this subsection, we explore the FPT of the first and last arriving walkers and present the corresponding equations of the probability distribution, the joint distribution, and moments as well. Further, we study the relationship between the MFPT of the first arriving walker and the distance between the selected two nodes, as well as the relationship between the MFPT of the last arriving walker and the mean CT of corresponding single random walks. Taking advantage of the order statistics technique and using (6)–(9), we can easily get the following formulas, of which the detailed proofs are omitted. Note that the results in this subsection generally hold for multiple weighted random walks, including MPRW and MSRW. We would like to stress that some time ago similar calculations were only done for the FPT of the first arriving walker for MSRW on networks. Results presented in our paper encompass the results of [24] as a special case.

3.2.1. Probability Distributions of and

, where and are given by (7) and (8), respectively,

3.2.2. Joint Distributions of and Distributions of

(1) Joint distributions of . When ,

When , where .

(2) Distributions of . When ,

When ,

3.2.3. Moment Analysis of FPT

Consider the following:

We can investigate general cases of and , respectively, in a similar way. Distributions, joint distributions, moments, and more characteristics of these FPTs could be easily got, the range as well. The discrete-time results obtained here can also be compared with those for multiple continuous-time random walks. We leave these problems to our future work.

3.3. MFPT of the First Arriving Walker

Since there is no path with a length shorter than and at least one path with a length equal to from the source to the destination, the FPT of single random walk on networks clearly satisfies , , Then, combining with (15) and (21), it is not difficult to obtain the following results:

That is to say, the first arriving walker of MRW will take the shortest path or near shortest path with high probability if the walker number is large enough.

3.4. MFPT of the Last Arriving Walker

How long does one have to walk before he sees all nodes or traverse all edges in a network? The answer to this question is the vertex CT and edge CT. In the following, we mainly consider the coverage problem of single random walks on a simple network by using numerical simulation. For a simple random walk on a simple general network, the better upper bound of the mean vertex cover time is [25], which is well verified by the simulations.

An interesting problem is what the number of walkers will arrive at when the MFPT of the last arriving walker has a consensus with or exceeds the mean cover time of corresponding single random walks. To address this problem, the paper that only gives the simulation analysis. We find the MFPT of the last arriving walker will be significantly more than the mean cover time and , when walker number is network size of several orders of magnitude, that is, .

4. Simulations and Discussions

In this section, for both MPRW and MSRW, we make numerical simulations to deepen our discussion as well as to confirm analytic results. We use a small random network, generated by the Erdös-Rényi model [14], to test the FPT properties of the first and last arriving walkers, respectively, and their relationship as well. If the MRW is being as the search strategy, the FPT of the first arriving walker, to a certain extent, characterizes the search efficiency while that of the last arriving walker depicts network searchability. We demonstrate that wandering walkers are able to utilize and detect the shortest paths and avoid the worst routings. In addition, we explore the cover time of single random walks, which is closely related to the MFPT of the last arriving walker and can be regarded as a performance indicator of network search.

We define a small random graph as labelled nodes and every pair of the nodes is being connected with probability by using the ER model. The average degree of the simple random network is , namely, . We choose a pair of nodes as the source and the destination; the source is marked as and its degree and the destination, and . The distance between them is 6 steps, . The walkers start from a node to reach a given node on the network according to different random-walk strategies. We perform MPRW and MSRW on the simple random network, respectively. Numerical data presented in the figures have been averaged over runs using different number of wandering walkers. The walker number increases from 5 to 400. Moreover, we carry out single PRW and single SRW starting at on the simple random network. Simulations state that the mean vertex cover time and the mean edge cover time for the single PRW, while and for the single SRW. In the following diagrams of the probability distributions, the vertical coordinate unit is percentage (%).

4.1. MPRW on a Simple Random Network

We record the FPT of the wandering walker, that is, the number of steps before the destination node is visited, starting from the source node. If all the walkers reach the destination, a round of the trial is over. We carry out rounds of trials and then analyze the statistical data. We calculate frequencies of the FPT of the first and last arriving walkers for MPRW with 10 walkers on the simple network, which are depicted in Figure 1. We can explore MPRW with 30 walkers in the same way; the numerical result is shown in Figure 2. We can also get the corresponding theoretical result for MPRW with 10 walkers and 30 walkers using (15) and (16) under the same condition. Simulation results confirm our analytic predictions.

We observe that for MPRW with 30 walkers, the probability of the first arriving walker utilizing the shortest path is , while for MPRW with 10 walkers the probability . The obvious gap can be seen in Figures 2(a) and 1(a). There is a significant increase in the probability of the first arriving walker taking the shortest path. If we use more walkers, the probability will be higher. This agrees with (27). Comparing with Figure 1(a), we can observe in Figure 2(a) that the distribution curve of first passage time peaks up towards and mainly concentrates in the interval , else , the probability for MPRW with 30 walkers. This implies that the first arriving walker will utilize the shortest path or near shortest paths with high probability. In fact, we can calculate the probability of the first arriving walker taking near shortest paths with a length by (15), also as shown in Figure 2(a). That’s to say, if we put 30 walkers on the network, the first arriving walker will almost surely take paths with a path length not more than 10. This observation has deeper meanings. On one hand, we can mark the nodes and the paths of the first arriving walker of MPRW visited, then we can detect the unknown shortest paths between two selected nodes. This is practical significance of realizing optimal routing on networks, especially the mobile ad hoc sensor networks. On the other hand, with a specified source and a walker number large enough, we conjecture that the wandering walkers will cover the vast majority of the network within steps with high probability because we can treat any other node as the destination. is the diameter of the network, which is the maximum distance measured on all pairs of nodes in the network. Consequently, based on the local information of the degrees of current node’s neighbours, the MPRW could be a mechanism of performing efficient search for networks, particularly for the highly heterogeneous small-world networks [14].

Considering the MRW as the search strategy, the FPT of the first arriving walker characterizes the search efficiency while that of the last arriving walker depicts network searchability. For MPRW with 10 walkers, we can see that the FPT of the last arriving walker is mainly distributed in the interval , namely, in Figure 1(b) and the corresponding MFPT is in Figure 3(b). For MPRW with 30 walkers, we observe that in Figure 2(b), in Figure 3(b), and the sample standard deviation of . These results can also be computed by (23) and (24). From Figure 3(b), we can see that as the walker number increases from 5 to 30, the MFPT of the last arriving walker increases fast, whereas is greater than 30, the MFPT increases slowly. The phenomenon that its growth rate is very small reflects the distribution of FPT which tends to be relatively centralized. At the same time, when the walker number is more than 30, with increasing, the sample standard deviation of tends downwards, as shown in Table 1. Further, we also operate rounds of trials with walkers and analyze the distribution of according to the statistical data. We find that only takes 7 numbers such as 655, 664, 697, 699, 710, 727, and 871, and its sample standard deviation is only 40.338. Moreover, is concentrated in 697, 699, 710, and 727, and the sum of the probabilities taking these four numbers is up to . This observation agrees with the significant increase in the probability of the first arriving walker taking the shortest path with a small increment in the walker number. We can also record traces of the last arriving walker and the traces containing bad cases of routings. Then, for instance, when designing the routing algorithm for one mobile wireless sensor network, based on local information, we can avoid poor routings of the last arriving walker and improve the reliability of the routing algorithm by MRW.

We can compute by (21) the MFPT of the first arriving walker for MPRW with a different walker number ; see Figure 3(a). From Figure 3(a), we can see that with the walker number increasing from 5 to 400, decreases from 16.537 to 6.03. monotonically decreases with walker number increasing, and it will converge to the distance 6 when . This further confirms the analytic prediction of (28). It can also be seen that decreases sharply in the interval . That is to say, a small increment in the number of walkers will enable the MFPT to decline substantially in the initial stage. This observation also holds in the case of MSRW [24, 26], which highlights its obvious practical meaning. For example, in [26], for one unstructured peer-to-peer network, the authors propose a MSRW query algorithm that resolves queries almost as fast as the flooding method while reducing network traffic much more in many cases. In the algorithm they use only 16–64 walkers while the network size is 4736–10,000. Their experimental practice is in accordance with our analysis here. The difference is that the query here may be faster since we conduct MPRW on networks.

We can also calculate by (23) the MFPT of the last arriving walker for MPRW with a different walker number ; see Figure 3(b). From Figure 3(b), when is more than 30, the MFPT increases slowly with increasing. This has been mentioned above when discussing the sample standard deviation of and is consistent with that the MFPT of the first arriving walker is declining substantially in the initial stage . In addition, we can compute by (25) the range ; see Figure 3(c). From Figure 3, we can see that the range mainly depends on . Unlike , we cannot determine the convergence of . However, we can explore the relationship between it and the mean cover time of single PRW, as well as the bounds on . Actually, hardly exceeds or , unless . From Figure 3(b), when , almost 20 times the size of network , is significantly larger than and . To some extent, the FPT and MFPT of the last arriving walker characterize network searchability [7]. In fact, for the complex networks with the same sizes and different topologies, with a specified source and a walker number under the same conditions, we carry out single and multiple random walks on them. The greater the quantities of and the mean CT, the more difficult to search for the network.

4.2. MSRW on a Simple Random Network: Comparison to MPRW

We make comparative study of MSRW and MPRW in the following aspects. Numerical results of the FPT of the first and last arriving walkers for MSRW with 10 and 30 walkers are demonstrated in Figures 4 and 5. The corresponding analytic results can be obtained by (15) and (16).

From Figures 5(a) and 4(a), we see that for MSRW with 30 walkers, the probability of the first arriving walker taking the shortest path is , while for MSRW with 10 walkers the probability is . This observation is similar to the case of MPRW, but the gap is not so apparent here. The first arriving walker will take the shortest path or near shortest paths with high probability ; see Figure 5(a). However, comparing with MPRW, the probability here is much smaller than ; see Figure 2(a).

We are able to compute by (21) the MFPT of the first arriving walker for MSRW with a different walker number ; see Figure 6(a). From Figure 6(a), we can see that with the walker number increasing from 5 to 400, decreases from 22.706 to 6.595. monotonically decreases with walker number increasing, and it will converge to the distance between the selected two nodes when . It can also be seen that decreases sharply in the initial interval . The results are consistent with MPRW. The difference is that the decreasing rate and the convergence speed of in the case of MSRW are much smaller than MPRW; see Figures 6(a) and 3(a).

As shown in Figures 4(b) and 5(b), for the distribution of of MSRW, it is relatively concentrated in the case of , comparing with . Numerical simulations show that when the walker number is more than 50, with increasing, the sample standard deviation of tends downwards. Comparing Figures 4(b) and 5(b) with Figures 1(b) and 2(b), with the same walker number , we find that the distribution of of MSRW is more decentralized than MPRW, while the MFPT of MSRW is much greater than MPRW; see Figures 6(b) and 3(b).

We can also calculate by (23) the MFPT of the last arriving walker for MSRW with a different walker number ; see Figure 6(b). When is more than 50, the MFPT increases slowly with increasing. In addition, we can compute by (25) the range and find that the range mainly depends on ; see Figure 6(c). From Figure 6(b), when , ; when , almost times the size of network . is just significantly greater than and . These tell that it is at lower cost to understand network searchability by MPRW than by MSRW.

In short, the MPRW search strategy here is more effective and feasible than MSRW. This is due to the fact that the MPRW strategy is prone to searching for high-degree nodes. In other words, if we start from a periphery (or low-degree) node and search for a hub (or high-degree) node on networks, we need only a small number of wandering walkers and a small number of steps by the MPRW strategy, which also offers one efficient way of detecting hub nodes in scale-free networks [14].

5. Conclusions

Some theoretical and technological problems with relation to MRW on networks have been investigated in this paper. These problems mainly come from random-walk theory and network search application. We mainly discussed the behaviors of the first and last arriving walkers of MRW on networks. We made use of random walks on weighted graphs to study single random walks on networks and took advantage of the order statistics approach to investigate MRW on networks, which can be a framework of random walks on networks. We analyzed the FPT and MFPT of MPRW on networks, making comparative studies with MSRW. We explored the FPT of the first and last arriving walkers and presented the corresponding equations of the probability distributions and moments. The MFPT of the first arriving walker monotonically decreases with walker number increasing, and it will converge to the distance between source node and destination node. Furthermore, a small increment in the number of walkers will enable the MFPT of the first arriving walker to decrease sharply in the initial stage and the distribution curve of the FPT of the first arriving walker to peak up towards the distance between the selected two nodes as well. Thus, if the walker number is properly set, the first arriving walker will take the shortest path or near shortest paths with high probability. With the walker number increasing, the MFPT of the last arriving walker also increases substantially in the initial stage. The MFPT of the last arriving walker is closely related with the mean CT of the corresponding single random walk. It could be seen that the MFPT of the last arriving walker is difficult to exceed the mean cover time unless .

Numerical simulations of an ensemble of random walkers moving on a paradigmatic network model confirmed analytical predictions and deepened discussions. In addition, by MRW on networks, we investigated some practical search-related issues. The MRW regarded as the search strategy, then the FPT of the first arriving walker characterizes the search efficiency while that of the last arriving walker portrays network searchability. Accordingly, the mean CT of single random walks can also be considered as a performance indicator of network search. We can detect unknown shortest paths by the first arriving walker and avoid poor routings by last arriving walker, which is of practical significance for realizing optimal routing and performing efficient searching on networks. MSRW and MPRW being search strategies on networks, MPRW prefers the high-degree nodes, while MSRW searches for the relatively low-degree nodes more efficiently. This inspires us to develop proper MRW for identification of influential spreaders in complex networks [27], including hub nodes in the highly heterogeneous networks and congestion bottlenecks in the urban traffic networks. We leave these interesting problems to our future work. Finally, we point out that MPRW and MSRW could complement each other in web search, routing, communication transmission, transportation, and other applications, thereby increasing the efficiency and reliability of the search-related processes.

Acknowledgments

This work is supported by Shanghai Municipal Education Commission Foundation (Grant nos. shgcjs020 and B-8938-11-0440), The National Natural Science Foundation of China (Grant no. 11001169), Connotative Construction Project of Shanghai University of Engineering Science (Grant no. nhky-2012-13), Shanghai Municipal Education Commission Discipline Construction Foundation (Grant nos. 11XK11, 2011X33, and 2011XY34), and Scientific Research Foundation of Shanghai University of Engineering Science (Grant no. A-0501-10-019).