Abstract

This paper focuses on the modeling of a rumor spreading in heterogeneous networks. Using the probability generating function method and pair approximation method, the current research obtains nonlinear differential equations to describe the dynamics of rumor spreading. The comparison between numerical simulations and Monte Carlo simulations confirms the accuracy of our model. Furthermore, the threshold condition is also obtained in this paper. The numerical simulation results show that the heterogeneity of the network accelerates the outbreak of rumors but reduces the maximum density of spreader and the scale of rumors. The present study also examines the effects of parameters on rumor transmission and the differences between rumor transmission recovery mechanisms and disease transmission recovery mechanisms.

1. Introduction

Rumor is an important form of human communication. It can be interpreted as an infection of the mind, which is also defined as a typical social phenomenon which runs through the whole evolution of mankind [1]. Its spreading can cause damage to personal reputations, affect the financial markets, and cause social panic and instability [2, 3]. Traditionally, rumors are propagated by word of mouth [4]. Nowadays, with the emergence of the Internet, people can spread rumors with instant messengers, emails or microblogs, and so on, which speeds up the dissemination of rumors virtually and makes rumors more influential than ever before [5]. The research on rumor propagation will offer us a thorough understanding of this issue and allow us to predict and reduce the possible harms caused by rumor propagation.

The study of the spread of the rumor began in the 1960s. Daley and Kendall [6, 7] first proposed the classical DK model of rumor spreading in which the total population is divided into three groups: the people who are spreading the rumor (spreaders), those who have never heard the rumor (ignorants), and the ones who have heard the rumor but do not spread it (stiflers), and they denoted them as , , and , respectively. Afterwards, Maki and Thomson [8] developed another classical MK model in 1973, which focused on the analysis of the rumor spreading based on mathematical theory via direct contact between spreaders and others. After that, more and more scholars paid attention to the spread of rumors. On one hand, more refined rumor spreading models are derived to model more detailed processes; on the other hand, the impacts of network structure on the rumor spreading are noticed.

To describe the process of rumor spread more carefully, the population is divided more finely. Xia et al. [9] extended the classic SIR rumor spreading model to a new SEIR rumor propagation model considering hesitating mechanism in complex social networks. Exposed (E) are those who have been infected, in the hesitating state, and do not spread rumors. In addition, in real life, there may be some people who do not agree with the rumor. They may persuade others to resist the rumor propagation. So Zan et al. [10] presented a susceptible-infected-counter-susceptible (SICS) rumor spreading model with counter mechanism in complex social networks. Then they derived mean-field equations to describe their dynamics in homogeneous networks and study the steady-state. In many cases, a rumor that concerns one event does not spread along, as a rumor cannot always spread without being affected by another rumor. To investigate the effects of interactions between two rumors, Jie et al. [11] built a model of a two-rumor interaction in which each individual has two states, which correspond to the attitudes toward each of the two rumors, and they built the corresponding ODE system. Gu et al. [12] revealed that the forgetting and remembering mechanism has great influence on the spread of rumors by extensive simulations. In view of this, Zhao et al. [13] added a new compartment, Hibernators, to the classic SIR model and they derived a new rumor spreading model, Ignorant-Spreader-Hibernator-Stifler (SIHR) model in homogeneous networks. They [13] investigated the final size of the rumor spreading under various spreading rates, stifling rates, forgetting rates, and average degrees of the network. All the above work studied rumor propagation in homogeneous networks, and the models were mean-field equations. Since the model loses a lot of information on the network, the threshold, below which the rumor final size is necessarily proportional to the fraction of initial speaker nodes but above which it is proportional to a fraction of the population, and the rumor final size may be missing in such case.

As early as in 1994, Lefevre and Picard [14] revealed the importance of network structure to rumor spreading. In fact, the topology of social networks shows highly complex connectivity in which each individual has a random number of connections to other individuals [15]. Zenette [16, 17] and Buvna et al. [18] performed a series of simulations in both static and small-world networks and showed the existence of a critical threshold of rumor spreading. So the influence of network structure on rumor spreading had been paid more and more attention to. Nekovee et al. [19] extended the former dynamic model to scale-free networks with assortative degree correlations and noticed the effect of degree correlation on rumor spreading. Moreno et al. [20] took into account that not only could nodes be in three different states (, , ), but also they belong to different connectivity classes and built a mean-field equations which is degree-dependent. Zhao et al. [21] also extended their work in [13] to describe rumor propagation by means of the degree-dependent mean-field equation. See more similar work in [2227].

Heterogeneity makes it difficult to derive differential equations to describe the epidemic. In fact, the heterogeneity of the network is not only reflected in the heterogeneity of degree distribution, but also in the locality of rumor spreading along the edge. So we try to find a better way to describe rumor spreading, rather than mean-field theory. Volz et al. [28] resolved this dilemma by showing the SIR-epidemic dynamics with a system of three nonlinear ODE’s which involves the PGF. See more similar work in [29, 30]. In this work, we try to describe the rumor spreading in [13, 21] by the method based on PGF. There are still differences between rumor spreading and disease spreading mechanism. In the process of disease transmission, the recovery of the infected persons is not related to other nodes. The recovery is not dependent on the network structure. However, since the spreading wants to recover, he has to contact a node that is not an ignorant, so, the rumor recovery mechanism depends on the network structure. What is more, the population can move between the compartment of the hibernators and the spreaders in the rumor spreading. For these reasons, the establishment of rumor propagation model based on PGF encounters irresolvable difficulties. So we have to combine the PGF and the pair approximation method to model this process. Our results show that this method is indeed a better way to describe the rumor propagation. Then we also examine the effects of parameters on rumor transmission and the differences between rumor transmission recovery mechanisms and disease transmission recovery mechanisms.

The remainder of this paper is organized as follows. We derive the nonlinear ODE’s model to describe the dynamics of the rumor spreading in Section 2. Furthermore, we obtain the rumor spreading threshold. In Section 3, we test this model in Poisson network and a refined scale-free network and compare the numerical simulations with Monte Carlo simulations, which shows that the numerical simulations of our model more consistently fit in with the Monte Carlo simulations. Then numerical simulations are conducted to further investigate the properties of parameters on the spread of the rumor. Finally, conclusions and discussions are given.

2. SIHR Model in Random Networks

This section introduces a low-dimensional system that models the percolation of an SIHR-rumor spreading in a heterogeneous network. It also obtains the disease thresholds.

2.1. Rumor Propagation Mechanism

This paper uses the rumor spreading rules proposed in Zhao et al. [13], which can be summarized as follows. When a spreader contacts with an ignorant, the last one becomes a spreader with probability , namely, spreading rate. When an ignorant contacts a spreader, the ignorant becomes a stifler with probability directly, namely, refusing rate. The hibernators reflect the repeatability of rumor spreading. The spreaders could forget the rumor and then remember it. Spreaders spontaneously forget the rumor and switch their state to hibernators at the rate as a result of the forgetting mechanism, namely, forgetting rate. The remembering mechanism also exists for hibernators. Hibernators spontaneously remember the rumor and become the spreaders at the rate that is called spontaneous remembering rate. Besides, when a Hibernator contacts a spreader, the hibernator switches to be a spreader with probability , namely, wakened remembering rate. When a spreader contacts another spreader or a hibernator or a stifler, only the initiating spreader turns into a stifler with probability , namely, stifling rate.

When a rumor spreads through a network, the nodes can be in any of the exclusive states: ignorants , spreaders , hibernators , and stiflers,, respectively. We use , , , and to denote the fraction of nodes in the sets , , , and , respectively. In addition, they satisfy the normalization condition: . The SIHR rumor spreading process is shown in Figure 1.

2.2. Our Model

To model the above rumor spreading progress in a heterogeneous network, we consider a closed and mixed population with N individuals, where is the set of vertices representing the individuals and is the set of edges representing the connections among individuals. We also assume that contacts are symmetric, that is, if an edge connects to , then an edge also connects to . Although the network is undirected (i.e., rumor can spread from each other), we wish to keep track of who spread. Therefore, for each edge , we define two arcs as the ordered pairs and . The first and second elements in the ordered pair are frequently called the ego and the alter, respectively [28]. Let be the degree distribution. Then is the proportion of nodes having network contacts; i.e., , which can reveal the structure of this network. Then the probability generating functions (PGFs) of the degree distribution are given by Let and be the averaged degree and averaged second-moment. Then they are and , respectively.

Since the network is heterogeneous, we use , , , and to denote the density of spreaders, ignorants, hibernators, and stiflers with degree k at time t, respectively. In addition, they satisfy the normalization condition: . Since our model is an extension of the network SIR model of Volz [28], we also need some notations in [28]. Let represent the fraction of degree one nodes that remain ignorant at time and represent the probability that an arc with an ego in state has an alter in state , where and may be , , , or . Given , it is easy to determine the fraction of nodes which remain ignorants at time t; i.e., The model is formulated as follows:It should be noted that since the spreaders and the hibernators can be transformed into each other, the model cannot be established by simply using the methods in the literature [28], which is the difficulty of the modeling process. The modeling process is given in detail in Appendix.

2.3. Initial Conditions and Spreading Threshold

If the probability of a node with degree selected uniformly at random and initial spreader is , we can anticipate the following initial condition.(1), , .(2), (3), (4), (because there is no hibernator initially).(5), which can be interpreted as the fraction of nodes remaining ignorant.

Applying the initial conditions above into the and considering giveFor (4), giving up the higher order items gets Because , the condition to guarantee is Therefore, the threshold is

On the basis of above theoretical analysis, a nonzero rumor diffusion threshold would be present. If , the rumor diffuses and necessarily occupies a fraction of population even as . And the rumor only spreads an infinitesimally small number of spreaders which approximates to zero compared with large population while is below threshold . Next, take different values for as , and . Equation (5) shows that the choice of does not affect the outbreak of the rumor. However,Hence, the selection of initial spreader nodes will not affect the threshold, but will affect the speed of rumor expansion.

We should also note that there is a threshold for rumor spreading in this paper, which is absent in the literature [13, 21]. Furthermore, the threshold depends not only on averaged degree but also on the fluctuations of the connectivity distribution .

3. Numerical Simulations and Monte Carlo Simulations

In this section, we carry out Monte Carlo simulations and numerical simulations to verify the above analytical results and further investigate the effects of parameters on the rumor spreading model. For Monte Carlo . The detailed process can be seen in [31]. If no otherwise specified, the numerical simulations of model (3) are performed with . Since model (3) involves and , (), to simplify the problem, we truncated the network to a maximum degree of 30. The initial condition of the numerical simulations of the model is . For the sake of conveniences, we define the cumulative rumor incidence as the fraction of nodes spreaders, stiflers, or hibernators. The peak value of spreader density can be used to measure the maximum rumor influence.

In Figure 2, the black solid lines show the result of 100 Monte Carlo simulations for an SIHR rumor model in a network with the Poisson degree distribution with . The red solid lines and green star lines represent numerical simulation trajectories based on the mean-field systems in [13] and [21], respectively. The blue dashed lines represent numerical simulation trajectories based on system (3). Figure 2(a) exhibits the trajectories of the density of spreaders and (b) exhibits the cumulative rumor incidence (densities of spreaders, stiflers and hibernators). And obviously, the blue dashed lines which cut through the dense mass of simulation trajectories are the analytical trajectories based on model (3). This implies that the model of (3) can well describe the spread of rumors in complex networks.

In order to further verify the reliability of model (3), we perform Monte Carlo simulations and numerical simulations of model (3) in a refined Power-law network. The network considered here can be generated by assigning a random number of lines to a set of nodes according to the distribution , where Then by the Configuration Model (CM) process [31], we can get a new network which is shown to have characteristic of Power-law network in Figure 3. Figure 3 shows the degree distribution of this new network in double logarithmic coordinates system.

In Figure 4, the Monte Carlo simulations for the network with the refined Power-law degree distribution in Figure 3 and the numerical simulation of model (3) and the mean-field system in [13, 21] are performed with and in Figure 3. The black solid lines show the results of 100 Monte Carlo simulations for the Power-law network. The red solid lines and green star lines represent numerical simulation trajectories based on the mean-field systems in [13] and [21], respectively. The blue dashed lines represent a numerical simulation trajectories based on model (3). Obviously, the blue lines which nearly cut through the dense mass of simulation trajectories are the analytical trajectories based on system (3). This implies that the model of (3) can well describe the spread of rumors in complex networks.

Figure 5 compares the spread of SIHR rumors in networks with the Poisson degree distribution and in the network with the refined power-law degree distribution. The results show that the final scale of rumors in the network with the Power-law degree distribution is smaller than that with the Poisson degree distribution. Furthermore, the density of spreaders reaches its maximum earlier, which is smaller than that with the Poisson degree distribution. Although heterogeneity accelerates the spread of rumors, it reduces the scale of spread.

The simulation results in Figure 6 show the evolution of rumor diffusion over time with different spreading rates , , and , respectively. In Figure 6(a), it is clear to see that the densities of the spreaders have sharp increases until they reach their peaks and then decrease to zero. Meanwhile, we can see that, with the increase of the spreading rate , the density of spreaders reaches its maximum earlier and larger. Figure 6(b) shows the cumulative rumor incidence increases with the increase of spreading rate .

Figure 7 displays how the evolution of rumor diffusion changes over time under different refusing rates , where we set , , and , respectively. In Figure 7(a), it is clear to see that the maximum of spreader density decreases with the increase of refusing rate . If more people have the ability to see through the rumor or do not spread rumor (the direct link from the ignorants to the stiflers), the maximum rumor influence would become smaller. Figure 7(b) shows the cumulative rumor incidence decreases with the increase of refusing rate .

Figure 8 illustrates how the evolution of rumor diffusion changes over time under different stifling rates , where we set , , , and , respectively. Figure 8 shows that both the peak value of the spreader and the cumulative rumor incidence decrease with the increase of stifling rate when is small, while they increase with the increase of when is bigger. However, in this modeling process, we assume that the probability of a spreader node with m nonignorant neighbors recovered in unit time is if the stifling rate is , but, actually, it should be . That is, we use the following approximation: , which is obviously unreliable with larger .

From Figure 9, we can see that the maximum of the density of the spreader decreases with the increase of the forgetting rate and it has almost no effects on the cumulative rumor incidence. Figures 10 and 11 show that the remembering rate and the wakened remembering rate have weak effects on the rumor spreading process. From Figure 11(a), we can see that the peak value of the spreader density increases with the increase of the wakened remembering rate .

Although the rumor spreading in social networks is similar to the disease spreading in biological networks, there are still differences between them. In the process of disease transmission, the recovery mechanism of the infected persons depends on themselves. However, a spreader turns into a stifler that needs to meet other spreaders, hibernators, or stiflers. So the recovery of spreader relies on the structure of network in the rumor spreading model. In order to compare the difference between the two mechanisms, we do the numerical simulations of SIHR model under two mechanisms, which are depending on the structure of network and depending on themselves. In Figure 12, we can see that the maximum rumor influence and the cumulative rumor incidence are lower under the mechanism depending on the structure of network than those under the mechanism depending on themselves.

4. Conclusion

This paper combines the PGF method and the pair approximation method to obtain rumor propagation model in heterogeneous networks. We obtained the following results.

(I) The comparison between numerical simulations and Monte Carlo simulations confirms the accuracy of our model.

(II) Different from the literature results, we get the threshold condition of rumor propagation, which only depends on the characteristics of the network and does not depend on the choice of the initial spreader node, which further confirms the fact that the fluctuations of the connectivity distribution cannot be neglected even for finite size systems. In fact, the choice of the initial spreader node only affects the spread speed of rumors.

(III) The rumor recovery mechanism, different from the disease recovery mechanism, is dependent on the network. This makes its recovery rate not a constant for the rumor propagator. In the early stage of rumor spreading, the neighbors of a rumor spreader are ignorant, so the recovery time of this rumor spreader is prolonged; however, in the middle and later stage of rumor spreading, the number of the ignorant neighbors of a rumor spreader decreases, so that the rumor spreaders recover faster and faster. Therefore, this recovery mechanism will reduce the scale of transmission.

The appendix shows how the model is obtained in detail. This systemic method can be generalized to study other propagation in complex networks.

Appendix

The Dynamics of SIHR Model in Random Networks

In this section, we develop an SIHR rumor spreading model involving the variables , , , , , , , , and . The method used is similar to that [28]. To model the rumor spreading process, we need more notations given in Table 1.

To calculate the dynamics, we choose a random ignorant ego node with degree k at time t. Then there will be a set of k arcs , corresponding to the ego node. We will assume that for each arc there will be a uniform probability that is spreader. There is an expected number arcs such that alter is spreaders. In a time dt, an expected number of these will be such that the ignorant ego received rumor becoming a spreader and of these will be such that the ignorant ego, who received a rumor without spreading it, becomes a stifler. Now let represent the fraction of degree k nodes that remain ignorant at time t. Consequently, the hazard for the fraction of ignorant nodes with degree k becoming spreaders or stiflers at time t is Using (A.1), we have For convenience, we will use the symbol to denote . From (A.2), it is clear that .

Given , it is easy to determine the fraction of nodes which remain ignorants at time t. Therefore, we have where is the defined probability generating function of the degree distribution , which greatly simplifies subsequent equations. Similarly, the change rate of the can be calculated directly as follows: Since from (A.3), we have This does not completely specify the dynamics of , which also depends on the variable . The dynamics of is derived as follows:Our goal is to put (A.6) in terms of the variables , , , and . Thus we have Using (A.7), can easily be obtained as follows:Next consider and in time dt. Firstly, is easily placed in terms of these variable. So we have For calculating , it requires careful consideration of the rearrangement of arcs among sets and . As an ignorant contacts a spreader, the ignorant will become a spreader with a certain probability or will become a stifler directly with a certain probability . So is reduced at a rate . And in time dt, spreaders forget the rumor, so is reduced at a rate . is also increased at the rate because nodes become spreaders due to remembering spontaneously. On the other hand, in addition to its own side, the edge from the new spreader node to other ignorant or spreader neighbors will also be moved into or removed from the collection . Therefore, as an ignorant has links to a spreader, the ignorant becomes a spreader. The newly spreader has on average arcs connected to ignorant nodes, where will represent the average number of arcs from ego to nodes in set not counting one arc to . So also increase at the rate . As nodes leave set in time dt, the fraction of arcs between and , is reduced by the fraction of arcs from spreaders to the newly spreaders or stiflers. Therefore, is reduced at the rate . The quantity is also increased; as the hibernator has links to spreaders, the stimulated hibernator will become a spreader and restart spreading the rumor. The new spreader will have on average arcs to ignorants, so is increased at a rate . Meanwhile, when a spreader contacts another spreader, a hibernator, or a stifler, the initiating spreader becomes a new stifler. The newly stifler will have on average arcs to ignorants, so is decreased at a rate . To summarize, the derivation of the dynamics of follows:To calculate we need to first calculate the number of ordered triples in which the second node has degree . We have Obviously, we have. Just like (A.11), we have Meanwhile, , can be represented by , separately. So there are and From (A.8) and (A.11)(A.14), we have Now applying (A.7)(A.9) and (A.15), we solve for in terms of and PGF as follows: This equation makes use of the variable which changes in time. Whose calculation is very similar to Firstly, we derive the dynamics of So we can get and Therefore, we have Similarly, applying the above mathematical method, the calculation of can be derived. So we have and From (A.20) and (A.21), we have Now, it is necessary to determine the derivation of to complete the model. In the same way, we have For calculating , we need to obtain the following: andFrom (A.23)(A.26), we have This equation makes use of the variable which changes in time. Deriving the dynamics of this variable will complete the model. So we have the following results. Next consider ; we have For the goal of , we must arrange the change of arcs ; we have From (A.25), (A.28), and (A.29), we have Finally, it is necessary to determine the time derivative of . Next consider in time dt. Because , we have From (A.32), we have For the goal of , we must arrange the change of arcs . So we have From (A.32)(A.34), we have

Data Availability

The data used to support the findings of this study are available from the corresponding author upon request.

Conflicts of Interest

The authors declare that no conflicts of interest exist with regard to the publication of this paper.

Acknowledgments

The research is supported by the National Natural Science Foundations of China under Grant snos. 11571210, 11331009, 11501339, 11101251, 11001157, 11471197, and 11701348 and the Youth Science Foundation of Shanxi Province no. 2010021001-1.