Abstract

The network-based cooperative information spreading is a widely existing phenomenon in the real world. For instance, the spreading of disease outbreak news and disease prevention information often coexist and interact with each other on the Internet. Promoting the cooperative spreading of information in network-based systems is a subject of great importance in both theoretical and practical perspectives. However, very limited attention has been paid to this specific research area so far. In this study, we propose an effective approach for identifying the influential latent edges (that is, the edges that do not originally exist) which, if added to the original network, can promote the cooperative susceptible-infected-recovered (co-SIR) dynamics. To be specific, we first obtain the probabilities of each nodes being in different node states by the message-passing approach. Then, based on the state probabilities of nodes obtained, we come up with an indicator, which incorporates both the information of network topology and the co-SIR dynamics, to measure the influence of each latent edge in promoting the co-SIR dynamics. Thus, the most influential latent edges can be located after ranking all the latent edges according to their quantified influence. We verify the rationality and superiority of the proposed indicator in identifying the influential latent edges of both synthetic and real-world networks by extensive numerical simulations. This study provides an effective approach to identify the influential latent edges for promoting the network-based co-SIR information spreading model and offers inspirations for further research on intervening the cooperative spreading dynamics from the perspective of performing network structural perturbations.

1. Introduction

The influence of information spreading on human society is increasingly profound. Promoting the spreading of some typical information, for instance, the vaccination guidance, technological innovation, commercials, and political propaganda can benefit all the aspects of socio-economic systems [15]. Therefore, researchers from multiple fronts, including statistical physics and network science, have been paying substantial attention to the specific subject of modeling and promoting the information spreading [610].

The complex network theory plays an important role in the study of information spreading. Under the framework of complex networks, a large amount of studies on information spreading dynamics have been gained. For instance, various network-based information spreading models such as the susceptible-informed-susceptible (SIS) model [11, 12], susceptible-informed-recovered (SIR) model [13, 14], threshold model [1519], and many of their extensions [2024] have been proposed to describe the various spreading dynamics of a single piece of information. The theoretical study of network-based spreading dynamics with a single-information has a long and successful history. However, cooperative information spreading widely exists in the real world [25, 26]. For example, the transmission of epidemic outbreak news may promote the spreading of epidemic prevention information. Recently, considering the fact that the spreading phenomena in the real world are often influenced by cooperativity effects, researchers go further to study on the cooperative spreading dynamics. Many efforts have been made to uncover and understand the interaction mechanisms [27], evolution patterns [28, 29], phase transition [30], and critical dynamics [31] of the cooperative spreading dynamics [32].

In addition to understanding and modeling of the cooperative spreading dynamics, the problem of how to effectively intervene the cooperative dynamics evolution is also of great importance in both theoretical and practical perspectives. In the study of intervening a single dynamics, researchers have developed numerous strategies from the perspective of designing effective transmission strategies [3338], identifying vital nodes [3946] and performing network structural perturbations [3, 4749]. However, the study of intervening cooperative spreading started to attract attention only recently. Min et al. [50] adopted the message-passing approach to find the influential spreaders by predicting the probability that an extensive coinfection outbreak initiated by two singly infected seed nodes.

Given the limited number of studies so far, no attention has been paid in intervening the cooperative information spreading from the perspective of performing network structural perturbations, to the best of our knowledge. To raise the interest of researchers to fill up this research blankness, in this study, we propose an effective approach to identify the influential latent edges which can promote the cooperative susceptible-infected-recovered (co-SIR) dynamics [50] if added to the original networks. The co-SIR model is first proposed to study the cooperative epidemic spreading. In this study, we extend it to describe cooperative information spreading. Specifically, we first employ the message-passing approach to predict the probabilities of each node being in different node states. Next, based on the obtained state probabilities of nodes, we come up with an indicator for measuring the influence of each latent edge in promoting the cooperative dynamics. Finally, the influential latent edges can be identified by ranking all the latent edges according to their quantified influence. Note that, the indicator we proposed in this study incorporates both the network topology information and dynamics information in the network-based co-SIR dynamical system. This study will verify the rationality and superiority of the proposed indicator in identifying the influential latent edges.

This paper is organized as follows. Firstly, Section 2 will give a detailed description of the co-SIR model. Secondly, Section 3 will show how to identify the influential latent edges in the network-based co-SIR dynamical system. Further, Section 4 will present the extensive numerical simulations to verify the effectiveness of our approach in identifying the influential latent edges. Finally, Section 5 will give a conclusion about this study.

2. Model Description

We consider a discrete-time cooperative co-SIR information spreading model [50] in this study. The co-SIR dynamics runs on a complex network of nodes and edges. Denote the adjacency matrix of network as ; thus, when there is an edge between node and , we have ; otherwise, . To be specific, the co-SIR model merges two classic SIR dynamics, that is, and . Each node of the network can be in three distinct states with respect to each one of the two dynamics. For instance, when dynamics A alone is considered, a node can be in susceptible , informed and able to transmit the information , or removed state. Therefore, when we take both dynamics and into consideration, there should be overall nine possible node states, that is, , , , , , , , , and .

Besides, the cooperativity between dynamics and is defined as follows: for a node informed with both dynamics, the probability to transmit the information by a single contact increases. Recently, Min and Castellano [50] introduced this kind of cooperative mechanism to describe the cooperativity of epidemics, considering the biological rationale that an infection with a pathogen may cause an individual to pass from the latent state to a fully infective one with respect to another pathogen. Similarly, when it comes to the spreading of information, being informed with message may encourage an individual to be more active in transmitting another message . For instance, when an individual is informed of an epidemic outbreak, he/she may be more willing to transmit the information about epidemic prevention.

To be specific, the dynamic process is as follows. Initially, all the nodes of network are set to be in the state. Next, we selected a small fraction of nodes to be the spreading seeds with respect to dynamics and , respectively. At the beginning of each discrete-time step, for each node and each one of its neighbor , if is informed with dynamics (or ) alone and susceptible with respect to the other, then will transmit the information (or ) to with probability (or ). That is to say, in this case, the state of node will be updated according to the following transition rules:

Otherwise, if is in the recovered state with respect to the other dynamics, the transmission probability will be (or ). Accordingly, the updating rules of the state of node are as follows:

Similarly, when is in the informed state and is not susceptible (i.e., is in the informed or recovered state) with respect to the other dynamics, the transmission probability should be (or ) as well. The corresponding updating rules are shown as follows:

Note that when the state of node and is and , respectively, node may transmit information (or ) with probability (or ) first and then transmit information (or ) with probability (or ) during the same time step. Specifically, the state of the susceptible node is updated according to the following transition rules:

After the transmission process, all the informed nodes recover with probability (or ) with respect to dynamic (or ); and the time step is over. The recovery rules are as follows:

Finally, the spreading dynamics will be terminated once there is no node that can further transmit information.

In the rest of the paper, we consider only the symmetric case, where and , and leave the nonsymmetric case open for future work. Besides, for the sake of simplicity, we assume the recovery probability .

3. Identifying Influential Latent Edges

Now, we turn to the identification of the influential latent edges of network , which can promote the spreading of the co-SIR model. For convenience, we call the nodes that are informed with both dynamics as the co-informed nodes. Denote the cooperative spreading prevalence of information and (that is, the fraction of co-informed nodes) as . Specifically, we are interested in finding the influential latent edges, which, if added to the original network , can maximize the cooperative spreading prevalence .

To identify the influential latent edges, we need an indicator to measure the influence of each latent edge in promoting the cooperative spreading prevalence . A direct indicator for measuring the influence of latent edge is the expected incremental number of co-informed nodes observed after adding an edge between node and . However, this indicator expects a large analytical and computational cost. Inspired by the recent references [7, 51], we narrow our focus to the node set , which is composed of node , , and their neighbors. It will be much more convenient to calculate the expected incremental number of co-informed nodes in the node set . Denote the probability of node being in the state after the termination of dynamics as . Note that we should have in the symmetric case. After adding an edge between node and , consider the information is transmitted from node to ; then, according to the transmission rules in Section 2, node becomes a new co-informed node with probability:

The first term on the r.h.s accounts for the case when the final states of node and are and , respectively; the second term takes into account the case when the final state of node is and the final state of node is or ; and the third term accounts for the case when and or and . Next, the co-informed node may transmit both information and to its neighbors. Therefore, we can approximately get the expected incremental number of co-informed nodes generated through this way of transmission initiated at node as

Similarly, we can get the expected incremental number by symmetry. Take both cases of and into consideration; therefore, we can define an indicator to measure the influence of latent edge as

Hereafter, the problem reduces to solving Equation (8), i.e., finding the probabilities of nodes being in different states after the termination of dynamics.

According to Section 2, the steady state behavior of the co-SIR model considered in this study can be predicted analytically by a message-passing approach [50]. Denote as the probability that node has not been informed by node with respect to information . In the symmetric case, we have , and the message-passing equations for these quantities arewhere is the set of neighbor of node and represents the set of neighboring nodes of node excluding . The first term on the r.h.s. takes into account the case that node is informed with one piece of the information by node that is in the co-informed state at the end of the dynamics. Meanwhile, the second term accounts for the case that node is informed with one piece of the information by node that is informed with only one piece of information.

Solving Equation (9) by iteration, we can get the probability that node being informed with both information and at the end of the dynamics as follows:

Similarly, the probability that node being informed only with one piece of the dynamics at the end is

Besides, the fraction of co-informed nodes after the termination of the dynamics is

Combining Equations (10) and (11), we can calculate the influence of each latent edge according to Equation (8) and then identify the influential latent edges for promoting the spreading of the co-SIR model by ranking all the latent edges with respect to the corresponding values of .

4. Simulation Results

This section will show the results of extensive numerical simulations on both synthetic and real-world networks to verify the effectiveness of our approach in identifying the influential latent edges. The detailed structural information of the networks employed in this section can be found from Table 1. Note that the message-passing approach [50] mentioned in Section 3 can accurately predict the Monte Carlo simulations; thus, to avoid large computational cost, we obtain the numerical value of cooperative spreading prevalence by solving Equation (12) instead of using the Monte Carlo simulation approach.

In our approach, we identify the influential latent edges by ranking the values of , which incorporates the information of both network structure (i.e., the adjacency matrices ) and spreading dynamics (i.e., and ). For comparison, we also test two additional approaches that only rely on network topology. On the one hand, we consider the approach to rank all the latent edges by which denotes the product of the degree centrality of nodes connected by the given latent edge. On the other hand, we employ the strategy to rank all the latent edges by , which is the product of betweenness centrality of nodes connected by the given latent edge. With the help of numerical simulations, now we are going to show the effectiveness and advantage of our approach in identifying the influential latent edges step by step.

Firstly, to verify the rationality and superiority of employing indicator to rank the influence of latent edges in promoting the spreading of co-SIR dynamics, we investigate the correlations between and the incremental cooperative spreading prevalence . The dynamics runs on a scale-free (SF) network with degree distribution , where denotes the degree exponent. More detailed information about the network can be found in Table 1. Figures 1(a)1(c) show versus when the cooperativity parameters are equal to 1, 1.5, and 2, respectively. Note that the transmission probabilities are set to be in all the numerical simulations of Figures 1(a)1(c). As can be seen, the values of and are almost linearly correlated in all the cases studied; thus, it is rational to use the indicator in ranking all the latent edges. Next, we go further to investigate the correlations between and for all the parameter regions by calculating the Spearman rank correlation coefficient [52, 53] between them. The Spearman rank correlation coefficient is defined as follows:where and are the ranks of latent edge scored by and , respectively, and is the total number of latent edges. For convenience, we refer to the latent edge rank scored by as numerical rank . Figures 1(d)1(f) show the correlation versus transmission probability when , , and , respectively. Note that we only consider the situation when , where is the threshold value of the dynamics; since in the case when , the cooperative spreading prevalence should be theoretically equal to 0. The theoretical values of are given by the inverse of the principal eigenvalue of the nonbacktracking matrix [50], where is a matrix with elements:

The specific values of of the networks can be found in Table 1. We can observe that the values of stay close to 1 (that is to say, the latent edge ranks scored by and are strongly correlated) in all the parameter regions. For comparison, we also calculate the Spearman rank correlations between the latent edge ranks scored by degree centrality product or betweenness centrality product and . As shown in Figures 1(d)1(f), the latent edge ranks scored by or are positively correlated with the numerical rank only when is small. However, when becomes large, the underlined correlations become negative. To sum up, Figures 1(a)-1(f) demonstrate that using the proposed indicator to identify the influential latent edges is more rational than using the centrality product or .

Secondly, considering the problem of identifying the influential latent edges, we are particularly interested in finding the optimal latent edge which can maximize the cooperative spreading prevalence; thus, we are going to show that the indicator proposed in this study performs well in identifying the optimal latent edge. Denote , , and as the optimal latent edges selected according to indicators , , and , respectively. Let us continue with the co-SIR dynamics on the SF network . Figures 2(a)2(c) present the numerical rank of the optimal latent edge versus transmission probability when , , and , respectively. It can be seen that for most values of regardless of the values of cooperative parameter . That is to say, the indicator can well predict the optimal edge which can maximize the cooperative spreading prevalence . Denote as the normalized numerical rank, where smaller indicates higher rank. Figures 2(d)2(f) further show the normalized ranks of optimal latent edges , , and versus when , , and , respectively. Visually, , , and all rank high when is slight above the threshold value, but once becomes large, the ranks of and fall quickly. Therefore, we can draw a conclusion that the strategy of using the indicator which incorporates both information of network topology and dynamics is better than using those structural indicators (i.e., and ) in finding the optimal latent edge.

Thirdly, to better understand the specific influence of the optimal latent edges , , and in promoting the co-SIR dynamics, we calculated the corresponding incremental cooperative spreading prevalence after adding those optimal latent edges to the original networks. For network , Figures 3(a)3(c) show the incremental cooperative spreading prevalence versus when , , and , respectively. The results demonstrate that adding the optimal latent edge selected according to the proposed indicator can effectively promote the cooperative spreading prevalence for most of the values of . However, adding the optimal or to has little influence in promoting for almost all the cases studied. Note that, on the network , when is small, the cooperative spreading prevalence ; thus, the incremental spreading prevalence after adding any single edge to the network. Similarly, when becomes too large, ; thus, as well. We also test the effectiveness of optimal latent edges , , and in promoting the spreading dynamics on three real-world networks: (a) Jazz musicians [54]; (b) physicians [54]; and (c) inf-USAir97 [55]. Some structural information of these real-world networks can be found in Table 1. Figure 4 shows the incremental cooperative spreading prevalence versus on the three real-world networks with different cooperativity parameter . The results are consistent with those of the SF network , i.e., using the proposed indicator in this study can effectively identify the influential latent edge to promote the co-SIR dynamics.

Finally, we investigate how the structural properties (i.e., degree, betweenness, and eigenvector centrality) of optimal latent edge change with . Denote , , and as the normalized latent edge rank scored by the degree centrality product , betweenness centrality product , and eigenvector centrality product , respectively. Figures 5(a)5(c) show the normalized rank of the optimal latent edge versus on different networks when , , and , respectively. Similarly, the corresponding results about normalized rank and are shown in Figures 5(d)5(f) and Figures 5(g)5(i), respectively. As can be demonstrated from Figure 5, when is small, the centrality ranks of the optimal latent edge are high for most of the cases. In other words, adding connection between nodes with high centrality can effectively promote the spreading of co-SIR dynamics when the transmission probability is small. This can be understood by the fact that when is near, the threshold should be small, and adding an edge between nodes with high centrality can help to keep the cluster of co-informed nodes, thus, promoting the co-SIR spreading dynamics. However, when becomes large, the centrality ranks of fall quickly in most of the cases studied. This is because nodes with high centrality ranks will have a higher probability to be co-informed when is large; thus, it is unnecessary to add connection between them. The results in Figure 5 give an intuitive explanation for the failure of indicator and in identifying influential latent edges when is large.

5. Conclusions

Cooperative information spreading on networked systems is a common phenomenon in the real world. The study of promoting the network-based cooperative spreading dynamics is of both theoretical and practical importance. In this study, we proposed an effective approach to identify the influential latent edges for promoting the spreading of the co-SIR model on complex networks.

To be specific, we first employ the message-passing approach to obtain the probabilities of each nodes being in different node states. Next, given the obtained state probabilities of nodes, we proposed the indicator to measure the influence of each latent edge in promoting the spreading of the co-SIR model. Then, we can rank all the latent edges by the indicator to identify the influential latent edges for promoting the co-SIR dynamics on the complex networks. Note that the indicator incorporates both the information of network structure and the co-SIR dynamics; and the numerical simulations verified that the indicator outperforms those structural indicators (i.e., the degree centrality product and betweenness centrality product ) in identifying the influential latent edges on both synthetic and real-world networks. Finally, we investigated how the structure properties of the optimal latent edges change with the transmission probability . It is surprising that the centrality ranks of the optimal latent edges fall quickly when becomes large in most of the cases studied. This finding gives an intuitive explanation for the failure of those structural indicators in identifying the influential latent edges when is large.

This study provides an effective approach for identifying the influential latent edges for promoting the network-based co-SIR information spreading dynamics. The research findings offer inspirations for further studies on intervening the cooperative spreading dynamics from the perspective of performing network structural perturbations.

Data Availability

The network data used to support the findings of this study have been deposited in the KONECT: the Koblenz Network Collection repository (10.1145/2487788.2488173) and the Network Data Repository with Interactive Graph Analytics and Visualization (http://networkrepository.com).

Conflicts of Interest

The authors declare that they have no conflicts of interest regarding the publication of this paper.

Acknowledgments

This work was supported in part by the National Natural Science Foundation of China under Grant 11975071 and 62006122, in part by the Scientific Research Foundation of Shantou University under Grant NTF19015, and in part by the 2020 Li Ka Shing Foundation Cross-Disciplinary Research under Grant 2020LKSFG09D.