Analysis of the Dynamic Influence of Social Network Nodes

Yin, Hong-Jian; Yu, Hai; Zhao, Yu-Li; Zhu, Zhi-Liang; Zhang, Wei

doi:https://doi.org/10.1155/2017/5046905

Scientific Programming

On this page

Abstract Introduction Conclusion Acknowledgments References Copyright Related Articles

Research Article | Open Access

Volume 2017 | Article ID 5046905 | https://doi.org/10.1155/2017/5046905

Analysis of the Dynamic Influence of Social Network Nodes

Hong-Jian Yin,¹Hai Yu,¹Yu-Li Zhao,¹Zhi-Liang Zhu,¹and Wei Zhang¹

Academic Editor: Fabrizio Riguzzi

Received21 Oct 2016

Accepted05 Jan 2017

Published31 Jan 2017

Abstract

In recent years, with the development of the social network theories, how to find or mining the most significant node in social network for understanding or controlling the information dissemination has become a hot topic and a series of effective algorithms have been presented. In this paper, a new scheme to measure the dynamic influence of the nodes in a social network is proposed, in which the sum of trust values of the propagation nodes is used. Simulations have been carried out and the results show that our scheme is stable and accurate.

1. Introduction

In the past decades, the revolutionary developments of communication tools have made significant changes to peoples social relationships. In the 1960s, Milgram’s small-world experiment showed that the average distance between any two people on Earth is six, and this phenomenon is referred to six degrees of separation [1, 2]. In 2011, the results of the analysis of the friend networks of 750 million active users in Facebook showed that the average distance between Facebook network nodes was only 4.74 degrees [3, 4]. In social network analysis, it is quite significant to find out or mining which node has the largest impact. Therefore, a lot of measurements have been proposed to calculate the importance of a node from different perspectives, including degree centrality, betweenness centrality, closeness centrality, k-shell centrality, eigenvector centrality, and the PageRank algorithm.

Degree centrality was proposed by Professor Linton, which reflects local properties of the network, and the main consideration is the node itself and the neighbors properties. Although the calculation of degree centrality is simple, it has some deficiencies [5–7]. Betweenness centrality, closeness centrality, and eigenvector centrality reflect the global property [8] of networks. Among them, betweenness centrality [9, 10] mainly considered the shortest path through the node. Closeness centrality [11] measures the difficulty to reach the other node. Eigenvector centrality [12, 13] mainly considered the status and prestige in the networks using the composition of the reputation of other nodes to reflect the influence of the node for the entire network. K-shell centrality reflects the nodes location within networks to measure node communication capacities [14, 15]. In addition, the PageRank algorithm [16] is also used to measure the impact of network nodes.

Currently, the most measurements are based on statistical properties with the topology of the networks and do not take the impact of changes of mutual trust among the nodes during information dissemination into account. In this paper, a new scheme to measure the dynamic influence of the nodes in a social network is proposed. In this new scheme, the modification of node trust value during information propagation plays a significant role. Furthermore, the cumulative change of nodes trust value is also considered in the new scheme.

2. The Measure of Dynamic Influence

2.1. The Model of Information Dissemination

SI, SIS, SIR, and so forth [17, 18] were originally used to research the spread of disease [19–21]. In these networks, people and their relations are considered as node and edge, which can be represented by , where is a set of nodes and is the set of connected edges. All nodes can be divided into three categories: class (susceptible) refers to those who do not get sick, but, because of the lack of immunity, they are susceptible to infection after contact with a sick sense; class (infective) refers to those who had infectious disease and it can spread to class members; and class (removal) refers to isolation or the person who has the illness and immunity.

In addition, suppose the number of nodes is constant , and each time the number of nodes to be removed is a constant proportion of the total number of . The average propagation period is , The dissemination number is . Figure 1 shows the procedure from susceptible state to removal state.

The SIR model is defined by following equation:

If the SIR model is used to illustrate the information dissemination in network , is considered as a node that can receive information, represents a node that has received information and has the ability to disseminate information, and represents a node which has received information but does not have the ability to disseminate it.

2.2. The Measure of Node Dynamic Influence

In this section, to further investigate the relationship between the sender node and the receiver node in a complex network, the nodes are stratified according to their distance from the node of information source. The layered network is shown in Figure 2.

The dynamic influence index of a social networks is represented by . Several rounds of information sources on the network node transmitting information are represented by . In addition, the trust value is the cumulative effect of the information spreading, where represents the node that is a push message acceptance of the push message from a node, namely, trust, .

In Figure 2, node a is considered as the information source node, and node a disseminates information to a set of neighbors with a certain probability. Node and node are connected by edge ; is the trust value on edge . When variable changes, the value of is also changed. This feature can be considered as a dynamic influence.

Use to represent the number of once valid pushing and to represent the number of once invalid pushing; representing the trust value on edge of any two nodes and in the networks at of dissemination, then the value isThe values of at and of dissemination are

Meanwhile, value is calculated by

If , the edge of a message is an invalid recommendation, while means that there is a valid recommendation in (3) and (4), where is the probability of the current node to propagate the message to its neighbor class nodes and is the number of neighbor node of the current node. It is important to note that represents outdegree in the directed network, while, in the undirected network, it denotes the degree of the node.

Probability is determined by the level of the node, the information lost during dissemination, and effects of cumulative history dissemination to the current dissemination. In Figure 2, according to the information attenuation principle, we can see that probability is inversely proportional to variable (when , the default value l for each node is 1). indicates the distance of the current node to source node. Probability is also inversely proportional to (when , the default value for each node is 0), which indicates the number of times that network information dissemination process. Probability is proportional to trust , which derived from edge connected the current node and its parent node.

The definition of probability is as shown in

It is apparent that probability decreases when layer deceased according to (3). Here, a new variable is introduced to balance and limit the value range of value.

After -rounds of information dissemination, the number of edges is , indicating the edge counts through the information route. In this case, the influence of source node is defined as the following equation:where denotes the edge count; is the influence of the source node after of the information dissemination.

2.3. The Detail of the Algorithm

The new algorithm aims to explore the relationship between the influence of the social network nodes and accumulation effects of information transmission. In this section, the stabilization of is used to eventually measure the influence of the node. A flow chart of the detailed algorithm is shown in Figure 3.

The spreading node selection algorithm consists of two parts: (1) when one node pushing a message, most of its neighbors have value to trust it. (2) Choose one neighbor node to receive the information.

The trade-offs of value determine the value of probability : a larger indicates a larger . In the algorithm, the algorithm selects the node who has the largest value, which is .

The push target node selection strategy is when a node has the ability to push messages selection in neighbor nodes, push message in neighbors, and select the maximum value of neighbor node on the path of a push.

3. Simulation Results

The initial values of the coefficients in our simulation are set to , , , , . All the data are from the public database [22, 23] shown in Table 1.

The dataset contains directed network, undirected networks, theoretical network, the real social networks, and the like. BA scale-free network [24] is a theoretical network that was proposed by Barabási and Albert to produce power-law distribution mechanism. It needs to specify that because the original online social network dataset contains isolated node, in this experiment, remove the original raw data collection network in the isolated node. In this experiment, the isolated nodes are removed in the original raw data collection network, and the maximum connectivity subgraph [25] is used in our simulations.

In the simulations, the data of () are 7.2594, 33.9749, 4.8111, 32.3878, and 9.989, respectively. The simulation results are listed in Table 2. In Table 2, represents an average degree [26] of the network, is the average path length [27] of the network, is the density of the network [28], is the density [29] of the network, is the weighted average of the network, and is the average clustering coefficient [30, 31] of the network.

In Figure 4, we selected the highest ranking node from the five network datasets. The ranking is according to the dynamic influence of 1000-round dissemination. In Figure 5, similarly, we selected the lowest ranking node from the five network datasets; the ranking is according to the dynamic influence of 1000-round dissemination. The horizontal axis is the number of dissemination rounds; the vertical axis is the trust value that represents the dynamic influence, where the actual results of trust were normalized so that the results of different networks can be compared in the same coordinate system.

In the Facebook network, number 67 node has the highest influence, which is rising rapidly within 100 rounds of the dissemination phase. The influence of node number 67 increases slowly within 100 rounds to 200 rounds of the dissemination phase. Finally, stable dissemination occurs at approximately 400 rounds, and the trust value is 0.7911. Node number 157 has a very low influence and is slightly jittery within 50 rounds of the dissemination phase. The influence of node number 157 rapidly declines within 50 rounds to 200 rounds of the dissemination phase. Finally, stable dissemination occurs at approximately 784, with trust value 0.355.

According to the simulation results, we can see that although different networks have different statistical properties, they have a same pattern, and the node with the highest influence increases quickly when the amount of dissemination increased and became eventually stable at approximately 300 rounds. For the nodes with low influence, they decreased faster and became eventually stable at approximately 500 rounds.

In addition, the proposed algorithm (DI) is compared to degree centrality (DC), betweeness centrality (BC), closeness centrality (CC), eigenvector centrality (EC), and PageRank (PR) to verify the accuracy and validity of the algorithm.

In Table 3, the top four nodes are listed according to the different measure algorithms. The results are roughly the same. Finally, 10% of the nodes herein formerly had importance under each dataset used for analysis and comparison, as shown in Table 4.

In the Facebook network, represents a collection of 10 elements under the five classical algorithms jointly that determined the top 10% nodes; represents together with DC algorithm the top 10% to 10 nodes, namely, the intersection hits: = 100%. Similarly, represents the union top 10% nodes under the five classical algorithms and the number of elements in the set is 55; represents the union together with DC algorithm in the top 10%, which is set as hits, = 100%. Simulation results show that the proposed algorithm has good accuracy and effectiveness.

4. Conclusion

In this paper, a new judgment scheme on the dynamic influence of the social network nodes is proposed. Considering the effect of changes in the information dissemination process of trust values, a new measurement of node dynamic influence is proposed. It is an improvement of the traditional algorithms. Finally, we analyze the influence of nodes according to topology of the network or statistical properties and further compare it with several classical algorithms to verify the validity and accuracy of the algorithm.

Competing Interests

The authors declare that they have no competing interests.

Acknowledgments

This research was supported by the National Natural Science Foundation of China (Grant nos. 61374178 and 61402092), the Online Education Research Fund of MOE Research Center for Online Education, China (Qtone Education, Grant no. 2016ZD306), and the Ph.D. Start-up Foundation of Liaoning Province, China (Grant no. 201501141).

References

S. Milgram, “The small world problem,” Psychology Today, vol. 2, no. 1, pp. 60–67, 1967.
View at: Google Scholar
D. J. Watts and S. H. Strogatz, “Collective dynamics of 'small-world' networks,” Nature, vol. 393, no. 6684, pp. 440–442, 1998.
View at: Publisher Site | Google Scholar
L. Backstrom, P. Boldi, M. Rosa, J. Ugander, and S. Vigna, “Four degrees of separation,” in Proceedings of the 4th Annual ACM Web Science Conference, pp. 33–42, 2012.
View at: Google Scholar
N. A. Christakis and J. H. Fowler, Connected: The Surprising Power of Our Social Networks and How They Shape Our Lives-How Your Friends' Friends' Friends Affect Everything You Feel, Think, and Do, Little, Brown and Company, New York, NY, USA, 2011.
R. Albert, H. Jeong, and A.-L. Barabási, “Error and attack tolerance of complex networks,” Nature, vol. 406, no. 6794, pp. 378–382, 2000.
View at: Publisher Site | Google Scholar
R. Pastor-Satorras and A. Vespignani, “Epidemic spreading in scale-free networks,” Physical Review Letters, vol. 86, no. 14, pp. 3200–3203, 2001.
View at: Publisher Site | Google Scholar
R. Cohen, K. Erez, D. Ben-Avraham, and S. Havlin, “Breakdown of the internet under intentional attack,” Physical Review Letters, vol. 86, no. 16, pp. 3682–3685, 2001.
View at: Publisher Site | Google Scholar
L. Jian-Guo, R. Zhuo-Ming, G. Qiang, and W. Bing-Hong, Node Importance Ranking of Complex Networks, 2013.
L. C. Freeman, “A Set of measures of centrality based on betweenness,” Sociometry, vol. 40, no. 1, pp. 35–41, 1977.
View at: Publisher Site | Google Scholar
N. E. Friedkin, “Theoretical foundations for centrality measures,” American Journal of Sociology, vol. 96, no. 6, pp. 1478–1504, 1991.
View at: Publisher Site | Google Scholar
G. Sabidussi, “The centrality index of a graph,” Psychometrika, vol. 31, no. 4, pp. 581–603, 1966.
View at: Publisher Site | Google Scholar | MathSciNet
K. Stephenson and M. Zelen, “Rethinking centrality: methods and examples,” Social Networks, vol. 11, no. 1, pp. 1–37, 1989.
View at: Publisher Site | Google Scholar | MathSciNet
S. P. Borgatti, “Centrality and network flow,” Social Networks, vol. 27, no. 1, pp. 55–71, 2005.
View at: Publisher Site | Google Scholar
L.-L. Ma, C. Ma, and H.-F. Zhang, “Identifying influential spreaders in complex networks based on gravity formula,” Physica A, vol. 451, pp. 205–212, 2016.
View at: Publisher Site | Google Scholar
S. Carmi, S. Havlin, S. Kirkpatrick, Y. Shavitt, and E. Shir, “A model of Internet topology using k-shell decomposition,” Proceedings of the National Academy of Sciences of the United States of America, vol. 104, no. 27, pp. 11150–11154, 2007.
View at: Publisher Site | Google Scholar
L. Page, S. Brin, R. Motwani, and T. Winograd, “The pagerank citation ranking: bringing order to the web,” Stanford Digital Library Technologies Project, 1999.
View at: Google Scholar
L. J. S. Allen, “Some discrete-time SI, SIR, and SISepidemic models,” Mathematical Biosciences, vol. 124, no. 1, pp. 83–105, 1994.
View at: Publisher Site | Google Scholar
B. Shulgin, L. Stone, and Z. Agur, “Pulse vaccination strategy in the SIR epidemic model,” Bulletin of Mathematical Biology, vol. 60, no. 6, pp. 1123–1148, 1998.
View at: Publisher Site | Google Scholar | Zentralblatt MATH
H. W. Hethcote, “The mathematics of infectious diseases,” SIAM Review, vol. 42, no. 4, pp. 599–653, 2000.
View at: Google Scholar
M. J. Keeling and P. Rohani, Modeling Infectious Diseases in Humans and Animals, Princeton University Press, Princeton, NJ, USA, 2008.
View at: MathSciNet
J. Heesterbeek, Mathematical Epidemiology of Infectious Diseases: Model Building, Analysis and Interpretation, vol. 5, John Wiley & Sons, New York, NY, USA, 2000.
N. Blagus, L. Šubelj, and M. Bajec, “Self-similar scaling of density in complex real-world networks,” Physica A: Statistical Mechanics and its Applications, vol. 391, no. 8, pp. 2794–2802, 2012.
View at: Publisher Site | Google Scholar
L. Tang and H. Liu, “Relational learning via latent social dimensions,” in Proceedings of the 15th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD '09), pp. 817–826, ACM, Paris, France, July 2009.
View at: Publisher Site | Google Scholar
A.-L. Barabási and R. Albert, “Emergence of scaling in random networks,” Science, vol. 286, no. 5439, pp. 509–512, 1999.
View at: Publisher Site | Google Scholar | MathSciNet
P. Erdős and T. Gallai, “On maximal paths and circuits of graphs,” Acta Mathematica Academiae Scientiarum Hungaricae, vol. 10, no. 3-4, pp. 337–356, 1959.
View at: Publisher Site | Google Scholar
M. O. Jackson, Social and economic networks, Princeton University Press, Princeton, NJ, USA, 2008.
View at: MathSciNet
A. Fronczak, P. Fronczak, and J. A. Hołyst, “Average path length in random networks,” Physical Review E, vol. 70, no. 5, Article ID 056110, 2004.
View at: Publisher Site | Google Scholar
A. Barreiras, “Diameter constrained network,” in Proceedings of the World Congress on Engineering, vol. 2, 2009.
View at: Google Scholar
L. Kowalik, “Approximation scheme for lowest outdegree orientation and graph density measures,” in Proceedings of the 17th International Conference on Algorithms and Computation (ISAAC '06), pp. 557–566, Kolkata, India, December 2006.
View at: Publisher Site | Google Scholar
J. Saramäki, M. Kivelä, J.-P. Onnela, K. Kaski, and J. Kertész, “Generalizations of the clustering coefficient to weighted complex networks,” Physical Review E, vol. 75, no. 2, Article ID 027105, 2007.
View at: Publisher Site | Google Scholar
T. Schank and D. Wagner, “Approximating clustering coefficient and transitivity,” Journal of Graph Algorithms and Applications, vol. 9, no. 2, pp. 265–275, 2005.
View at: Publisher Site | Google Scholar

Copyright

Copyright © 2017 Hong-Jian Yin et al. This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

PDF Download Citation

Download other formats

Order printed copies

Views

1981

Downloads

1086

Citations