Mobile Recommendation Based on Link Community Detection

Deng, Kun; Zhang, Jianpei; Yang, Jing

doi:https://doi.org/10.1155/2014/259156

The Scientific World Journal

On this page

Abstract Introduction Related Works Evaluation Conclusion Acknowledgments References Copyright Related Articles

Research Article | Open Access

Volume 2014 | Article ID 259156 | https://doi.org/10.1155/2014/259156

Mobile Recommendation Based on Link Community Detection

Kun Deng,¹Jianpei Zhang,¹and Jing Yang¹

Academic Editor: Wanneng Shu

Received04 Jun 2014

Revised20 Jul 2014

Accepted22 Jul 2014

Published26 Aug 2014

Abstract

Since traditional mobile recommendation systems have difficulty in acquiring complete and accurate user information in mobile networks, the accuracy of recommendation is not high. In order to solve this problem, this paper proposes a novel mobile recommendation algorithm based on link community detection (MRLD). MRLD executes link label diffusion algorithm and maximal extended modularity (EQ) of greedy search to obtain the link community structure, and overlapping nodes belonging analysis (ONBA) is adopted to adjust the overlapping nodes in order to get the more accurate community structure. MRLD is tested on both synthetic and real-world networks, and the experimental results show that our approach is valid and feasible.

1. Introduction

Along with the rapid development of the mobile communication technology, mobile devices have gradually become one of the most important platforms for people to acquire information. Compared with the traditional internet PC terminal, mobile devices have a lot of advantages such as strong mobility, good portability, and easy access, which allow people to get service and information on mobile networks at any time. Meanwhile, as the information in mobile networks is increasing constantly, the amount of information is now beyond the acceptability of the people. Moreover, the performances of mobile devices in terms of interface display, terminal processing, as well as input and output are still limited and have led to serious information overload issues for people [1]; thus, the utilization ratio of network resources is severely influenced. Therefore, how to identify the information which attracts the users most from vast amounts of information has become an urgent issue in mobile networks personalized service.

In recent years, personalized information recommendation service provided by mobile recommendation systems has become an effective solution to mobile information overload issues. However, in consideration of their privacy and security, mobile users are reluctant to provide precise and complete information to the mobile recommendation systems, which influenced the accuracy of the mobile recommendation systems greatly.

Currently, the community detection technology in complex networks provides powerful support to solve the above problems. The reason for this is that the community structure in complex networks often possess the feature of “dense intracommunity connections and sparse intercommunity connections” [2], in which intracommunity usually represents a group of people who share some similar characteristics; for example, in mobile networks, people often form their own mobile communication communities according to their professions and interest. Because of the same background and interest, the users in the same community are likely to have the same information demands (e.g., the same kind of movie, music, news, etc.). Therefore, the utilization of community detection technology could divide all mobile users into different communities, and then according to the property of every community that the users usually have the same information demands, it could provide accurate information service for the whole community with only a small part of user information.

To sum up, excellent community detection method is essential in completing a recommendation task accurately by community detection technology in mobile networks. However, traditional community detection method is to divide the network into some disconnect communities, and every node belongs to only one community, while in mobile networks, communities are incomplete independent and are overlapped, which means some nodes could be members of different communities at the same time [3]. Therefore, overlapping community detection in mobile networks is usually of more practical significance. Although previous researches on community detection mainly focused on identifying the communities of nodes, studies have shown that [4] link community detection usually has more advantages in detecting overlapping communities, that is, because a node may carry multiple identities, which means it belongs to multiple communities at the same time. For instance, a person may belong to multiple communities at the same time, such as families, friends, and coworkers. In contrast, a link in a network usually indicates the unique relationship between a pair of nodes; it has one single role and belongs to only one community. When all the communities are determined, the corresponding overlapping nodes will naturally belong to multiple communities. Therefore, link community has obvious advantage in detecting overlapping communities and provides powerful technical support for the improvement of the accuracy of mobile recommendation.

Based on the above analysis, an algorithm is proposed in this paper, called mobile recommendation based on link community detection (MRLD), which could perform accurate group recommendation to mobile users without having to get all the user information in mobile networks. MRLD algorithm first initializes the label of every link to the node label which possesses the higher degree when connected by the link and gets the links according to the degree of the nodes (the nodes are connected by the links) in descending order. Then executes triangle label diffusion to aggregate the links with the same label to one community, thus, to form an original community structure with dense intracommunity connection; secondly, we combine the obtained communities with maximal extended modularity (EQ) [5] of greedy search to obtain the optimal overlapping community structure; thirdly, overlapping nodes belonging analysis (ONBA) is adopted to adjust the overlapping nodes to avoid “excessive overlapping” which had happened in link community detection; finally, on the basis that overlapping communities are detected precisely, accurate information targeted on every community in mobile networks is recommended to the users.

The rest of this paper is organized as follows. Section 2 surveys the related work. Section 3 explains and justifies the design of our algorithm, named MRLD. In Section 4, we discuss the experiments of MRLD algorithm. Section 5 concludes this paper.

2.1. Mobile Networks Recommendation

As mobile devices could approach users much easier than traditional internet PC terminals, mobile users are likely to have more realistic social behaviors in mobile networks; therefore, many recommendation approaches based on social relationships in mobile networks are proposed. For example, reference [6] built a collaborative filtering algorithm based on the friend relations of social websites and accomplished the collaborative recommendation task, and the experimental results suggested that the friend relations in social websites could improve the accuracy of collaborative filtering algorithm; reference [7] decomposed the obtained score matrix of the users and discovered the potential factors to influence the scores, meanwhile, it took the influence of the trust among friends into consideration and accomplished the group recommendation; reference [8] combined collaborative filtering algorithm with social trust networks and discovered the neighbors of the users through the score of social trust networks, thus, recommendation is well performed by collaborative filtering algorithm; reference [9] obtained the mobile networks in which users interact with each other by the frequency and duration of the interaction, in other words, it used the frequency and duration of the interaction to figure out the strength ties between the users, meanwhile, it analyzed the distance (the distance refers to the sum of the link weights of the pathway between any two nodes, and the weights of links represent the strength of the users’ relationship) between two nodes to estimate the possibility of two nodes being friends. Finally, the nodes that have stronger connection strength with a user are recommended to the user as potential friends.

2.2. Community Detection

Recently, various kinds of community detection algorithms have been proposed, for example, GN algorithm [10], proposed by Girvan and Newman, considers that the edge betweenness connected between the communities should be higher than the edge betweenness connected inside the community. The edge betweenness of an edge can be defined as “the number of the shortest paths in the network which pass through the edge and connect any two nodes.” GN algorithm obtained a dendrogram concerning community structure by deleting iteratively the edge with the biggest edge betweenness, and Newman and Girvan defined modularity function [2] to cut the dendrogram and obtained the optimal community structure. After that, optimization algorithm based on modularity function has become one of the main stream algorithms in community detection. However, these algorithms are to decompose the networks into several disconnected communities and every node must belong to only one community. But in real-world networks, communities are incomplete independent and are overlapped. As a result, many overlapping community detection algorithms have been proposed, for example, CPM algorithm based on clique percolation [3] considers that the edges inside the community are likely to form big complete subgraphs, while the edges between the communities are almost impossible to form big complete subgraphs; hence CPM algorithm uses clique percolation to detect the community structure; algorithms based on local optimization [11, 12] constantly expand the local community where different seeds are located by optimizing fitness functions, thus, the global optimal community structure is obtained; algorithms based on label propagation [13–15] first initialized a unique label with belonging coefficient for every node. Then they, repeatedly, modify the labels by summing and normalizing the belonging coefficients of the neighbor nodes which have the same labels. Finally, nodes with the same labels belong to the same community, and the overlapping nodes possess multiple labels.

Research has shown [4] that link community detection is usually more advantageous in detecting overlapping communities. Therefore, many link community detection approaches are proposed. For example, reference [16] converted the original network structure into line graph structure and then employed random walk method to detect the overlapping community structure; reference [4] proposed a hierarchy clustering algorithm based on link similarity. This algorithm first generated a dendrogram of link community structure, then it use partition density function to cut the dendrogram, through which the final community structure is obtained; reference [17] extended the map equation method of node communities to link communities and modified the coding rule of random walk in order to complete link community detection.

This paper takes advantage of the superior capability of link community in detecting overlapping communities and proposes a community detection algorithm which could accomplish information recommendation task by precisely detecting overlapping communities in mobile networks.

3. The Algorithm

MRLD algorithm first adopts link label diffusion algorithm to create the original community structure; secondly, it combines the communities with maximal extended modularity (EQ) of greedy search to obtain the optimal overlapping community structure; thirdly, the algorithm analyzes and adjusts the obtained link community structure through overlapping nodes belonging analysis (ONBA), and then gets the ultimate overlapping community structure; finally, based on the acquired densely connected overlapping community structure, recommendation to all the users of the community could be implemented.

3.1. Link Label Diffusion Algorithm

This section takes links as research object and proposes an algorithm, named link label diffusion algorithm (LLDA), to acquire the original link community structure, and before the detail description of the algorithm, the following definitions and theorem should be provided first. Assume a social network , here denotes the set of nodes and denotes the set of edges (or links) of .

Definition 1 (neighbor link). Assume , then the neighbor link of link could be defined as the set of links which connect node or , and do not belongs to this set, that is: and and , where and represent the sets of the neighbor nodes of and , respectively.

Definition 2 (triangle). Assume and , , are the three vertexes of triangle . Then triangle could be defined as any two of the three vertexes , , are connected by links, that is: .

Definition 3 (partition density function, see [4]). Assume in community , is the number of links and is the number of nodes, then the partition density function is defined as the following formula: where when , assume . Formula (1) suggests that in , more links and fewer nodes indicate a denser intracommunity connection.

Definition 4 (link label). Assume link belongs to community , and is the community identifier of , if , then is called the link community label of link (link label in abbreviation).

Theorem 5. Given nodes , links , is a community, is the number of links in , and is the number of nodes, where , , if join in community to constitute the community , then the partition density of (denoted ) is equal or larger than the partition density of (denoted ).

Proof. The partition density of community can be defined as follows:
If join in to constitute a community , then the partition density of community can be defined as
The number of links is added by one after the joining of into community , so . Because , nodes is connected by and , so nodes are already members of community . That is to say, the joining of into community to form a new community would not change the number of nodes, that is, , then formula (3) could also be defined as
Therefore, . This concludes this proof.

Theorem 5 shows that if two links of a triangle belong to the same community, then the joining of the third link would increase the partition density of the community. Based on this theorem, LLDA algorithm is proposed in this paper. Research has shown [18] that the node with higher degree has stronger influence in the local range, and it could absorb the surrounding nodes to form a community. So LLDA algorithm first initializes the label of every link to node label which possesses the higher degree when connected by the link, thus, to form an original community with tree structure. Then, in this network, get links according to the degree of the nodes (the nodes are connected by the links) in descending order. Meanwhile, search triangle set which includes , if the other two links of triangle in set have the same label, then the label of is modified to the label of the other two links. This process is also called triangle label diffusion. At this moment, if there is still link which does not belong to any community, then the label of is modified to the most frequent label of its neighbor links. After the above process, links with the same label are attributed to the same community. Based on this idea, LLDA algorithm could be expressed as shown in Algorithm 1.

Input: mobile network .
Output: link community structure .
Begin
(1) for each is the number of links in
(2) node label with higher degree in the two nodes connected by
(3) initialized label modification marker of links
(4) end for
(5) labelList the set of nonredundant labels ranged by the number of links with the same label in descending order
(6) for each labelList
(7) dnEdges select a set from the links whose labels are in the network
(8) for each dnEdges and
(9) st search the triangle set which include link , and extract the set of nodes in without nodes and .
(10) for each
(11) if
(12) ; ; ;
(13) end if
(14) end for
(15) end for
(16) end for
(17) for each and
(18) the most frequent labels of the neighbor links of link
(19) end for
End

In Algorithm 1, steps are the initialization process of link labels, where is the label modification marker of link , and its purpose is to make sure the label of every link could only be modified once after the initialization, thus, to ensure every link is in the same community with the node which possess the highest degree in the local range. Step is to get links according to the degree of the nodes (the nodes are connected by the links) in descending order. The aim is to make sure that in this diffusion process, link labels diffuse from the links with higher node degree to the links with lower node degree. Steps are to perform triangle label diffusion to the labels of the links. According to Theorem 5, this operation is reasonable and a dense link community structure is obtained by the end of this process. By this time if there are still links with label modification marker , then it means that these links are not detected to any community. Steps are followed to modify the labels of the links whose label modification marker to the most frequent labels of their neighbor links. The fundamental of this process is that the number of the neighbor links which belong to the same community with a link could reflect the tendency of the link to a community, For example, Figure 1 (Figure 1(b) is the line graph structure correspond to the network topology structure of Figure 1(a)) shows that link has a denser connection with community A, so it is more reasonable to attribute link to community A.

(a) Topology structure

(b) Line graph structure

From Algorithm 1 we can see that the time complexities of step and steps are the highest. Obviously, the time complexity of step is . Steps are to execute triangle label diffusion to the links in the networks, so, labelList multiplies dnEdges is equal to . And because the number of triangles which include a specific link in the networks is , therefore, the running time of steps is . However, is usually constant, so the time complexity of this operation is . To sum up, the time complexity of LLDA is .

After the execution of LLDA, links with the same label are aggregated to the same community, indicating the accomplishment of original community construction phase. This process have shown that link labels diffuse from the links with higher node degree to the links with lower node degree, thus, aggregated the densely connected links to one community in the local range.

3.2. The Combination of Link Communities

Though multiple communities with dense intraconnections are achieved after the execution of LLDA algorithm, the corresponding community structure may not be the global optimal structure, so it is still necessary to combine the link communities to achieve the global optimal community structure. This paper employed the maximal extended modularity (EQ) of greedy search to combine the obtained communities in order to achieve the optimal overlapping community structure.

Definition 6 (extended modularity, EQ). Assume is the number of links in the network, is the th community in the community structure, is the number of belonging communities of node is the degree of node , and is the corresponding adjacent matrix of the whole network, then the EQ can be defined as follows:
Here, if nodes , is connected by a link, then , otherwise . The implication of EQ is that, a higher EQ value indicates a denser intracommunity structure, that is, a more qualified community structure.
According to the definition of EQ, the EQ value of the community structure is the accumulation of the EQ value of every community, if the combination of two communities increases the EQ value, then it means that the combined community structure is superior to the community structure before the combination. Hence we can do the following modification to the function of EQ, given two communities and , then the functions of and can be defined as follows, respectively:
If add the EQ values of and , then
If the new community obtained after the combination of and is denoted by , then the EQ value of can be demonstrated as the following formulation:
At this moment, the increment of to (denoted ) can be defined as
According to the above analysis, if the value of is higher, it means that the combined community makes more contribution to the value of EQ. Based on this, the ideology of community combination is to compute the of every pair of adjacent communities, and then to combine the communities with the largest value. Iterate the above process until the of every pair of adjacent communities is less than or equal to zero. Thus, it can be seen that the final community structure after the iteration correspond the optimal EQ value.
It can be easily seen that, if LLDA algorithm divide the whole community into original communities, and the average number of nodes in every community is , then the time complexity of every combination is . In the worst case, the algorithm would combine the communities into one community and the time complexity is . Since is far lower than , and is constant, the time complexity of this phase is far below .

3.3. Overlapping Nodes Belonging Analysis

Since the detection of link communities is the detection of communities that the links belong to, so if a node is connected by multiple links and one of the links belongs to a different community, then this node is to be identified as an overlapping node. As is illustrated in Figure 2, nodes 7, 8, 9, 10 have already form a dense community, but each node of the nodes 7, 8, 9 is connected to another community by a link, so nodes 7, 8, 9 will be identified as overlapping nodes in the detection process. Therefore, we can come to the conclusion that in link community detection algorithms, nodes have higher probability of being identified as overlapping nodes, so the detected communities are likely to be “excessive overlapping” which decreased the accuracy of community detection.

In order to solve the problem of “excessive overlapping” in link community detection, this paper proposes overlapping nodes belonging analysis (ONBA). Before discussing ONBA in detail, we will introduce the definition of community belonging value.

Definition 7 (community belonging value). Given link community structure , in which represents any community in , is the number of communities in , and is the set of overlapping nodes, then the belonging value of any overlapping node belongs to community can be defined as follows:
Here, is the number of links which connect node with the nodes in , is the degree of node , and is the belonging value of node which belongs to community ; then we can use to represent the set of belonging values that node belongs to , and is the number of the adjacent communities of node .
Based on the definition of community belonging value, this paper proposes overlapping nodes belonging analysis (ONBA). The basic ideology of this analysis is: if a node connect densely with some communities, but connect sparsely with the others, then the overlapping node has the tendency of belonging to the communities which are densely connected with it, that is, the node belongs to the communities; if an overlapping node has no obvious tendency to any community, in other words, the number of links which connected the node with each of its adjacent communities have no obvious difference, then no adjustment is made to the node. According to this ideology, the details of ONBA algorithm are demonstrated as shown in Algorithm 2.
From Algorithm 2 we may know that if every is lower than belonging threshold , and then the community of node remains unchanged, if there exist the situation that the belonging value of node (denoted ) belongs to community is higher or equal to , then delete node from the communities whose community belonging value is lower than . After the above operations, the ultimate overlapping community structure is achieved.
It can be seen that in ONBA algorithm when the value of is lower, the number of nodes which has tendency is relatively larger, so the overlapping nodes have higher possibility of being adjusted, but the tendency of an overlapping node to a community can easily be satisfied, so there still will be many overlapping nodes in the detected community structure. However, as the value of increases, the tendency of a node to a community will be more difficult to be satisfied, which declines the number of the detected overlapping nodes. Of course, when the value of is higher, there will be less nodes with tendency, so only a few nodes will be adjusted, thus, the community structure obtained at last will still have a lot of overlapping nodes. Overall, plays an essential role in judging a node is overlapping node or not. Therefore, during the analysis of the experiment, the value of will be analyzed in detail.
In ONBA algorithm, the time complexity of calculating the community belonging value of every overlapping node is the highest. The time needed in calculating the community belonging value of overlapping nodes is , where is the average degree of the nodes in the networks. Since mobile networks are usually of sparse structures, and is constant, the time complexity of ONBA is . And because is far less than which is the number of nodes in the networks, so the time complexity of ONBA is far lower than .
In MRLD algorithm, the time complexity of LLDA is . The running time of the combination phase and the running time of ONBA algorithm are far shorter than . Therefore the time complexity of MRLD is .
With the accurate overlapping community structure obtained in the mobile networks, we can perform recommendation to the users of the whole community according to the limited user information.

Input: link community structure obtained by LLDA
Output: the ultimate community structure obtained by MRLD
Begin
(1) overLapNode the set of overlapping nodes derived from C
(2) Set
(3) for each overLapNode
(4) comList obtain the label list of communities which are connected with node
(5)
(6) for each comList
(7) compute the belonging value of node v belongs to
(8) if >
(9)
(10) end if
(11) end for
(12) for each comList
(13) if and
(14) Remove node v from
(15) end if
(16) end for
(17) end for
End

4. Experiments and Evaluation

In order to evaluate the performance of the MRLD algorithm, we use synthetic networks and real-world networks to test and analyze it. Five classic algorithms: CPM [3], LC [4], LFM [11], COPRA [13] and GANET+ [19] are selected to compare with MRLD aiming at proving the validity and the feasibility of the algorithm.

4.1. Evaluation Criteria

In order to test the performance of every algorithm, three evaluation criteria are adopted to evaluate the advantages and disadvantages of each algorithm in three aspects: the accuracy of community detection, the accuracy of overlapping nodes detection, and the density of intracommunity connections.

The first criterion is extended normalized mutual information (NMI) [11], it is to test the accuracy of overlapping community detection. Its definition is as follows: where and are community structures, represents the normalized condition entropy of a cover with respect to .

The range for the value of NMI is from 0 to 1; if , it means the detected community structure is completely the same with the true community structure; if , then the detected community structure is entirely different with the true community structure. That is, the value of NMI increases when the accuracy of the detected community structure is improved.

The second criterion is -score [20], which is adopted to evaluate the accuracy of overlapping nodes detection, and its function is defined as follows: where recall denotes the number of correctly detected overlapping nodes divided by the real number of overlapping nodes, and precision denotes the number of correctly detected overlapping nodes divided by the total number of detected overlapping nodes. The range for -score is from 0 to 1, and the value of -score is higher when the accuracy of overlapping nodes detection is higher.

The third criterion adopted is EQ (see Section 3.2). It is to evaluate the density of intracommunity connections, in which higher EQ values indicate denser intracommunity connections of the community structure.

4.2. Synthetic Networks

LFR benchmark networks [21] proposed by Lancichinetti et al. have extremely similar statistical property with real-world networks. So this paper chooses LFR benchmark as the experimental dataset of MRLD. The parameter settings for the LFR benchmark networks are as follows: the network size is set to 200, the average node degree is set to 10, the maximum node degree is 30, and minimum community size is 20, the maximum community size is set to 50, is the number of overlapping nodes, and it is set to either 20 or 100, the number of communities each overlapping node belongs to (denoted ) is set from 2 to 6, the mixing parameter varies from 0.1 to 0.4 (the community structure is clearer when the value of is lower). Because there exist real overlapping community structure in LFR benchmark networks, this paper adopt NMI and -score respectively in this dataset to assess the similarity of the community structure obtained by each algorithm with the true community structure, with the purpose of assessing the detection accuracy of each algorithm.

4.2.1. The Analysis of Belonging Threshold Value

The purpose of this section is to find out which threshold value could ensure the optimal accuracy of MRLD. Figure 3 shows the comparison of -score in different network structures, when the threshold value of MRLD varies from 0 to 0.9. As we can see, when , the -score values are higher than the corresponding -score values when . This indicates that ONBA is effective. Besides, we can also see that if , only when and or 3, the -score values of the detected communities are lower than the corresponding -score values when . In other cases, when all -score reached their optimal values. So in this paper, the threshold value of MRLD is set to 0.5.

(a) and

(b) and

4.2.2. Comprehensive Analyses of MRLD

Figure 4 shows the comparison of MRLD with other algorithms in terms of NMI values. As we can see, no matter in the networks with lower overlapping density presented in Figures 4(a), 4(c), and 4(d) or in the networks with higher overlapping density presented in Figure 4(b), the NMI values of MRLD is obviously superior to the other algorithms. Therefore, we can come to the conclusion that MRLD possesses a higher accuracy in detecting overlapping communities.

(a) and

(b) and

(c) and

(d) and

During process of detecting overlapping nodes with LC and GANET+ algorithms, there exists the phenomenon of excessive overlapping nodes detection, which makes it unable to evaluate accurately the accuracy of the two algorithms in detecting overlapping nodes. Therefore, this paper only analyzes the accuracy of overlapping nodes detection of MRLD and the other three algorithms.

Figure 5 shows the comparison of MRLD with other algorithms in terms of -score in networks with lower overlapping degree. As we can see, only when , , the -score value of the detected communities is lower than the -score of CPM, in other cases, the -score values of MRLD are obviously superior to the -score values of the other algorithms compared. This could prove the high accuracy of MRLD in detecting overlapping nodes.

(a)

(b)

(c)

Figure 6 shows the comparison of MRLD with other algorithms in terms of recall, precision, and -score values when . The comparison has shown that in networks with higher overlapping density, MRLD could still detect overlapping nodes accurately.

(a) -score

(b) Recall

(c) Precision

By analyzing NMI, we know that the community structure detected by MRLD is more accurate, and by analyzing -score we can come to the conclusion that MRLD is also more accurate in detecting overlapping nodes. To sum up, MRLD has excellent ability to detect overlapping community structure in networks.

4.3. Real-World Networks

4.3.1. Traditional Real-World Networks

As the topological property of real-world networks is different from the topological property of synthetic networks, we use some well-known real-world networks to further evaluate the performance of MRLD. Table 1 illustrates the datasets of real-world networks used in this paper.

In real-world networks, there are few networks whose community structure is already known, and the networks is usually not overlapped, so it is hard to execute good accuracy analysis. Therefore, this paper adopts EQ as the evaluation criterion to assess the density of intracommunity connections of the communities detected by each algorithm.

Table 2 is the comparison of MRLD with other algorithms in terms of EQ. As we can see, the algorithms are performed in eight real-world networks, although in network Lesmis, Football, and Jazz, MRLD does not obtain the optimal EQ value, in the other five networks, the EQ value of MRLD is obviously superior to the EQ values of the other algorithms. This means that MRLD could uncover community structure with dense intraconnections.

4.3.2. Real-World Mobile Networks

In order to give a comprehensive display of the performance of MRLD algorithm, we chose 66 students as the study object. All these students are from the same major of a university and they often communicate with the mobile software “Wechat.” The 66 students are presented as nodes in the network and have links between the students who often communicate with each other, 163 links in all. Investigations show that the students are interested in five kinds of news, namely, political news, entertainment news, sports news, military news, and social news. Figure 7 shows the detection result obtained by MRLD in this network, in which we have five geometric figures: parallelogram, circle, rectangle, hexagon, and rhombus, representing the favorite kind of news of a student in the order of entertainment news, sports news, military news, social news and political news, respectively. As we can see, the network is divided into four communities by MRLD algorithm, and in each community most students have the same favorite news. That is to say, the users in the same densely connected community are more likely to have the same interest. It also proved that in mobile networks, MRLD could aggregate users with the same interest into the same community, that is, it is feasible to perform group recommendation in mobile networks using community as the unit.

5. Conclusion and Future Work

In consideration of their privacy and security, mobile users are reluctant to provide precise and complete information to the mobile recommendation systems, which influenced the accuracy of the systems greatly. Focusing on this problem, we propose MRLD algorithm. MRLD first initializes the label of every link to node label which possesses the higher degree when connected by the link, then get the links according to the degree of the nodes (the nodes are connected by the links) in descending order and execute triangle label diffusion to aggregate the links with the same label to one community, thus, to form a dense original community structure. Next, it combines the obtained communities with maximal EQ of greedy search to obtain the optimal overlapping community structure. Afterwards, overlapping nodes belonging analysis (ONBA) is adopted to adjust the overlapping nodes in order to avoid “excessive overlapping” which had appeared in link community detection. On the basis of detecting the overlapping communities precisely, MRLD could improve the accuracy of recommendation by performing targeted recommendation to every community in mobile networks. MRLD algorithm has been tested both on synthetic and real-world networks, and the evaluation results have shown the validity and feasibility of MRLD.

Currently, utilizing community detection technology to perform mobile recommendation has not been studied much. Thus, in the future, we will explore new approaches of mobile recommendation on the basis of community detection to improve the speed and accuracy of mobile recommendation.

Conflict of Interests

The authors declare that there is no conflict of interests regarding the publication of this paper.

Acknowledgments

This work is sponsored by the National Natural Science Foundation of China (nos. 61370083, 61073043, and 61073041), the National Research Foundation for the Doctoral Program of Higher Education of China (nos. 20112304110011 and 20122304110012), the Natural Science Foundation of Heilongjiang Province (no. F200901), and the Harbin Outstanding Academic Leader Foundation of Heilongjiang Province of China (no. 2011RFXXG015).

References

L. C. Wang, X. W. Meng, and Y. J. Zhang, “A cognitive psychology-based approach to user preferences elicitation for mobile network services,” Acta Electronica Sinica, vol. 39, no. 11, pp. 2547–2553, 2011.
View at: Google Scholar
M. E. J. Newman and M. Girvan, “Finding and evaluating community structure in networks,” Physical Review E, vol. 69, no. 2, Article ID 026113, 2004.
View at: Google Scholar
G. Palla, I. Derenyi, I. Farkas, and T. Vicsek, “Uncovering the overlapping community structure of complex networks in nature and society,” Nature, vol. 435, no. 7043, pp. 814–818, 2005.
View at: Google Scholar
Y. Y. Ahn, J. P. Bagrow, and S. Lehmann, “Link communities reveal multi-scale complexity in networks,” Nature, vol. 107, no. 31, pp. 761–764, 2010.
View at: Google Scholar
H. W. Shen, X. Q. Cheng, K. Cai, and M. B. Hu, “Detect overlapping and hierarchical community structure in networks,” physical A: Statistical Mechanics and Its Applications, vol. 388, no. 8, pp. 1706–1712, 2009.
View at: Google Scholar
G. Groh and C. Ehmig, “Recommendations in taste related domains: collaborative filtering vs. social filtering,” in Proceedings of the International ACM Conference on Supporting Group Work, pp. 127–136, November 2007.
View at: Google Scholar
H. Ma, I. King, and M. R. Lyu, “Learning to recommend with social trust ensemble,” in Proceedings of the 32nd International ACM SIGIR Conference on Research and Development in Information Retrieval, pp. 203–210, July 2009.
View at: Google Scholar
W. H. Huang, X. W. Meng, and L. C. Wang, “A collaborative filtering algorithm based on users’ social relationship mining in mobile communication network,” Journal of Electronics & Information Technology, vol. 33, no. 12, pp. 3002–3007, 2011.
View at: Google Scholar
D. Quercia and L. Capra, “FriendSensing: recommending friends using mobile phones,” in Proceedings of the 3rd ACM Conference on Recommender Systems (RecSys '09), pp. 273–276, October 2009.
View at: Publisher Site | Google Scholar
M. Girvan and M. E. J. Newman, “Community structure in social and biological networks,” Proceedings of the National Academy of Sciences of the United States of America, vol. 99, no. 12, pp. 7821–7826, 2002.
View at: Publisher Site | Google Scholar | MathSciNet
A. Lancichinetti, S. Fortunato, and J. Kertész, “Detecting the overlapping and hierarchical community structure in complex networks,” New Journal of Physics, vol. 11, Article ID 033015, 20 pages, 2009.
View at: Publisher Site | Google Scholar
A. Lancichinetti, F. Radicchi, J. J. Ramasco, and S. Fortunato, “Finding statistically significant communities in networks,” PLoS ONE, vol. 6, no. 4, Article ID e18961, 2011.
View at: Publisher Site | Google Scholar
S. Gregory, “Finding overlapping communities in networks by label propagation,” New Journal of Physics, vol. 12, Article ID 103018, 2010.
View at: Publisher Site | Google Scholar
Z. H. Wu, Y. F. Lin, S. Gregory, H. Y. Wan, and S. F. Tian, “Balanced multi-label propagation for overlapping community detection in social networks,” Journal of Computer Science and Technology, vol. 27, no. 3, pp. 468–479, 2012.
View at: Publisher Site | Google Scholar | MathSciNet
J. R. Xie, B. K. Szymanski, and X. Liu, “Slpa: uncovering overlapping communities in social networks via a speaker-listener interaction dynamic process,” in Proceedings of the 11th IEEE International Conference on Data Mining Workshops (ICDMW '11), pp. 344–349, December 2011.
View at: Publisher Site | Google Scholar
T. S. Evans and R. Lambiotte, “Line graphs, link partitions, and overlapping communities,” Physical Review E, vol. 80, no. 1, Article ID 016105, 2009.
View at: Google Scholar
Y. Kim and H. Jeong, “Map equation for link community,” Physical Review E, vol. 84, Article ID 026110, 2011.
View at: Google Scholar
J. P. Bagrow and E. M. Bollt, “Local method for detecting communities,” Physical Review E, vol. 72, no. 4, Article ID 046108, 2005.
View at: Publisher Site | Google Scholar
C. Pizzuti, “Overlapped community detection in complex networks,” in Proceedings of the 11th Annual Genetic and Evolutionary Computation Conference (GECCO '09), pp. 859–866, Montréal, Canada, July 2009.
View at: Publisher Site | Google Scholar
J. R. Xie, S. Kelley, and B. K. Szymanski, “Overlapping community detection in networks: the state of the art and comparative study,” ACM Computing Surveys, vol. 45, no. 4, p. 43, 2013.
View at: Google Scholar
A. Lancichinetti, S. Fortunato, and F. Radicchi, “Benchmark graphs for testing community detection algorithms,” Physical Review E, vol. 78, no. 4, Article ID 046110, 2008.
View at: Publisher Site | Google Scholar
W. W. Zachary, “An information flow model for conflict and fission in small groups,” Journal of Anthropological Research, vol. 33, no. 4, pp. 452–473, 1977.
View at: Google Scholar
D. Lusseau, “The emergent properties of a dolphin social network,” Proceedings of the Royal Society B: Biological Sciences, vol. 270, S2, pp. S186–S188, 2003.
View at: Publisher Site | Google Scholar
D. E. Knuth, “The Stanford GraphBase: A Platform for Combinatorial Computing,” 1993, http://www-cs-faculty.stanford.edu/~uno/sgb.html.
View at: Google Scholar
M. E. J. Newman, “Modularity and community structure in networks,” Proceedings of the National Academy of Sciences of the United States of America, vol. 103, no. 23, pp. 8577–8582, 2006.
View at: Publisher Site | Google Scholar
M. Girvan and M. E. J. Newman, “Community structure in social and biological networks,” Proceedings of the National Academy of Sciences of the United States of America, vol. 99, no. 12, pp. 7821–7826, 2002.
View at: Publisher Site | Google Scholar | MathSciNet
P. Gleiser and L. Danon, “Community structure in jazz,” Advances in Complex Systems, vol. 6, no. 3, pp. 565–573, 2003.
View at: Google Scholar
L. A. Adamic and N.Glance, “The political blogosphere and the 2004 US election: divided they blog,” in Proceedings of the 3rd International Workshop on the Weblogging Ecosystem, pp. 36–43, 2005.
View at: Google Scholar
R. Guimerà, L. Danon, A. Díaz-Guilera, F. Giralt, and A. Arenas, “Self-similar community structure in a network of human interactions,” Physical Review E, vol. 68, no. 6, Article ID 065103, 2003.
View at: Google Scholar

Copyright

Copyright © 2014 Kun Deng et al. This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

PDF Download Citation

Download other formats

Order printed copies

Views

1171

Downloads

829

Citations

The Scientific World Journal

Mobile Recommendation Based on Link Community Detection

Abstract

1. Introduction

2. Related Works

2.1. Mobile Networks Recommendation

2.2. Community Detection

3. The Algorithm

3.1. Link Label Diffusion Algorithm

3.2. The Combination of Link Communities

3.3. Overlapping Nodes Belonging Analysis

4. Experiments and Evaluation

4.1. Evaluation Criteria

4.2. Synthetic Networks

4.2.1. The Analysis of Belonging Threshold Value

4.2.2. Comprehensive Analyses of MRLD

4.3. Real-World Networks

4.3.1. Traditional Real-World Networks

4.3.2. Real-World Mobile Networks

5. Conclusion and Future Work

Conflict of Interests

Acknowledgments

References

Copyright