Abstract
With the rapid development of the Internet and communication technologies, a large number of multimode or multidimensional networks widely emerge in realworld applications. Traditional community detection methods usually focus on homogeneous networks and simply treat different modes of nodes and connections in the same way, thus ignoring the inherent complexity and diversity of heterogeneous networks. It is challenging to effectively integrate the multiple modes of network information to discover the hidden community structure underlying heterogeneous interactions. In our work, a joint nonnegative matrix factorization (JointNMF) algorithm is proposed to discover the complex structure in heterogeneous networks. Our method transforms the heterogeneous dataset into a series of bipartite graphs correlated. Taking inspiration from the multiview method, we extend the semisupervised learning from single graph to several bipartite graphs with multiple views. In this way, it provides mutual information between different bipartite graphs to realize the collaborative learning of different classifiers, thus comprehensively considers the internal structure of all bipartite graphs, and makes all the classifiers tend to reach a consensus on the clustering results of the targetmode nodes. The experimental results show that JointNMF algorithm is efficient and wellbehaved in realworld heterogeneous networks and can better explore the community structure of multimode nodes in heterogeneous networks.
1. Introduction
Community structure is an important feature of realworld networks as it is crucial for us to study and understand the functional characteristics of the real complex systems. With the fast growth of Internet and computational technologies in the past decade, many data mining applications have advanced swiftly from the simple clustering of one data type to the multiple types, which usually involved high heterogeneity, such as the interrelations of users, videos, pictures, and web page in web networks (shown in Figure 1). Those networks with multiple modes/dimensions are called heterogeneous network in this work. Unlike homogeneous networks that only contain one kind of nodes and have explicit community structure, the community structures of heterogeneous networks are usually obscure and complicated, which are owing to the coexistence of multimode or multidimensional interactions. Therefore, it is challenging to effectively integrate the information of multiple dimensions/modes to discover the hidden community structure underlying heterogeneous interactions.
There are a number of problems for traditional clustering methods to mine the community structure in heterogeneous networks. First, heterogeneous networks contain different types of nodes and relationships. Processing and interpreting them in a unified way present a major challenge. Second, various data types are related to each other. Tackling each type independently will lose the mutual information between those interactions, which are essential to gain a full understanding of heterogeneous networks. Consequently, matrixfactorizationbased clustering has emerged as an effective approach for clustering problems in highdimensional datasets. In [1], it is shown that nonnegative matrix factorization outperforms spectral methods in document clustering, achieving higher accuracy and efficiency.
In this paper, we adopt multiview learning as a tool to reveal the communities in heterogeneous networks, because it has powerful interpretability and applicability for data clustering. So we present a joint nonnegative matrix factorization (JointNMF) solution to detect community in heterogeneous networks. To summarize, the main contributions of this work includes the following: (1) we construct the bipartite graph model of heterogeneous networks, which efficiently incorporates the multimode or multidimensional information, to enhance the community detection in heterogeneous networks. (2) We propose an optimal algorithm for the iterative procedure of joint matrix factorization. Computationally, JointNMF clustering is more efficient and flexible than graphbased models and can provide more intuitive clustering results. In particular, it provides mutual information between each network graph to realize the collaborative learning of different classifiers and makes all the classifiers tend to reach a consensus on the clustering results of heterogeneous networks.
The remainder of the paper is organized as follows. In Section 2, we demonstrate the related work about corresponding domain and define the bipartite graph model of heterogeneous networks. In Section 3, we formulate the multiview method via JointNMF for community detection in heterogeneous networks and present an optimal algorithm to achieve fast convergence. Then we test our algorithm on a variety of real heterogeneous networks and present the experimental results in Section 4. Finally, Section 5 concludes the paper.
2. Preliminary
2.1. Related Work
In the past years, the research of community discovery in heterogeneous networks has attracted more and more attention of researchers. Among these methods, matrix factorization effectively reflects the community structure of the networks and promises a meaningful community interpretation that is independent of the network topology. In addition to a quantification of how strongly each node participates in its community, nonnegative matrix factorization (NMF) does not suffer from the drawbacks of modularity optimization methods [2], such as the resolution limit [3]. Nguyen et al. [4] used nonnegative matrix factorization with Idivergence as the cost function and introduce two approaches which are, respectively, applied to the directed and undirected networks. Based on the importance of each node when forming links in each community, He et al. [5] use nonnegative matrix factorization to form a generative model, taking it as an optimization problem to discover the structure of link communities. Chen et al. [6] presented a semisupervised community discovery algorithm based on NMF. It introduced the a priori knowledge as constraints into the heterogeneous networks and reconstructed the feature matrix to detect communities. Tang et al. [7] introduced the concept of modularity optimization into the heterogeneous network and integrated the network snapshots in a time period and then obtained the community partition with maximum at that moment. Wang et al. [8] took the internal connections as the graph regularization constraint and utilize triNMF model to improve the performance of community detection in bipartite networks.
Due to multidimensional nodes and special link patterns in heterogeneous networks, it is more suitable to mine the special community structure via semisupervised methods. However, most community detection methods still focus on homogeneous networks and might not work well in heterogeneous networks. There are two main reasons: first, heterogeneous data contain different types of relations. Processing and interpreting them in a unified way present a major challenge. Second, the special link patterns of heterogeneous networks greatly limit the effectiveness of these methods, which tend to cluster the multimode nodes by constructing the node or edge similarities of them, as they did in the homogeneous networks. But, for heterogeneous networks, the similarities among onemode nodes sometimes can only be defined by the nodes of the other mode. That made these methods unable to keep working well in heterogeneous networks. In summary, most works are derived based on the graph model, which requires solving eigenvalueproblem. Computationally, they are inefficient and inapplicable to largescale datasets. Moreover, they are completely unsupervised and ignore the inherent complexity and diversity of heterogeneous networks.
Recent works [9–11] have shown that multiview learning in multimode datasets can effectively improve the clustering performance in the sense that clustering can make full use of the dual interdependence between multiple nodes to discover certain hidden community structures. In this work, we present a semisupervised method (JointNMF) to incorporate multimode/multidimension information for unity discovery. In the proposed methodology, users are able to provide constraints on the target mode, specifying the multiple connecting relationships in each bipartite graph. Our goal is to improve the quality of community structure by multiview learning in all modes of nodes and linking. Using an optimal iterative procedure, we then perform jointfactorizations of the bipartite graph matrices to obtain the consensus of network partition and finally infer the targetmode clusters while simultaneously deriving the communities of related feature nodes. In addition, due to the fact that NMFbased methods often require the community numbers of networks to be specified beforehand, several methods [10, 12, 13] have been developed to solve this problem. Due to the simplicity and practicability of the existing method in [4], here we choose it to get the community numbers.
2.2. Model Formulation
Both the multimode and multidimensional networks can be modeled as bipartite graphs, which completely describe the diverse properties and characteristics of the multimode connections. In this way, the heterogeneous network can be regarded as a comprehensive depiction of multimode nodes, described by a series of subbipartite graphs. Each bipartite graph represents the relationship between a special kind of node and the target nodes and contains the structural feature of the heterogeneous network in its own perspective. Compared to processing each subgraph independently, combining multiple bipartite graphs would undoubtedly be more effective and accurate for community discovery. For example, the document can be clustered through both semantics and reference relationship, and multimedia resources can be through the content annotation and the user preferences.
By means of the multiview learning, we treat each bipartite graph as one independent feature set of the target nodes. In this way, heterogeneous networks can be depicted in multifaceted, different perspectives simultaneously. As shown in Figure 1, the web network can be expressed with the triple vector. Assuming users as the target mode, pages and tags are the twodimensional feature space that reflects the community structure of user nodes. Specifically, the bipartite graph shows the users’ feature in the picture dimension, and indicates the users’ feature in the video dimension. Based on the bipartite graph model (shown in Figure 2), heterogeneous network can be decomposed into a series of bipartite graphs , where indicates the relationships between mode 1 and mode .
(a)
(b)
In this work, we take the core mode nodes as target nodes and give priority to the community division of target nodes. Due to the core position of target nodes in heterogeneous networks, community distribution of the other nodes is exclusively conducted and decided by the community structure of target nodes. Through grouping the target nodes in different communities, our method simultaneously divides the connecting nodes of the other mode into the corresponding communities. Through the bipartite graphs model, each mode’s nodes can naturally be split into different graphs, any of which suffices for mining knowledge. Observing that these bipartite graphs often provide compatible and complementary information, it becomes natural for one to integrate them together to obtain better performance rather than relying on a single view. Based on the bipartite graph model, we can use multiview learning for seamlessly integrating multiple node information to discover the underlying community structure in heterogeneous networks.
3. Multiview Algorithm via JointNMF for Community Detection in Heterogeneous Networks
In this section, based on multiview learning, we propose a joint nonnegative factorization matrix algorithm for community detection. This method adopts semisupervised learning to integrate multiple bipartite graphs in heterogeneous networks and extends semisupervised learning from single graph to multiview graphs. Based on the collaborative learning between different modes (graphs), the multiview learners finally obtain the consensus on the clustering results of the target nodes, thus jointly promoting the performance of community discovery. For convenience, we present in the Notations the important notations used in this paper.
3.1. Objective Function of Multiview Learning via JointNMF
For the original NMF framework, it just considers the intertype information of 1mode nodes. Such formulation assumes each subnetwork to be independent and fails to model the heterogeneous networks in a unified way. Recently, some researchers [9, 10] have found that multiview learning on multiple bipartite graphs is well applied to heterogeneous networks for community clustering, because it can promote the performance of the intrinsic structure discovery in multimode networks. As a result, by constructing the bipartite graphs of heterogeneous networks, the optional intertype information of different modes of nodes is incorporated into JointNMF. More importantly, we can exploit the mutual information from multidimensional spaces to group likeminded nodes from different graph perspectives, thus strengthening the community detection in heterogeneous networks.
For multiview learning, the bipartite graphs of different mode have conditional independence. It means that all the independent learners under different graphs can not make the wrong decision in the same time. Therefore, our semisupervised learning method can effectively integrate bipartite graph information and unlabeled information and adopt cooperative learning between multiview graphs to surmount the obstacle of complexity and diversity in heterogeneous datasets. In this way, our method realizes the information complementation of different information graph, thereby enhancing the overall performance of community discovery in heterogeneous networks.
In order to group the relevant targetmode nodes and the corresponding nodes of other mode into the same community, the following objective function is used to measure the accuracy and smoothness of clustering results:where and , respectively, denote the basis matrix and coefficient matrix decomposed from graph . is associated with the node number of mode 1, and is the node number of model , and let be the preset community number. In particular, it is worthwhile to note that mode 1 is regarded as the target node in this paper. The internal connection matrix has also been incorporated for nonnegative matrix factorization.
For each single bipartite graph (Figure 3), according to the principle of NMF, we can minimize the objective function to obtain the th dimensional clustering results of the target nodes. And, for heterogeneous network, it is indispensable to minimize the sum of fitting errors for revealing the community structure of target nodes in the multidimensional/multimode space.
However, JointNMF is also subject to several problems such as slow convergence and large computation. Moreover, heterogeneous networks have more complicated connecting relationships between multiple modes/multiple dimensions, which further limits the application and effectiveness of NMF to explore the hidden community structures. Aiming at these problems, our work mainly optimizes the iterative solutions of NMF from the two aspects:(1)To simplify the iterative procedure, a matrix is introduced that satisfies the condition(2)Inspired by [14, 15], we incorporate the idea of multiview learning into joint matrix factorization, which can effectively extend semisupervised learning from single graph to multiple graphs, from single mode to multiplemode nodes. Moreover, our method adds some essential constraints on the matrix factorization, which can ensure the uniqueness and accuracy of the results in network partition.
3.2. Optimization Algorithm of JointNMF
Assuming mode 1 as the target mode in heterogeneous network, the other modes are all connected with the target mode. To achieve an accuracy network partition with joint nonnegative matrix factorization, it must satisfy the conditionwhere denotes the coefficient matrix of the target nodes, which is concluded under the multiview learning of all the modes . indicates the final consensus of community partition on the target node and the other nodes.
First, we construct the objective function of JointNMF, which mainly comprises two terms: the first term is the standard NMF approximation of the objective function, and the second one is a penalty term about the deviation from the consensus . In particular, by means of semisupervised learning in multiple bipartite graphs, it is realized that the collaborative learning between multiple modes or multiple dimensions is applied to community structure mining in heterogeneous networks. In this way, the target nodes can be clustered as consistent as possible, and then we automatically obtain the cluster results of the other nodes with maximum consistency.
Applying the regularizations (see (3)) in (1), the objective of our JointNMF approach is transformed to minimizewhere mode 1 is the target nodes and the objective combines the internal connection of mode 1 and all other connecting graphs related to mode 1. is mainly used to adjust the weight of bipartite graph , and it also reflects the importance of the corresponding linking mode in networks, . When , (4) is transformed into an unsupervised NMF function.
In order to optimize the procedure of matrix factorization, we construct a special diagonal matrixwhere denotes the diagonal matrix operations. According to (4), the objective function can be transformed into the following minimization problem:
Since JointNMF (see (4)) is a nonconvex function about factor matrices , , it is difficult to obtain the global optimal solution directly. As a result, we propose an alternative iterative update solution, which iterates with the following two steps sequentially to reach the fast convergence.
Step 1. Fixing , calculate and to minimize the objective function.
Step 2. Fixing and , calculate to minimize the objective function.
In this way, the alternative iterative update solution fixes one relevant variable with the latest value, and thus the minimizing objective (see (6)) is transformed into a convex optimization problem about some single variable. By means of alternating iterative update, we would get the local extremum solution or stable solution finally. Thus, our multiplicative update procedure can effectively be applied to multiple matrix factorization and speed up the convergence process.
To minimize the objective function, it is necessary to decompose each bipartite graph separately to converge. For one single graph , its specific objective function is regarded aswhen is fixed, for any given graph , the results of and do not rely on the calculation of other graphs. Assuming as the Lagrange multipliers that constrain , the Lagrangian function of (6) can be simplified as , and denotes the matrix trace. Accordingly, can be rewritten aswhere includes the regularization term . Introducing the diagonal matrix constructed in (5), can be rewritten as
By setting the derivative of with respect to , we obtain
According to KarushKuhnTucker (KKT) condition in [13], set the derivative of with respect to :
Based on KKT optimization condition, the solution is obtained byIf the initialization , it can be concluded that will remain nonnegative in the subsequent iteration.
Fixing and , use the diagonal matrix to normalize the column vector of matrix , and then we obtainwhere the normalization process does not change the numerical value of , . Assuming as the Lagrange multipliers that constrain , the Lagrangian function of (8) can be rewritten as
Similarly, according to KarushKuhnTucker (KKT) condition in [13], set the derivative of with respect to :
Hence, the iterative update solution of is regarded as
After getting the matrices and , according to KTT condition, set the derivative of with respect to :
To minimize the objective , the iterative update solution of is regarded as
Repeat the above iteration of matrix factorization, and update the factor matrixes , , and continuously, until the objective function tends to converge or reaches the maximum iteration number.
After iterations, we can infer the community membership of multiple nodes based on the JointNMF results. For simplicity, the community indices are determined by taking the maximum of each column in (the target nodes) and (the other mode nodes). Note that once we obtain the consensus matrix , the cluster label of mode could be computed from . The detailed procedure is illustrated in Algorithm 1.

3.3. Algorithm Convergence and Complexity
Here we prove the theoretical convergence of JointNMF algorithm.
Proposition 1. Given a bipartite graph and its initialization factor matrices and , the objective function (9) decreases monotonically under the alternative iterative update rules (see (12) and (18)).
Proof. The proof of the proposition is similar to the convergence proof of nonnegative matrix triplefactorization in [16]. Moreover, recent studies [17] found the following: despite the alternating iterative update may fail to converge to a stable point, but the improved iterative rules (see (12) and (18)) can guarantee JointNMF algorithm can converge to a local extremum point.
In our algorithm, and are sparse matrices, and the computation of them only involves vector norm enumeration without matrix multiplication, and thus it is more computationally efficient. Moreover, instead of minimizing each matrix factor optimally with timeconsuming multiplications of large matrices, JointNMF transforms the original heterogeneous networks into some bipartite graphs requiring much fewer matrix multiplications and effectively optimizes the iterative procedure with faster convergence and lower computational complexity.
The running time of our algorithm is mainly consumed in the alternative iterative procedure. For single network graph , the complexity of matrix factorization is , where is the iterative number of algorithms and is the preset community number. As a result, the computational complexity for JointNMF method is .
4. Experimental Results
In this section, the experiments use a series of real networks to validate the algorithms’ performance. Real networks are always more irregular and various than synthetic networks and have more complex community structures. Here we choose 4 popular real heterogeneous networks in different sizes: WebKB [18], Newsgroups [19], Cora [20], and Last.fm [21]. For all the networks, we compare the experimental results with other 4 wellknown algorithms of community detection: Kmeans [12], NMF [13], SSNMF [6], and PMM [22]. All the experiments are performed on an Intel Core2 Duo 2.0 GHz PC with 2 GB RAM, running on Windows 7.
In the following tests, different measures are introduced to evaluate the partition quality of the classical algorithms for community detection in heterogeneous networks. Since the structures of real networks are almost unknown, we adopt two standard measures widely used for clustering: normalized mutual information (NMI) [23] and clustering accuracy to quantify the partition quality of the community detection methods. For NMFbased methods, the weight parameters are set to 0.1, thus making all the bipartite graphs with the same weight. In addition, we obtain the community numbers from the method as suggested in [4], which has been shown to well predict the number of network communities. In our experiments, we repeat each method with 50 times on all the networks and compute the average results.
The average execution times found by different algorithms are shown in Table 1. We can see that Kmeans costs much less time than NMFbased algorithms, as it does not need the matrix factorization iterations. For all the real heterogeneous networks, JointNMF effectively accelerates the convergence speed of nonnegative matrix factorization and converges in fewer iterations and CPU seconds than other NMF methods. Because the network scales are quite different, the corresponding performances of JointNMF are different, too. For the larger networks, JointNMF has a greater competitive advantage than other methods. Our method is only slower than Kmeans and NMF, which, however, has much worse clustering performance.
Tables 2 and 3, respectively, show the clustering accuracy and NMI values found by different algorithms. The methods using semisupervised learning, including SSNMF, PMM, and JointNMF, generally achieve better clustering results. Therefore, we can conclude from the experiment results that multiview learning gives full consideration to the multimode/multidimension information, and it is a better choice for mining the community structure in heterogeneous networks.
JointNMF method attains the maximum NMI and clustering accuracy in community structure for most test cases, which means that our method has better partition quality, and achieves accuracy community structure on the real heterogeneous networks. More importantly, our method does not suffer from the problems of modularity optimization methods and makes full use of the duality information of multimode nodes, which can greatly enhance the performance of clustering algorithms. Therefore, compared to the other 4 methods, we can conclude that JointNMF has competitive clustering performance in terms of both accuracy and partition quality against popular community detection methods.
5. Conclusions
In this work, we introduce a multiview learning algorithm of community discovery based on nonnegative matrix factorization. In order to reveal the underlying community structure embedded in heterogeneous networks, we divide the datasets into some relational bipartite graphs and require those graphs learnt from factorizations with multiple views towards a common consensus. To achieve this, we introduce multiview learning in the heterogeneous data mining with matrix factorization and finally make all the learners reach a consensus about network partition. Moreover, we design an optimal iterative procedure to ensure the matrix factorization is simple and meaningful in terms of clustering. Through multiview learning, we are able to discover the hidden global structure in the heterogeneous networks, which seamlessly integrates multiple data types to provide us with a better picture of the underlying community distribution, highly valuable in most realworld applications.
Different form the traditional methods, our work is an instructive attempt to discover the multimode or multidimensional structure in heterogeneous networks. Actually, our JointNMF framework jointly takes intertype and intratype information of target nodes into considerations, thus makes the partitioning results more reasonable and effective, and detects communities with high accuracy and quality. Experimental results on four realworld datasets show that our algorithm is a competitive method to explore community structures in heterogeneous networks.
Notations
Heterogeneous networks dataset  
The bipartite graph describing the relationships between mode 1 and mode  
Objective function a heterogeneous network  
The count mode in heterogeneous networks  
The coefficient matrix factorization from all the bipartite graphs  
Auxiliary matrix for simplifying the iterative procedure  
The th basis matrix factorization from  
The th coefficient matrix factorization from . 
Conflicts of Interest
The authors declare that they have no conflicts of interest.
Acknowledgments
This work was supported by the National Natural Foundation of China under Grant no. 61271253.