Topological Influence-Aware Recommendation on Social Networks
Users in online networks exert different influence during the process of information propagation, and the heterogeneous influence may contribute to personalized recommendations. In this paper, we analyse the topology of social networks to investigate users’ influence strength on their neighbours. We also exploit the user-item rating matrix to find the importance of users’ ratings and determine their influence on entire social networks. Based on the local influence between users and global influence over the whole network, we propose a recommendation method with indirect interactions that makes adequate use of users’ relationships on social networks and users’ rating data. The two kinds of influence are incorporated into a matrix factorization framework. We also consider indirect interactions between users who do not have direct links with each other. Experimental results on two real-world datasets demonstrate that our proposed framework performs better than other state-of-the-art methods for all users and cold-start users. Compared with node degrees, betweenness, and clustering coefficients, coreness constitutes the best topological descriptor to identify users’ local influence, and recommendations with the measure of coreness outperform other descriptors of user influence.
As the amount of information available online increases exponentially, it becomes more difficult for users to find the relevant information or contents in which they are interested, thereby resulting in an information overload problem. Recommender systems play an important role in tackling the problem of information overload and have attracted more attention in both academia and industry in recent years [1–7]. Such recommender systems have been used in many domains, including product recommendation on Amazon and movie recommendation on Netflix.
Collaborative filtering (CF) is one of the most popular techniques in recommender systems. Some CF methods that use only user-item ratings for recommendation confront cold-start problems. More specifically, for a new user in such recommender systems, because he/she has given few ratings, these CF algorithms perform poorly for the user. The situation is similar for a new item in recommender systems.
Given the rapid increase of online social networks and applications, users participate in online activities and produce a lot of social relationships, such as social friendships and trust relationships. In the real world, we always ask our trusted friends for movie and book recommendations. Social relationships provide an independent source of information for recommender systems in addition to user-item rating information. Social relationships among users can be incorporated into memory-based CF methods and matrix factorization methods in the recommendation process. Social influence theory in  indicates that correlations exist between two socially connected users; therefore, social relationship networks can be used for recommender systems. Social relationships provide opportunities to handle new users and improve the performance of recommender systems.
Some work has been done to exploit social relationship networks of users in recommender systems and improve the performance of recommender systems [9–14]. These methods assume that a user’s taste can be affected by his/her trusted friends, and they take advantage of trusted relationships and friendships for recommendations in addition to user-item ratings. These methods incorporate direct trust relationships among users into matrix factorization to learn user preference vectors and item feature vectors. However, few works explore user influence in view of social network structure. Tang et al.  adopted PageRank algorithm on social networks to determine the reputations of users. Current social recommendation methods often use the rating similarity between users to measure the influence that a user exerts on his/her neighbours. However, it is not sufficient to consider the influence strength simply as the rating similarity. These methods do not take into account users’ different roles in the network, which efficiently indicate users’ influence in the process of information sharing through social networks.
In this paper, we investigate users’ influence on their neighbours and the entire network, and we incorporate the influence into recommender systems. We exploit the topology of social networks to determine the local influence of each user and determine his/her global influence based on the number of his/her ratings. Both local and global influences are applied in the matrix factorization framework. In addition to the effect of trusted friends, we consider indirect interactions from users that have a high reputation. Experimental results demonstrate that our proposed algorithm outperforms other state-of-the-art recommendation models. Within this framework, the main contributions of this work include the following.(i)We explore the topological influence of users according to their roles on social networks and incorporate the topological influence into recommendations.(ii)In addition to the influence between socially linked users, we also consider the influence of indirect interactions among users, which can improve the recommendation performance.(iii)Our proposed recommendation framework reduces recommendation errors, particularly for cold-start users.
The rest of this article is organized as follows. In Section 2, we provide an overview of several major approaches for recommender systems. In Section 3, we detail our proposed recommendation framework. The experimental results and empirical analysis are presented in Section 4. Section 5 presents the conclusions and suggestions for future work.
2. Related Work
In this section, we review several approaches for recommender systems, including collaborative filtering systems that use only the user-item rating matrix and social-based recommender systems that have attracted lots of attention recently.
Collaborative filtering is a widely used recommendation method. Generally, it is based on the assumption that similar users have similar preferences on common items. CF contains memory-based collaborative filtering [1, 15, 16] and model-based collaborative filtering [17, 18]. Memory-based CF methods use user-item ratings to calculate similarities between pairs of users or items and to identify neighbours of a target user or item and then make the prediction based on the weighted sum of ratings from neighbour users or items. However, memory-based methods are not efficient for highly sparse data as it is difficult to estimate the similarities accurately from the data. Model-based collaborative filtering methods train a predictive model based on patterns recognized from the known user-item ratings and then make recommendations via the predictive model. Among different model-based methods, low-rank matrix factorization (MF) techniques have attracted much attention [17–19] due to the advantages of scalability and accuracy.
Based on the assumption that users’ tastes can be represented by a small number of latent factors, the MF method in  decomposes the user-item rating matrix into a user feature matrix and an item feature matrix , shown aswhere and are the number of users and items, respectively. The estimated rating of item given by user is . The matrices and can be learned by minimizing the sum-of-squared-errors objective function in where is the indicator function that equals 1 if user has given a rating to item and 0 otherwise. The regularization terms and are used to avoid overfitting. The optimization of (2) is generally solved by performing stochastic gradient descent.
Recently, some social recommendation methods based on matrix factorization techniques have been proposed to directly use trust relationships among users to provide better recommendations. These methods show substantial improvements [9, 12, 20–23] and relieve the cold-start problem [24–26]. One effective method for integrating social relationships into the MF model is to jointly factorize the user-item rating matrix and user-user social relationship matrix by sharing a common user feature matrix . According to the same user feature matrix, information from social relationships can be transferred to improve the recommendation performance. However, an experimental analysis shows that this method is suitable for the membership links, but not very capable of handling the friendship links .
One more effective way to utilize social relations, as discussed in , is to introduce social regularization into the MF framework . Social regularization constrains the difference between a user’s latent vector and that of his/her friends. It is based on the assumption that a user’s taste is similar to his/her directly connected friends. Experiments show that social regularization is more suitable for incorporating social relationships into MF than the joint factorization models. These recommendation models do not reflect the real-world process that users make decisions based on their own tastes and friends’ influence. In , the authors incorporated the real-world decision-making process into recommendations and simultaneously fused the target user’s interest and the interests of his/her trusted friends to predict the missing ratings. Using this method, the influence strength of friends is treated as the same. Recently, trust-ware recommender systems have focused on both online and offline trust relationships. A new trust-ware recommendation method USBN  combined online and offline social trust to improve the personal recommendation performance.
A few works have been done based on heterogeneous friends’ influence in social recommendations [10, 29–31]. Friendships on social networks are different from trust relationships because friendships are bidirectional, and the interests of friends are heterogeneous . In , the authors tried to treat friends separately according to rating similarities. They extended the regularization model by weighting each social link regularization term with the rating similarity between users. There are many kinds of relationships and interactions on online social networks [33, 34]. Therefore, it is not sufficient to consider the influence strength simply as rating similarities. For example, in a microblog network, direct interactions include users’ mentions, reposts, and comments. In , Li and Xiong explored multiple direct interactions between users in a microblog network. They extended the regularization model by considering the direct interactions, and they inferred the influence by the number of mentions, reposts, and comments. In addition to user-item ratings and user-user relationships, some additional information from online user behaviours can also be combined with recommender systems. In , the authors considered different influences of friends and different levels of willingness to be influenced in social recommendations. The influence of friends and different levels of willingness to be influenced are generated by using a social influence propagation method on social networks.
3. Proposed Recommendation Framework
3.1. Local Social Influence
In the majority of cases, the influence between two users only takes effect at the local scale . We define this type of influence as local influence (LI). In , the authors analysed the most influential nodes on social networks and found that coreness of nodes can represent the centrality and influence of nodes in network graphs more accurately than degrees and betweenness of nodes. Based on this theory, we first analyse the topological influence with respect to coreness and how this kind of influence is distributed in the chosen datasets; we then propose the calculation method of users’ local influence according to their coreness in the network topology. In graph theory, a k-core of a graph G is the maximal connected subgraph of G in which degrees of nodes are at least k. Equivalently, it is one of the connected components of the subgraph of G formed by repeatedly deleting all nodes if their degrees are less than k. A node u has coreness c if it belongs to a c-core but not to any (c+1)-core.
Based on this definition of k-core, a node with a larger coreness means that it is at a location closer to the centre of the network, which implies that it may influence more users of the network and the influence may be strong because of its location in the network. If a hub exists at the edge position of the network, it will have a minimal impact in the influence propagation process of the network, whereas a less connected node placed in the core of the network will have a significant effect in the influence propagation process in the whole network. Thus, in this paper we define a node’s influence proportional to its coreness. Figures 1(a) and 1(b) show the distribution of coreness of the two chosen datasets described in Section 4.1.
In the Epinions dataset, most nodes’ coreness is smaller than 10, and the proportion is 94.75%. This distribution may be because this network fits the power-law distribution and a large long tail of users has a very small coreness. The smallest coreness is 0 and the biggest coreness is 26 in the Epinions dataset. Unlike the Epinions dataset, in the Ciao dataset, the proportion of nodes in which coreness is smaller than 10 is 65.05%, implying different network hierarchies and influence distributions of the two chosen datasets. The smallest coreness is 0 and the biggest coreness is 32 in the Ciao dataset.
We define a user ’s coreness as .We normalize coreness before calculating the values of the local influence of the nodes in trust networks so that the values of local influence are in the range. We use the notation to represent the normalized value of user ’s coreness in trust networks. We use the notation to represent the value of user ’s local influence in trust networks. It is obvious that should vary monotonously with . We define in
If user has a larger coreness, his/her friends are affected more strongly by him/her, leading to larger . The parameter is an offset used to make the values of local influence of users above 0. The hyperbolic tangent function is chosen to map the coreness to the value of local influence nonlinearly and limit in the range. We also investigate other generating functions of local influence and compare their effects on recommendations in Section 4.5.
3.2. Global Influence
A user’s global influence indicates his/her reputation in the whole network. The user’s reputation is a sort of status that gives additional powers and capabilities in recommender systems . In the physical world, the user’s reputation plays an important role in recommendation . In , the authors found that suggestions from people with high reputations positively affect consumers’ adoption of a brand. Massa  found that, in the online world, recommendations from users with high influence in entire networks are more likely to be trustworthy and reliable. In this paper, we use the notation to represent user ’s global influence over the entire social network. On online shopping websites, we are more likely to trust those users who have bought more items or rated more items. Based on this intuition, in our proposed recommendation method, we use the number of ratings given by a user to measure the user’s global influence. We define as the number of ratings given by user . We normalize , and the expression of is presented in (4), analogous to (3):where represents the normalized value of and is an offset to make the values of stay above 0. We investigate other generating functions of global influence in Section 4.5.
3.3. Indirect Interactions
In the real world, in addition to asking our friends for suggestions, we tend to take into account the suggestions of some persons who have high reputations in the community, even if they are not our friends and we have no direct interactions with them. In online networks, a celebrity’s opinion is likely to affect the actions of other users even if they do not follow the celebrity. Indirect interactions between users will also affect users’ actions, so indirect or implicit user connections should be emphasized . For example, on Twitter, a celebrity’s tweet that recommends a book may be retweeted by some users that are both the celebrity’s followers and friends of the target user. When the target user sees the tweet, he/she may adopt the suggestion. Based on such a tendency, in our proposed recommendation method, we take into account indirect interactions between users. More specifically, if user is not a trusted friend of user but has high influence on the social network, user ’s behaviours are likely to be influenced by user . Therefore, for user , those users who are not his/her trusted friends but have top-K coreness are considered to make recommendations for him/her.
3.4. Recommendation Approach
With and as previously defined, we present our method of recommendation with direct and indirect social influence (RDISI). Based on the idea that a user’s taste is close to that of his/her trusted friends, we introduce a regularization term as shown in Eq. (5), which constrains a user’s latent vector in terms of friends’ local influence. The notation denotes the set of friends of user :
The parameter is used to control the importance of this regularization term. In terms of indirect interactions, as discussed in Section 3.3, users with high local influence can also affect other users’ behaviours. Therefore, a user’s taste is affected by such users and, in our method, a user’s feature vector should also be close to that of the users with high local influence. In addition to the regularization term in (5), we also introduce another regularization term as shown in (6). is the top-K users in terms of the value of local influence of users who are not user ’s trusted friends:
As previously discussed, recommendations from users with high global influence are more likely to be trustworthy and reliable. Therefore, we use the values of users’ global influence to weight the importance of their recommendations so as to incorporate global influence into MF. In the MF framework, the weight of in (2) is determined by , which is equal to 0 or 1. In RDISI, we also consider the global influence of user , so we define the new weight for as . Therefore, the importance of is controlled by , and parameter is used to control the importance of global influence.
The method incorporates two kinds of social influence into recommendation to improve the performance of recommender systems. The optimization problem minimizes the sum-of-squared-error objective function shown in the following:
A local minimum of the objective function given by (7) can be found by performing gradient descent in feature vectors and , as shown in the following:where notation means user ’s in-link friends and is the set of users among whose nontrusted users user is in the top-K list in terms of coreness.
3.5. Training and Prediction
Three steps are designed to train our proposed model.
Step 1. It is to generate the user-user trust matrix with trust relationships and then calculate coreness of each node in the trust network from the trust matrix. We normalize the values of coreness and then calculate the values of local influence according to (3).
Step 2. It is to generate the user-item rating matrix with rating data. We use the user-item rating matrix to calculate the number of ratings for each user. We then normalize the numbers of users’ ratings and calculate the global influence using (4).
Step 3. It is to use stochastic gradient descent to find the optimal user feature matrix and item feature matrix .
The details of the steps are shown in Algorithm 1.
3.6. Complexity Analysis
The main cost in learning and is computing the loss function and its gradients against feature vectors of users and items. Because of the sparsity of rating matrix R and trust relationships matrix T, the computational complexity of evaluating the loss function is , where is the dimensionality of the feature vectors of users and items, is the number of users, and and are the numbers of nonzero entries in matrices R and T, respectively. The computational complexities for gradients and in (8) and (9) are and , respectively. Therefore, the total computational complexity in one iteration is , which indicates that the computational time of our method is linear with respect to the number of users and the number of observations in the two sparse matrices. This complexity analysis shows that our proposal approach is very efficient and can scale to very large datasets. Moreover, the main cost of computing the influence is performing the k-core decomposition of the network to compute each node’s coreness. The computational complexity of k-core decomposition is . Thus, if we compute the influence periodically because of the emergence of new influences, the computational cost is linear with the number of social relationships.
4. Experimental Analysis
In this section, we conduct several experiments to compare the recommendation qualities of our approach with other state-of-the-art recommendation models.
We chose two real-world datasets to evaluate our proposed method: Epinions and Ciao. Each dataset has a trust network. The two datasets were collected from the websites http://www.epinions.com/ and http://www.ciao.co.uk/, respectively.
Epinions is an online product review website where users can read reviews about a variety of products (such as books, articles for daily use, cars, and home appliances) to help them make decisions on what to purchase. Users can also post a review after rating a product with integer scores from 1 to 5. Every member of Epinions establishes social relationships (i.e., trust relationships) with others to show his/her attitude to other users.
Ciao is an online shopping portal website in Europe. The site provides a network platform where registered users can review items and share their opinions on various products to help others make decisions. These reviews are available to the general public. Each user on Ciao also maintains a trust list to indicate his/her attitude to others.
The two datasets are crawled by Jiliang Tang et al. from two popular product review sites Epinions and Ciao in the month of May, 2011. The raw Epinions dataset contains 27 categories of items and the Ciao dataset contains 28 categories of items. These two datasets are published at Jiliang Tang’s homepage at “https://www.cse.msu.edu/~tangjili/”. Each of the two datasets is randomly extracted from the corresponding raw dataset so that they are not biased. The two chosen datasets have widely been used for performance evaluation.
Some statistics of these datasets are presented in Table 1. Rating data and social relationship data are both very sparse for the two datasets. The density of ratings is calculated by The density of social relationships is calculated byThe statistics in Table 1 show that sparsity is quite noticeable in the two datasets, both for user-item rating matrices and user-user trust relationships. The rating data in the Ciao dataset is slightly sparser than those in the Epinions dataset. However, the social relationship density of Epinions is much sparser than that of Ciao. We divide users’ rating data into the training set and test set. More specifically, for each rating dataset, we randomly choose 80% as the training set and the remaining 20% as the test set. We generate 5 random data splits of the training and test set and report the average result over the 5 splits.
We choose four well-known metrics to measure the performance of our proposed approach in comparison with other collaborative filtering and trust-aware recommendation models. They are mean absolute error (MAE), root mean square error (RMSE), precision, and recall. The metric MAE is defined aswhere denotes the rating that user has given to item , denotes the predicted rating that user gives to item , and denotes the number of test ratings. The metric RMSE is defined asFrom the definitions, we can see that a smaller MAE or RMSE value means a better performance.
The metric precision is defined aswhere is the number of recovered items in the recommendation list for user and L is the length of recommendation list. The recommendation list for user consists of the items with the highest predicted score generated by the recommendation algorithm.
The metric recall is defined as where is the number of items collected by user in the testing set. Assuming the length of recommendation list, , is fixed, a greater precision or recall means a better performance.
In this section, to show the effectiveness of our proposed recommendation approach, we compare our recommendation method RDISI with the following representative models.
Probabilistic Matrix Factorization (PMF) : This studies a low dimensional user feature matrix and item feature matrix to predict the ratings. This method only uses the user-item rating matrix for recommendation.
RSTE : This method linearly combines a basic matrix factorization model and a trust-based neighbourhood model together and simultaneously fuses the user’s interest and interests of his/her trusted friends to predict the missing ratings.
SoRec : This method is based on matrix factorization and exploits local social context by performing a factorization on the social relationship matrix. The method jointly factorizes the user-item rating matrix by and the social relationship matrix by , where is the user feature matrix and is the factor feature matrix with no realistic implications.
SocialMF : This method takes into account the interests of trusted friends by incorporating a regularization term into the objective function. The regularization term controls the distance between user ’s feature vector and the combinational feature vector of his/her trusted friends.
SoReg : This method incorporates the social regularization term into MF by weighting each social link regularization with the rating similarity between users. User similarities are used to control the distance of feature vectors between user and his/her trusted friends.
4.4. Parameter Settings
For our method, we select optimal parameters for both datasets. Because the two datasets have different data statistics, different parameters are needed for training. The parameter is determined through cross-validation. For the Epinions dataset, we determine the parameters and . Parameter is set as 3. For the Ciao dataset, we set the parameters , , and . The learning rate is 0.0004 and in our proposed method for both datasets. The parameters of the compared recommendation methods are shown in Table 2. We set the number of latent factors as and set the length of recommendation list for all experiments.
4.5. Experimental Results
We randomly select 80% of data for each dataset as training data to verify our proposed method. The experiment results are shown in Table 3. The percentages in Table 3 are the improvements of our RDISI method over the corresponding approaches.
As shown in Table 3, in the Epinions dataset, SocialMF does not perform better than SoRec; SocialMF even has larger RMSE than PMF in Epinions. There may be two reasons for this: the Epinions dataset has some noises, and SocialMF fails to deal with these noises well, and the social relationship density of Epinions is very small, as shown in Table 1, so that SocialMF cannot take advantage of trust relationships well. From the results, we can observe that our proposed recommendation method performs better than the comparison partners.
As discussed in Section 3.4, the definition of is . If γ is extremely large, the weight of a rating dominates in determining users’ feature vectors. A very small γ means that local influence dominates in that process. To determine the best value of , we fix parameter and observe the performance versus . In each dataset, we change the ratio of training data to 60%, 70%, and 80%. The parameters are the same for different training ratios in each dataset. In the Epinions dataset, we set . Awe then adjust the parameter in the range. The result is illustrated in Figure 2(a). In the Ciao dataset, we set . The result is illustrated in Figure 2(b). Moreover, when we set the parameter at 4 in the Epinions dataset and 4.5 in the Ciao dataset, the values of MAE are both infinite, indicating that a very large weight of GI greatly damages the recommendation performance. The results show that the variations of MAE are similar in both datasets with different ratios of training data, and MAE achieves the lowest value in the interval of . Therefore, the optimal value of does not closely correlate with the datasets, and the complexity of our method can be reduced.
Parameter indicates the importance of trustees and celebrities. We also use three ratios of training data (i.e., 60%, 70%, and 80%) in each dataset. First, we set the other parameters at optimal values in both datasets and then adjust parameter in the range. The results are shown in Figure 3. The results demonstrate that an appropriate combination of influence of trustees and celebrities can improve the recommendation performance. The best value of is similar in both datasets, implying that the parameter can be easily determined independently of the datasets.
To verify whether coreness is more effective than other indicators (e.g., node betweenness, node degrees, and clustering coefficients) on determining local influence in recommender systems, we conduct experiments to compare the performance of RDISI using betweenness, degrees, clustering coefficients, and coreness to determine users’ local influence, respectively. In these comparison experiments, we merely replace in (3) with normalized node betweenness, node degrees, and node clustering coefficients, respectively. The comparison results are shown in Table 4.
The results in Table 4 clearly indicate that coreness is more effective than node betweenness, node degrees, and node clustering coefficients. Therefore, we use coreness to represent users’ local influence in our recommendation method RDISI.
In the Epinions and Ciao datasets, some users have rated lots of items, but most users have rated only a few items. We select those users who have rated no more than 10 items in the training set as cold-start users. We conduct experiments to verify whether our method RDISI performs better than other state-of-the-art recommendation models. In addition to coreness, node’s h-index can also represent its local influence. According to Hirsch , a scientist who has index h means that h of his/her Np papers have at least h citations each, and the other (Np-h) papers have fewer than h citations each. Based on this original definition, we define the user’s h-index in trust networks. A user i has index h if h of his/her in-link friends (i.e., the users who trust i) have at least h in-link friends each, and the other in-link friends of i have fewer than h in-link friends. Inspired by this, we propose a variant of RDISI named RDISI-H in which the user’s local influence is determined by his/her h-index on social networks. We verify the performance of RDISI-H on cold-start users here.
The comparison results for MAE and RMSE on cold-start users are shown in Figure 4, which indicates that our recommendation methods RDISI and RDISI-H outperform other models on cold-start users. Among the comparison models, SoReg performs the best in both datasets. In Epinions, the improvements in terms of MAE for RDISI and RDISI-H are 2.27% and 2.41%, respectively. In Ciao, the improvements in terms of MAE for RDISI and RDISI-H are 4.52% and 4.29%, respectively. Although the improvements are from the view of cold-start users, it seems that the h-index is an effective indicator of influence of users in the recommendation process.
Inspired by , we consider that local influence of nodes is not linear with coreness of nodes. Local influence initially increases rapidly with the increase of coreness, but it gradually becomes relatively stable. We also consider a similar relationship between global influence and the number of user ratings. In this paper, we limit the values of user influence in the range. Based on these assumptions, we choose the hyperbolic tangent function or the variant of logistic function as the generation function of local influence and global influence. Even if a user’s coreness is very small, he/she can still contribute to the information propagation to a certain extent; therefore, the user’s local influence cannot be ignored. Even if a new user has not rated any item, he/she can still take part in online activities through which he/she still affects other users’ preferences. Therefore, when the number of ratings given by a user is very small or even equal to 0, the value of his/her global influence is not 0. Therefore, we adjust the two preceding functions to meet this condition. The new functions are and , where is the offset used to make the values of users’ influence above 0. We first choose as the generation function of local influence and global influence. We replace either the generation function of local influence or global influence with the variant logistic function while fixing the other one. The experiment results are shown in Table 5. The results indicate that recommendations using perform better. Therefore, we choose as the generation function of local influence and global influence.
In trust networks, in general, relational information is not static. The effect of the emergence of new influencers and new trends should be discussed. Specifically, the number of the users in a social network is increasing, and new trust relationships among users have been emerging. To verify the performance of our proposed algorithm with the effect of the emergence of new influencers and new trends, we conduct a new experiment. In this experiment, we use a larger Epinions dataset which is named Epinions_ext dataset. The Epinions_ext dataset contains 390732 ratings of 13209 users for 14027 items and 145927 trust relations among users. Specifically, the Epinions dateset we used and described in Section 4.1 is a subset of Epinions_ext dataset, which means that there are new uses and new trust relationships in addition to that of the Epinions dateset we used and described in Section 4.1. We also select optimal parameters for the Epinions_ext dataset. The experiment results compared with the comparison methods are shown in Table 6.
When we conduct experiments on the Epinions_ext dataset, we get smaller MAE and RMSE, which means that our proposed model even performs better when the social relationships and the number of users in the network are increasing. It is also noticeable that our proposed model gets larger improvement than the comparison methods when using the Epinions_ext dataset and our proposed model is scalable with the size of the dataset.
Our proposed model is focusing on rating prediction so that we select the metrics MAE and RMSE to evaluate the performance of our proposed recommendation model. To verify whether our proposed method is effective in the ranking, which is another task of recommender systems, we conduct extended experiments to verify the performance of our proposed method in the ranking task by using the metrics precision and recall that are both defined in Section 4.2. We also select optimal parameters for the experiments in this section. The experiment results are shown in Table 7.
Our proposed model and the comparison methods in this paper are focusing on rating prediction so that these methods do not perform well in the ranking task of recommender systems, which can be verified by the values of precision and recall. However, our proposed method performs better than the comparison partners in ranking task.
With the popularization of online social networks, exploiting social relationships provides a reliable source that can be utilized to improve the performance of recommender systems. In this paper, we exploited users’ trust relationships and calculated each user’s coreness, which determines the user’s local influence on social networks. A user’s global influence is determined by the number of ratings he/she has given. Incorporating local and global influence, we propose the recommendation method RDISI. In addition to direct influence, the method also considers indirect interactions between users who do not have direct links. Experimental results from the real-world datasets Epinions and Ciao demonstrate that our method performs better than some state-of-the-art social recommendation models. Moreover, as analysed beforehand and shown in Figure 1, the structures of user relations in the two datasets are different, but our method managed to improve the performances in both cases. Coreness constitutes the best topological descriptor for identifying users’ local influence, and recommendation using coreness outperforms that using node degrees, betweenness, and clustering coefficients.
In this paper, we only investigate how trust relationships affect users’ preferences and how they can be fused into the MF recommendation model to make better recommendations. However, distrust relationships in social networks are also critical in the social recommendation process. Even very few distrust links can have a great impact on social recommendations. Thus, it is worth conducting research using a dataset that contains both trust and distrust relationships as some networks allow users to express distrust of others.
The data used to support the findings of this study are available from “http://www.cse.msu.edu/~tangjili/trust.html”.
Conflicts of Interest
The authors declare that they have no conflicts of interest.
This work has been supported by the National Natural Science Foundation of China under Grant 61872033, the Humanity and Social Science Youth Foundation of Ministry of Education of China under Grant 18YJCZH204, and the Beijing Natural Science Foundation under Grant 4184084.
B. Sarwar, G. Karypis, J. Konstan, and J. Riedl, “Item-based collaborative filtering recommendation algorithms,” in Proceedings of the 10th International Conference on World Wide Web, pp. 285–295, 2001.View at: Google Scholar
H. Ma, H. Yang, M. R. Lyu, and I. King, “SoRec: Social recommendation using probabilistic matrix factorization,” in Proceedings of the 17th ACM conference on Information and knowledge mining, p. 931, USA, October 2008.View at: Google Scholar
J. Tang, X. Hu, H. Gao, and H. Liu, “Exploiting local and global social context for recommendation,” in Proceedings of the 23rd International Joint Conference on Artificial Intelligence, pp. 2712–2718, August 2013.View at: Google Scholar
G. Karypis, “Evaluation of item-based top-N recommendation algorithms,” in Proceedings of the 10th International Conference on Information and Knowledge Management, pp. 247–254, November 2001.View at: Google Scholar
J. L. Herlocker, J. A. Konstan, A. Borchers, and J. Riedl, “An algorithmic framework for performing collaborative filtering,” in Proceedings of the 22nd Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, pp. 230–237, August 1999.View at: Google Scholar
R. Salakhutdinov and A. Mnih, “Probabilistic matrix factorization,” in Proceedings of the International Conference on Neural Information Processing Systems, pp. 1257–1264, December 2007.View at: Google Scholar
H. Ma, I. King, and M. R. Lyu, “Learning to recommend with explicit and implicit social relations,” ACM Transactions on Intelligent Systems and Technology, vol. 2, no. 3, 2011.View at: Google Scholar
T. Zhao, C. Li, M. Li, Q. Ding, and L. Li, “Social recommendation incorporating topic mining and social trust analysis,” in Proceedings of the 22nd ACM International Conference on Information and Knowledge Management, CIKM 2013, pp. 1643–1648, USA, November 2013.View at: Google Scholar
Q. Zhang, J. Wu, H. Yang, W. Lu, G. Long, and C. Zhang, “Global and local influence-based social recommendation,” in Proceedings of the the 25th ACM International on Conference on Information and Knowledge Management, pp. 1917–1920, Indianapolis, IN, USA, October 2016.View at: Publisher Site | Google Scholar
R. Reshma, G. Ambikesh, and P. Santhi Thilagam, “Alleviating data sparsity and cold start in recommender systems using social behaviour,” in Proceedings of the 2016 International Conference on Recent Trends in Information Technology, ICRTIT 2016, April 2016.View at: Google Scholar
Q. Yuan, L. Chen, and S. Zhao, “Factorization vs. regularization: Fusing heterogeneous social relationships in top-N recommendation,” in Proceedings of the 5th ACM Conference on Recommender Systems, RecSys 2011, pp. 245–252, October 2011.View at: Google Scholar
X. Yang, H. Steck, and Y. Liu, “Circle-based recommendation in online social networks,” in Proceedings of the 18th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, KDD 2012, pp. 1267–1275, August 2012.View at: Google Scholar
Q. Zhang, J. Wu, P. Zhang, G. Long, I. W. Tsang, and C. Zhang, “Inferring latent network from cascade data for dynamic social recommendation,” in Proceedings of the 16th IEEE International Conference on Data Mining, ICDM 2016, pp. 669–678, 2017.View at: Google Scholar
H. Wang, J. Wu, C. Zhou, Z. Ji, and J. Wu, “Mining subcascade features for cascade outbreak prediction in big networks,” in Proceedings of the 2016 International Joint Conference on Neural Networks, IJCNN 2016, pp. 3942–3949, Canada, July 2016.View at: Google Scholar
J. Tang, C. Aggarwal, and H. Liu, “Recommendations in signed social networks,” in Proceedings of the International Conference on World Wide Web, pp. 31–40, April 2016.View at: Google Scholar