Abstract

Recommender systems suggest items to users based on their potential interests, and they are important to alleviate the search and selection pressures induced by the increasing item information. Classical recommender systems mainly focus on the accuracy of recommendation. However, with the increase of the diversified demands of users, multiple metrics which may conflict with each other have to be considered in modern recommender systems, especially for the personalized recommender system. In this paper, we design a personalized recommendation system considering the three conflicting objectives, i.e., the accuracy, diversity, and novelty. Then, to let the system provide more comprehensive recommended items, we present a multiobjective personalized recommendation algorithm using extreme point guided evolutionary computation (called MOEA-EPG). The proposed MOEA-EPG is guided by three extreme points and its crossover operator is designed for better satisfying the demands of users. The experimental results validate the effectiveness of MOEA-EPG when compared to some state-of-the-art recommendation algorithms in terms of accuracy, diversity, and novelty on recommendation.

1. Introduction

With the rapid development of Internet and Web 2.0, consumers are facing a large amount of advertising and shopping information. To alleviate the search and selection burden for consumers, recommender systems (RSs) [1, 2] aim to suggest the suitable items by deducing the interesting information revealed from commodity. Nowadays, RSs have been successfully used in many applications; e.g., e-commerce websites recommend commodities [3, 4], social websites push some potential friends [5, 6] and link inferences [7], and e-learning educational institutions assist students in choosing courses and learning activities [8].

In recent years, many RSs have been proposed [912], including the content-based RSs, the collaborative filtering based RSs, the knowledge-based RSs, and the hybrid RSs. The content-based RSs suggest similar items by extracting the content’s features from the profiles of items and users; the collaborative filtering based RSs exploit the community data (e.g., feedback rating, tags, or clicks from other users) to make recommendation, while the knowledge-based RSs use knowledge bases and knowledge models (e.g., ontologies) for recommendation. The hybrid RSs are the combination of the above approaches. A comprehensive survey of the state-of-the-art RSs can be found in [13, 14].

In general, most traditional RSs [1519] mainly focus on the accuracy of recommendations. However, with the increase of diversified demands of users, multiple metrics are often considered in personalized RSs, such as diversity and novelty that may conflict with each other. As pointed out by the studies in [20, 21], it is easy to guarantee the accuracy for recommendation, but it is difficult to maintain the diversity and novelty for recommendation. Generally, the increase of diversity or novelty in RSs will decrease their accuracy. Therefore, personalized RSs are often required to consider accuracy, diversity, and novelty simultaneously when making recommendation.

Some research studies have been proposed to balance the accuracy and diversity of recommendations [18, 22, 23]. In [22], the balancing problem between the accuracy and diversity was modeled as a quadratic programming problem using a control parameter, which determines the importance of diversity in the recommendation lists. Then, several optimization strategies were accordingly designed to optimize the target model [22]. Similarly, in [23], a hybrid recommendation algorithm was proposed by combining heat-spreading (HeatS) algorithm and probabilistic spreading (ProbS) algorithm to improve the diversity and accuracy of recommendations. Moreover, in [18], some item ranking techniques were presented for giving diverse recommendations with comparable levels of accuracy. However, these algorithms [18, 22, 23] are limited to giving recommendations with both high accuracy and good diversity, ignoring novelty.

Multiobjective optimization problems (MOPs) try to simultaneously optimize a set of conflicting objectives, which can be found in various research fields, such as engineering design [25] and route planning [26]. When solving MOPs, a number of nondominated solutions are returned, each of which is a tradeoff among different objectives [27]. Based on the studies in [28, 29], there is a tradeoff among multiple conflicting objectives (i.e., accuracy, diversity, and novelty) when modeling the MOPs for RSs. Recently, multiobjective evolutionary algorithms (MOEAs) have been proposed to solve the modeled MOPs for RSs. For instance, in [30], the personalized recommendation was first modeled as an MOP with two objectives (accuracy and diversity), and then an MOEA with the ProbS technique was proposed. Following this work, two novel MOEAs [31, 32] were designed based on the frameworks of MOEA/D [32] and NSGA-II [24] for recommendations with long tail items. These works [3032] have based their performances on balancing the accuracy and diversity of recommendations. However, these MOEAs are also limited to considering the two conflicting objectives (accuracy and diversity) and do not consider the algorithm-specific knowledge in their designs.

In this paper, a multiobjective personalized recommendation algorithm, called MOEA-EPG, with extreme point guided evolutionary computation is proposed. In MOEA-EPG, the accuracy, diversity, and novelty of recommendations are chosen as the three conflicting objectives, and the aim of our algorithm is to optimize the modeled MOP for personalized recommendation. The prior knowledge in RSs is extracted from the MOP by finding the extreme value of each objective, which is then incorporated into MOEA-EPG. As revealed by our experiments, this knowledge can effectively guide the population’s evolution, so as to improve the search efficiency. Moreover, the crossover operator is further designed for better satisfying the demands of users. To conclude, the main contributions of this paper are summarized as follows.(1)The recommendation problem is modeled as an MOP with three conflicting objectives: accuracy, diversity, and novelty. The first objective measures the accuracy of recommendations determining whether the predicted ratings are similar to the true ones of users, and the second one optimizes the diversity of recommendations by calculating the recommendation coverage, while the last one reflects the capability of RSs to recommend unknown items to users.(2)An extreme point guided method is proposed based on the prior knowledge of RSs, which is used to enhance the performance of personalized recommendation. For each objective, we choose the solution with the maximal value from the available items as the extreme point. Then, the chosen extreme points are embedded into the initialization of population to guide the evolutionary search toward the Pareto-optimal front.(3)At the crossover process, the similarity between users’ interests is first evaluated, and then the users with a higher similarity will be selected for crossover with a larger probability. This process will help to generate offspring solutions with a good quality.(4)The experiments on two classical data sets (Movielens and Netflix) show the superiority of our MOEA-EPG over the current MOEAs [30, 33] in terms of accuracy, diversity, and novelty.

The rest of this paper is organized as follows. Section 2 introduces the related background, such as the definition of RSs, MOPs, and classical recommendation algorithms. Section 3 presents the proposed MOEA-EPG algorithm, including the modeled MOP for RSs, individual representation, and the extreme point guided MOEA. Section 4 provides the experimental comparisons of MOEA-EPG with other competitive recommendation algorithms. At last, conclusions and future work are given in Section 5.

2.1. Definition of RSs

RSs aim to predict potential likes and interests of users, according to their historical data about the users’ evaluations of items. Items in RSs may represent any consumed content, like books, movies, news, and music. Let U be a number M of users, and let I be a number N of items. Based on the historical data, a rating matrix R is generated to measure the users’ evaluations of items [34]. The aim of RSs is to push lists of personalized recommended items to the users. To do recommendation, the ratings of all the items should be assigned for all users. However, in practical cases, some users may not give evaluations to all items, and these evaluations need to be predicted. Let indicate the true ratings of M users to N items, and let represent the predicted ratings. Then, a number of L items with the highest predicted ratings for a given user will be added to its recommendation list. Figure 1 gives a simple RS with four users (, , , ) and four items (, , , ), which records the ratings of users to movies. Please note that each rating in this recommendation would be “like” and “dislike”, while the solid and dashed lines indicate their true and predicted ratings, respectively.

2.2. Multiobjective Optimization Problems

Multiobjective optimization problems (MOPs) often consist of several conflicting objectives. In MOPs, the optimization of one objective would affect that of the others. Thus, the result of optimizing all the objectives is not a single optimal solution, but a set of solutions, each of which is the best tradeoff among all the objectives [3537]. Generally, an MOP can be modeled bywhere is a decision vector with n dimensions, Ω is the decision space, and m is the number of objectives. The MOP in (1) aims to maximize all the objectives simultaneously. In the following, some definitions about the Pareto optimum theory [38] are given.

Definition 1 (Pareto-dominance). A decision vector dominates a decision vector (noted as y) if and only if

Definition 2 (Pareto-optimal). A solution is Pareto-optimal if and only if

Definition 3 (Pareto-optimal set, PS). This set PS includes all the Pareto-optimal solutions.

Definition 4 (Pareto-optimal front, PF). This set PF contains all the values of the objective functions related to the Pareto-optimal solutions in PS.

2.3. Classical Recommendation Algorithms

User based collaborative filtering: It is one classical RS method [39, 40] which predicts the ratings of a user for items based on the preference of some users who show similar interests. For a given user u, its K most similar users are denoted by S (u, K). Let represent the items rated by the users in S (u, K), but not by the user u. The predicted rating of user u to an item i in is computed bywhere suv is the similarity between users u and v, is the absolute value of suv, and rvi is the rating of user v to item i. Then, a number L of items in Iu with high values are recommended to user u, where L is the length of recommendation list. There are many well-known methods for computing the users’ similarity , such as Cosine Similarity (CS) [34], Pearson Correlation Similarity (PCS) [41], and Adjusted Cosine Similarity (ACS) [15]. In this paper, the CS method [34] is chosen due to its simplicity, which computes the similarity as follows:where ru is the ratings of user u to all items and is the length of ru.

MOEA with ProbS (MOEA-ProbS) [30]: This algorithm proposed a multiobjective recommendation model to solve the conflicts between accuracy and diversity. In MOEA-ProbS, the accuracy is evaluated by the ProbS method [42], while the diversity is measured by the coverage of the recommendations. MOEA-ProbS can provide multiple recommendations for users in an independent run, such that users can choose what they want from the multiple recommendations according to their preferences. However, the predicted ratings in [30] are not so accurate when compared to the traditional collaborative filtering (CF) algorithm, due to the lack of prior knowledge of RSs.

Probabilistic MOEA (PMOEA) [33]: This algorithm mainly introduced a new diversity indicator and a multiparent probability crossover to have a better recommendation. PMOEA was validated to obtain a good balance between the two objectives (accuracy and diversity) in recommendations.

Following the MOEAs-based recommendation algorithms in [30, 33], we propose a multiobjective personalized recommendation algorithm using extreme point guided evolutionary computation to enhance the performance of RSs, which shows some advantages in terms of accuracy, diversity, and novelty.

3. Our Algorithm: MOEA-EPG

In this section, we first model the MOP for RSs with three conflicting objectives: accuracy, diversity, and novelty, and then present the MOEA-EPG algorithm to solve this problem.

3.1. Optimization Objectives

Three optimization objectives, i.e., accuracy, diversity, and novelty, are used as the objectives for personalized recommendations.

The accuracy of recommendations measures the similarity between the predicted ratings and the true user’s ratings. In fact, it is impossible to compute the true user’s ratings in the training stage. Therefore, the predicted ratings of items are used as the optimization objective. The predicted ratings of items for all the users in one cluster C are computed as follows:where is the number of users in the cluster C, L is the length of the recommendation list, and represents the list of items recommended to user u. In MOEA-ProbS [30], the predicted ratings are obtained by ProbS algorithm [42]. Here, following the work in [39], is modified as follows:where denotes the K most similar users to user u.

The diversity of recommendations reflects the difference between items in a recommendation list. Classical diversity metrics [43], such as interuser diversity, intrauser diversity, and coverage, can be used in our algorithm. Here, the coverage is chosen to evaluate the diversity of recommendations, which is computed as follows:where N is the total number of items and is the number of nonduplicated items in the recommendation lists for the users under the same cluster. Generally, RSs with a high coverage in (9) are valuable to users as many choices can be provided for the decisions.

The novelty reflects the degree of recommendations, i.e., the number of the unknown items that are recommended to users. Here, we chose the most common index, self-information [30, 33], to evaluate the novelty of recommended items. Let represent the popularity degree of an item i. The self-information of item i is defined as follows:where M is the total number of users. Generally, items with a low popularity degree are novel to users. For better recommendations, the average novelty N(L) which represents the mean self-information of all the items in the recommendation list is used, which is computed as follows:where Su is the recommendation list of user u.

3.2. Our Method
3.2.1. Extreme Point Guided Method

To accelerate the convergence of recommendations, an extreme point guided method is used, which incorporates the prior knowledge of RSs. In this method, three extreme points, each of which has the maximal value for one objective, are used to extend the Pareto-optimal front.

The accuracy of recommendations in (8) will be affected by the coverage in (10) and the novelty in (12). In our approach, an optimal value for (8) is found at the initialization phase and it is treated as an extreme point to guide the evolutionary process, aiming to search the extreme region toward the accuracy. For each user u in one cluster C, let be the set of available items for user u sorted by a descending order based on the predicted ratings. Then, the first L items in are assigned to the recommendation list and recommended to user u. This extreme point has a maximal accuracy, as follows:

Similarly, the second extreme point is found by only optimizing the coverage. The optimal value for coverage can include diverse categories of items. Therefore, assume that the users in a cluster C are represented by , where is the number of users in the cluster C. To enhance the coverage, the items that have been recommended to previous users are removed and no longer pushed to the other users. This method assures the diversity of items and reduces duplicates in the cluster C. Thus, this extreme point shows the property of maximal coverage, as follows:where is the number of distinct items for all the users under the same cluster.

The third extreme point is obtained by only optimizing the novelty of recommendations. Similarly, for each user u in a cluster C, the L items in with the largest self-information in (12) are first found, and then they are assigned to the recommendation list of user u. Based on (11), the items with low degree will have high self-information. Thus, the L items with the smallest degrees for each user u are actually recommended, and they are recorded by . This extreme point has a maximal novelty, as follows:Here is the self-information of item i as defined in (11).

These three extreme points are used to accelerate the convergence of our algorithm toward the entire Pareto-optimal front and also to enhance the population’s diversity.

3.2.2. Solution Encoding

The encoding formula in [30] is used to convert the recommendation list as the solutions for MOPs. In RSs, recommendation results should be provided to all the users in the same cluster. Here, a solution represents recommendation lists of all users in the same cluster. Given a number M of users in a cluster and a length L of recommendation list, the solution is encoded by a matrix with size M×L. Therefore, the matrix consists of all items recommended to users in a cluster. The solution encoding is shown in Figure 2, in which each row represents the L items recommended for each user. In general, RSs are limited to recommend the same items to a user twice.

3.2.3. Evolutionary Operators

The uniform crossover [30] is used in our approach due to its simplicity. As shown in Figure 3, in each crossover process, two similar users are chosen to execute the crossover. To avoid generating invalid solutions (i.e., one item is recommended to a user twice or more), an additional operation which propagates the same elements in the two parents to the offspring is executed. In the remaining positions, the offspring will inherit from the first parent if a randomly generated value in is larger than 0.5; otherwise, it will inherit from the second parent [30]. The above crossover is executed on all pairs of parent solutions.

A mutation operator is performed on each individual generated by the crossover, which is detailed as follows: if a gene from the parent matrix is selected for mutation, an available item is randomly selected from the items. To avoid useless solutions, we choose one of the remaining items that does not exist in the parent.

3.2.4. Complete Algorithm of MOEA-EPG

The flowchart of MOEA-EPG is shown in Figure 4. MOEA-EPG includes two main parts: data processing and evolutionary algorithm. Moreover, the framework of MOEA-EPG is also provided in Algorithm 1, where N is the population size, T means the maximum number of generations, K shows the ratio of similar users, P records the recommendation lists for all users, and L is the length of recommendation list.

 Input the training data, the test data, and a binary matrix R
 Set g=0, L=10, N=100, T=3000, K=0.5
 Cluster all the users U in the training data into four clusters with size ,
and
 Set with
for j=1 to 4
   Compute the similarities of different users in based on Eq. (7)
   Set
   for k=1 to
      Compute the available items of user , as recorded by the set
      for  =1 to Ns
       Compute the predict rating for user to item by Eq. (8)
    EPOP = Initialization(L, N, R, pr)
    while g T
        Epa = Objective_Func(EPOP)
        Epa = Fast_Non_Dominated_Sort(EPa)
        EPOP = Selection(EPOP, Epa)
        EPOP = Uniform_Crossover(EPOP)
        NPOP = Mutation(EPOP)
        POP =
        EPOP = Update(POP, Epa, N)
        g = g+1
    = EPOP
Return P

In the phase of data processing, the users U are classified into several clusters based on the K-means clustering [44] in the training data. In this paper, we divide users into four clusters, i.e., , and four recommendation lists are initialized as four empty sets, as shown in lines -. After that, the available items Iu of each user and the predicted ratings of these items are calculated according to the similarities of different users in lines -. The framework of MOEA-EPG is given as follows.

In line , the three extreme points (, , and ) are calculated using (13)-(15) and then the algorithm executes initialization. In line , the function Objective_Func(.) calculates the objective values of the individuals in the population EPOP based on (8), (10), and (12), and it records the final results at Epa. After that, the function Fast_Non_Dominated_Sort(.) in Algorithm 2 is executed on the evolutionary population to generate nondominated fronts based on Epa which are the solutions on the first level in line . Then, the function selection(.) in line selects parent solutions from the population EPOP by using the binary tournament [45]. The functions Uniform_Crossover(.) and Mutation(.) in lines - execute the crossover and mutation operations in Section 3.2.3 on the parent population EPOP, respectively, which produce the offspring population NPOP. Finally, the function Update(.) chooses N individuals as POP from EPOP and NPOP for the next evolution based on the elitist selection strategy [24] (lines -). This strategy is composed of a fast nondominated sorting and a crowding distance comparison. Then, the loop of evolutionary algorithm is repeated until the current generation number g reaches the predefined maximal generation T. If this termination condition is satisfied, our algorithm will return the recommendation results P and all users are recommended (line ).

Input the population Epa
for each  pEpa
     Sp =, MP = 0
     for each  qEpa
     if (pq) then    // If p dominates q
     Sp = Sp    // Add q to the set of solutions dominated by p
     else if (qp) then
     MP = MP +1  // Increment the domination counter of p
       if  MP = 0 then    // p belongs to the first front
      = 1
     F1 = F1
i=1           // Initialize the front counter
while  Fi
   S =         // Used to store the members of the next front
   for each pFi
   for each qSp
     Mq = Mq -1
     if Mq = 0 then  // q belongs to the next front
     qrank = i+1
     H = H
i = i+1
Fi = H
Return Fi

The function Fast_Non_Dominated_Sort(.) in Algorithm 2 works as follows. Firstly, the solutions on the first level are found based on the nondominated sorting strategy (lines - in Algorithm 2). Secondly, the solutions on the first level are deleted, and the nondominated sorting strategy works on the rest of solutions to find the solutions on the top level (lines - in Algorithm 2). The steps above are executed until all solutions are sorted in a ranking level (lines - in Algorithm 2). In this case, all individuals are divided into multiple levels. The detailed descriptions for the Fast_Non_Dominated_Sort(.) in Algorithm 2 can be found in [24]. Here, a simple example is given in Figure 5 when optimizing the three conflicting objectives (accuracy, coverage, and novelty). The tradeoff on these three objectives can be well observed and the solutions plotted by the red circles on the first level are preferred, as they are more effective and efficient to approach the PF.

4. Experimental Results

In this section, experiments on two data sets are executed to examine the performance of MOEA-EPG. First, some settings about the experiments are introduced, including the experimental data, the performance metrics, and the parameters settings. Second, comparisons of MOEA-EPG with two state-of-the-art recommendation algorithms (i.e., User-based-CF [39] and MF [46]) and two competitive MOEAs for RSs (i.e., MOEA-ProbS [30] and PMOEA [33]) are made to demonstrate the effectiveness of MOEA-EPG in the accuracy, diversity, and novelty of the recommendations. Finally, the effectiveness of the extreme point guided method and the impacts of some parameters are analyzed.

4.1. Experimental Data

In our experiments, two classical benchmark data sets, i.e., Movielens and Netflix, are used. The Movielens data set includes 943 users and 1682 movies, which can be downloaded from the website of GroupLens Research (http://www.grouplens.org/). The original ratings in this data are assigned values from 1 to 5. Here, a binary rating system is used by using “like” or “dislike”. An item is assigned to the binary rating “like” by a user when his original rating is larger than 3 [30]. Then, 80% of the data are randomly selected as the training data set with known information for recommendations, while the remaining data are used as the test data set. To make a fair comparison, similar to [30], the users in this data are divided into four clusters by using the K-means algorithm [47], each of which is used as the test data. The properties of these data sets are presented in Table 1 [23].

The Netflix data set was used for the Netflix prize, which can be downloaded from the contest website (http://www.netflixprize.com/). Here, a random subset of Netflix set is used, containing 155177 ratings of 1000 users on 3952 movies. The properties of these data sets are shown in Table 2.

4.2. Performance Metrics

Here, the hypervolume (HV) metric [27] is adopted to evaluate the convergence and diversity of solutions, which is widely used for the evaluations of MOEAs [36, 37]. Let be a reference point in the objective space that is dominated by all vectors in true PF. The HV metric calculates the size of the objective space dominated by the solutions in S and bounded by zr, which is computed as follows:where denotes the Lebesgue measure. In this paper, the reference point zr is set to (0, 0) and (0, 0, 0) for biobjective and three-objective problems, respectively. A large HV indicates the solution set has a good quality. In this experiment, all the algorithms with the highest HV from 30 runs are selected for comparisons.

Moreover, three performance metrics are used, i.e., precision, coverage, and novelty. Precision is the most important metric for evaluating RSs. For a given user u, its precision is defined as the ratio of relevant items, which are correctly recommended, as defined bywhere is the number of common items in the recommendation list of user u and the test data set preferred by user u, and L is the length of recommendation list. The recommendation accuracy in this paper is measured by the mean precision of all the users.

The coverage metric of RSs illustrates the ratio of distinct items in the recommendation list over all the available items. Generally, RSs with a low coverage may be ineffective to users, as they are not enough to make decisions. In this paper, the objective in (10) is used as a performance metric to measure the diversity of RSs.

The novelty of RSs reflects how well RSs can recommend unknown items to users. Generally, RSs with a large novelty will extend user’s interests. In this paper, the objective in (12) is used as a performance metric for this indicator.

4.3. Experimental Settings of Comparison Algorithms

The related parameters in comparison algorithms are listed in Table 3, and the length of recommendation list L is set to 10. To allow a fair comparison, the related parameters for each algorithm were set as suggested in the corresponding references [30, 33, 39, 46].

In Table 3, N is the population size, and are, respectively, the crossover probability and the mutation probability, is the number of parents used for crossover in PMOEA (PMOEA-ProbS and PMOEA-CF), n is the number of neighbors, and Tr is the number of independent trials. Moreover, in MOEA-EPG, means the ratio of the most similar users to the target users within a cluster. The comparisons of MOEA-EPG with these algorithms are comprehensive and effective in validating its performance.

4.4. Effectiveness of MOEA-EPG

To study the effectiveness of our MOEA-EPG, a three-objective model is used to show the experimental results of MOEA-EPG on all the eight data sets. The nondominated solutions with the highest HV value [27] are plotted as the final results for each data set. In Figure 6, the final solutions of MOEA-EPG are plotted for all the eight data sets on the objective space. As observed from Figure 6, there exists a tradeoff among the used three objectives, which forms a nondominated front. Each solution in the nondominated front is marked by the blue square. From these plots, we can observe that the quality and distribution of solutions are quite good.

To further show the distributions of HV results, Figure 7 gives the boxplots of different HV values for eight data sets obtained by MOEA-EPG in the accuracy-coverage space under 30 runs. On each box diagram, the central mark is the median, while the upper and lower edges of the box mean the 25th and 75th percentiles, respectively. Obviously, the distribution of HV values of MOEA-EPG is robust, which further confirms the effectiveness of our algorithm.

4.5. Comparisons with Other Competitors

In this subsection, our algorithm is compared with five competitors on the classical Movielens and Netflix data sets. As User-based-CF and MF only obtain one optimal value in one run, they cannot be used as comparison algorithms using HV. In our experiments, the performance comparisons are conducted under different objectives, i.e., the accuracy-coverage, the accuracy-novelty, and the accuracy-coverage-novelty spaces.

Since the computation difficulties under different objectives are varied, the parameter T is set to 3000 for the accuracy-coverage and accuracy-novelty spaces and set to 6000 for the accuracy-coverage-novelty space, while the remaining parameters are set the same as listed in Table 3. All the mean HV values and the standard deviations are collected for performance comparisons in Table 4, where the best result is highlighted in boldface. From Table 4, it can be observed that MOEA-EPG shows the best performance among all the MOEAs-based recommendation algorithms, especially in the accuracy-novelty space and accuracy-coverage-novelty spaces.

To visually show the advantages of MOEA-EPG, Figures 810 give the final recommendation results with the best HV values regarding the accuracy-coverage, accuracy-novelty, and accuracy-coverage-novelty spaces, respectively. In Figure 8, it is observed that, when compared to User-based-CF and MF, MOEA-EPG is advantageous, as it can find better results in the accuracy and coverage. Moreover, for the data sets of Movielens 2, Movielens 4, and Netflix 2, MOEA-EPG has the advantages over MOEA-ProbS, PMOEA-ProbS, and PMOEA-CF. For the rest of data sets, MOEA-EPG performs competitively, as it can produce the best results among all the competitors. Only a few results of MOEA-ProbS, PMOEA-ProbS, and PMOEA-CF are better than those of MOEA-EPG in the coverage. However, this performance enhancement for coverage decreases a large proportion of accuracy. For Movielens 1, Movielens 3, and Netflix 1, the loss on their accuracy is significant. In Figure 9, it is also confirmed that the advantages of MOEA-EPG are particularly pronounced, as MOEA-EPG can always outperform the other algorithms to find good recommendation solutions for all the data sets, regarding both accuracy and novelty. In Figure 10, MOEA-EPG also performs much better than MOEA-ProbS, PMOEA-ProbS, and PMOEA-CF. This indicates that MOEA-EPG is able to produce superior solutions in terms of the accuracy, coverage, and novelty.

4.6. Effectiveness of the Extreme Point Guided Method

In this section, the effectiveness of the extreme point guided method is studied. As introduced in Section 3.2.1, three extreme points with respect to accuracy, coverage, and novelty will be used in the evolutionary process to guide the search processes. To further study the underlying rationality of our proposed method, MOEA-EPG is compared to its three variants, i.e., MOEA-EPG-I, MOEA-EPG-II, and MOEA-EPG-III. MOEA-EPG-I, MOEA-EPG-II, and MOEA-EPG-III are implemented, to remove the accuracy extreme point, the coverage extreme point, and the novelty extreme point from the original MOEA-EPG, respectively. In this experiment, we use Movielens data sets due to the limitation of paper.

All the mean HV values and the standard deviation from 30 runs are recorded for performance comparisons in Tables 5 and 6 for four data test sets (Movielens 1, Movielens 2, Movielens 3, and Movielens 4), where the best result is highlighted in boldface. Based on the experimental results, MOEA-EPG can find better solutions than MOEA-EPG-I and MOEA-EPG-II in the accuracy-coverage space, and it outperforms MOEA-EPG-III in the accuracy-novelty space. In particular, the accuracy extreme point and the novelty extreme point have a significant impact on the performance of the algorithm. They facilitate the efficiency in evolutionary search with better directions, and therefore the application of extreme point guided strategy is effective.

Figure 11 shows the final recommendation results with the best HV values regarding MOEA-EPG with three variants (MOEA-EPG-I, MOEA-EPG-II, and MOEA-EPG-III) on the accuracy-coverage and the accuracy-novelty spaces for all the data sets, respectively. Figure 11(a) illustrates that MOEA-EPG performance is superior to that of MOEA-EPG-I in terms of accuracy. It can be seen that the accuracy of the algorithm can be improved due to the introduction of the extreme accuracy point. Figure 11(b) shows that MOEA-EPG is better than MOEA-EPG-II on the coverage. Figure 11(c) confirms that MOEA-EPG gets a performance superior to that of MOEA-EPG-III regarding novelty. Similar to the accuracy extreme point, the novelty extreme point also has a great impact on the performance of the algorithm. These results further validate the effectiveness of the extreme point guided method on accelerating the convergence of our algorithm.

4.7. Impact of Parameters

In MOEA-EPG, the setting of parameter K (i.e., the similar ratio of users in (9)) may affect the performance of our algorithm. In this section, in order to study the impact of parameter K in MOEA-EPG, MOEA-EPG was run with different K values (i.e., 0.3, 0.5, 0.7, 0.9), while the other parameters in MOEA-EPG are kept the same as listed in Table 3. In Table 7, all the mean HV results and the standard deviations over 30 runs are recorded for performance comparisons. The best result for each data set is highlighted in boldface.

Figure 12 further gives the final results with the best HV values in data sets. From the comparison results, we observe that the setting of parameter K would affect the performance of MOEA-EPG. Moreover, the HV values of MOEA-EPG under K=0.5 are large in most data sets, whereas the performance of our algorithm becomes poor when K takes other values. Therefore, in this paper, we select 0.5 as the similar ratio of users.

4.8. Computational Complexity Analysis of MOEA-EPG

The computational complexity of MOEA-EPG can be easily deduced from its pseudocode introduced in Algorithm 1. Its computational complexity is mainly determined by the evolutionary search in lines -. In this evolutionary loop, the functions Objective_Func(.) in line and selection(.) in line take a computational complexity with O(N·m2·n), where N is the population size, m denotes the number users, and n represents the number of items. According to [24], the functions Fast_Non_Dominated_Sort(.) in line and Update(.) in line own a computational complexity with O(M·N2). Moreover, the functions Uniform_Crossover(.) and Mutation(.) in lines - have a computational complexity with O(N·m2·n). Therefore, the overall computational complexity of MOEA-EPG is approximately O(T·N·m2·n) as N·m2·nN2 generally, where T is the predefined maximal generation number.

Table 8 further gives the computational complexities of all comparison recommendation algorithms. User-based-CF [39] and MF [46] have a computation complexity with O(m2·n) while our computational complexity analysis is given above with O(T·N·m2·n). Since MOEA-ProbS and PMOEA adopt the same NSGA-II framework except for the crossover operators, they share the same computational complexity (i.e., O(T·N·m2·n)) with MOEA-EPG.

To show the actual running efficiency of all the MOEAs-based recommendation algorithms, Table 9 records their average execution times over 30 independent trials. From the results in Table 9, it can be found that although they share the same computational complexity, the actual running times are different due to the running of different crossover operators and some extra strategies. For example, MOEA-ProbS shows the fast execution efficiency, as it adopts the original NSGA-II [24] on the recommendation system. Since MOEA-EPG has some extra processes, such as the calculation for the three extreme points and the selection of similar parents for crossover, its execution times are a little longer than that of MOEA-ProbS on all the data sets. It is noted that PMOEA-ProbS and PMOEA-CF consume the longest execution times, as they need to run the crossover operators multiple times.

5. Conclusions

In this paper, we presented a multiobjective personalized recommendation algorithm, MOEA-EPG, based on extreme point guided evolutionary computation. In this algorithm, we modeled the personalized recommendation as an MOP with the three conflicting objectives, i.e., accuracy, diversity, and novelty. The accuracy of recommendations considers whether the predicted ratings are similar to the true ones of users, and the diversity reflects the coverage on recommendation, while the novelty evaluates the capability of RSs to recommend unknown items to users. In MOEA-EPG, three extreme points on accuracy, diversity, and novelty were incorporated to guide its evolutionary search toward the PF, and a novel crossover operator was designed to better satisfy the demands of users. The experimental results demonstrated the superior performance of the proposed MOEA-EPG over two state-of-the-art recommendations (i.e., User-based-CF [39] and MF [46]) and two competitive MOEAs for RSs (i.e., MOEA-ProbS [30] and PMOEA [33]), in terms of the accuracy, diversity, and novelty. Moreover, the effectiveness of the extreme point guided method was also validated to accelerate the convergence speed toward the PF, and the sensitivity analysis on parameters was conducted to have a robust performance for MOEA-EPG.

Our future work will focus on the performance metrics for the recommendation algorithms and further study other nature-inspired optimization algorithms (e.g., particle swarm optimization and ant optimization algorithms) for RSs.

Data Availability

The data used to support the findings of this study are available from the corresponding author upon request.

Conflicts of Interest

The authors declare that they have no conflicts of interest.

Acknowledgments

This work was supported by the National Natural Science Foundation of China under Grants 61876110, 61803269, 61872243, 61836005, and 61672358, the Joint Funds of the National Natural Science Foundation of China under Key Program Grant U1713212, the Natural Science Foundation of Guangdong Province under Grant 2017A030313338, Shenzhen Technology Plan under Grants JCYJ20170817102218122 and JCYJ20170302154032530, and CONACyT under Grant 221551.