Abstract

In general, the proximities to a certain diversity along the front and the Pareto front have the equal importance for solving multiobjective optimization problems (MOPs). However, most of the existing evolutionary algorithms give priority to the proximity over the diversity. To improve the diversity and decrease execution time of the nondominated sorting genetic algorithm II (NSGA-II), an improved algorithm is presented in this paper, which adopts a new vector ranking scheme to decrease the whole runtime and utilize Part and Select Algorithm (PSA) to maintain the diversity. In this algorithm, a more efficient implementation of nondominated sorting, namely, dominance degree approach for nondominated sorting (DDA-NS), is presented. Moreover, an improved diversity preservation mechanism is proposed to select a well-diversified set out of an arbitrary given set. By embedding PSA and DDA-NS into NSGA-II, denoted as DNSGA2-PSA, the whole runtime of the algorithm is decreased significantly and the exploitation of diversity is enhanced. The computational experiments show that the combination of both (DDA-NS, PSA) to NSGA-II is better than the isolated use cases, and DNSGA2-PSA still performs well in the high-dimensional cases.

1. Introduction

In the past 30 years, evolutionary multiobjective optimization (EMO) has been popular in research and application [110], and a lot of multiobjective evolutionary algorithms (MOEAs) have been presented [1125]. The nondominated sorting genetic algorithm (NSGA) being one of the first MOEA was introduced in [12]. Then, an improved version of NSGA, called NSGA-II [13], has been proposed. To improve the performance of NSGA-II, S. Salomon et al. [26, 27] introduced a diversity preservation mechanism that is based on partitioning algorithm for function selection. It is shown that this procedure significantly enhances the exploitation of diversity when embedded into NSGA-II [27]. Moreover, P. Mohapatra and S. Roy [28] proposed a new method which was called AP-NSGA-II. It employed the machine processed of NSGA-II but worked with a set of average-point-based techniques to maintain the diversity among solutions. The performance of the proposed AP-NSGA-II can maintain the diversity to some degree. Then, X. Y. Pan and J. Zhu [29] proposed an improved algorithm LDMNSGA-II. The algorithm adopted the technology of Latin hypercube sampling to ensure that the distribution of initial population could be uniform. And it used differential evolution operator to replace crossover operator of NSGA-II to enhance the local search ability and search accuracy. The results demonstrate that the proposed algorithm can achieve a good overall property on multiobjective optimization. Considering the execution time of NSGA-II, many scholars have made some research on the algorithm [3034]. It is worth noting that Y. R. Zhou et al. [35] introduced the dominance degree matrix for a vector set. And a fast method was designed for constructing the new data structure. By using the dominance degree matrix, it developed an efficient implementation of nondominated sorting called dominance degree approach for nondominated sorting (DDA-NS). Empirical results demonstrated that DDA-NS transcends other typical approach for nondominated sorting and DDA-NS works well for solving multiobjective problems. For these reasons, we borrow the idea of dominance degree matrix and PSA method to improve the performance of NSGA-II. So that it can combine the advantages of reducing runtime and improving diversity.

This paper is organized as follows. Section 1 provides an introduction; Section 2 introduces the DDA-NS method and PSA. Section 3 describes a straightforward approach for integrating DDA-NS and PSA into NSGA-II. The performance comparison is shown at Section 4. Finally, the paper ends with some conclusions in Section 5.

2. Basic Ideas and Concepts

Mathematically, a multiobjective problem (MOP) is described asHere is the decision vector of dimension , is the objective function, and is the decision space (DS). The image set, , is called the objective space (OS). Let be a decision vector. A decision vector is called a Pareto optimal solution if its objective vector is not dominated by any other vector in the objective space . All the Pareto optimal solutions constitute the Pareto optimal set. The Pareto optimal front is the set of images of the Pareto optimal set.

2.1. DDA-NSS

We first introduce some basic notions of DDA-NS which was introduced in [35].

2.1.1. Dominance Degree

Let , which is called dominance degree of to and represents the dominance between and , where is the cardinality of .

It is obvious to find the following:

  

  If , then , which means that will reach its maximum value at when

If , then or

2.1.2. Dominance Degree Matrix

Let be a objective vectors set, , and is a dimensional vector. Define the dominance degree matrix on the set for the dominance relation by

A faster method for calculating the dominance degree matrix follows the following two main steps:

Sort the objective vectors on every objective by using Quicksort [36].

Construct the dominance degree matrix by using the sorted results acquired in step .

Firstly, we construct the comparison matrix of a row vector and define the comparison matrix of by

One can see Algorithm 1 in literature [35] to calculate the matrix .

Secondly, sum the comparison matrices of each objective vector and then we can obtain the dominance degree matrix of a set of vectors. One can see Algorithm 2 in literature [35] for the details of this procedure.

Finally, a novel nondominated sorting method is provided in the following paragraph, in which the dominance degree matrix is utilized to assign the solutions to nondominated fronts in the population .

As we know, nondominated sorting approach assigns all solutions in initial population to nondominated fronts . The first front consists of the set of nondominated solutions in the initialized population . The second front is the set of nondominated solutions in the remaining set (i.e., removes all solutions assigned to from the population ) and repeats the procedure for the subsequent fronts.

Assume that denotes the dominance degree matrix of the population of size . In order to eliminate the influence of the same individuals, the corresponding elements of are set to 0. Let be a row vector which consists of the corresponding maximum elements from each column of . It can be verified that, for any , dominates if and only if equals . Then, the solutions corresponding to elements of whose subscripts are less than are nondominated solutions of . These nondominated solutions were classified to the first front . Then, delete the column and row vectors corresponding to from and denote the remaining matrix by . Then, continue the same procedure for matrix and obtain the second front . Repeat such a process until all solutions are classified to . Thus, a new nondominated sorting method is attained, which is provided in Algorithm 3 of literature [35].

The time complexity of the DDA-NS approach is . By using the DDA-NS approach, the complexity to compare the objective function values of nondominated sorting process in the NSGA-II can be reduced from to at iteration of the NSGA-II algorithm.

2.2. Part and Select Algorithm (PSA)

The Part and Select Algorithm (PSA) was put forward by S. Salomon et al. [27]. In this part we will make a brief introduction of the Part and Select Algorithm (PSA) which aims to select the well-spread points from a set of (contains candidate solutions). The procedure contains two main steps.

Step 1. Divide into subsets, so that the similar members can be grouped in the same subset.

Step 2. Each diverse subset is constructed through selecting one member from each generated subset.
To divide a set into subsets, PSA carry on divisions of one set into two subsets. In each dividing process, the set with the greatest dissimilarity among whole elements is divided. The same procedure is continued until the desired stopping criterion is met. The desired stopping criterion can be either a predefined number of subsets or a maximal dissimilarity among each of the subsets. The dissimilarity of the set is the measure .

Let ( objective vector for points ) and denote

Obviously, a larger indicates a greater dissimilarity among the members of . The pseudocode of PSA originates from Algorithm 1 in literature [27].

After the set has been divided into the subsets , choose the member closest to the center of the hyperrectangle circumscribing in the sense of Euclidean metric. If there exist more than one member closest to the center, chose one of them randomly. Let be the original set of members, from which a subset of members is to be selected. The complexity of this selecting approach is .

Proposition 1 (see [27]). Given a set , the overall complexity of choosing elements out of using PSA and the center point selection is .

3. Improved NSGA-II

NSGA-II is widely applied for solving multiobjective problems (MOPs) [2730]. However, its nondominated sorting approach usually has high computational overhead and the final Pareto approximate set is lack of diversity. For these considerations, this paper proposed a variant of NSGA-II algorithm. We denote this improved algorithm by DNSGA2-PSA which takes NSGA-II as the framework and retains the NSGA-II elitist-preserving approach, the crowding distance mechanism, and the binary tournament selection. After the population initialized, rank the population by DDA-NS and then calculate the individual’s crowded distance in each layer by PSA, so as to reduce the running time of the nondominated sorting and keep the initial solutions well-distributed. When the parent and offspring populations merged, PSA is used to prune population size instead of crowding comparison operator. Specific changes are as follows.

3.1. Modifying the Nondominated Sorting of NSGA-II

DDA-NS is integrated into the classical NSGA-II to replace fast nondominated sorting method. We assume that the current population of the size be sorted by DDA-NS. Firstly, we use Algorithm 2 in literature [35] to calculate the dominance degree matrix of the current population , donated as and then set it as input to Algorithm 3 in literature [35]. Further, the nondominated front is putted out.

3.2. Integration of PSA into NSGA-II

Because of the minimal requirements of PSA, the algorithm can be integrated into any MOEA and used as a selection mechanism and/or as a crowding assignment mechanism. So, the purpose is to utilize the high competency of PSA to select a diversified subset from an arbitrary set for enhancing diversity along the Pareto front. The difference between DNSGA2-PSA and the classical NSGA-II is that, instead of selecting members from the last front according to the crowded-comparison operator, selection is according to PSA. That is, assume that the combined current population of the size 2N is sorted to the nondominated fronts and and . Then the next parent population is constructed from the members of the sets and from members of the set . The only difference from the classical NSGA-II is that, instead of selecting members from the last front according to the crowded-comparison operator , selection is according to the PSA, as follows. The last front is partitioned to subsets according to Algorithm 1 in literature [27], and the central element of each subset is chosen to . Besides, the crowding assignment of NSGA-II, used for the binary tournament, is modified as well. Each set , is partitioned according to Algorithm 1 of literature [27] into subsets, and every member of is assigned a crowding measure equal to the number of members in its subset.

3.3. DNSGA2-PSA Algorithm

The DNSGA2-PSA is formed by integrating existing NSGA-II with DDA-NS and PSA. The procedure of the DNSGA2-PSA is depicted in Algorithm 1.

Initial population
Assign the solutions to fronts by DDA-NS //Algorithm 3 originates from ([35], Algorithm 1: DDA-NS)
Calculate the crowded distance in each layer by PSA // Algorithm 1  originates from
([27], Algorithm 1:Partitioning a set A into m subsets)
Parent population binary tournament selectin of
Offspring population genetic operators (simulated binary crossover and polynomial mutation) of
Recombination population
The next generation population select the recombination population by PSA // Algorithm 1 originates from
([27], Algorithm 1:Partitioning a set A into m subsets)

4. Computational Experiments and Performance Comparison

To demonstrate the scalability of the DDA-NS method and PSA, the experiments were conducted to compare the proposed DNSGA2-PSA with the NSGA2-PSA, along with in this part. Both algorithms are tested on the following benchmark functions and evaluated by the following performance criteria. All experiments were implemented on a system with the following specifications.

4.1. Benchmark Function

Example 2 (DTLZ1, [37]). where , , , and the real Pareto front is linear hyperplane .

Example 3 (DTLZ2, [37]). where , , and the real Pareto front is nonconvex and formed with .

Example 4 (DTLZ3, [37]). where , , and the real Pareto front (at ) is nonconvex.

Example 5 (DTLZ4, [37]). where , , and the real Pareto front is nonconvex and formed with .
The Pareto optimal front of DTLZ1 is the hyperplane of all points that satisfy the equation , where M is the number of objectives. The PF of DTLZ2-4 is the hypersphere of all points that satisfy the equation . DTLZ3 uses the same function for convergence as in DTLZ1, and therefore it can check the ability of a MOEA to find solutions close to the real PF. A very small section in the decision space is mapped to a very large section in the objective space of DTLZ4. This property makes it a good benchmark to test the ability of a MOEA to find a diverse set of solutions. Since the proposed DNSGA2-PSA aims to enhance the diversity of the approximated set, the results for the DTLZ4 are of major importance.

4.2. Performance Criteria

In the evaluation of multiobjective optimization algorithms, the convergence of the optimal solution set to the real Pareto optimal solution set and the uniform distribution of the optimal solution sets on the real Pareto solution set is considered. In this paper, four performance criteria are used to evaluate the convergence and the diversity of distribution of these algorithms.

4.2.1. GD

Generation distance (GD) [38] is a way to estimate how far the elements produced by our algorithm are from those in the true Pareto front of the problem. The measure isHere, is the number of nondominated vectors found by the algorithm and is the Euclidean metric between each of these vectors and the nearest member of the true Pareto front.

4.2.2. IGD

Inverted generational distance (IGD) [39] measure can indicate the overall quality of a solution set (i.e., its convergence to the Pareto front and diversity along the Pareto front). IGD is defined byHere is a distance between and in the objective space. This paper uses the Euclidean distance as .The IGD in (11) is the average distance from each reference point to its nearest solution in the solution set Q.

4.2.3. HV

Hypervolume (HV) [40] is also known as the measure or the Lebesgue measure [41, 42]. The hypervolume of a set measures the size of the dominated portion of the objective space by Pareto approximation set. HV is defined byHere is the nondominated solution set, is the decision vector, is the decision space, is the target vector, is the target space, and is the Lebesgue measure, while is the reference point.

4.2.4. Spacing

Spacing [43] is evenness metric. Assume that A denotes the approximate Pareto optimal solution set obtained by our algorithm, and then Spacing is defined byHere is the mean of , and is the number of objective functions. If the value of Spacing equals zero, it indicates that the obtained nondominated solution is evenness in the target space.

4.3. Discussion of the Results

In this section, the DNSGA2-PSA is compared with the , along with NSGA2-PSA. For a fair comparison, both algorithms start with the same initial population at every test and use the same genetic settings. In this paper, we employ a binary tournament selection operator, a polynomial mutation operator, and a real-parameter simulated binary crossover operator to construct the genetic operators. The mutation probability of and crossover probability of are used. For real-coded NSGA, the distribution indexes for crossover and mutation operators are set to be and , respectively. In the three algorithms, the maximum number of generations is set as 250; the population size is set as 100. The results of a typical approximated set can be seen for three objectives in Figure 1; the statistical results for two to five objectives are shown in Figures 2 and 3.

We discuss the effectiveness of DNSGA2-PSA in terms of the following aspects.

4.3.1. Computational Cost

Intuitively, as we can see from Table 1, the execution time of DNSGA2-PSA is less than NSGA2-PSA but more than for DTLZ1-4, which shows the superiority of the DDA-NS approach. Accordingly, the DDA-NS approach is demonstrated to outperform the original NSGA-II whose framework embedding the fast nondominated sorting approaches in terms of running time.

4.3.2. Convergence and Diversity

As we know, GD means the convergence of the algorithm; IGD provides information about both convergence and diversity of the approximated set. Smaller values of the GD and IGD are preferred. With respect to the DTLZ1-4, Tables 2 and 3 reflect that the GD and IGD values of DNSGA2-PSA are always better than that of . In both aspects of convergence and diversity of solutions, DNSGA2-PSA performs better than in these test problems. With regard to comprehensive performance, DNSGA2-PSA is better than .

Table 4 is the results of HV measure. A higher value of HV is associated with a good spread and convergence. Based on the results shown in Table 4, we can conclude that our algorithm (DNSGA2-PSA) obtains the best results, with the largest HV (on average). Also, we can say that the DNSGA2-PSA is relatively better than the algorithms.

As we know, Spacing provides information about diversity of the approximated set and smaller value of the Spacing is preferred. As can be seen from Table 5, the index of diversity is better than NSGA-IIDDA-NS but roughly equivalent as NSGA2-PSA. So, we can conclude that DNSGA2-PSA algorithm has a great advantage in promoting diversity.

For two to five objectives of DTLZ series, the boxplot of GD, IGD, HV, and Spacing are depicted in Figures 2 and 3, respectively. We can conclude that the IGD, HV, and Spacing values of DNSGA2-PSA are better than . However, as the number of objective functions increases, this advantage comes at the cost of a loss of proximity, as manifested in larger GD value of DNSGA2-PSA. Then the unresolved tradeoff between diversity and proximity emerges. The higher HV and smaller Spacing of DNSGA2-PSA indicate that the gain in diversity is more significant than the loss of proximity.

4.3.3. Overall Evaluation

Through the group of experiments, it has been shown that the proposed modified DNSGA2-PSA is clearly superior compared with the and NSGA2-PSA, especially generating relatively diverse nondominated set and reducing the running time sharply. This illustrates the effectiveness of the mechanisms adopted in our DDA-NS and PSA approach to maintain diversity and reduce runtime.

5. Conclusion

To reduce the execution time and enhance the exploitation of diversity of NSGA-II, this paper has discussed the combination of a new vector ranking scheme and diversity preserving selection with NSGA-II algorithm. The new vector ranking scheme produces a new data structure, namely, the dominance degree matrix. Using the dominance degree matrix, a novel and efficient execution of nondominated sorting to serve for ranking vector set (DDA-NS) is developed. Moreover, diversity preserving mechanism PSA (Part and Select Algorithm) is proposed. To demonstrate the potential benefit of the novel algorithm, DDA-NS and PSA are integrated into the well-known NSGA-II. According to experimental results, we can conclude the following.

The fast dominance degree matrix construction method can improve the computational complexity of vector ranking procedure efficiently.

The selection based on the partitioning algorithm can gain a front involving higher diversity (based on Spacing).

Appendix

Boxplot

See Figures 2 and 3.

Data Availability

The data used to support the findings of this study are available from the corresponding author upon request.

Conflicts of Interest

Dan Qu, Xianfeng Ding, and Hongmei Wang declare that they have no conflicts of interest.

Acknowledgments

At the point of finishing this paper, the authors would like to express their sincere thanks to all those who have lent them hands in the course of their writing this paper. First of all, they would like to take this opportunity to show their sincere gratitude to supervisor Mr. Xianfeng Ding who has given them so much useful advice on their writing and has tried her best to improve the paper. Secondly, they would like to express their gratitude to the classmates who offered them references and information on time. Without their help, it would have been much harder for them to finish this paper.