Research Article | Open Access
Alberto Pajares, Xavier Blasco, Juan M. Herrero, Gilberto Reynoso-Meza, "A Multiobjective Genetic Algorithm for the Localization of Optimal and Nearly Optimal Solutions Which Are Potentially Useful: nevMOGA", Complexity, vol. 2018, Article ID 1792420, 22 pages, 2018. https://doi.org/10.1155/2018/1792420
A Multiobjective Genetic Algorithm for the Localization of Optimal and Nearly Optimal Solutions Which Are Potentially Useful: nevMOGA
Traditionally, in a multiobjective optimization problem, the aim is to find the set of optimal solutions, the Pareto front, which provides the decision-maker with a better understanding of the problem. This results in a more knowledgeable decision. However, multimodal solutions and nearly optimal solutions are ignored, although their consideration may be useful for the decision-maker. In particular, there are some of these solutions which we consider specially interesting, namely, the ones that have distinct characteristics from those which dominate them (i.e., the solutions that are not dominated in their neighborhood). We call these solutions potentially useful solutions. In this work, a new genetic algorithm called nevMOGA is presented, which provides not only the optimal solutions but also the multimodal and nearly optimal solutions nondominated in their neighborhood. This means that nevMOGA is able to supply additional and potentially useful solutions for the decision-making stage. This is its main advantage. In order to assess its performance, nevMOGA is tested on two benchmarks and compared with two other optimization algorithms (random and exhaustive searches). Finally, as an example of application, nevMOGA is used in an engineering problem to optimally adjust the parameters of two PI controllers that operate a plant.
In industry, there are many situations where a design problem turns into an optimization problem. Moreover, it is usual that these problems have conflicting objectives, which generates a multiobjective optimization problem (MOP) ([1–3]). A MOP basically consists of three stages : its definition, the optimization problem (search), and the multicriteria decision-making (MCDM). In an a priori multiobjective approach , the stages of optimization and decision-making are carried out at the same time, which results in a single solution, since the desired preferences are defined beforehand. On the contrary, in an a posteriori approach , the search does not supply a single solution but a set of optimal solutions (Pareto optimal solutions). This procedure is more time-consuming, but it gives the designer information about different solutions and provides them with a better understanding of the problem, which will allow them to make a well-informed decision in the decision-making stage. On the other hand, an optimization problem may have multimodal solutions ([7–9]) or nearly optimal solutions ([10, 11]) that are useful for the designer. These alternative solutions, which have a similar or even the same performance as the optimal solutions, are ignored in a traditional multiobjective approach, although they offer interesting and useful information to the designer. In order not to lose this valuable information, there must be taken into account not only the optimal solutions but also the nearly optimal ones ().
Including the multimodal and nearly optimal solutions in the decision-making scenario increases the number of alternatives available to the designer. However, this poses two issues. On the one hand, the number of solutions will increase considerably and this may slow the algorithm to the point of becoming inoperative. On the other hand, having a huge amount of alternatives to choose makes the decision-making process more complicated. For this reason, it is important in most cases to reduce, in some way, the number of solutions. The way in which this reduction is performed will depend on the criterion or approach that the designer decides to adopt, which, in turn, depends on the application or the context. The existing algorithms usually perform this reduction by means of a discretization of the objective space. But this discretization is carried out without taking into account any consideration of the location of the solutions in the parameter space. The result of proceeding in this way is that nearly optimal solutions which have distinct characteristics could be neglected. Despite this drawback, this method is suitable for some applications.
One possible approach, however, and the one that we assume here, is the following: (1) finding a manageable number of solutions (so that the two issues mentioned above can be avoided) but, at the same time, (2) without neglecting the existing diversity in the characteristics of the nearly optimal solutions. We think that this approach is common in many applications and the existing algorithms do not serve in this case, precisely because they do not pay attention to the characteristics of the solutions. This is why, in this work, we have developed an algorithm to provide the designer with those solutions, among all the nearly optimal solutions, that fulfill that particular criterion, namely, the nondominated in their neighborhood. These solutions are the information that our designer finds more relevant. For example, a designer adopting this approach will not want to be given two nearly optimal solutions which have similar characteristics (neighboring solutions), if one of them is dominated by the other (i.e., worse for at least one of the objectives and not better for the rest), as he or she will logically choose the nondominated one. However, if these two solutions have significantly different characteristics (i.e., they are nonneighboring solutions), then both of them will be interesting for the designer. If the designer has these solutions available, they can analyze them a posteriori (for example, by including new indicators or by considering the physical sense of the solutions), in order to decide which one is the most suitable. From now on, we will call these solutions potentially useful solutions. This does not mean that the rest of nearly optimal solutions are useless, but simply that they are dispensable for a designer who has adopted the approach specified before. These potentially useful solutions are the ones that according to his or her criterion gives them the most relevant information, the information that they want to get.
An example where the specified approach could be the chosen one and, therefore, it is desirable to find the nearly optimal solutions nondominated in their neighborhood, is when one of the objectives cannot be included in the optimization process due to its high computational cost. In this case, it is possible to exclude that objective from the optimization process and to obtain the set of optimal solutions for the rest of objectives. When the optimization process has finished, the excluded objective can be evaluated (only at the optimal solutions previously obtained) and incorporated into the decision-making process. The problem here (if a classical MOP approach is followed) is that, since the excluded objective is ignored in the search, there will be interesting solutions (those with a good performance for the excluded objective and with similar performance for the rest of objectives) which will be missed. The new approach that we are proposing in this paper finds them. Another situation where it may also be interesting to examine nearly optimal solutions nondominated in their neighborhood happens when several objectives are aggregated, which is a way of defining a priori preferences. When a MOP has a lot of objectives, the decision-making stage becomes too complicated. That is why sometimes several objectives are aggregated in groups (for example, performance, cost, and robustness), in order to simplify the problem. However, this approach could miss valuable solutions for the designer. This loss of information can largely be avoided by finding the nearly optimal solutions nondominated in their neighborhood.
So, finding the nearly optimal solutions nondominated in their neighborhood solves the lack of information that typically appears in a classical multiobjective approach. Moreover, taking into account these solutions has two additional advantages: (1) it enables the designer to reassess the design objectives—if there are nearly optimal solutions which display certain characteristics that are absent in the optimal ones, this could mean a poor formulation of the objectives—and (2) it detects the existence of overparameterization (i.e., when the number of parameters is higher than needed, which could result in a man-made multimodality).
In this work, a new algorithm is presented, nevMOGA, which is aimed at finding the sets of optimal solutions and nearly optimal solutions nondominated in their neighborhood. There exist several algorithms in the literature that consider nearly optimal solutions as well ([13, 14]). However, they use these solutions simply as a means to compute an approximation of the Pareto front, whereas nevMOGA discriminates them and returns some of them (the nondominated in their neighborhood) as its outcome. nevMOGA is based on an algorithm called evMOGA, which is described in . This new algorithm includes additional parameters to define when a solution is considered to be a nearly optimal solution and the size of a neighborhood.
In this section, the concepts of multiobjective optimization problem and Pareto set are formally introduced, and the problem which we are dealing with is graphically explained (i.e., the problem of finding optimal and nearly optimal solutions nondominated in their neighborhood).
A multiobjective optimization problem (A maximization problem can be converted into a minimization one. For each of the objectives that have to be maximized, the transformation: can be applied.) can be stated as follows: where is defined as a decision vector in the domain and : is defined as the vector of objective functions . and are the lower and upper bounds of each component of .
Definition 1 (dominance ). A decision vector is dominated by any other decision vector if for all and for at least one , . This is denoted as
Definition 2 (Pareto set). The Pareto set (denoted by ) is the set of solutions in which are not dominated by another solution in :
Definition 3 (Pareto front). Given a set of Pareto optimal solutions , the Pareto front is defined as
Figure 1 shows an example to clarify what we mean by optimal solutions, nearly optimal solutions, and nearly optimal solutions nondominated in their neighborhood in a monoobjective (Figure 1(a)) and a multiobjective (Figure 1(b)) problem. Let us consider the monoobjective case. In theory, all the nearly optimal solutions (grey lines) could be interesting for the decision-maker. However, considering all these solutions makes the decision-making stage too difficult. Moreover, many of these solutions () will have very similar characteristics to the optimal one (), since they are close to it in the parameter space. This, in addition to the fact that they have a worse performance than the optimal one, leads us to consider them, under the criterion assumed, less relevant and in some cases dispensable. The rest of them () will have similar characteristics to , which is the best in and, therefore (always under the specified criterion), they could be dispensable. Consequently, the solutions which provide more relevant information (potentially useful solutions) are and , i.e., the optimal solution and the nearly optimal solution nondominated in its neighborhood. Likewise, in the multiobjective case, all the nearly optimal solutions (grey area) could be considered but, again, taking into account all of them would significantly complicate the decision-making stage while most of these solutions do not add much relevant information to the process. So, the sets (optimal solutions) and (nearly optimal solutions nondominated in their neighborhood) provide the most valuable information to the designer without unnecessarily complicating the decision-making process.
An example of the usefulness of finding the nearly optimal solutions nondominated in their neighborhood is when one of the objectives cannot be included in the optimization process due to its high computational cost. The usual way to cope with this difficulty consists of solving the MOP for the rest of objectives and then computing the excluded one only for the optimal solutions. The problem with that (if a classical MOP approach is followed) is that there will be nearly optimal solutions with an outstanding performance with respect to the excluded objective (and therefore worth considering) that will be ignored. On the other hand, if one is not willing to miss all those interesting solutions and decides to take into account all the nearly optimal solutions, then the time-consuming objective will be evaluated at a vast number of solutions, many of which are not even worth considering, as they will have similar characteristics to the optimal solutions (because they are neighboring solutions). In summary, approaching this kind of problems with a classical strategy leads to either neglecting potentially useful solutions or having to evaluate a time-expensive objective at many dispensable solutions, whose inclusion in the decision-making stage will, furthermore, unnecessarily complicate the decision. With our new approach, these problems can be overcome. Let us look at it with a concrete example.
Let us suppose a MOP with three objectives to minimize. Table 1 shows five possible solutions to the problem ( to ). In a classical MOP, all these solutions belong to the Pareto set (except either or , as they are multimodal solutions and, therefore, one of them would be discarded). Now, assume that the objective is computationally impracticable, i.e., its incorporation into the optimization process is impossible. For this reason, the search stage has to be solved for the first two objectives and then will be evaluated only at the solutions previously obtained. First, note that in a classical MOP, only the Pareto front is obtained, so either or (multimodal solutions) and would be discarded, although , , and seem to be good solutions, since they are nearly optimal solutions in different neighborhoods and thus worth considering.
With our new approach, , , and would also be found, because they are nondominated in their neighborhood; in other words, they have significantly different parameter values. would be discarded since it is dominated by and they are neighboring solutions, i.e., they are expected to have similar characteristic. has a better performance in than ; however, this improvement is insignificant (if the neighborhoods are well defined) as they are very similar solutions (neighboring solutions). Through this new procedure, from all the neighboring solutions, only those with the best performance over the objectives considered in the optimization process (in this example, and ) would be selected. The solution displays a better value for than and (which are in different neighborhoods) but worse values for and (this is why it is a nearly optimal solution) and, therefore, in a classical MOP, it would be discarded, although its analysis may be interesting for the decision-maker. Logically, would be ruled out since it is much worse for and , although it has a better value for . With the proposed approach, is only evaluated at , , and but not at ( is preferred instead) and (which is discarded, since it is not a nearly optimal solution) and, consequently, the computational burden is alleviated. Therefore, thanks to this new approach; it is possible to reduce the computational cost while obtaining the potentially useful solutions for the designer.
Another example that shows the usefulness of obtaining also the nearly optimal solutions nondominated in their neighborhood can be found when several objectives are aggregated, which is a common procedure when there are a great number of them. Let us suppose that a MOP originally has four objectives to minimize. Let be to (Table 2) the four solutions to this MOP. In a classical MOP, all these solutions would be selected to form the Pareto set.
If the designer wanted to reduce the number of objectives, a possibility would be to add and to obtain () and and to obtain (). But now, only or (multimodal solutions) would be the optimal solution (in a classical MOP). Let us suppose that the algorithm chooses and discards . In this case, and are discarded, although they are interesting solutions for the designer. However, if the algorithm keeps also the nearly optimal solutions nondominated in their neighborhood, and would not be excluded with the aggregated objectives. Finally, presents a poor performance for both and (clearly dominated) and, therefore, it does not seem to be a good solution.
In conclusion, when the objectives are aggregated, a classical MOP finds only one solution from the three that are interesting and the other two are missed. Therefore, finding the nearly optimal solutions nondominated in their neighborhood allows the designer to aggregate objectives (and, in this way, simplifying the decision-making stage) without losing potentially useful solution.
In summary, as seen in the examples, the multimodal solutions and the nearly optimal solutions which are significantly different from those which dominate them are solutions that can be potentially useful for the designer during the decision-making stage. This additional information is not provided to the decision-maker when these kinds of MOPs are approached in a classical manner.
3. Materials and Methods
In this section, we present in detail a novel algorithm, nevMOGA, which is the main contribution of this work, as well as the metric and benchmarks that have been used to demonstrate its correct functioning. Let us begin with the new set of interest and its discretization, which is performed by nevMOGA.
3.1. New Set of Interest
The nevMOGA algorithm is aimed at finding not only the optimal solutions but also the nearly optimal solutions nondominated in their neighborhood. In the following, both sets of solutions are defined.
Definition 4 (-dominance ). Define as the maximum acceptable performance degradation. A decision vector is -dominated by another decision vector if for all and for at least one , . This is denoted by .
Definition 5 (-Pareto set ). The -Pareto set (denoted by ) is the set of solutions in which are not -dominated by another solution in :
Definition 6 (neighborhood). Define as the maximum distance between neighboring solutions. Two decision vectors and are neighboring solutions () if for all .
Definition 7 (-dominance). A decision vector is -dominated by another decision vector if they are neighboring solutions (Definition 6) and . This is denoted by .
Definition 8 (-Pareto set). The -Pareto set (denoted by ) is the set of solutions of which are not -dominated by another solution in :
The set (optimal solutions and nearly optimal solutions nondominated in their neighborhood) is the new set of interest, i.e., what nevMOGA has to find. In order to clarify its nature, we will use a graphical example (see Figure 2). In this example, the set of interest is formed by the union of and . is a set of solutions nondominated in their neighborhood which belong to the set . is a set of solutions nondominated in their neighborhood which do not belong to the set . is not a nearly optimal solution, and, therefore, it will be discarded since it does not have the desirable performance (). is dominated (i.e., dominated by a neighboring solution) by and, as a result, this alternative provides less relevant information and, for this reason, is discarded. is dominated by nonneighboring solutions but not by neighboring solutions, so it belongs to , and the same can be said of any solution in . belongs to the optimal set . In summary, nevMOGA searches for the set , in our example, .
The set clearly depends on the values given to the new parameters (Definition 4) and (Definition 6). Given that in most problems, the objectives have a physical sense, it is possible to decide a priori how much performance we are willing to lose and to define consequently. Similarly, when the decision variables have a physical sense, the parameter can also be set intuitively. By setting , the designer establishes to what extent the two solutions are considered “similar”; in other words, setting defines the neighborhood.
If the decision variables do not have a physical sense and it is not evident how to choose , it is still possible to follow a simple procedure (see Figure 3) which will help us to do it if has been previously set. First, a reference solution is chosen. Second, in the objective space, a rectangle is defined with the center at and twice in width and in height. Then, starting from (in the search space), the value of each variable decision is increased and decreased independently, until the solution leaves the rectangle which was previously defined. Finally, each element of the vector neighborhood is set to the minimum distance between and the first solutions (one for each direction) which left the mentioned rectangle ( and in Figure 3). Through this procedure, we manage to quantify what excursion is needed in the search space in order to get significant changes in the objective space and, in this way, we are able to define when no other criterion is available.
3.2. Discretization of the New Set of Interest
Another issue is that may contain infinite solutions, so it is necessary to discretize it and to obtain a finite set of solutions () which are adequately distributed. In order to carry out this task, the concepts of box, _box, and their associated dominances have to be defined beforehand.
Definition 9 (box ). Let , with for all . Given a decision vector , is defined as the vector , where for , and where and are the maximum and minimum values of .
Definition 10 (box dominance ). Given two decision vectors and whose boxes are and , respectively, is said to box dominate , , if for all and for at least one , .
Definition 11 (-box dominance). Given the neighboring decision vectors and whose boxes are and , respectively, is said to -box dominate , , if .
The algorithm only keeps non--box-dominated solutions. A box cannot contain two neighboring solutions. When a neighboring solution is found in the same box, the nearest solution to the ideal corner (the lower left one) will be chosen (see Figure 4, and ) and the rest will be discarded. Figure 4 shows how the algorithm performs the discretization. is -box dominated (i.e., box dominated by a neighboring solution) by and, therefore, it is discarded. is box dominated but not -box dominated (since and are nonneighboring solutions), so it is an alternative, as it is a nearly optimal solution. , , and are in the same box. Two neighboring solutions cannot be in the same box, so either or has to be eliminated. In this case, is eliminated because it has a greater distance to the ideal corner than (). is a good alternative because it is a nearly optimal solution, it is not -box dominated and it does not belong to either the neighborhood of or .
The algorithm searches a discretization of the set of interest. This new discrete set is . The maximum number of solutions (in the worst case) of is (see 7). Similarly, and are the discrete sets of and , respectively.
3.3. Description of nevMOGA
The algorithm nevMOGA is based on the algorithm evMOGA . The main difference between them is that nevMOGA computes an additional population, namely, the set of nearly optimal solutions nondominated in their neighborhood (subfront()). Its main characteristics are the following: (i)It is an elitist algorithm with two archives where the individuals of and are stored. It has also an additional, auxiliary population formed by the new individuals which are created during the process(ii)It is a real-coded algorithm and uses crossover (extended intermediate crossover) and mutation (random Gaussian distribution) operators
The algorithm manages four populations (see Figure 5): (1): by means of the population , the search space is explored, with the aims of obtaining the solutions of and having diversity in the set of solutions. The number of individuals in this population is constant and equal to (2)Front is the archive where is stored, i.e., a discrete approximation of the Pareto front. The size of this population varies but is always less than or equal to a given maximum size which depends on the number of boxes previously defined by the user(3)Subfront is the archive where is stored, i.e., the nearly optimal solutions nondominated in their neighborhood. Its size is variable but bounded, depending on the number of boxes(4) is an auxiliary population where the new individuals generated by the algorithm in each iteration are stored. The number of individuals of this population is , which must be multiple of 4
Now that the archives front and subfront have been presented, it is possible, with the help of the prior definitions, to establish the conditions that a solution must fulfill in order to enter them. This is the aim of the following two additional definitions:
Definition 12 (inclusion of in front). Given a solution and the archive front, will be included in front if and only if where is the distance from to the ideal corner (lower left) of the box (see Figure 3, and ). Additionally, if is included in front, then all the solutions that fulfill the following condition will be eliminated from front:
Then, it will be determined whether any of these solutions which have been eliminated from front is included in subfront or not, by applying Definition 13. Furthermore, all the solutions that fulfill the following condition will be eliminated from subfront:
Definition 13 (inclusion of in subfront). Given a solution such that , will be included in subfront if and only if
Additionally, if is included in subfront, then all the solutions that fulfill the following condition will be eliminated from subfront:
In order to maintain the diversity of solutions in , it is necessary that the population be diverse too. For this, is permanently ordered by using the niche count, which is an indicator of how densely populated a solution is in (Definition 14). In this way, during the processes of selection and substitution, the solutions in the less populated areas have a higher probability of being chosen to generate new individuals, whereas those in the more populated areas have a higher probability of being substituted.
Definition 14 (niche count ). where is the sharing function, which gives a measure of how similar two elements in a population are and is defined by
The algorithm pseudocode is the following:
Lines 3, 6, 8, 10, 13, 14, and 15 of the previous pseudocode constitute the changes which have been added to evMOGA, the algorithm on which ours is based. Next, the most important steps in nevMOGA are explained: (i)Lines 7 and 12 (Algorithm 2). Here, it is determined whether a new individual is included in or not. For this, the individual has to fulfill Definition 12. If the new individual is finally included in front, then it is analyzed whether its inclusion eliminates other individuals from front (in which case it will be established whether they must be included in subfront) and from subfront(ii)Lines 8 and 13 (Algorithm 3). Here, it is determined whether a new individual which was not included in front has to be included in subfront or not. For this, the individual has to fulfill Definition 14. If the new individual is included in subfront, then it is analyzed whether its inclusion eliminates other individuals from subfront(iii)Line 10 (Algorithm 4). The population is created by using the following procedure: (1)Two individuals are chosen at random, one from front () (where any individual has the same probability of being chosen) and another from () (where each one has a different probability of being chosen). In effect, the solutions of are ordered according to their niche count (from least to greatest) and its individual is chosen according to an exponential distribution, which makes it more likely for a solution with a lower niche count to be chosen. This procedure favors the uniform distribution of the solutions(2)A random number is generated. If (probability of crossover/mutation), then a crossover will be performed (step 3); otherwise, a mutation will be performed (step 4)(3) and are crossed by means of an extended intermediate crossover operator, which produces two new individuals and : The parameter is a random value uniformly distributed which belongs to interval , and is a parameter which is adjusted by using an exponential decreasing function, as in simulated annealing : Figure 6 shows from to , where and .(4) and are mutated by using a random mutation with Gaussian distribution: where the variances and are expressed as a percentage of . These variances are calculated by a function which is similar to the one previously used for the parameter : (5) and are included in These five steps are repeated again from step one but now with a solution from subfront instead of one from front. The whole process is executed again and again until is full. In order to set the parameters which appear in the equations of the steps 3 and 4, the default values suggested by  for the original algorithm (evMOGA) are taken.(iv)Line 14 (Algorithm 5). In the updating of , it is determined whether a new individual from () substitutes any other existing individual in or not. The search for the individual to be substituted () starts from an individual chosen by an exponential distribution which is the inverse of the one used for the choice of in the creation of (line 10, Algorithm 4) and, therefore, now the more populated solutions will be more likely to be chosen. This is aimed at achieving a more uniform distribution of the solutions. During this updating, there may occur three distinct cases: (1)The new individual is dominated by some member of front. In this case, an individual dominated by will be searched to be substituted(2)The new individual is not dominated by any member of front, and there exists an individual dominated by which is dominated by some member of front. In this case, it will be substituted(3)The new individual is not dominated by any member of front, and there does not exist an individual dominated by which is not dominated by any member of front. In this case, the initial solution is chosen. If and are neighboring solutions, a solution dominated by will be searched to be substituted. If they are not, a solution dominated by will be searched to be substituted. In this last case, if that solution is not found, will be randomly substituted by a solution from the neighborhood of
3.4. Performance Metrics
Next, we present one metric which will serve to assess the performance of the different algorithms to be compared. This metric measures both the convergence and the diversity between two sets (the outcome set and the target set). The outcome set of each algorithm is the set of solutions that it returns, whereas the target set results from discretizing the search space with a fine grain. We use this metric to measure the performance of the algorithms in the objective space (convergence toward the front) and in the parameter space (diversity of solutions).
3.4.1. Averaged Distance Hausdorff
A single indicator is used to measure the convergence and diversity between two sets (averaged distance Hausdorff , ). The sets and have and solutions, respectively. The is calculated by where
This indicator is the average of the Hausdorff distance, and we use it here in order to measure the performance between an outcome set and a target set . This performance is measured both in the objective space and in the parameter space. In this way, it is possible to measure convergence toward the front and diversity of solutions. We use . It is desirable that this indicator has the smallest possible value. Ideally, when , then .
Our algorithm nevMOGA has been tested on two distinct benchmarks. In this section, we describe them.
3.5.1. Benchmark 1
The first benchmark considered in this work corresponds to an academic example  and is stated as follows: where subject to
Setting , , and , this MOP has one global Pareto set: and eight local Pareto sets:
3.5.2. Benchmark 2
The second benchmark is an adaptation of the modified Rastrigin . This benchmark has been modified in order to turn it into a MOP with one optimal set and nearly optimal solutions lying in different neighborhoods. This is its formulation: