Abstract

A multiobjective optimization problem which focuses on parallel machines scheduling is considered. This problem consists of scheduling independent jobs on identical parallel machines with release dates, due dates, and sequence-dependent setup times. The preemption of jobs is forbidden. The aim is to minimize two different objectives: makespan and total tardiness. The contribution of this paper is to propose first a new mathematical model for this specific problem. Then, since this problem is NP hard in the strong sense, two well-known approximated methods, NSGA-II and SPEA-II, are adopted to solve it. Experimental results show the advantages of NSGA-II for the studied problem. An exact method is then applied to be compared with NSGA-II algorithm in order to prove the efficiency of the former. Experimental results show the advantages of NSGA-II for the studied problem. Computational experiments show that on all the tested instances, our NSGA-II algorithm was able to get the optimal solutions.

1. Introduction

The parallel machines scheduling problem is a widely studied optimization problem. This problem considers several available identical machines to execute a set of jobs . Indeed, it can be described as a special hybrid flowshop scheduling problem which has only one stage. Every job is considered with a processing time , a release date , a due date , and a sequence-dependent setup time which means the transfer time between two adjacent jobs and (job is scheduled immediately after job ). Two different objectives are minimized at once in this work: makespan and total tardiness. According to the lawler scheduling classification, the studied problem is defined as [1].

The most studied criteria for scheduling problems is the makespan. It is the completion time of the job which is finished at last (maximum completion time of jobs). Cheng and Sin [2] have proved that the problem of minimizing the makespan on two identical parallel machines is NP hard. Burcker [3] has proved that if the number of machines is greater than two, then the problem is even strongly NP hard. In the literature, many methods are proposed to solve it. Gendreau et al. [4] have proposed a heuristic and a lower bound for the problem. Yalaoui and Chu [5] have proposed a heuristic for . An exact method is also developed to solve it. Gharbi and Haouari [6] have proposed a lower bound and use the branch-and-bound method to solve the problem, where is the delivery time. NΓ©ron et al. [7] have used two branching schemes for the parallel machines scheduling problem with release dates and tails. Zouba et al. [8] present heuristic algorithms to solve a parallel machines scheduling problem with a period-based changing mode operators to minimize the makespan. Fanjul-Peyro and Ruiz [9] have proposed the size-reduction heuristics for the unrelated parallel machines scheduling problem with the minimization of makespan.

The total tardiness is another concerned criterion which is the sum of tardiness of all jobs. The parallel machines scheduling problem to minimize the total tardiness is known as NP hard according to Koulamas [10]. Some exact methods are proposed to solve this problem. Azizoglu and Kirca [11] have proposed a branch-and-bound method to the parallel machines problem with total tardiness minimization. Yalaoui and Chu [12] have also developed a lower bound and a branch-and-bound to solve the same problem. In the work of Shim and Kim [13], they developed dominance properties and a lower bound for the same problem, and a branch-and-bound method is adopted to solve it. In 2008, Shim and Kim [14] have used a branch and bound algorithm for solving a parallel machines scheduling problem with a job splitting property. Nguyen et al. [15] proposed a new dynamic programming formulation for solving this problem. Moreover, many heuristics have been developed, such as of Pritsker et al. [16], Alidaee and Rosa [17], Baker and Bertrand [18], and Lamothe et al. [19].

Many approximated methods have been proposed to solve the multiobjective optimization problems (MOPs). In 1985, Schaffer [20] proposed an algorithm named VEGA (vector evaluated genetic algorithm) to solve the multiobjective optimization problem. It was the first method which is based on the simple genetic algorithm to solve MOPs. Kursame [21] has proposed the VOES method (Vector-Optimization Evolution Strategy) in 1990. Hajela and Lin [22] have proposed the HLGA (Hijela’s and Lin’s genetic algorithm) in 1993, but these three algorithms are non-Pareto approaches. In the multiobjective optimization problems, there is no single optimal solution, but a set of nondominated solutions. They are called the Pareto optimal solutions. Recently, the Pareto-based approaches are interested by the researchers. Multiple Objective Genetic Algorithms (MOGA) [23], Niched Pareto Genetic Algorithm (NPGA) [24], Nondominated Sorting Genetic Algorithm (NSGA-II) [25], Strength Pareto Evolutionary Algorithm (SPEA-II) [26], Pareto Archived Evolution Strategy (PAES) [27] and Pareto envelope-based selection algorithm (PESA) [28] are proposed for solving MOP and widely used for scheduling problems.

The parallel machines multiobjective scheduling problem is NP hard in the strong sense. Therefore, approximated methods are widely proposed for solving it. In 2003, Cochran et al. [29] have proposed a two-stage multipopulation genetic algorithm (MPGA) to minimize three different criteria: makespan, the total weighted tardiness, and the total weighted completion time. Chang et al. [30] have minimized the makespan and the total tardiness for parallel machines scheduling problem in 2005 by means of a two-phase subpopulation genetic algorithm (SPGA). The numerical results demonstrate that the SPGA is better than NSGA-II and MOGA. After that, they have proposed a modified version of SPGA by using a global archive and an adaptive strategy for solving the same problem [31] in 2006. In the works of Baesler et al. [32], a multiobjective optimization algorithm is appropriate to solve the problem. This algorithm is the combination of a genetic algorithm and a local search and is compared with the MOGA and the MOSA for the problem. Furthermore, Baesler et al. [33] have proposed MultiObjective Simulated Annealing with Random Trajectory Search (MOSARTS) for the same problem. Bouibede-Hocine et al. [34] used NSGA-II and SPEA for the problem. Dugardin et al. [35] have used the NSGA-II for solving a hybrid job shop and a parallel machines scheduling problems. Gao et al. [36] have presented a new parallel genetic algorithm for the multiobjective parallel machines scheduling problem which is subjected to a special process constraint. The aim is the minimization of the makespan and the earliness/tardiness penalty. Later, Gao [37] have proposed a novel artificial immune system for solving the same problem. Fang [38] proposed an improved weighted-based multiobjective genetic algorithm for the typical parallel machines scheduling problem. Lian [39] have proposed a united search particle swarm optimization algorithm for the parallel machines problem with different due dates. Chyu and Chang [40] have developed a pareto evolutionary algorithm for a biobjective unrelated parallel machines scheduling problem.

Our main contribution in this paper is to consider a special multiobjective parallel machines scheduling problem taking in consideration additional constraints compared to other previous works such as these of Baesler et al. [32, 33] or that of Bouibede et al. [34]. These additional constraints are the release date, the due date and the sequence-dependent setup times. At our knowledge, no other studies take in consideration all these constraints for multiobjective parallel machines scheduling problem, and thus we may consider that the problem studied in this paper is a specific scheduling problem.

Furthermore, as the problem is NP hard in the strong sense, two approximated methods: the non-dominated sorting genetic algorithm (NSGA-II) and Strength Pareto Evolutionary Algorithm (SPEA-II) are developed in this paper for the first time to solve the studied problem. An integer mathematical model is also proposed for modelling this problem, and we have proposed a special chromosome to encode it. An exact method is applied to demonstrate the efficiency of the proposed methods.

The rest of the paper is organized as follows: Section 2 describes the considered problem and the mathematical model. In Section 3, we develop the NSGA-II and the SPEA-II algorithms. Computational results are shown in Section 4. Our conclusion and perspectives are given in Section 5.

2. Problem Description and Mathematical Formulation

In the parallel machines scheduling problem, a set of independent jobs should be scheduled on identical machines without preemption. Each job has a processing time , a release date and a due date . The sequence-dependent setup times are also considered in this problem. It deals with the setup time when a machine switches the production from job to job (job precedes immediately job on the same machine). Without loss of generality, we have set . We assume that , and it means that if job is scheduled at the beginning on a machine, no setup time is required. All these job data are generated randomly based on a protocol test which is described in Section 4. Some assumption must be respected: each machine can execute only one job at once; each job can be processed only once.

Some notations are defined below::the number of jobs:the number of machines:the index of jobs :the index of machines :the order of job in the machine:the release date of job , :the due date of job , :the sequence-dependent setup times if job is the immediate successor of the job on the same machine.:the processing time of job , :the completion time of job , :the real tardiness of job , , :make span, :the number of jobs assigned to machine .

The problem can be formulated as follows: subject to

Equation (1) represents the objective function, and the goal of our work is to minimize the makespan and the total tardiness. Constraint (2) ensures that only one job can be scheduled at the th job position. Constraint (3) means that each job can be scheduled only once. Constraints (4)–(9) denote the data of jobs which are scheduled at the th job position of th machine position jobs, such as processing times, setup times, release dates, due dates, completion times, and tardiness times. Constraints (10) and (11) explain how to compute the makespan and the total tardiness. Constraint (12) is a decision variable: if job is scheduled on machine in position , then , otherwise 0. Constraint (13) shows that if job is the immediate successor of the job on the same machine, then ; otherwise, .

3. Resolution Methods

In this section, two approximated methods and an exact method are developed to solve our problem. Two well-known algorithms have been chosen: NSGA-II and SPEA-II. The experimental results of these algorithms are compared in the following section.

3.1. Encoding Strategy

A special encoding of the studied problem is presented here. It is adopted for our developed methods: NSGA-II and SPEA-II. A chromosome (see Table 1) with a () matrix is proposed to encode the problem. The elements in each column correspond to the job index, the number of the machine where the job is scheduled, and the position of the job on the machine. It is described hereafter

shows the index of jobs, ,

All feasible solutions of the initial population are randomly generated. A HOT (High priority Order Transition) sequence rule is adopted here. A random value is generated in a uniform distribution for each job, and all the jobs are ranked by the decreasing order of this value. Thereafter, the job with the highest value is served first on the machine which is earliest available.

For instance, Table 1 presents an example for which the Gantt chart is illustrated in Figure 1.

3.2. NSGA-II

The Non-dominated Sorting Genetic Algorithm (NSGA-II) is the second version of the NSGA algorithm. The latter has been widely used to solve multiobjective optimization problems. It was proposed by Deb et al. [25] in 2000 as a fast and efficient multiobjective genetic algorithm to find the non-dominated solutions based on the Pareto dominance relationship. In this algorithm, the procedure begins starting from the creation of an initial population. All the feasible solutions in this population are randomly generated. At each generation, selection, crossover, and mutation operations as well as ranking, the different solutions are realized. These four operations are the main concept of the NSGA-II.

The binary tournament technique is used in the operations of the parents selection. We chose the classic one-point crossover operator in this study, and an operator of reparation is necessary for the created children. In the mutation operator, two columns are randomly chosen and they are interchanged. In the operator of ranking, all the different solutions are assigned to several non-dominated fronts. A computation of the crowding distance has been done to sort the solutions in the same front.

Several parameters are to be set in this algorithm. The population size and the number of generations determine the computational duration. The crossover probability and the mutation probability decide the convergence and the diversity of the results. The parameters setting of NSGA-II is complex. In this work, several tests have been carried out to choose the best parameters of crossover and mutation probabilities. We have tested this problem with a crossover probability equal to and for the mutation probability. The final values of the parameters are chosen according to the measuring criteria which are presented in Section 4. We have set a great crossover probability equal to 0.9 and a small mutation probability equal to 0.1. The population size and the generation number are set to 100. For the comparison with another approximated method, a fixed computing time is considered as the stopping criteria. We have chosen 1 second, 2 seconds, and 5 seconds for the different problems (see Section 4).

The main process of NSGA-II is described in Algorithm 1 [25].

(1) Generate the initial population of size
(2) Evaluate these solutions
(3) Sort these solutions by non domination and crowding distance
(4) Creation of the offspring population with the operators of selection, crossover and mutation
(5) Evaluate all solutions
(6) Sort the solutions of two populations: and
(7) Choose the best solutions for the new population with the remaining steps (ranking into non
 dominated front, crowding distance)
(8) If the stopping criteria is satisfied then the algorithm is stopped, the obtained results are all the
 solutions in the first non dominated front; repeat steps (4) to (7) otherwise

3.3. SPEA-II

The second version of the SPEA (Strength Pareto Evolutionary Algorithm) is presented here, which has been introduced by Zitzler et al. [26]. A regular population and an archive mechanism (Pareto archive) are used in SPEA-II. During the generation, all the non-dominated solutions are copied to the Pareto archive. Compared with SPEA, the Pareto archive size is fixed in this algorithm. The Pareto archive is filled with the non-dominated solutions until the considered archive size. If there are not enough non-dominated solutions, the dominated solutions which have the best objective values are also considered. On the other side, if there are too many non-dominated solutions than the archive size, we only kept the best of them with a decreasing order of distance (see [26]).

We have used the same operations used in NSGA-II for the selection, crossover, and mutation. In the parameters settings of the algorithm, we have chosen the same crossover probability, mutation probability, and size of the population. We have set the archive size to 50 based on the tested results. The main concept of SPEA-II is described in Algorithm 2 [26].

(1) Generate an initial population and an empty Pareto archive , set to 0
(2) Evaluate all the solutions in and
(3) Find the non dominated solutions between and . If there are too many non dominated
 solutions for the archive, calculate the distance Οƒ of the solutions, we only kept the best solutions
 with a decreasing order of distance . In contrary, if there are not enough non dominated solutions in
 the archive, the best dominated solutions must be filled in the archive till this archive is full
(4) If the stopping criteria is satisfied, then stop the algorithm and export the non dominated
 solutions in the Pareto archive as the result
(5) The operators of selection, crossover and mutation are applied for the new population , and
 then return to step (2)

3.4. Exact Method

A full enumeration method is applied here. The aim is to enumerate all the feasible solutions and find the absolute Pareto optimal solutions among them. We have used the same ranking operation of NSGA-II to find the nondominated solutions. Because of the too long computational times, this method can be applied only the small instances. The results of the comparison between the exact method and the NSGA-II algorithm (which has been proved to be better than SPEA-II as shown in the next section) are shown in the following section.

4. Computational Results

4.1. Definition of the Protocol Test

For the computational experiments, a protocol test is developed in which the job data are randomly generated. The job data include the processing times , the release dates , the due dates and the sequence-dependent setup times . In a previous work [41], we have presented a protocol test based on the work of [12, 42]. For each job, the processing times are generated using a uniform distribution []. After the generation of the processing times, the setup times are set to , where is a coefficient randomly generated in [], and . The release dates are generated using a uniform distribution . The value of is considered from [43]. The due dates of jobs are in the interval [, ], where , and we have chosen and between 0.2 and 0.4. Since there are four parameters to set, and two values are proposed to each parameter, we have then 16 different configurations to test, and the index of each configuration is shown in Table 2. For the comparison of two approximated methods, three types of instances have been processed: the problem with 10 jobs and 3 machines, the problem with 20 jobs and 3 machines, and the problem with 50 jobs and 5 machines. The full enumeration method cannot be applied on large problems because of the too long execution times. Therefore, the comparison between the proposed methods and the full enumeration method is realized on the problems with 5, 6, 7, and 8 jobs.

4.2. Measuring Criteria

The comparison of two different Pareto fronts for solving the same MOP is very complicated, since each front is not a single scalar value, but a set of non-dominated solutions. An approximated method involves the following objectives [44]: the distance to the absolute Pareto optimal front is to be minimized, the diversity of the generated solutions is to be maximized and the spread of the obtained front is to be maximized.

Recently, several measuring criteria have been proposed for comparing two non-dominated fronts. Ranjithan et al. [45] have proposed a spread metric which determines the maximum range represented by the non-dominated solutions in the objective space and a coverage metric that characterizes the distribution of solutions. Kumar and Ranjithan [43] have proposed a factor that represents the degree of dominance of non-dominated solutions from two obtained fronts which are produced by two different algorithms. Van Veldhuizen [46] has proposed an error-ratio criterion and a generational distance (GD). The former one represents the proportion of nontrue Pareto solutions in the obtained front. It measures the general progress to the reference set. Zitzler and Thiele [44] have presented a Zitzler measure, a distance is presented by Riise [47]. The two latter measures were adopted in this work, because they do not require to compute the absolute Pareto optimal front (reference set). The number of nondominated solutions of the Pareto front is also considered in our work.

Let and be two Pareto fronts obtained by different methods. and are the number of nondominated solutions in fronts and , respectively. The distance is defined as , where is the distance between solution of and its orthogonal projection on . The value of is negative, when solution is below the front (or the extrapolated front). The distance shows if front is closer to the optimal solutions than front . The lower of the value we find, the better quality of front we will get.

The Zitzler measure has been presented by Zitzler and Thiele [44], where measures the ratio of solutions in which are dominated by at least one solution in . The equation is defined by , where and are the non-dominated solutions in fronts and respectively. When , it means that all of the solutions in are dominated by solutions in . However, when , it means that there are no solutions in that are dominated by at least one solution in . On the other side, measures the ratio of solutions in which are dominated by at least one solution in .

4.3. Results
4.3.1. Comparison of NSGA-II and SPEA-II

First, the experimental results of NSGA-II and SPEA-II are compared. Figure 2 shows an example where 20 jobs should be scheduled on 3 identical parallel machines. The objective values of all the obtained solutions are shown in Table 3. We can observe that the front obtained by NSGA-II dominates the other front obtained by SPEA-II. Indeed, in this example, we have , which means that there is no solution in the NSGA-II front that is dominated by at least a solution from SPEA-II. means that all the solutions of SPEA-II are dominated by at least one solution of NSGA-II front. Finally, we have ; is negative because the NSGA-II front dominates the SPEA-II front.

Tables 4, 5, and 6 show the comparison results of NSGA-II and SPEA-II to solve three different problems with 16 different configurations. The index of the configuration is shown in Table 2. For each configuration, we have tested 10 instances. The mean value of , the minimum value and the maximum value of , the mean values of and are presented. In this work, 480 instances are therefore tested.

The first line of Table 4 is explained. This problem is defined with 10 jobs and 3 machines. Index 1 means that the jobs data are randomly generated with , , , and (see Table 3). The next 7 elements show the comparison results. Since each configuration is tested for ten instances, the value of shows that the NSGA-II has obtained more non-dominated solutions than SPEA-II with respective mean values of 4 and 3.2. The mean value of the Riise distance indicates that the obtained front of NSGA-II dominates the SPEA-II front, since this value is negative. The minimum and maximum values of are both negatives, which means that the Riise distance is always negative during the ten tests. The results show that ; that is, there are no solutions provided by NSGA-II that are dominated by at least one solution of the SPEA-II. On the other side, means that all solutions provided by SPEA-II are dominated by at least one solution of NSGA-II.

In this work, we have chosen a fixed computing time as a stopping criterion for NSGA-II and SPEA-II. One second, two seconds, and five seconds are the execution times that are fixed for the problems with 10 jobs, 20 jobs, and 50 jobs, respectively. Tables 4, 5, and 6 show a comparison between NSGA-II and SPEA-II for the tested instances.

The values of are always negative, which means that the NSGA-II front always dominates the SPEA-II front. The absolute value of distance is not high for small-size problems. However, we can observe that the absolute values of the distance are higher for large-size problems. That means that for small problems, the two fronts of NSGA-II and SPEA-II are both very close to the absolute Pareto front (the efficiency of NSGA-II is demonstrated in the following section). So, the difference between the two fronts is not very obvious, and the number of non-dominated solutions in the obtained fronts are quiet similar. When solving complex problems, the advantage of NSGA-II is obvious. The NSGA-II front is under the SPEA-II front and the distance between the two fronts is larger than before. The NSGA-II has obtained more non-dominated solutions than SPEA-II. The value of is always less than . In fact, the values of are nearly equal to 0, while those of are nearly equal to 1. Based on all the tested instances, the comparison between the different numerical results show the advantages and the efficiency of NSGA-II.

4.3.2. Comparison with the Pareto Optimal Solutions

In this section, the absolute Pareto optimal solutions obtained by an exact method are compared to the solutions of the NSGA-II, which has been proved to be better than SPEA-II. This exact method consists of enumerating all the possible solutions of the problem. Since the full enumeration requires too much execution time, we cannot solve large-size problems with this exact method. In this work, we are able to solve the problem with a maximum number of 8 jobs. The results in Table 7 have shown that the NSGA-II has obtained the same solutions (, ) than the exact method for the tested instances. The main advantage of the NSGA-II is that it requires less computing time (CT NSGA-II) than the full enumeration (CT FE).

5. Conclusion

In this work, an identical parallel machines scheduling problem with release dates, due dates, and sequence-dependant setup times is considered. The goal is to minimize the makespan and the total tardiness. The contribution of this paper is to develop a new mathematical model and two metaheuristics which are the NSGA-II and SPEA-II. The computational results show the advantage of NSGA-II over the SPEA-II on the 480 tested problems based on the adopted measuring criteria. After that, the NSGA-II is compared to the full enumeration for small-size problems. We can observe that NSGA-II is able to obtain the absolute optimal solutions. The full enumeration method cannot solve problems with more than 8 jobs. Hence, the perspective is to develop another exact method to compare the results of the two methods with larger and more complex problems. On the other side, the setting of crossover and mutation probabilities is very difficult. Therefore, it would be interesting to develop a self-adopted GA which can set the probabilities during the generations.