#### Abstract

This paper focuses on the multi-objective optimization of the reentrant hybrid flowshop scheduling problem (RHFSP) with machines turning on and off control strategy. RHFSP exhibits significance in many industrial applications, but scheduling with both energy consumption consideration and reentrant concept is relatively unexplored at present. In this study, an improved Multi-Objective Multi-Verse Optimizer (IMOMVO) algorithm is proposed to optimize the RHFSP with objectives of makespan, maximum tardiness, and idle energy consumption. To solve the proposed model more effectively, a series of improved operations are carried out, including population initialization based on Latin hypercube sampling (LHS), individual position updating based on Lévy flight, and chaotic local search based on logical self-mapping. In addition, a right-shift procedure is used to adjust the start time of operations aiming to minimize the idle energy consumption without changing the makespan. Then, Taguchi method is utilized to study the influence of different parameter settings on the scheduling results of the IMOMVO algorithm. Finally, the performance of the proposed IMOMVO algorithm is evaluated by comparing it with MOMVO, MOPSO, MOALO, and NSGA-II on the same benchmark set. The results show that IMOMVO algorithm can solve the RHFSP with machines turning on and off control strategy effectively, and in terms of convergence and diversity of non-dominated solutions, IMOMVO is obviously superior to other algorithms. However, the distribution level of the five algorithms has little difference. Meanwhile, by turning on and off the machine properly, the useless energy consumption in the production process can be reduced effectively.

#### 1. Introduction

Hybrid flowshop scheduling problem usually involves several stages, each of which contains a certain number of parallel machines and each job passes through all the stages only once in sequence. However, in some special industries, a job needs to access some stages more than once, such as semiconductor wafer manufacturing and TFT-LCD (thin film transistor liquid crystal display) panel manufacturing. RHFSP has been proved to be NP-hard [1], making it difficult to be solved by traditional methods. Therefore, it is of great theoretical and practical value to carry out research on the efficient intelligent optimization algorithm for this problem.

Since Graves and Meal et al. [2] first studied the reentrant scheduling problem in 1983, great progress has been made. Although real-world RHFSP is multi-objective by nature, many researches focus on the RHFSP with single-objective. Bertel and Billaut [3] studied the RHFSP with parallel machines and proposed a genetic algorithm to minimize the weighted number of delayed jobs. Pearn and Chung et al. [4] investigated the integrated circuits final test scheduling problem with reentrant and proposed three network algorithms to minimize the total machine load. Choi and Kim [5] proposed several improved heuristic algorithms, such as NEH and SO, to minimize the maximum completion time and compared them with the SA algorithm. The experimental results showed that the heuristic algorithms can produce better solutions in the relatively short time. Kim and Lee [6] studied the RHFSP of uncorrelated parallel machines at each stage and proposed CDS and NEH heuristic algorithms to minimize the maximum completion time under certain delay constraints. Choi and Kim et al.[7] considered the two-stage RHFSP and minimized the completion time using the branch and bound algorithm under the constraint of maximum allowable due date. El-Khouly and El-Kilany et al. [8] proposed an optimization method for reentrant production workshop using Lagrangian decomposition with the objective of minimizing the total delay. Hekmatfar and Fatemi Ghomi et al. [9] proposed some heuristic algorithms and random key genetic algorithms (RKGA) with the objective of minimizing the maximum completion time and compared them with the hybrid genetic algorithm (HGA). Experimental results showed that the HGA was superior to other algorithms. Lin and Lee et al. [10] combined the analytical hierarchy process (AHP) and the genetic algorithm to deal with multi-objective RHFSP and applied them in a repair shop. The results showed that it is better than manual scheduling. Chen and Pan et al. [11] applied the hybrid tabu search algorithm to study RHFSP with the objectives of minimizing the maximum completion time and compared the mixed tabu search algorithm with integer programming techniques. Wu and Liu et al. [12] studied the reentrant scheduling problem considering learning effects with the objective of minimizing the maximum completion time.

In recent years, some scholars have studied the multi-objective RHFSP. Cho and Bae et al. [13] proposed a Pareto genetic algorithm to solve the biobjective RHFSP, which included the crossover operation based on Minkowski distance and local search strategy with the objectives of minimizing the maximum completion time and total delay time. Choi and Kim et al. [14] studied the real-time dynamic RHFSP with multiple optimization objectives, including maximum system output, minimum average running time, minimum average delay time, and minimum number of total lost jobs using a real-time scheduling mechanism based on decision tree which is applied to a real TFT-LCD panel production line at last. Ying and Lin et al. [15] proposed an efficient iterative Pareto greedy (IPG) algorithm based on the research in [13]. Shen and Wang et al. [16, 17] proposed an improved teaching and learning optimization algorithm and a Pareto discrete harmony search algorithm to solve the biobjective RHFSP.

At present, energy saving plays an increasingly significant role in manufacturing industries, especially energy-intensive industries. Optimizing production schedule helps to reduce unnecessary energy consumption. Luo and Du et al. [18] proposed a novel ant colony optimization algorithm for the hybrid flowshop scheduling problem considering energy consumption cost. Mansouri and Aktas et al. [19] solved the flowshop scheduling problem considering maximum completion time and total energy consumption by mixed integer multi-objective programming model and heuristic algorithm. Lei and Zheng et al. [20] proposed a hybrid frog leaping algorithm to solve the flexible flowshop scheduling problem considering energy consumption with the objective of minimizing the workload balance and total energy consumption. Wang et al. [21] used the NSGA-II algorithm to solve the problem of identical parallel machine scheduling with the objectives of reducing total energy consumption and makespan. Liu and Guo et al. [22] addressed a novel integrated green scheduling problem of flexible job shop and crane transportation to reduce the comprehensive energy consumption of the machining process and transportation process using a mixed integer programming model. Yildirim and Mouzon [23] proposed a mathematical model to minimize energy consumption and total completion time of a single machine, furthermore, introducing the dominance rules and a heuristic to increase the speed of the proposed multi-objective genetic algorithm. In [24], the single-machine scheduling problem with the objective of minimizing the energy cost under the change of energy price was studied; experiments show that energy consumption can be reduced by using the machine turning on and off control strategy.

In the current context of rising energy prices and increasingly stringent environmental concerns, it is particularly important to reduce energy cost and achieve green and sustainable development for the manufacturing industry. According to the latest data released by the State Energy Administration and China Telecom Federation, the energy consumption of manufacturing industry accounts for about one-third of the total energy consumption. However, both purchasing more energy-efficient equipment and building more energy-efficient production lines require huge financial investments. The benefits could not be easily enjoyed by most of manufacturing companies, especially those small and medium sized enterprises. In practice, it was observed that in an eight-hour shift, the bottleneck machines still stay idle 16% of the time on average. If the machines are turned off during the idle periods, 13% energy will be saved [25]. So it is significant for saving energy to turn off machines when they are idle for a certain amount of time. However, the research on energy consumption of RHFSP only focuses on the total energy consumption or idle energy consumption and seldom combines the scheduling problem with the machine turn on and off strategy. A scheduling model considering the machine turning on and off control strategy is proposed in this paper. It breaks the assumption that machines will not stop until all the jobs have been processed in RHFSP and reduces the useless energy consumption.

The remainder of this paper is organized as follows: the RHFSP is described with the makespan, max tardiness, and idle energy consumption objectives in Section 2. In Section 3, the proposed IMOMVO algorithm is introduced in detail, using an existing benchmark to present the procedure of the encoding and PS decoding. The influence of parameter settings on the IMOMVO algorithm is investigated and comparisons with other algorithms for simulated data are shown in Section 4. Finally, in Section 5, some conclusions and future work are discussed.

#### 2. Problem Description

The RHFSP addressed in this study can be described as follows [17, 26, 27]:* n* jobs need to be processed in* s* stages in sequence. There are identical parallel machines in the stage* i*, and each job can be processed on any machine in the corresponding stage. Due to the workflow requirement, a job may access some stages more than once. The objective is to determine the allocation of jobs to parallel machines at each stage, as well as the sequence of the jobs assigned to each machine in order to find the no-dominated Pareto set. Using the three-field notation introduced by Pinedo [28], the RHFSP is expressed as* FFC ** C*_{max},* T*_{max},* SEC*_{min}.

Additionally, the following assumptions are made: all the jobs and machines are ready at zero time; at any time, each machine can process at most one job, and each job can only be processed by at most one machine; all the jobs do not affect each other; the number of re-entrance of each job, the processing time of each operation, and the production route of each job through the shop are known in advance; the buffer capacity between any two consecutive processing stages is infinite; preemption is not allowed, and once the job is processed, it cannot be interrupted; the machine failure and the machine adjustment time are not considered.

##### 2.1. Symbol Definition

The mathematical symbols involved in the model and their meanings are as follows: : number of jobs : index for jobs, : number of stages : index for stages, : number of identical parallel machines in stage : index for machines in stage , : number of operations for job : index of operations for job : the operation of job : processing time of : a large number : set of operations that are processed in stage : starting time of : due date of job : completion time of job : energy consumption of machines per unit time at processing jobs time : energy consumption of machines per unit time at idle time : turning on and off time of machines : energy consumption of machines at the turning on and off time : the critical time to choose to shutdown machines : number of machines in all stages, : index of machines, ; index of machine : number of operations on machine : starting time of operation on machine : ending time of operation on machine = , if turns off the machine after an operation; 0, = , if* O*_{jk} is processed on the machine* l* in stage* i*; 0,

##### 2.2. Machine Turning On and Off Control Strategy

It is generally assumed in RHFSP that the machine stops until all the jobs have been processed. Inevitably, machines will be idle during the waiting time. Kordonowy [29] analyzed the energy consumption of machine tools. It is observed that only 19.2%-65.8% of the energy is consumed in machining time. Gutowski [30] found that most of the energy is consumed while the machine is idle in the Toyota Motor Corporation. Turning off the machine in idle time can save this part of energy consumption. However, it will take considerable time and energy to turn on and off machines. So we need to switch the machine at the right time. In general, energy consumption behaviors can be conceived as cyclic processes. Take turning shop as an example, the cyclic power graph is shown as Figure 1 [31].

From Figure 1, it can be seen that energy consumption cycle can be obviously simplified into two phase. One is the idle time which is waiting for tasks, and another is the processing time which is working on tasks. Energy consumption depends on the working state of machines. It is assumed that the energy consumption per unit time is constant when the machine is idle. When the machine is in shutdown state, no energy consumption will be generated. The machine state transition diagram for an energy consumption cycle is shown in Figure 2. The text in brackets on the arc represents the transition time and energy consumption required for the transition between different states. Machine turning on and off decision variables are used to judge the state of a machine after completion of machining. We need to determine after which operations the machines should be turned off.

First of all, if turns off the machine after an operation, the conversion time should be less than the interval between the start processing time of the next job and the completion time of it, that is to say, the constraint (1).

Secondly, the energy consumption of starting machine is greater than that of idling state. When choosing to shut down the machine, the interval between adjacent jobs should be greater than a critical time length, that is, the constraint (2) should be satisfied.

Supposing , the above two constraints can be combined into the following form (3):

##### 2.3. Mathematical Programming Model

Based on the existing literature [14, 15, 32], a mixed integer programming model for three-objective of is presented, denotes the makespan, denotes the maximum tardiness time, and denotes the idle energy consumption.

*Objective Functions*

In , because of the existence of product term , the model is a more complex nonlinear model, so it needs to be transformed into a linear model.

Plugging into formula (6), then the nonlinear term is removed and the model is transformed into a linear form as formula (8):

Subject to

Equations (4), (5), and (8) represent the three-objective functions of the optimization problem: makespan, maximum tardiness, and idle energy consumption (turning on and off energy consumption and idle energy consumption). Constraint (9) ensures that the starting time of operation is not earlier than the end time of operation . Constraint (10) ensures that each operation can be processed on only one machine in the required stage. Constraints (11), (12), and (13) ensure that each machine processes at most one operation at a time; Equation (14) is the nonnegative restriction for and constraints (15) and (16) define the maximum completion time target.

In a three-objective optimization problem, according to the definition of dominance [33], solution dominates (), if and only if . A solution is the Pareto optimal solution (or nondominant solution) if and only if there is no solution that can dominate it. For the above RHFSP, it aims to obtain the Pareto optimal solutions or the Pareto front.

#### 3. The Proposed IMOMVO Algorithm

Although many intelligent optimization algorithms have been used to solve the RHFSP, “No Free Lunch Theorems” [34] indicated that no algorithm can solve all the optimization problems. One algorithm is only targeted at certain specific problems and not necessarily effective for other problems.

The Multi-Verse Optimizer (MVO) [35] is a novel heuristic algorithm proposed by Professor Mirjalili in 2015, which divides the search process into two aspects: exploration and development. There is always high possibility of moving objects from a universe with high inflation rate to a universe with low inflation rate. In order to maintain the diversity of universes and perform exploitation, each universe has wormholes to transport its objects through space randomly. In 2016 Mirjalili et al. [36] proposed a Multi-objective Multi-verse Optimizer (MOMVO) algorithm. The search mechanism of MOMVO is very similar to MVO, but external archive is introduced to store the best non-dominated solutions so far. In order to further improve the quality of the solutions, the MOMVO algorithm was improved from three aspects: In many algorithms, the initial population is generated in a completely random manner. Since the number of populations is much smaller than the solution space, random solutions may be concentrated in a certain local region, which is not conducive to expanding the search space and converging towards the global optimal solution. This paper adopted the Latin hypercube sampling technology to initialize the population. Lévy flight was introduced into MOMVO. Both short-distance and long-distance movement are used to update the individual position, which can improve the global search ability of the algorithm. Selecting partial solutions from external archives and updating the individual positon using logical self-mapping can improve the local search ability of the algorithm.

##### 3.1. Coding and Decoding

MVO algorithm is mainly used to solve optimization problems of continuous functions, so it cannot be directly used to deal with discrete optimization problems. In this paper, we adopt random key coding. Each gene is represented by random numbers within . To get a feasible schedule, a reasonable decoding method is required to determine the processing sequence of all jobs at each stage and arrange suitable machines for the operations. Then the start time and completion time of all operations can be calculated. At last the three objectives can be obtained. In this study, we decode it according to PS method in reference [13], generate scheduling schemes according to various constraints, and ensure that the arbitrary arrangement of individuals is always a feasible scheduling. With the PS method, all jobs are allocated according to the job permutation. When it allocates a job, all its operations are assigned to the machines that can complete the operations as early as possible.

Take the RHFSP with 4 jobs and 3 stages as an example, in which the number of identical parallel machines at each stage is 3, 2, and 2, respectively. It is assumed that all the following parameters of the machines are the same, the working energy consumption per unit time* PW*=8, the idling energy consumption per unit time* PI*=2, the turning on and off cost of the machine* Setupcost*=10, the turning on and off time of the machine* Tsetup*=2. In addition, the due date of each job and the processing times are shown in Table 1.

If the processing time of one operation =0, it is indicated that the job is not processed in the stage of a certain pass. It can be seen from Table 1 that jobs 1, 2, and 4 are involved in two re-entrances, and job 3 is involved in one re-entrance. For job 2, its processing sequence is stage 1 → stage 2 → stage 2 → stage 3 → stage 2 → stage 3. Using the random key coding, the individual length is equal to the number of jobs, and each element is arbitrarily selected within as shown in Table 2. It is hypothesized that an individual position vector is [0.6555, 0.3922, 0.7431, 0.1712], then the corresponding job sequence is by ascending the elements of each vector, which means the processing order of the jobs is 4-2-1-3. The PS method was used to decode the job sequence into a feasible schedule. The Gantt charts are shown in Figure 3. Firstly, all the operations of job 4 are arranged on the machines that can process it at the earliest. Then, the operations of job 2 are arranged on the machines that can process them as early as possible. If the completion time of a certain operation of job 2 is smaller than the start time of the arranged operation of job 4 on a certain machine, the operation of job 2 will be arranged before it; otherwise, it will be arranged behind it. Next, following the same procedure as before, all the operations of jobs 1 and 3 are arranged on the machines. At last, it calculates the objective values with* C*_{max}=13,* T*_{max}= 3.4, and* SEC*_{min}=18.

**(a)**

**(b)**

**(c)**

**(d)**

##### 3.2. Right-Shift Procedure

In this study, we propose a right-shift procedure to further improve the quality of the solutions. On the premise of not changing the job sequence, the procedure can reduce the and minimize the number of idling without affecting the makespan. Since the adjustment of the latter job will affect the adjustable range of the former job, the order of adjustment should be from back to front. The specific steps are as follows.

Without considering the energy consumption cost of the machine, all the jobs will be processed as early as possible and then calculating the start and end time of each operation to generate a schedule sequence.

According to the schedule sequence, the start time of each operation on each machine is adjusted from back to front. The adjustment diagram is as shown in Figure 4, in which* A* and* B* are the jobs corresponding to two operations on the machine.* S*_{A} and* S*_{B} are the start time of the corresponding operation, is the start time of the next operation of job* A*,* E*_{A},* E*_{B,} and are the completion time of the corresponding operation, and* t*_{1},* t*_{2,} and* t*_{3} are the corresponding processing time. If* S*_{B} is not greater than ,* E*_{A}=*S*_{B}. Otherwise, it does not need to be adjusted.

Adjust the start time and completion time of all operations according to .

##### 3.3. Population Initialization Based on Latin Hypercube Sampling

When the optimal solution space cannot be predicted, the solution space feature of the initial population can maximize the information of all the individuals within a limited number. Therefore, the distribution of the initial population seriously affects the convergence performance of the algorithm. In this paper, the Latin hypercube sampling technique is used to initialize the population. Assuming that m samples are extracted in an n-dimensional vector space, the steps for Latin hypercube sampling are as follows.

Divide each dimension into* m* intervals that do not overlap so that each interval has the same probability.

Randomly extract a point in each interval of each dimension.

Randomly extract the points selected in from each dimension and form them into vectors.

In case of 2-dimensional vector and 100 samples, the distribution of the initial population constructed by completely random and Latin hypercube sampling is shown in Figure 5. The distribution of samples constructed by Latin hypercube sampling is more uniform.

##### 3.4. Updated Positions of Individuals Based on Lévy Flight

The Lévy distribution was a probability distribution model proposed by the famous French scientist Paul Pierre Lévy in the 1930s. Lévy flight is a random search method following the Lévy distribution. It usually moves in short distance and occasionally in long distance, so as to avoid the repeated movement in one place. The diagram is shown in Figure 6. Updating the individual position by Lévy flight can expand the search space, increase the diversity of the population, and avoid falling into local optimum in a certain range.

In this paper, the Mantegna algorithm was used to simulate the Lévy distribution. The specific principles are as follows [37]:

where and are the current position and updated position of the individual, *α* is the step scaling factor, is the point multiplication operation, the step size factor of the Lévy flight ranges within , and is a Lévy random path.

Generating random step size according to Lévy distribution, , where *μ* and v follow normal distribution, and , where

##### 3.5. Chaotic Local Search of Logical Self-Mapping

Introducing logical self-mapping into MOMVO algorithms, 20% of individuals in the external archives are randomly selected as elites. Then, chaotic optimization algorithm is used to search within the neighborhood of the elites, and the search space is gradually narrowed as the iteration progresses. If a better solution is detected, it will replace the solution in the external archives. If the search does not produce a better solution, it jumps to the next elite until the traversal is completed. In this paper, the logical self-mapping function is adopted to generate the chaotic sequences. The mathematical expressions are as follows:

Scale the elites* X*_{i} according to formula (21).

Substitute the scaled* D*-dimensional variables into logical self-mapping function to generate new* D*-dimensional variables. Then, transform the generated chaotic variables into the solution space* X*_{i} according to formula (22).

Calculate the function value, if , then and update* t*_{i} according to formula (20); repeat until the maximum number of iterations is reached.

##### 3.6. The Algorithm Flow of IMOMVO

MOMVO and MVO have similar search mechanism using white holes, black holes, and wormholes to improve the solutions. A leader selection mechanism is employed to select solutions from the archives and create tunnels between solutions. Specifically, the crowding distance between each solution in the archives is first selected, and the number of solutions in the neighborhood is counted as a measure of coverage or diversity in the approach. Then, to improve the distribution of solutions in the archives across all objectives, a roulette wheel from the less populated regions of the archives is applied to select solutions. The pseudocode of the IMOMVO algorithm is as shown in Algorithm 1.

Set the parameters of the IMOMVO algorithm. | |

Initialize the population using Latin hypercube sampling technology | |

initialize iter =1. | |

while | |

Update WEP and TDR | |

for | |

Boundary checking of the universes | |

Calculate the inflation rate (fitness) of universes | |

Calculate the start processing time of each operation according to PS decoding strategy | |

Right-shift procedure | |

Calculate the inflation rate (fitness) of universes | |

end for | |

Sort fitness values | |

Find the non-dominated solutions | |

Normalized the inflation rates | |

Add the non-dominated solutions to archive ANS | |

Delete the solutions with many neighbouring solutions from the ANS | |

Applying roulette wheel and = N_{i}/c | |

end if | |

Local search with Chaotic Local Search of Logical Self-Mapping method | |

Update the position of universes | |

Update the position of universes using Levy flight | |

Boundary checking of the universes | |

end while | |

Return the solutions in ANS |

#### 4. Computational Results

In order to validate the effectiveness of the IMOMVO algorithm, four multi-objective optimization algorithms, including MOMVO, MOPSO, MOALO, and NSGA-II, are selected for comparative study. The simulation environment is windows 7, Intel Core i7-4770 cpu@3.40GHz, 8G memory. The algorithm is programmed by MATLAB R2017a.

##### 4.1. Test Problems

The experiments were conducted on the benchmark set randomly generated by Cho et al. [13], which consists of 120 small-sized test problems and 120 large-sized test problems. For small-sized problems, the number of jobs is within , the number of stages is within , the number of re-entrance is within , the number of parallel machines per stage is within , and the processing time is within . For large-sized problems, the number of jobs is within , the number of stages is within , the number of re-entrance is within , the number of parallel machines per stage is within , and the processing time is within .

##### 4.2. Performance Measures

This paper selected three performance indicators: SP, GD, and IGD, in which SP and GD are described in literature [38, 39]. Since the true optimal Pareto front of the problem is unknown, the non-dominated solutions of the five algorithms are approximated as the optimal Pareto front.

SP is used to measure the distribution uniformity of the non-dominated solutions on the Pareto front. As we all know, the smaller the SP, the better the result. When SP=0, the non-dominated solutions on the Pareto front are evenly distributed. The SP is calculated as follows:where , , , and* N* is the number of non-dominated solutions.

GD is used to evaluate the approximation degree between the front obtained by the algorithm and the real Pareto front of the problem which is calculated as follows:where* N* is the number of non-dominated solutions in the Pareto front and is the Euclidean distance between the* i-th* solution and the nearest solution in the optimal Pareto front. A smaller GD corresponds to better convergence of the algorithm. The minimum value of GD is 0, indicating that all the non-dominated solutions in the Pareto front are included in the optimal Pareto front.

*IGD* is a comprehensive performance evaluation indicator which can evaluate the convergence performance of the algorithm and the diversity performance of non-dominated solutions by calculating the minimum distance between the point on each optimal Pareto front and the Pareto front obtained by some algorithm. The calculation formula is as shown in (25), where is the number of non-dominated solutions in the optimal Pareto front and* N* is the Pareto noninferior solution set obtained by the algorithm. Clearly, the smaller the* IGD, *the better the convergence and distribution quality.

##### 4.3. Parameter Setting

The performances of metaheuristics often depend on the parameter setting. The IMOMVO algorithm mainly involves three key parameters, population size* N*, step size factor of Lévy flight, and local detection speed* P* of MOMVO. Many instances selected from the benchmark set [13] are used for investigation, while there is little difference. In this paper, Taguchi method is used to study the effects of different parameters on the experimental results with Sproblem-04-20 as an example. The factor levels for each parameter are shown in Table 3. According to the number of factors as well as the number of factor levels, the orthogonal table L_{16} (3^{4}) is adopted. For each factor level combination in the orthogonal table, it runs 10 times independently. In addition, the IGD performance metrics is regarded as the response variable (RV). Clearly, the smaller the RV value, the better the combination. The experimental results are shown in Table 4. The average response value (ARV) and the importance order of each parameter are shown in Table 5. Figure 7 shows the level trend of each parameter.

From Table 5 and Figure 7, it can be seen that parameter* N* is the most significant one among the three parameters. Parameter ranks second and parameter* P* ranks the last. Regarding parameter* N*, a large value of* N* may lead to an insufficient evolution process, while a small value may cause insufficient exploration at each generation. As for parameter , a larger meant larger step size, which is beneficial to global search, but not to local search. According to the above investigation, the three key parameters of the IMOMVO algorithm are set to* N*=80, =1.5, and* P*=7.

##### 4.4. Results and Discussion

In this paper, six small-sized and six large-sized problems are randomly selected for the benchmark set. Each problem runs 10 times independently using the five algorithms, and each run gets a set of []. The average (Avg), the standard deviations (Std), and the minimum values (Min) for the six small-sized and six large-sized problems are reported in Tables 6 and 8. The optimal results of the performance indicators are shown in bold.

The average and standard deviation can only represent the problem-solving performance from the macroscopic perspective. It can be seen from the student’s t-tests whether there are significant differences between two algorithms. To show the statistical difference between the IMOMVO and other algorithms, the results are listed in Tables 7 and 9. As can be seen from Tables 7 and 9,* GD* and* IGD* of IMOMVO is better than those of the other algorithms on all the small-sized problems and most large-sized problems. Therefore, the proposed IMOMVO algorithm was significantly superior to other algorithms at 95% confidence level. However, regarding* SP*, IMOMVO was not significantly different from other algorithms. Therefore, the distribution levels of the five algorithms have little difference.

Taking Sproblem-04-02 as an example to analyze the scheduling results after adding the turning on and off control strategy. There are 16 jobs, 8 machines, 1 re-entrance, and 6 stages in the problem. Meanwhile, the number of identical parallel machines in each stage is 1, 2, 1, 2, 1, and 1. The Pareto front obtained by each of the five algorithms is shown in Figure 8. Taking a non-dominated solution 13—5—8—2—12—9—11—6—10—1—16—4—7—15—14—3 of IMOMVO algorithm as an example, its corresponding objective function values =153, = 110.7137, and =222. In Figure 9, Gantt charts of the non-dominated explored by IMOMVO with considering the machine turning on and off control strategy are illustrated. Figure 10 shows the Gantt charts of the non-dominated (, = 110.7137, and =276) without considering the machine turning on and off control strategy. Comparative results show that the operations while considering the machine turning on and off control strategy is relatively centralized, and the energy consumption is reduced by 54. For machine 3, after job 4 is finished, job 14 is processed directly without waiting. In Figure 7, the double arrow indicates that the machine needs to be shut down. The first, seventh, and eighth machines do not need to shut down, while all other machines need to be shut down during the processing. For machine 5, there are five idling, all of which need to be shut down except for the fourth one. The machine turning on and off control vectors with and without considering the machine turning on and off control strategy are shown separately in formula (26) and (27), among which 1 means shutdown the machine and 0 means idle after finishing one operation. In addition, comparisons show that the number of idling of all machines is 42 and 32, respectively. So the number of idling is reduced by 10; however, both of them have 15 shutdowns during the processing.

Taking six small-sized problems as examples, each problem runs 10 times independently, and the relative variation of the average makespan and idle energy consumption of each problem is shown in Figure 11. The negative sign indicates the relative reduction. It can be seen from the graph that RHFSP considering machines turning on and off control strategy greatly reduces the idle energy consumption on the premise of ensuring the production efficiency, which shows the effectiveness of the strategy.

To sum up, it can be seen from the experimental results that the RHFSP considering machines turning on and off control strategy reduced the total energy consumption of the machines effectively without sacrificing the production efficiency. The useless energy consumption in the production process can be reduced by reasonably setting the turning on and off of the machines. The proposed IMOMVO is more effective than other algorithms for solving the multi-objective RHFSP.

#### 5. Conclusions

In this paper, the RHFSP with the objectives of makespan, maximum tardiness and idle energy consumption was solved by the MOMVO algorithm. In order to establish a high-performance approach for this problem, the MOMVO algorithm was improved, including population initialization based on Latin hypercube sampling (LHS), individual position update based on Lévy flight, and chaotic local search operation based on logical self-mapping. The effectiveness of the improved operations was shown by numerical tests. Experimental results demonstrated the superiority of the proposed IMOMVO to the other algorithms. Specifically, the comparative results showed that IMOMVO was significantly better than other algorithms in terms of the convergence and diversity of the non-dominated solutions. Regarding the distribution of the non-dominated solutions, there was no significant difference in the five algorithms. At present, the research on the RHFSP is not profound enough. The RHFSP model considering the machine turning on and off control strategy in this paper can reduce the energy consumption of the machine effectively. It is mainly suitable for the scenarios in which the turning on and off is convenient and the turning on and off energy consumption is relatively low. In the future, the work could focus on the RHFSP considering time-of-use tariffs and the joint optimization considering maintenance.

#### Data Availability

The data used to support the findings of this study were supplied by Professor Cho HM. These datasets are cited at relevant places within the text as references [13]. Authors are not authorized to make them public.

#### Conflicts of Interest

The authors declare that they have no conflicts of interest.

#### Acknowledgments

The authors would like to thank Professor Cho HM for providing us the data sets. The research of this paper is made possible by the generous support from National Natural Science Foundation of China (71840003); Science and Technology Development Program of University of Shanghai for Science and Technology (2018KJFZ043); Ministry of Education “Cloud Number Integration Science and Education Innovation” Fund Project (2017A01109); and Henan Province Science and Technology Research Project (182102210113).