#### Abstract

This study proposes a genetic algorithm to solve the biobjective vehicle routing problem with time windows simultaneously considering total distance and distance balance of active vehicle fleet. A new complex chromosome is used to present the active vehicle route. Through tournament selection, one-point crossover, and migrating mutation operator, the solution of the problem is solved. In experiment on Solomon’s benchmark problems, considering the total distance and distance balance, the results are improved in all classes of problems. According to the experimental results, the suggested approach is sufficient and the average GA performance is good.

#### 1. Introduction

The vehicle routing problem (VRP) is one of the most attractive topics in operation research, logistics, and supply chain management. VRP deals with minimizing the total cost of logistics systems. VRPs are well-known combinatorial optimization problems arising in transportation logistic that usually involve scheduling in constrained environments. In transportation management, there is a requirement to provide services from a supply point (depot) to various geographically dispersed points (customers) with significant economic implications. Because of VRP’s important applications, many researchers have developed solution approaches for those problems.

Vehicle routing problem with time windows (VRPTW) is a variant of VRP with adding time windows constraints to that model. In VRPTW, a set of vehicles with limited capacity is to be routed from a central depot to a set of geographically dispersed customers with known demands and predefined time windows in order that fleet size of vehicles and total traveling distance are minimized and capacity and time windows constraints are satisfied. Due to its inherent complexities and usefulness in real life, the VRPTW continues to draw the attention of researchers and has become a well-known problem in network optimization, so many authors developed different solution approaches based on exact and heuristics methods.

Many exact optimization approaches have been used to solve the VRPTW which is a well-known NP-hard problem [1]. An exact algorithm [2] of branch and cut techniques is presented. For its complexity, only small scale models can be solved [3] and such methods are inefficient in general [4]. By far Kohl’s work [5] is one of the most efficient exact methods for solving 100-customer-size instance. As a result, many researchers have investigated the VRPTW using heuristics and metaheuristics approaches.

In recent years, approximate approaches are used in VRPTW instead of exact methods considering latter’s intolerably high cost. Various heuristic methods may be found in literature in [6, 10]. These methods, including simulated annealing [7], and tabu-search [8], were proposed in literature. Genetic algorithm for VRPTW [6, 9–11] maybe the most widely used solution because of its efficiency. Thangiah [12] presents a hybrid using genetic algorithm and local search optimization. Different performance of genetic algorithm, tabu-search, and simulated annealing is studied in [6, 10].

These above pieces of literature focus on the single objective problems of the VRPTW by far. In fact multiobjective problems attract many researchers’ attention since the multiobjective is closer to real environments in these years. Some multiobject VRPs are formulated as a single function using weight parameters determined only experientially. Pareto-based approach is good to solve such problem since the managers can make their own decisions from the Paretooptimal output [13]. A specialized genetic operators and variable-length chromosome representation was used in VRPTW and produced very good result on Solomon’s 56 benchmark problems [14].

Different objectives were classified in [15] according to different factors, that is, the tour, the resources, and the node activity. On tour, minimizing distance travelled (or time required) was the most common objective, while reducing the imbalance (or disparity) in the workload of vehicles was studied in [16]. Minimizing the number of vehicles is one of objectives related to resources. Ghoseiri and Ghannadpour [17] studied the multiobjective problem of minimizing the number of vehicles and the travelling distance. However, sometimes in real life the vehicles are often employed by the company and the cost is fixed. That means it is impossible to reduce such cost by reducing the numbers of vehicles, Whereas the total travelling distance is an important economic variable which is related to fuel consumption [18]. Furthermore, the workload balance of vehicles is another important variable because of management requirements.

This paper studies a biobjective VRPTW considering simultaneous minimization of the total traveling distance and workload imbalance of vehicles. Generally, the workload imbalance includes the distance imbalance and the load imbalance. However, in some real life environment, that is, fresh food delivery, the weight of good can be ignored because these orders are not heavy and make no influence on the workload cost. In other words, this paper will consider the multiobjective of the total travelling distance and the distance imbalance of active vehicle fleet. Section 2 describes the formulation of the VRPTW problem. Section 3 discusses the process of genetic algorithm to solve this problem. The experiments and results are analyzed in Section 4. Finally, Section 5 provides the conclusions to this work.

#### 2. Model Formulation

The VRP problem was introduced by [19] and became one of the most widely analyzed NP-hard problems. The single objective VRPTW aims to determine which customers are visited by each vehicle and the route each vehicle follows to serve the assigned customers, while the distances travelled by the vehicles are minimized and the capacity and time windows constraints are satisfied. The VRPTW has been widely studied because it remains one of the most difficult problems in combinational optimization and has a considerable economic impact on all the logistic system [11], especially due to the importance of supply chain operations [20]. Some VRPTW problems were discussed with exact methods, such as Lagrangian relaxation-based methods, column generation, and dynamic programming. However, these exact methods often perform poorly for some intermediate and large problems. In this case, some heuristic and meta-heuristic methods have been proposed to solve these problems. And the results show that these methods obtain feasible solutions in acceptable times.

##### 2.1. Formulation for VRPTW

A nondirected complete graph can be used to model the VRPTW. The vertices denote the depot and the customers, and edges correspond to the links between them.

The VRPTW can be formulated as follows.

*Notation*:is the earliest time for customer to allow the service. :is the latest time for customer to allow the service. :is the cost for travelling from node to node . It is considered as the distance or time required for travelling from node to node . :is the demand at customer . :is the maximum number of vehicles that can be used. :is the number of customers plus the depot. The depot is denoted with number 1, and the customers are denoted as . :is the loading capacity of each vehicle. :is the corresponding time at which vehicle starts to service customer . :is a given large value. :is the decision variable. It is equal to 1 if vehicle travels from node to node and is equal to 0 otherwise.

subject to

Equation (1) is the objective function of the single objective problem. Equation (2) denotes that a vehicle must travel from one node to a different one. Equation (3) indicates that variable is equal to 1 if vehicle goes from node to node and is equal to 0 otherwise. Equation (4) states that a customer is serviced only once by exactly one vehicle. By specifying the constraint of (5), it is taken into account that the load for a given vehicle cannot exceed its capacity Q. Equation (6) specifies that there are up to routes going out of the delivery depot. Equation (7) guarantees that each vehicle departs from and returns to the depot. Equation (8) ensures that time windows are observed. Given a large value, , the inequality represented in (9) specifies that, if vehicle is travelling from customer to customer , the vehicle cannot arrive at customer before . The variable corresponds to the time at which vehicle starts to service customer . If the vehicle does not service , is not calculated.

##### 2.2. Multiobjective VRPTW with Distance Balance

The paper aims to solve the vehicle routing problem with hard time windows and route balance as a multiobject problem, where both the total travelling distance and route imbalance are minimized. The route balance often was related to the following factors:(1)balancing the number of customers visited by each active vehicle,(2)balancing the quantity or weight of the good delivered by each active vehicle, sometimes balancing the load rate (BLR), denoted as (10), where is the exact load of vehicle and is the capacity of vehicle (3)balancing the time required of the route,(4)balancing the waiting time required of the route,(5)balancing the delayed time of the route,(6)balancing the distance of the route travelled by active vehicles.

In this paper, we consider the imbalance of the distances of the route travelled, which is defined as (11). And (12) is the mean of all distances. In order to describe the balance more clearly, we use the balance factor to represent the degree in (13). Consider the following:

Thus, from (1) and (11), the new multiobjective problem is defined as follows.

Minimize while the constraints described above ((2)–(9)) are satisfied.

#### 3. Multiobjective Genetic Algorithm

Various heuristic and meta-heuristic approaches have been proposed for solving the VRPTW. GA, compared with other heuristics [21–23], has been widely used to solve this problem, because of its efficiency and flexibility. Three main GA algorithmic operators, namely, selection, crossover, and mutation, can be configured in different ways, resulting in various GA combinations. Thangiah et al. [24] were the first to apply GA to VRPTW. During the past few years, numerous studies have been devoted to developing GAs for solving VRPTW. GA is an adaptive heuristic search method based on population genetics. The genetic algorithm represents the solution space using genetic coding of a feasible solution as a chromosome that defines an individual member of a population. While binary strings have been commonly used in the literature to code chromosomes, we adopt integer strings in the proposed GA, where each gene in a chromosome represents a customer (or a node). In a single objective problem of genetic algorithm, special fitness function is often defined but in MOP application of genetic algorithm the Pareto ranking scheme has often been used [25]. The Pareto ranking process tries to rank the solutions to find the nondominated solutions. Therefore, according to this process each solution gives a rank value in respect of different objective values that shows the quality of the solution compared to the other solutions. It is easily incorporated into the fitness evaluation process within a genetic algorithm by replacing the raw fitness scores with Pareto ranks. These ranks, to be defined later, stratify the population into preference categories. With it, lower ranks are preferable and the individuals within rank 1 are the best in the current population. The idea of Pareto ranking is to preserve the independence of individual objectives. This is done by treating the current candidate solutions as stratified sets or ranks of possible solutions. The individuals in each rank set represent solutions that are in some sense incomparable with one another. Pareto ranking will only differentiate individuals that are clearly superior to others in all dimensions of the problem. This contrasts with a pure genetic algorithms attempt to assign a single fitness score to a MOP, perhaps as a weighted sum. Doing so essentially recasts the MOP as a single objective problem. The difficulty with this is that the weighted sum necessitates the introduction of bias into both search performance and quality of solutions obtained. For many MOP’s, finding an effective weighting for the multiple dimensions is difficult and ad hoc and often results in unsatisfactory performance and solutions.

##### 3.1. Chromosome Representation

This paper uses a complex two-part chromosome to represent the solution of VRPTW. The chromosome is separated into two parts by a zero number decorated by yellow. The first part of the chromosome is a chain of integers and each of the integers represents a customer. We also call this part customer-part. The customers on it are separated to several routes, each of them representing a sequence of delivers that must be covered by a vehicle. The second part of the chromosome contains vehicles information. We also can call this part vehicle-part. In the vehicle-part the quantity of genes equals the quantity of routes in the customer-part. The number on each of the genes represents the length of its corresponding route. The sum of these numbers in vehicle-part must be equal to the quantity of customers. For example, Figure 1 shows a representation of a possible solution with 8 customers and 3 vehicles. There are 3 genes in vehicle-part which means that those 8 customers are separated to 3 routes. The 3 on the first gene in vehicle-part represents route 1 that services 3 customers which are 1, 5, and 4. The 2 on the second gene means that route 2 services 2 and 8. The 3 on the third gene means that route 3 services 7, 6, and 3.

This design is different from the classical approach, in which the route information is mixed with the customer sequences in a single chromosome. Storing the route information and customer sequences separately can represent the solution more clearly and facilitate the implementation of the algorithm, but its effect on the computational efficiency would not be significant. Without loss of generality, we consider the following implementations of the three operators.

##### 3.2. Selection

There are several commonly used selection operators used in GA selection process. Roulette wheel selection (RWS) is to stochastically select from one generation to create the basis of the next generation. RWS enables the fittest individuals to have a greater chance of survival than weaker ones. This replicates nature in that fitter individuals will tend to have a better probability of survival and will go forward to form the mating pool for the next generation. Weaker individuals are not without a chance. In nature such individuals may have genetic coding that may be proven to be useful for future generations. Unlike RWS, uniform selection (US) assigns the same probability to each chromosome of the population. The US operation proceeds at random and is easy to implement. However, it has been criticized for lacking the spirit of natural evolution compared with RWS. Tournament selection (TS) is the most commonly used operation besides RWS. The TS operator involves running several “tournaments” among a set of chromosomes chosen at random. The one with the largest fitness is selected for crossover in a pair of chromosomes. The tournament size can be used to adjust the selection pressure. If the tournament size is larger, weaker individuals will have a smaller chance to be selected. This process is repeated until the mating pool is full. Since some experiments indicate that the TS operator outperforms the RWS and US, we choose TS as the selection operator. A possible explanation is that TW always selects the best set of individuals to crossover, whereas RWS and TS are probabilistic and hence some good individuals may be lost in the evolutionary process [26].

##### 3.3. Crossover

One-point crossover operator evolves selecting one point randomly which divides a parent into two parts. Each of these points is selected with equal probability. For example, the crossover point is selected at the third gene of parent 1 randomly. The offspring inherits the left side from parent 1 and other genes are inherited from parent 2. Another offspring is produced by exchanging the roles of two parents. Figure 2 illustrates the process of one-point crossover [27]. The cycle crossover can produce offspring through a cycle which is a sequent of the position of the first parents. The partially mapped crossover operator produces the offspring by randomly selecting two crossover points [28]. The linear-order crossover also selects two crossover points from one parent and produces a new offspring with another parent [29]. Some experiments illustrates that the one-point crossover is more efficient than the other tree operators [26]. Thus this paper chooses the one-point crossover operator.

##### 3.4. Mutation

In mutation process, there are also several mutation operators in different literatures. Some of them are very complex. However, some different mutation operators were experimented that they did not make significant difference in GA efficiency. A possible explanation to that maybe the mutation rate is always very small, typically between 0.01 and 0.1. A migrating mutation is adopted to produce heterogeneous chromosomes in the pool to avoid early convergence of the algorithm. This mutation method is to randomly select a chromosome from the pool and then randomly choose a customer from one route. Then the selected customer is tried to be inserted into a new route. If the insertion results produce a feasible route, this mutation operator succeeds. Otherwise, this process is repeated until a feasible solution is achieved. Figure 3 illustrates the migrating mutation process.

**(a)**

**(b)**

**(c)**

#### 4. Experimental Results and Comparisons

The Solomon’s problems consist of 56 data sets, which have been extensively used for benchmarking different algorithms for VRPTW in literature over the years, since they represent relatively well different kinds of routing scenarios [14]. These problems are different in fleet size, vehicle capacity, traveling time of vehicles, and so on. The customers’ details include the sequence of customer index, location in and coordinates, the demand for load, the ready time, due date, and the service time required. All the test problems consist of 100 customers and these customers are generally adopted as the problem size for performance comparisons in VRPTW. The traveling time between customers is equal to the corresponding Euclidean distance. This problem data is clustered into six classes, named C1, C2, R1, R2, RC1, and RC2, respectively. And different categories indicate the different customer distribution and different time windows constraints. According to the customer location, problem category C has all customers clustered in groups, problem category R has all customers located randomly, and problem category RC has a mixture of random and clustered customers. In other words, customers are located closer to each other in problem category C than in R and customers in RC are in the middle. That also means that the R is more difficult to be solved. Moreover, the time windows of category 1 (C1, R1, and RC1) is smaller than that of category 2 (C2, R2, and RC2). Smaller time windows mean that some candidate solutions are more likely to become unfeasible after a small change in the sequence of visited customers. Furthermore, for category 1, the time window is also narrow for the depot, which means that only a few customers can be served by one vehicle.

This section describes computational experiments carried out to investigate the efficiency of the proposed GA. The algorithm is coded in JAVA and run on a PC with 2.53 GHz CPU and 2000 MB memory. The standard Solomon’s VRPTW benchmark problem instance is used as experimental data [30]. Empirically the computation is based on the following parameters: Population size = 100, Generation number = 500, Crossover rate = 0.9, Mutation rate = 0.2.

To illustrate the influence of different models with routing balance, we have employed the best solution of the benchmark problems in Table 1, 2, and 3, for the data groups C, R, and RC, respectively. Each table includes the following: in the first column, the benchmark problem instance according to [30], in the second column, the best known solution for that problem in literatures, in the third column, the best solution found by the algorithm of this paper, in the fourth column, the difference by percent between the best known and the best found, in the third column from last, the balance rate of single objective search, the second column from last and the final column, the new distance value and the balance rate of biobjective algorithm of this paper.

From the results of Tables 1 to 3 in single objective approach, it is found that the balance rate of C1, R1, and RC1 is bigger than that of C2, R2, and RC2 when the balance is not considered. In other words, category 1 with wider time windows has more space to improve the balance than category 2 with smaller time windows. For example, the balance rate in C101 is 42.2% but in C201 is 11.2%.

After considering the distance balance, the biobjective solution data illustrated much improvement on the balance rate without much influence on the distance cost. For example, in C101 when the balance decreases from 42.2% to 10.3%, the distance only reduces by about 10 (839 minus 828.94). Not only the instances with wide time windows get a great improvement, but also the ones with the narrow time windows reduce the balance rates.

The comparative data shows that suggestive algorithm of this paper reaches better route balance without significant deterioration of the VRPTW solution, in terms of the active vehicle fleet. From Table 4, the last column () presents the degree of balance improved.

#### 5. Conclusion

The problem discussed in this paper is of significant practical importance in cases where employee’s labor balance is key motivation of vehicle routing. In some situation, the weight is not very important compared with the distances of the active vehicle fleet when considering labor balance.

This paper proposed a genetic algorithm to solve the biobjective vehicle routing problem with time windows simultaneously considering total distance and distance balance of active vehicle fleet. We used a new complex chromosome to present the active vehicle route. We choose Tournament selection, one-point crossover, and migrating mutation operator to solve this GA. After iterator operation, the solution of the problem was solved. In experiment on Solomon’s benchmark problems, we found that this objective is close to best known value in literatures, even though it was not designed for the single objective problems. Considering the distance balance, those instances are imbalanced and have much space to improve. From the results, the distance balance was improved in all classes of problems in the biobjective problem. According to the produced results, the suggested approach was sufficient and the average GA performance was adequate.

#### Acknowledgments

The authors thank the anonymous reviewers for their useful suggestions and comments. This work was supported by the National Natural Science Funds of China no. 51105157.