#### Abstract

In the reverse logistics recycling process, considering the diversity of recycling types, a variety of vehicle models are used to meet the reverse logistics recycling requirements. Therefore, this paper considers the fixed costs, transportation costs, carbon emission costs during driving, and time penalty costs in the reverse logistics process under multiple constraints such as multiple vehicle types and time windows in the recycling center. The recovery point is the recovery model, and an improved genetic algorithm is used to solve it. Drawing lessons from the idea of greed, the superiority of the initial population is improved; the entry matrix and the exit matrix of the iterative population are constructed, and the crossover operator is improved based on this, and the forward insertion method is introduced to design the hybrid crossover operation to speed up the population. At the same time, the improvement of the mutation operator is proposed to increase the diversity of the population. The experimental results show that the multivehicle mixed delivery mode can reduce costs more effectively than single-vehicle models, and the improved genetic algorithm has better convergence and stability.

#### 1. Introduction

In recent years, with the shortage of natural resources, how to maximize the use of existing resources has become a hot issue that countries around the world pay close attention to. The emergence of reverse logistics can relieve the pressure on natural resources to a certain extent and increase the availability of goods. Optimizing the recycling plan, improving the efficiency of product recycling, and reducing recycling costs are of great significance to improving recycling efficiency.

Reverse logistics refers to the reverse flow of scrapped products from customers to the starting point for disposal or remanufacturing purposes. The recycling center uses vehicles of different models to collect recyclable products from customers and recycle them to the recycling point. The recycling center is responsible for recycling products and preliminary processing of reusable products. For cost-saving reasons, recycling points can establish partnerships with recycling centers that are closer. However, due to the complexity of the logistics network and environmental impact, the transportation of recycled products has brought a heavy burden to the company. The local government encourages companies to carry out recycling operations through laws and advertisements to reduce resource consumption. The implementation of the reverse supply chain management mechanism can improve the efficiency of the recycling network, reduce the level of pollution caused by waste products, and promote the clean and environmentally friendly development of world cities. In practice, distance is an important factor affecting product recycling. By evaluating the location of the recycling center relative to the recycling point, it is found that the recycling operation is costly. Therefore, neglected waste constitutes a source of serious environmental problems and interferes with people’s daily life to a certain extent. Taking into account the particularity of the recycling business, optimizing the recycling process can improve the efficiency of the entire network while reducing costs and pollution.

#### 2. Literature Review

Regarding the path optimization of reverse logistics, many scholars at home and abroad have done research. For example, Chaabane et al. [1] established a mathematical model for the recycling problem of scrapped vehicles, combining the classic vehicle routing problem and the collection and delivery problem, as well as the constraints of multiple vehicle types and time windows. Foroutan et al. [2] aimed at minimizing operating costs and environmental costs, taking into account the return and lead time and delay costs, and established a mixed-integer nonlinear programming model for the routing and scheduling of multimodel green vehicles. Chen et al. [3] used an improved ant colony algorithm to solve the vehicle configuration and route optimization model that minimizes the recovery task. Mishra et al. [4] used the form of cooperative alliance to realize the limited means of vehicle route optimization in the reverse logistics network. The cooperative mechanism can help avoid the ineffective allocation of resources, reduce the operation of recycling logistics, and reduce long-distance transportation. Carlos et al. [5] considered the dangers in the process of recycling e-waste and carried out recycling, modeling the reverse logistics process of electrical and electronic equipment as a biobjective mixed-integer programming model under uncertainty. Chen et al. [6] proposed a nonlinear optimization model involving the number of second-hand products and reprocessed products. For these two types of requirements, analysis and insights are provided in the form of a complete strategy, which consists of different scenarios and can make the best decision under variable conditions. Numerical examples supplement insights into the model by performing sensitivity analysis on relevant model parameters. In order to minimize the total cost, Patrik and Klaudia [7] established a time window to recycle the heterogeneous fleet electric vehicle path model under consideration of vehicle load constraints; the vehicle type limits the weight of recyclable waste and the services allowed by customers’ time limit. Samantha et al. [8] used reverse logistics to determine the resources needed to reuse IOT for road infrastructure and modeled reverse logistics to transport the Internet of Things from the tailings dam to the processing plant and then to the road construction site. Cao et al. [9] studied Slovakia’s green and reverse logistics and found that the dominant position in Slovakia’s green and reverse logistics was achieved by large-scale production companies in the automobile industry in the west. Lechner and Reimann [10] established a multiangle MADM framework to provide systematic decision support for enterprises to choose the best 3PRLP. Saleh Moslehi et al. [11] evaluated the feasibility of a sustainable “reverse logistics” process in Brazil’s mobile phone manufacturing industry and conducted research on actual cases of returned mobile phones from one of Brazil’s largest mobile phone manufacturers. Wang et al. [12] designed a hybrid method based on CRITIC and EDAS methods and FFS to solve the needs of sustainable third-party reverse logistics providers.

There has also been some in-depth research on the path optimization problem of multivehicles with time constraints. For example, Ge and Yin [13] verified that cross-docking operations with multiple vehicle types can effectively improve the operating efficiency of supermarket chains and reduce logistics costs. Guo et al. [14] established a multimodel electric vehicle path optimization model considering the differentiated service cost under factors such as vehicle diversity, charging strategy, human-vehicle matching, and service time differentiation. Zhang et al. [15] proposed a new multiagent attention model to solve the multivehicle routing problem with soft time windows. Fan et al. [16] designed a genetic variable neighborhood algorithm by improving the adaptive search strategy for the vehicle routing optimization problem of multivehicles with fuzzy collection requirements. Yue et al. [17] considered transportation risks and transportation costs and constructed a multiobjective multivehicle path optimization model. Xu et al. [18] comprehensively considered factors such as multiple depots, multiple models, multiple goods, mismatched supply and demand among customers, and arbitrarily splitting of delivery and delivery requirements in order to optimize the multipoint production material allocation system and studied the routing problem of delivery vehicles. The research of Nicola et al. [19] included selecting suppliers and routing homogenous vehicle fleet to purchase different products from the selected suppliers to meet the minimization of transportation and procurement costs, define new branching rules, introduce new inequality families, and prove the new branch price-cutting method is very competitive. Appiah et al. [20] used a mixed-integer quadratic programming model to optimize the distribution of goods from 105 distributors in two factories in Ghana. Glize et al. [21] embedded a method for accurately solving its biobjective counterparts with ℇ constraints and proved that the modified method is stronger than the next set provided by the most advanced methods obtained from the literature.

It can be seen from the above literature that there have been many studies on reverse logistics, and many domestic and foreign scholars have conducted various research studies on reverse logistics. There have also been different aspects of research on multivehicle path optimization. However, there is no research on path optimization for multiple vehicles in the recycling process.

Therefore, according to the characteristics of recycled products and the impact of time constraints on the economic and social benefits of enterprises, multiple vehicles will be used for recycling and distribution, and a reverse logistics path optimization model under the time window will be constructed, which is more in line with actual needs. And because of the complexity of the path optimization problem, genetic algorithm is used to solve it.

#### 3. Problem Statement and Model Formulation

##### 3.1. Problem Description

A recycling center *D* has *M* types of vehicles, and the corresponding vehicle load capacity is ), and there are vehicles of various types. Different types of vehicles start from the recycling center to recycle several recycling points. It is assumed that the recycling volume of each recycling point does not exceed the load of the vehicle, and the total recovery of each subpath does not exceed the load of the vehicle. The vehicle returns to the recycling center after completing the task. The situation that meets the needs of recycling points and minimizes the total cost is solved, and the most suitable path is found through models and algorithms to minimize the cost of recycling centers; it is obvious from Figure 1.

##### 3.2. Basic Assumption

Based on the above problem description, the following assumptions are made:(1)There is only one recycling center, its inventory is large enough, and there are multiple recycling vehicles of different models.(2)Each model has a different rated load, fixed cost and driving cost.(3)The total amount of recycling at the recycling point does not exceed the vehicle capacity limit.(4)The driving speed of the vehicle is uniform.(5)The recycling task only involves the recycling of goods, that is, one-way flow.(6)The request for recycling vehicles must be issued from the recycling center and returned.(7)Recycling points are all serviced and only receive one service. The geographic location, recycling volume, and service time range of all recycling points are known and fixed.(8)Each vehicle has a daily working time limit.(9)Each recycling point can only issue one order, and recycling companies can accept multiple orders within their capabilities.(10)The point to be recycled has its own recycling volume and recycling time window.(11)Recycling vehicles should arrive at the collection point within the collection time window specified by the customer. If they arrive after the time window, overtime penalty costs must be paid; if they arrive before the time window, they must wait until the start time of the time window of the collection point.

##### 3.3. Symbol Description

: collection point collection, , where 0 is the recycling center : there are types of vehicles, : the available number of vehicle types*m*is , : the maximum load capacity of model : maximum daily working hours of model : fixed cost of model : unit transportation cost of model : the carbon cost of CO

_{2}emissions per unit distance travelled by model : CO

_{2}emission factor : distance from recovery point to recovery point : travel time from collection point to collection point : late penalty factor : recycling point

*i*provides the best recycling service time window for recycling companies : recycling point

*i*needs to recycle : current load capacity of vehicle

*k*at collection point

*i*: maximum daily working hours per vehicle : 0-1 variable; vehicle

*k*passes through recycling point

*j*from recycling company

*i*

#### 4. Model Building

##### 4.1. Analysis of Each Cost of Reverse Logistics

Taking into account various costs in the recycling process, including transportation costs and recycling costs, carbon emission costs, and time penalty costs, the problem studied in this article can be described as recycling centers from multiple recycling points.

The detailed steps of each part of the cost are as follows.

###### 4.1.1. Fixed Cost of Vehicles

The fixed cost of recycling vehicles is usually constant including vehicle depreciation, insurance, annual inspection fees, road and bridge fees, and driver’s wages and other costs related to delivery vehicles. The fixed vehicle cost is as follows:

###### 4.1.2. Variable Transportation Costs of Vehicles

The transportation variable cost of a distribution vehicle is usually the same as the mileage of the vehicle. Therefore, the transportation variable cost of a distribution vehicle can be expressed as

###### 4.1.3. Carbon Emission Cost

The cost of carbon emissions mainly refers to the cost of CO_{2} emissions produced by vehicles consuming fuel during the distribution process; carbon emissions = fuel consumption × CO_{2} emission coefficient [22]. Fuel consumption is not only related to transportation distance but also related to vehicle load. According to literature [22], a linear function related to the fuel consumption per unit distance *ρ* and the weight of the cargo vehicle is obtained, where is the maximum cargo capacity of the vehicle, the cargo capacity of the cargo vehicle is *X*, and the fuel consumption per unit distance is when fully loaded. The fuel consumption per unit distance at load time is .

Then, the fuel consumption per unit distance is as follows:

Therefore, the carbon emission cost in the reverse logistics process of multivehicle models can be expressed as follows:

###### 4.1.4. Time Penalty Cost

Since the collection point has requirements for the collection time, the collection vehicles are required to arrive within the prescribed time period, but due to many other reasons, the collection vehicles often cannot arrive on time, which will incur a certain penalty cost, so the penalty cost is :

The time window constraint allows the vehicle to arrive earlier than , so the waiting time is , but the arrival time is earlier than the time window, one needs to bear a certain penalty fee, is the penalty factor, and is a very large positive number.

In summary, the vehicle path optimization model for reverse logistics recycling considering multiple vehicle types is

Equation (8) means that each recycling point can only be passed by the recycling vehicle once at most; equation (9) is used to restrict the existence of recycling centers in multiple distribution routes; equation (10) is used to restrict entry and exit from a certain recycling point. The vehicles at the recycling point are the same vehicle; formula (11) is used to restrict the number of times each vehicle can be recycled to any recycling point not more than once; formula (12) indicates that the number of vehicles used in each type of recycling center should not exceed the available number of vehicles of this type; equation (13) indicates that the total recovery volume of all collection points in each route meets the vehicle capacity limit; equation (14) indicates that the running time of each vehicle cannot exceed the maximum transportation time; equation (15) is the decision variable constraints.

#### 5. Algorithm Design

In the traditional genetic algorithm, the effect of solving the vehicle routing problem of a single parking lot and single vehicle is better. However, in this paper, the two constraints of “multiple vehicle types” and “time window” need to be considered at the same time. The quality of the solution will be affected, and even unsolvable situation exists. The algorithm design used in this article refers to Chenet al.’s [23] research on multidepot and multichromosome genetic algorithm, and on this basis, the “time window” constraint is added, and each individual is set to contain 2 chromosomes-delivery route sequence and vehicle sequence, which means all available vehicles and their delivery routes, effectively avoiding invalid solutions. The following is a detailed introduction of the algorithm.

##### 5.1. Solution Encoding

This article uses natural number coding to encode such problems. The *N* recycling points are arranged to indicate the order in which the recycling points are passed by the serviced vehicles. It is necessary to insert multiple points representing the recycling companies to divide them into multiple ends to represent multiple routes for multiple vehicles. First, the number of collection points is represented by a natural number *j*, which represents the *j*th collection point, and 0 represents a recycling company. First, an ordered sequence of reclaim points is generated, and then *n*−1 0s are inserted in the sequence to divide the sequence into *n* segments; each segment represents a service line corresponding to a vehicle, where *n* represents the number of vehicles. For multivehicle problems, inserting 0 into the sequence cannot distinguish the correspondence relationship between the vehicles type and the lines, so consider replacing them with vehicle numbers. Recycling companies can use vehicle *k* to represent; let ‐k represent the number of the *k*th vehicle, ‘‐’ represents the number to distinguish it from the collection point, and ‘k’ represents the *k*th vehicle [11]. The original feasible solution can be expressed as , and the path it represents is as follows: Vehicle 1: 0-1-2-3-0 Vehicle 2: 0-4-5-0 Vehicle 3: 0-6-7-0

That is, the sequence from one vehicle number to the next vehicle number is the sequence of collection points that this numbered vehicle passes through.

##### 5.2. Population Initialization

The fitness of an individual determines the strengths and weaknesses of the individual and determines whether an individual can be retained in the iterative process. According to the individual’s delivery route sequence chromosome and vehicle sequence chromosome, the delivery task is arranged, and the fitness is calculated. The process is as follows:(1)For the individual in the population, select an available vehicle according to its vehicle sequence chromosome, and then select as many customer points as possible to be delivered in sequence in its delivery route sequence chromosome. The total demand for customer points should not exceed the rated load of the vehicle.(2)Judge whether all the customer points in the chromosome of the distribution route sequence have been distributed. If there are customer points to be distributed, return to (1); if not, go to (3).(3)After completing the task distribution, generate the distribution plan of the *i*-th individual in the population, and calculate its total cost according to formula (1).(4)Take the cost corresponding to the *i*-th individual, set , and is the fitness of the individual.

Example: suppose the delivery route sequence chromosome of the *t*-th individual in the population is [4, 5, 6, 3, 1, 2], and the corresponding customer point demand is [300, 200, 200, 400, 100, 300]; the vehicle sequence chromosome is [2, 3, 1, 4, 5], the corresponding model is [1, 2, 2, 3, 3], and the corresponding rated load of the model is [300, 500, 500, 400, 400]; according to the above task allocation method, the assigned tasks are as follows:①Install vehicles with a sequence number of 2 and model 1, and the service route is 0-4-0②For vehicles with a sequence number of 3 and model 2, the service route is 0-5-6-0③For vehicles with a sequence number of 1 and model 2, the service route is 0-3-1-0④For vehicles with a sequence number of 4 and model 3, the service route is 0-2-0⑤For other vehicles, distribution tasks are not assigned

Then, the total cost and the fitness degree are calculated according to the assigned task and formula (1).

##### 5.3. Crossover Operation

Since the variable-point crossing may cause the vehicle to be overloaded, resulting in an illegal solution, this paper adopts a fixed-point single-point crossing, and the crossing point is set after the first gene (vehicle gene). Individuals A and B are randomly selected according to the crossover probability , and the specific steps for performing crossover operations on them are as follows:(1)Randomly generate a natural number between 1 and *H*, representing the same chromosome number of the two individuals to be crossed (the number of individual A is set to *a*, the number of individual B is set to *b*, and *a* = *b*).(2)Take out the customer genes with the same two chromosomes *a* and *b*, and store them in the gene bank .(3)Exchange the customer genes of 2 chromosomes (keep the order of customer genes unchanged).(4)Compare the client genes of chromosomes other than chromosome *a* in individual A with the genes of one by one, and delete all the same genes. Similarly, perform similar operations on individual B.(5)Insert the genes in into any position after the first gene of any chromosome of individual A randomly one by one. After that, judge whether the vehicle is overloaded; if it is overloaded, randomly insert the next chromosome, and judge whether it is overloaded again, and so on, until all the genes in are allocated. Perform similar operations on individual B.

##### 5.4. Mutation Operation

Suppose the mutation probability , the first gene of each chromosome of an individual does not generate mutation, and for each other gene *i*, a floating-point number corresponding to gene *i* is randomly generated between 0 and 1. If , put the gene into the mutation gene bank *G*, and delete it from the chromosome at the same time; otherwise, this gene will not produce mutation. After that, follow the above step (5) to randomly insert the genes in *G* into the chromosomes one by one.

##### 5.5. Selection Operation

The traditional proportional selection method (the probability of selection is proportional to the fitness of the individual) has some bad properties: in the early stages of iterations, some super individuals with high fitness will dominate the selection process, resulting in excessive selection pressure; in the late stage, the fitness gap between individuals is small, resulting in too little selection pressure. The normalization technique proposed by CHENG and others for the problem of et al. minimization makes up for the above shortcomings, and its form is as follows:where is the fitness of the individual; is the maximum fitness of the population individual; is the minimum fitness of the population individual; is the average fixed cost of the vehicle; and is the fitness after transformation.

The advantage of converting into is that the survival pressure is always at a relatively moderate position, thereby preventing premature convergence. Then, use roulette to choose: first, calculate the total fitness ; second, calculate the relative probability of individual , ; then, calculate the cumulative probability of individual *i*, ; finally, randomly generate a floating-point number in the interval 0 ∼ 1; if , select the first individual; otherwise, choose to make hold the *i*-th individual.

##### 5.6. Elite Selection

The *S* + 1th individual is designed to be initially used to preserve the optimal individual in the initial population. After that, if the fitness of the *S* + 1th individual is due to all offspring individuals, it is used to replace the worst individual in the offspring; otherwise, it is replaced by the best individual in the offspring. On the one hand, elite selection can ensure that the genes of outstanding individuals are not lost in the process of evolutionary selection, thereby accelerating convergence; on the other hand, it ensures that the optimal individual in the offspring population will not be worse than the parent population, thus making the evolutionary curve monotonously decreasing.

#### 6. Example Simulation and Analysis

In order to verify the effectiveness of the algorithm, 19 recovery points and 1 recovery center are selected. The calculation example is compared with the traditional genetic algorithm on MATLAB 2018a to verify the effectiveness and feasibility of the proposed multichromosome genetic algorithm under the model. The running environment of the experiment is AMD Ryzen 53550H with Radeon Vega Mobile Gfx 2.10 GHz CPU 16 GB memory, Windows 10 64-bit operating system.

##### 6.1. Simulation

In the experiment, there are two different types of recovered vehicles, and the rated load of each type of recovered vehicles is 6 t and 8 t; the fixed cost (yuan) is 40 and 60; the unit transportation cost (yuan) is 5, 7, and 8; the constant speed of the vehicle (km/h) is 40; and the penalty cost (yuan/min) for late arrival is 4. See Table 1 for other parameters of recycling companies and recycling points, and the parameter values in the model are shown in Table 1.

##### 6.2. Path Optimization Results and Algorithm Comparison

According to the algorithm design and solving steps in Section 2, it is solved by an improved genetic algorithm. Improved GA solving parameters: the population size is 1000, the number of solutions is 500, the dimension *D* is 96, the mutation rate is 0.01, and the number of iterations is 500. The program was run 20 times, the running time was 61.1196 seconds, and the minimum objective function was 987.2. The optimal recovery path is shown in Table 2, and the path diagram is shown in Figure 2.

Comparing GA and the improved GA for 500 iterations, it can be seen from Figure 3 that, in the process of algorithm iteration, the gap between the objective function values obtained by IGA and GA is increasing. The stage is improved, and the minimum value of the IGA function in the iterative process is significantly reduced, indicating that the improved strategy for the honeybee stage in this article is effective. At the same time, as the number of algorithm iterations increases, it is obvious from Figure 3 that GA has reached the optimal value . It can be seen that‐ when the number of iterations reaches about 70 times, GA reaches the optimal value, and the objective function value of the IGA solution is still significantly reduced. This further shows that the search mechanism in this article can expand the search range of borrowing and improve the optimization speed of the solution and accuracy, and the improved genetic algorithm has higher quality, which verifies the effectiveness of the algorithm in solving the model.

##### 6.3. Comparative Analysis of Different Models

From Table 3, it can be seen that the total cost of recycling a single 6 t vehicle is yuan, the total cost of recycling a single 8 t vehicle is yuan, and the total cost of recycling 6 t and 8 t multiple models is yuan. It is concluded that the total transportation cost of recycling for multiple vehicles is significantly lower than the total cost of recycling for a single vehicle. From this, it is concluded that the use of multiple models can better reduce the total cost of recovery.

#### 7. Conclusions

This paper considers the multivehicle reverse logistics path under time constraints to conduct research, and the main conclusions are as follows:(1)According to the characteristics of recycled products and the requirements of low‐carbon green logistics, the four aspects of fixed costs, transportation costs, carbon emissions, and time penalty costs are comprehensively considered. A path optimization model is established in which the recycling center carries out recycling from multiple recycling points with the goal of minimizing the total cost of recycling.(2)Taking into account the shortcomings of early maturity of genetic algorithms and fast convergence in the later stages of evolution, multichromosome genetic algorithms are often used in parking lots. The IGA converges faster than the GA and has better solution results.(3)The simulation of the calculation example shows that the recycling of multiple-vehicle types is cheaper than recycling of a single-vehicle type, and the efficiency is changed.

In the actual reverse logistics operation environment, the complexity is far greater than the ideal state in the article. In the subsequent research of the article, the quality uncertainty problem can be considered together, which is closer to the actual reverse logistics environment.

#### Data Availability

No data were used to support this study.

#### Conflicts of Interest

The authors declare that there are no conflicts of interest regarding the publication of this paper.