Abstract

The location and capacity of express distribution centers and delivery point allocation are mixed-integer programming problems modeled as capacitated location and allocation problems (CLAPs), which may be constrained by the position and capacity of distribution centers and the assignment of delivery points. The solution representation significantly impacts the search efficiency when applying swarm-based algorithms to CLAPs. In a traditional encoding scheme, the solution is the direct representation of position, capacity, and assignment of the plan and the constraints are handled by punishment terms. However, the solutions that cannot satisfy the constraints are evaluated during the search process, which reduces the search efficiency. A general encoding scheme that uses a vector of uniform range elements is proposed to eliminate the effect of constraints. In this encoding scheme, the number of distribution centers is dynamically determined during the search process, and the capacity of distribution centers and the allocation of delivery points are determined by the random proportion and random key of the elements in the encoded solution vector. The proposed encoding scheme is verified on particle swarm optimization, differential evolution, artificial bee colony, and powerful differential evolution variant, and the performances are compared to those of the traditional encoding scheme. Numerical examples with up to 50 delivery points show that the proposed encoding scheme boosts the performance of all algorithms without altering any operator of the algorithm.

1. Introduction

Since 2014, the express delivery volume in China has ranked first in the world for six consecutive years [1]. Statistics show 107.7 billion pieces of delivery in China in 2021, with a 31% growth compared with 2020 [2].

The rapid growth of the Express Delivery Industry has brought fierce competition among the participants. The primary delivery companies actively improve service quality and delivery efficiency. For regional express delivery service suppliers, the location of the service center has a significant impact on efficiency and, hence, the overall operation cost.

Such a problem is typically modeled as a logistic distribution problem, which minimizes total distances between the distribution center (DC) and associated delivery points (DPs) under certain constraints. The problem is twofold: one considers the location-allocation problem (LAP) of the distribution centers and the other considers the vehicle routing problem (VRP) that starts at the distribution center and goes through each delivery point and back to the distribution center. The joint problem is the location and routing problem (LRP) [3]. Both LAP and VRP are NP-hard, and the combined LRP has attracted attention since the 1960s [4]. The problem structure of LRP is shown in Figure 1.

In LRP, the optimal route associated with a given distribution center may change with the demands of the delivery point or the policy of distribution centers. Therefore, the VRP may be left to the distribution centers to solve. Since the delivery points for each distribution center are not many, the local VRP can be solved easily. Therefore, the separation of LAP and VRP could significantly reduce the complexity of the problem.

The LAP may apply to a broader region, such as a country, state, or province. The options for locations are limited to a list of lower-level cities, and the options for allocation are subsets of delivery points. The solution of location usually uses binary representation because the number of options is fixed. The solution of allocation usually uses integers to denote the belonging of delivery points. Since the total number of distribution centers and delivery points is fixed, the size of a solution can be fixed as well. However, for a city-level LAP, the model and solution representation may vary because the available options for distribution centers can be many, and the setup cost and capacity of different options have a significant impact on the total cost.

On the other hand, the NP-hard nature of LAP makes heuristic algorithms favorable, among which swarm-based algorithms are representative [3, 5, 6]. The solution representation plays a vital role in applying the algorithm because an efficient encoding scheme may reduce the length of the solution string, smooth the solution space, or eliminate the constraints. For example, choosing distribution centers out of 100 candidate locations requires 100 binary bits to represent the solution, whereas a floating number representation of the coordinates needs only numbers. When considering the capacity of a distribution center, the binary representation quantizes the solution space and loses smoothness. Finally, the LAP constraints may result in many infeasible solutions, hence reducing the search efficiency.

This paper discusses the city-level express distribution center location problem and provides a new perspective for simplifying the LAP. Floating numbers denote the coordinates of distribution centers, and the delivery point assignment for each distribution center is determined by the decoded sequence of delivery points and the capacity of the distribution center. The total distance between the distribution center and delivery points, setup costs, and operational costs is used to evaluate the solution. Representative swarm-based algorithms, including particle swarm optimization (PSO), differential evolution (DE), artificial bee colony (ABC), and a powerful variant of DE, and LSHADE-cnEpSin [7], are compared under two solution encoding schemes.

2. Literature Review

The LAP tries to determine the location of distribution centers and the delivery points assigned to each distribution center simultaneously. LAP can take various forms in different scenarios. For example, distribution centers are sites for distributing medical services in public health emergencies, and delivery points are affected in literature [8]. Distribution centers are wastewater treatment plants in the wastewater treatment problem, and delivery points are processing units in literature [9]. Distribution centers are web servers in web services, and delivery points are user centers in literature [10]. For an express distribution center location problem, the operational cost involves the capacity and location of each distribution center, and the capacity is either determined or constrained by the assigned delivery points. Such a problem is called a capacitated location-allocation problem (CLAP) [11].

Much effort has been put into this issue. Pham et al. [12] applied a hybrid of the Fuzzy-Delphi-TOPSIS approach to identify the critical criteria for choosing the logistic distribution center. Yang et al. [13] considered the distances between manufactory and distribution centers and between the distribution center and customers and combined tabu search and genetic algorithm to select four distribution centers out of ten candidates. Karaoglan et al. [14] modeled the LRP with simultaneous pickup and delivery, which reflects the practice of beer distribution and empty bottle collection. The problem is then solved using an improved version of the simulated annealing (SA) algorithm.

The solution of CLAP may benefit from a geospatial information system (GIS) since the regional division and distance between two points can be more precise. Vafaeinejad et al. [15] developed a vector assignment ordered capacitated median problem (VAOCMP) model to describe the fire station location and allocation problem. In the VAOCMP model, the arrival time of the fire engine to demand points and the capacity of the fire facility are considered. The closeness of the fire facilities ranks the demand, and a facility will be filled up with closer demands. The problem is then solved by tabu search and simulated annealing (SA). Zheng et al. [16] proposed that the underground metro might be used as a complement to the urban logistics system. They utilize GIS to find the shortest path through all the most demanding points. The demanding points are allocated by the Voronoi diagram, which partitions a plane into polygons such that all the points inside a polygon are closest to one of the communities [17].

The demand for delivery points may be stochastic. Expert opinions can be introduced to build the distribution model of the customers with a lack of data. Zhou and Liu [18] used fuzzy numbers to model the customer demand, and the expected cost was used as the minimization goal. The expected cost is obtained by fuzzy simulation, and the model is solved by network simplex programming and genetic algorithms. The location of the demands may also be stochastic. Mousavi and Niaki [19] used the normal distribution to model demand location and fuzzy variables to model the amount of demand. Three cost functions were proposed: (1) minimization of the fuzzy expected cost, which is the integration of the credibility of fuzzy events; (2) the -cost minimization model, which minimizes the upper bound of transportation cost that has credibility greater than ; and (3) the credibility maximization model. The model is then solved by using fuzzy simulation and a genetic algorithm.

Noticing the redundancy of express terminal nodes that different express service suppliers establish in the same city, Meng et al. [20] proposed the express terminal nodes optimization integration problem (ETNOIP). The goal of ETNOIP is to establish the minimum number of shared express terminal nodes that could serve a given number of customer clusters. The capacity and scope of an express terminal are included in the cost as well. The model is then solved by SA with neighbor search and shows advantages over immune genetic algorithm (IGA) and CPLEX, an IBM optimization solver.

Many swarm-based algorithms and their variations have emerged in the last two decades. Many swarm-based algorithms have been applied to LAP as well. Xu et al. [21] used the wolf-pack algorithm to optimize the total distance. Bao et al. [6] applied particle swarm optimization (PSO) to a logistic vehicle routing problem. A supported vector machine was introduced to distinguish the state of a particle, and the state will determine whether or not a group of particles will be updated. Moonsri et al. [5] discussed the poultry logistics planning problem, which routes vehicles to each established depot. A new mutation formula is developed in the reinitialization phase of differential evolution (DE) to protect the local structure of the solution, and a location search of partial variables is used to enhance the exploitation ability. Guo and Zhang [3] considered the vehicle routing problem and location-allocation problem as a whole and applied a discrete artificial bee colony (ABC) to determine the choice of recycling centers, the vehicles that serve the recycling center, and the route of the vehicles.

Various models have been proposed in the past decades to describe different scenarios, and more algorithms have been developed to solve the proposed models. The NP-hard nature of CLAP and the constraints that come with it make algorithms unable to reach their full potential. Taking the express distribution center location-allocation problem as an example, we propose a general encoding method for swarm-based algorithms, which eliminates the capacity constraint in allocating delivery points and improves the efficiency of the compared algorithms.

3. Swarm-Based Algorithms for LAP

3.1. Particle Swarm Optimization (PSO)

PSO is the most representative swarm-based algorithm. The core mechanism is defined as follows:

The position of particles represents the candidate solutions. The symbol denotes the -th coordinate of a particle in the generation, which is updated by the velocity associated with each particle. Three parts determine the velocity of a particle in the next generation:(1)Current velocity is weighted by a linearly decreasing factor .(2)The difference between the current position and the global best position is scaled by the social learning factor and a uniform distribution random number .(3)The difference between the current position and the personal best position is scaled by the personal learning factor and another uniform distribution random number . The last two parts represent social learning and self-learning.

3.2. Differential Evolution (DE)

DE is another widely used swarm-based algorithm that uses other solutions in a swarm, instead of the global or personal best, to generate new solutions. Many mutation operators have been proposed in the literature. In this paper, we adopt the “DE/rand/1” strategy as follows:

The mutated solution is generated from three solutions that are randomly selected from the swarm and are different from the current solution and each other. The difference between the two is scaled by a factor and then added to the third solution. Note that only some bits of the selected solution are mutated by probability , allowing a subtle modification of the solution. Additionally, is a random natural number in the range , ensuring that at least one bit is mutated. The candidate solution is retained if the corresponding cost is better than the current one. Given the differential nature of the mutation operator (2), DE conducts large-scale exploration in the early stages and subtle exploitation in the later stages.

3.3. Artificial Bee Colony (ABC)

ABC produces new solutions in three ways that mimic the behaviors of three types of bees: employed, onlooker, and scout bees. The food sources represent the current best solutions. The employed and onlooker bees share the same mutation operator as follows:

The difference between the current solution and a randomly selected solution is scaled by a random number in [−1,1], and added to the current solution. This mechanism allows employed and onlooker bees to search for the alternative to the current solution. The behavior difference between employed and onlooker bees is that the employed bees make sure each food source is visited once in a cycle. In contrast, the food source is visited by an onlooker bee with a probability defined as follows:where is the population size and is the fitness of the food source . The onlooker bees are dispatched based on the roulette selection–the solutions with better cost have a higher chance of being visited. Each food source (solution) has a visiting limit. If there is no better solution found around the current solution after a certain number of visits, the food source is abandoned, and a scout bee is sent to generate a new random solution in the search space.

The mechanism of employed bees maintains the diversity of the swarm; onlooker bees exploit the neighbors for better solutions, and the food source visiting limit ensures that the algorithm will not be stuck on some solutions.

3.4. Ensemble Sinusoidal Parameter Adaptation Incorporated with LSHADE (LDES)

The parameter settings of DE partially depend on the problem. Therefore, research on the parameter settings of DE [22] and the adaptive parameters of DE [23] is proposed to tackle this problem. In the research stream of adaptive DE, Zhang and Sanderson [24] proposed a self-adaptive DE, JADE, which generalizes the “DE/current-to-best” mutation strategy to “DE/current-to--best” and controls the parameters in a self-adaptive manner. Tanabe and Fukunag [25] proposed a Success-History-Based Adaptive DE (SHADE). As an enhancement to JADE, SHADE utilized a history-based parameter adaptation scheme and ranked third in the real-parameter single objective optimization competition, CEC 2013. Tanabe and Fukunaga [26] later proposed the LSHADE algorithm, which extends the SHADE algorithm with the Linear Population Size Reduction (LPSR). The LPSR of LSHADE reduces the number of function evaluations in the exploitation stage of optimization and further enhances the performance. LSHADE wins the CEC 2014 competition.

Two years later, Awad et al. [7] proposed the LSHADE-EpSin algorithm, which incorporated the ensemble sinusoidal parameter adaptation and became the joint winner of the competition of CEC 2016. One year later, Awad et al. [27] proposed an improved algorithm, LSHADE-cnEpSin, to tackle the problems with high correlation between variables. LSHADE-cnEpSin became the second winner in the competition of CEC 2017. For brevity, we will denote LSHADE-cnEpSin as LDES in the rest of the paper.

4. Problem Formulation and Solution Representation

4.1. Capacitated Location-Allocation Problem

A complete planning scheme of distribution centers includes the number of distribution centers, the location and capacity of each distribution center, and the delivery points that are serviced by each distribution center.

The location of distribution centers affects the delivery mode and distance, hence the efficiency and service quality of the distribution centers. The factors that may affect the location selection of distribution centers may be classified into two classes: natural factors and social factors. Natural factors include the natural conditions, such as mountains and rivers, the conditions of the land, and the distribution of roads [11]. Social factors may include infrastructure, client demand distribution, suppliers, and policies.

The capacity of distribution centers determines whether the demand of the assigned delivery point can be served. The assignment of delivery points determines the transportation cost and the operational cost.

This paper focuses on the transportation, setup, and operational cost to simplify the model and highlight the main factors. To build up the objective functions for the cost, let us define the symbols as shown in Table 1.

4.1.1. Transportation Cost

The transportation costs may be affected by capital, fuel, lubricant, and operational costs. Sahin et al. [28] showed that the total cost of a unit of cargo in road transportation with trucks consists of 14% investment cost, 60% fuel cost, 17% operational cost and maintenance cost of the vehicle, and 9% external cost, which are positively related to the route length. Assuming the vehicles are fully loaded and the cost has a linear relationship with the route length, the total transportation cost can be formulated in the following equation:where is the city block distance as follows:

4.1.2. Setup Cost

The setup cost may vary depending on how the distribution center is set, such as renting a warehouse or building a new depot. Assuming that the average setup cost for any possible location is known, the total setup cost is simply as follows:where the operational cost for a given point can be determined in advance through investigation.

4.1.3. Operational Cost

The operational cost depends on how many demands a distribution center needs to meet, which is usually described by a cubic function of demand [29]. Therefore, a distribution center’s operational cost per unit demand is a quadratic function of demand. The total operational cost is formulated as follows:where are polynomial coefficients. The total demand that is assigned to the distribution center is determined by the decision variable . The cost per unit demand is then formulated as

The optimal capacity for a distribution center is the solution to the following equation:

To summarize, the total cost for an express CLAP iswhere , , and are the weight coefficients for transportation, setup, and operational cost, respectively.

4.1.4. Constraints

The boundary constraints for the decision variables are as follows:

The capacity of the distribution center must satisfy the total demand of all delivery points serviced by it:where is the total demand as described in Table 1. Each delivery point is serviced by one distribution center:

4.2. Encoding Scheme and Evaluation Criteria

The selected algorithms work on a set of floating number vectors (known as the population , where nPop is the population size). Therefore, the solutions of CLAP need to be encoded into floating number vectors before the algorithms can be applied. When a new vector is found by an algorithm, it must be interpreted (decoded) into the actual location and allocation plans before it can be evaluated. This section discusses the encoding/decoding schemes and the evaluation criteria of the plan.

4.2.1. Traditional Encoding Scheme

A complete location planning scheme of distribution centers can be represented by the decision variables described in Table 1. The location of a distribution center requires two numbers to denote the horizontal and vertical coordinates. Another quantity is required to denote the capacity. Furthermore, a number denoting the belonging of a delivery point is also required. Given that the length of the solution representation in the selected algorithms is fixed, the traditional encoding/decoding scheme can be as shown in Figure 2.

For a traditional encoding/decoding scheme, the length of a solution string is 3 , where is a positive integer and denotes that the -th delivery point belongs to the -th distribution center. The decision variables are obtained as follows:where are floating numbers. and () have the same range as and , respectively. The range of () is because the lower bound of the capacity cannot be determined in this encoding scheme. () and is the rounded-up function.

The traditional encoding scheme is straightforward, but the drawbacks are clear: (1) the number of distribution centers must be determined in advance, and (2) solution strings generated in the search process may violate the constraints. For the boundary constraints defined in Equations (12) and (13), the abovementioned algorithms will truncate the exceeded solutions. However, for the constraints in Equations (12) and (13), an infeasible solution means that the decoded plan cannot be executed.

For example, the delivery points assigned to a distribution center may have greater total demand than the capacity of the distribution center, or no delivery points may be assigned to a distribution center. Hence, a punishment term is defined as follows to suppress the infeasible solutions:

The modified cost function can be as follows:where is the weight of the punishment term. If no delivery point is assigned to a distribution center, the wasted resources are naturally a punishment for the cost. Such solutions will not compete with the solutions that take advantage of the available capacity.

4.2.2. Constraint-Solved Encoding Scheme

The abovementioned encoding scheme and cost function allow the existence of infeasible solutions, which decreases the search efficiency since there are invalid calculations for the infeasible solutions. This paper presents an improved encoding scheme that introduces random proportion and random key (RPK) as shown in Figure 3, which transfers the constraint problem into an unconstraint problem.

As shown in Figure 3, a floating number vector is used as the solution string, where . The number will be mapped into the number of distribution centers and will be mapped into the position of each distribution centers by

Note that if , the numbers will be omitted.

The numbers can be decoded into the capacity of each distribution center and the assignment of delivery points through random proportion and random key mapping.

The random key mapping determines a sequence of delivery points as follows. The partial vector is sorted in either ascending or descending order to obtain the sorting index . For example, denotes the order of in the sorted vector and denotes the order of in the sorted vector. The index represents the sequence of the delivery points.

The random proportion mapping uses a partial vector to determine the lower bound capacity of a distribution center:

Note that the numbers [] are omitted as well if .

The assignment of delivery points for the distribution center is determined as follows:where is a continuous partial sequence in , which makes the summation of demands from the set of delivery points just greater than the lower bound .

The actual capacity of the distribution center is determined by the summation of demand from the delivery point assigned to it:

The idea of RPK is to determine the number of distribution centers dynamically and make the capacity just enough for the assigned delivery points. The logical procedure of obtaining a feasible capacitated location-allocation plan from any is as follows:Step 1: obtain the number of distribution centers by Equation (19)Step 2: obtain the position of distribution centers by Equation (20)Step 3: sort and get the index Step 4: Step 5: for in doCalculate the lower bound capacity by (21)Starting from , find the first that satisfies Equation (22)The set of delivery points assigned to the distribution center is Obtain the actual capacity of the distribution center by (24)Step 6: ,

A decoding example is as follows. Assume that , , , , and , and demands for each delivery point are [100, 80, 110, 150, 60]. A vector with the length is [0.66, 0.1, 0.35, 0.5, 0.6, 0.7, 0.9, 0.4, 0.95, 0.25, 0.55, 0.6]. Then, , , , , , , and . The sorting index of the ascending order of [0.4, 0.95, 0.25, 0.55, 0.6] is [15]. The minimum set that satisfies equation (22) is ; therefore, , , and .

For an arbitrary , the RPK encoding scheme always produces a unique and feasible plan. The total capacity of the distribution centers equals the total demand, which maximizes resource utilization. Furthermore, the RPK encoding scheme allows algorithms to operate on a uniform vector , which facilitates the application of algorithms.

5. Evaluation Experiments

5.1. Experiment Settings

The evaluation experiments were conducted on a 10 km by 10 km square region of Zhenjiang, China [30]. The map of this region is divided into 10 10 grids, and the setup cost for each lattice is obtained via investigation. The map, grid, and setup cost matrix are shown in Figure 4.

The setup costs are scaled into five levels. Level 5 represents the most expensive setup cost. The central area (slightly above the middle) has the highest cost, and the suburbs have the lowest cost. A mountain is located slightly below the middle, and some waters are in the north, where a distribution center cannot be set up. We set the cost much higher than the maximum level cost (20 in this case) to prevent generating a distribution center located in these areas. While a finer grid may bring a plan closer to reality, a 10 10 grid is sufficient to show the algorithm’s mechanism and maintain the map’s readability.

The TES and RPK encoding schemes are applied to four algorithms: PSO, DE, ABC, and LDES. Since the compared algorithms use different population sizes, we use the number of cost function evaluations (FEs) instead of the number of generations to measure the performance. The algorithms will stop when the maximum FEs are reached. RPK works on a unified search space and transforms a solution into a location and allocation plan, where is the dimension of the search space, i.e., the number of variables. The algorithm parameters are shown in Table 2.

5.2. Comparison of TES and RPK Encoding Schemes

The comparison is conducted on the map in Figure 4. There are 20 delivery points on the map, each with different demands. The maximum number of delivery centers is set to 5. The location of delivery points and the associated demands are shown in Figure 5.

Each algorithm runs 30 times, and the optimum costs found by each algorithm are averaged over 30 runs. The results are shown in Table 3.

From Table 3, we observe that the RPK encoding scheme improves the performance of all compared algorithms. The average optimum cost was reduced by at least 11.38%. ABC shows the best average performance for both RPK and TES encoding schemes. The DE and LDES show comparable performance with ABC for the RPK encoding scheme, while the smaller standard deviations (0.05 and 0.09), respectively, indicate more stable performance.

The improvement has two sources: first, the RPK allows automatic selection of the number of distribution centers; since each distribution center has a setup cost, fewer distribution center reduces the total cost. However, fewer distribution centers increase the number of delivery points to be serviced, hence increasing the route length of travel through all delivery points and leading to greater transportation costs. On the other hand, TES adopts a fixed number of distribution centers (in this case, 5), and the setup cost is on an average greater than RPK. The RPK balances the number of distribution centers and the route length, which allows each distribution center to operate at a lower average cost.

Second, the capacity distribution mechanism in RPK ensures that the total capacity equals the total demand and no capacity is wasted, which produces a more efficient allocation plan.

The statistics and convergence curves of the four algorithms in the two encoding schemes are shown in Figures 6 and 7. The nonoverlapping notch of the boxes corresponding to different encoding schemes of the same algorithm shows that RPK reduces the median of the optimal cost by 5%.

Figure 7 shows that most optimization processes converge within 50000 function evaluations regardless of the algorithm or the encoding scheme. Except for ABC with RPK, the search of the other algorithms makes small progress throughout the entire process. This observation shows that the visiting limit mechanism keeps the ABC from being stuck in the local optima.

The best location and allocation plan found by ABC using RPK and TES encoding schemes are shown in Figure 8. A distribution center and associated delivery points are depicted in the markers with the same shape, where a solid marker denotes the distribution center and hollow markers denote delivery points. The size of each marker is proportional to the capacity/demand of DC/DP.

Both RPK and TES avoid the mountains and water areas. RPK allows the algorithm to automatically choose the number of distribution centers (in this case, 3). The delivery points are clustered around each distribution center to minimize transportation costs. The optimized number of distribution centers distributes the demands so that each distribution center may operate at a capacity with the lowest possible cost. For the TES, a fixed number of distribution centers prevents some distribution centers from operating at the lowest possible cost. Although the average transportation cost is reduced due to fewer delivery points serviced by each distribution center, the increased setup cost and operational cost make the allocation plan less economical.

The algorithm is smart enough under both encoding schemes to choose a location right on the edge of the lattice, which minimizes the setup cost and the transportation cost simultaneously. The detailed plan is shown in Table 4. The symbols , , and represent the transportation, setup, and operational costs of each distribution center, respectively.

The coordinates and show that all distribution centers are located on the edge of the lattices, where the setup cost is the lowest, and the distance to delivery points is minimized. The capacities of each distribution center with the RPK encoding scheme are the same as the total demands assigned to them. In contrast, the capacities of TES are slightly greater than the demands, which are a waste of resources and cause greater costs. Meanwhile, the capacities of RPK are closer to the optimal capacity (the optimal capacity can be obtained by solving (10), which is 20 in this case). The total transportation cost of RPK is greater than TES. However, lower setup costs and operational costs compensate for the overall cost.

5.3. Sensitivity of the Number of DCs and DPs

This section considers the impact of the number of distribution centers and delivery points. The number of delivery points is 10, 20, 30, 40, and 50. The maximum number of distribution centers is 2, 4, 6, 8, and 10. All four algorithms are tested. RPK and TES are also compared. The results are given in Table 5.

Improvement of RPK over TES can be observed in most cases (except for DE and LDES with #DP = 10 and #DC = 2). For #DC = 2 cases, the improvements of RPK with different numbers of delivery points are not significant because two distribution centers are not enough for any encoding scheme to distribute the demands into economic capacity. When #DC = 4, the average improvement is at least 5% (ABC with #DP = 30), and the highest improvement is 21.3% (PSO with #DP = 10). When #DC = 10, the average improvement is up to 56.05% (PSO with #DP = 10).

For #DP = 10, the averaged costs of RPK are steady with different numbers of distribution centers, whereas the averaged costs of TES keep rising with the increasing number of distribution centers. The reason behind this observation is that two distribution centers are sufficient for the optimal distribution of the total demand of 10 delivery points. Even if the maximum number of distribution centers varies, RPK automatically selects two distribution centers to distribute the demands and produce similar solutions. On the other hand, TES uses a fixed number of distribution centers. Therefore, the TES has a similar performance with RPK when #DC = 2 but deteriorated performance with the increasing number of distribution centers because additional distribution centers cause additional setup and operational costs.

The statistics of different combinations of algorithms, encoding schemes, and the number of distribution centers are shown in Figure 9.

The improvements of RPK over TES are shown in Figure 10. When the total demands rise with the number of delivery points, the maximum number of distribution centers that RPK shows a significant improvement (over 20%) rises as well. For example, when #DP = 10, the maximum number of distribution centers needs to be at least 4 for PSO to have an improvement of greater than 20%. When #DP = 50, the number rises to 10. Below a certain maximum number of distribution centers, neither RPK nor TES could find a better allocation plan. In contrast, above the threshold, the dynamic number of distribution centers in RPK shows excellent efficiency in solving the CLAP.

6. Conclusion

The solution representation of practical engineering problems may significantly affect the performance of swarm-based algorithms. Proper encoding of the solutions may bring three significant advantages:(1)The encoded solution could have uniform ranges, which is suitable for the algorithm adopting a “crossover” operator that may switch the position of the elements in a solution vector(2)The landscape of the solution space is altered to provide more “algorithm-friendly” information, such as gradients and continuity(3)Some constraints may be eliminated, which increases the rate of feasible solutions in the newly generated solutions, hence improving the search efficiency

We propose the random proportion and random key (RPK) encoding scheme to represent the location and allocation plan of an express CLAP. RPK brings three advantages over traditional encoding schemes:(1)RPK dynamically chooses the number of distribution centers in the search process. The solutions with different numbers of distribution centers coexist and evolve in the same swarm.(2)The allocation of delivery points is determined by the order of elements instead of the value of elements. Then, the candidate capacity is determined by the proportion of the total demand. This mechanism allows the delivery point assignment constraint and capacity/demand constraint to be satisfied simultaneously. There is no need to introduce a punishment term for violation of constraints.(3)RPK benefits all continuous optimization algorithms, swarm-based or not, by the means of transformation of the search landscape and elimination of constraints.

Data Availability

The experiment results data used to support the findings of this study have been deposited in the Science Data Bank repository (https://www.scidb.cn/s/ziQZba).

Conflicts of Interest

The authors declare that there are no conflicts of interest regarding the publication of this paper.

Acknowledgments

This research was supported by the National Science Foundation of China (grant no. 51875270) and Philosophy and Social Science Research Project for the Universities of Jiangsu Province (grant no. 2019SJA1912).