Abstract

Energy supply is an important system that affects the overall efficiency of urban transportation. To improve the system operational efficiency and reduce costs, we formulate and solve a collaborative multidepot petrol station replenishment problem with multicompartments and time window assignment by establishing a mixed-integer linear programming model. The hybrid heuristic algorithm composed of genetic algorithm and particle swarm optimization is used as a solution, and then the Shapley value method is applied to analyze the profit allocation of each petrol depot under different coalitions. The optimal membership sequence of the cooperation is determined according to the strict monotone path. To analyze and verify the effectiveness of the proposed method, a regional petrol supply network in Chongqing city in China is investigated. Through cooperation between petrol depots in the supply network, the utilization of customer clustering, time window coordination, and distribution truck sharing can significantly reduce the total operation costs and improve the efficiency of urban transportation energy supply. This approach can provide theoretical support for relevant government departments and enterprises to make optimal decisions. The implementation of the joint distribution of energy can promote the sustainable development of urban transportation.

1. Introduction

The collaborative multidepot petrol station (PS) replenishment problem (PSRP) with multicompartments and time window assignment (CMPSRPMT) is a variant of the multidepot PSRP with time windows (MPSRPTW) [1], combining the latter with cooperation among petrol depots (PDs) and truck sharing (TS) of petrol distribution. CMPSRPMT incorporates multicompartment truck application and integrates a collaboration mechanism to design increasingly efficient petrol distribution networks, minimize aggregate operating cost, and reduce urban traffic burden. As of 2019, China has 260 million cars and 106,556 petrol stations (PSs). Responsible for the energy supply of an increasing number of cars, PSs are mainly located in urban areas and on main traffic routes. These stations make up a huge distribution network that produces considerable operating costs and negative effects on cities and the environment each day. Therefore, analyzing this distribution network and optimizing its operations are important. CMPSRPMT is a distribution network design composed of multiple petrol depots and stations. The collaboration among PDs in the distribution network and the time window assignments (TWAs) to PSs are designed to improve the operational efficiency and ensure that products are delivered to PSs on time.

Based on operational requirements, each petrol depot (PD) in the current distribution network is only responsible for specific PSs in a certain region. PDs are cut off from others, and sharing of distribution resources in a network is extremely difficult. The result is a temporal and spatial imbalance between the capacity of distribution trucks and the demand for customer service delivery, such as the idling and roundabout transport of distribution trucks. The inefficient distribution capacity and service quality of several PDs limit the efficiency of the entire distribution network. Moreover, customer demands for heterogeneous service time windows (TWS) require depots to increase their number of delivery trucks, which exert pressure on transportation facilities and depot budgets. Therefore, multidepot cooperative distribution must be based on TWS and distribution TS to effectively assign customer service. Through comprehensive resource sharing, multidepot cooperative distribution reduces costs and improves the service quality of the entire distribution network. Indeed, in China, large oil companies are already collaborating on the distribution of refined products [2, 3]. In terms of academic research, some scholars [4, 5] have also discussed the cooperation between different gas stations in petrol distribution.

In this study, multicompartment truck sharing, TWS, and a cooperation mechanism are integrated into the traditional MPSRPTW as CMPSRPMT. To optimize the CMPSRPMT and improve calculation accuracy, an optimal mathematical model is established to minimize the total operating cost. A hybrid heuristic algorithm that combines genetic algorithm (GA) with particle swarm optimization (PSO) algorithm is designed to achieve near-optimal solution. The Shapley value method is applied to allocate the benefits from cooperation. The membership sequence is analyzed according to the strict monotone path (SMP) principle. Therefore, it is conducive that the solution of the CMPSRPMT can improve the efficiency and flexibility of petrol distribution networks. For the CMPSRPMT, first of all, refined products have special distribution nature, that is, product diversification, and cannot be mixed; Secondly, multidepot cooperation, multicompartment vehicle use, and different time-window coordination mechanisms are comprehensively considered; it can propel the sustainable development of distribution theory and the entire intelligent urban transportation system.

The remaining parts of this study are organized as follows. In Section 2, relevant studies on petrol distribution optimization with the cooperation strategy and TWS are reviewed. In Section 3, a practical example of the CMPSRPMT is presented, and a mathematical model is established using notations and definitions to minimize total operating costs. InSection 4, a hybrid optimization methodology is introduced to solve the CMPSRPMT. In Section 5, a small-scale example and a case study in Chongqing city in China are conducted to verify the applicability of our proposed methodology. InSection 6, the conclusions and future directions are provided.

2. Literature Review

Past decades have scant academic research on the replenishment of PSs. Previous studies focus on optimizing the distribution of a single PD to multiple PSs and several variations of PSRP. For example, the multiperiod, time window, trip packing, and multidepots with TWS are separately considered for the PSRP [1, 68], which with its related problems receives increasing research focus given its practical importance. Popović et al. [9] developed a variable neighborhood search heuristic for solving a multiproduct, multiperiod inventory routing problem (IRP) in fuel delivery with multicompartment homogenous vehicles. Their method was proven superior to other optimization methods. Vidović et al. [10] proposed a mixed-integer programming model and a heuristic approach with and without fleet size costs to observe their impact on the multiproduct, multiperiod IRP in fuel delivery. Huang [11] employed the tabu search to solve an advanced capacitated location-routing problem in a distribution network with multiple pickup and delivery routes. Wang et al. [12] formulated an oil distribution model intra-area with distribution quantity and routing as decision variables to extend the multidepot half open vehicle-routing problem with TWS. They also used GA to solve this model. Carotenuto et al. [13] provided a mathematical formulation to the periodic PSRP and proposed different heuristic solutions. Wang et al. [14] proposed a mathematical model that considered petrol trucks returning to a depot multiple times and developed a heuristic algorithm according to a local branch-and-bound search with a tabu list and the Metropolis acceptance criterion to solve it.

The aforementioned works are some of the few that are directly related to PSRP modeling and research. However, the present study is closely related to the multicompartment truck transportation and TS to be considered in modeling. Derigs et al. [15] introduced a formal model, an integer program formulation, and a benchmark suit of 200 instances of a general vehicle-routing problem with compartments and presented a solver suite of heuristic components. He et al. [16] likewise investigated an approach that can be used in sharing internal trucks among multiple container terminals (SIMT) and proposed a novel strategy to resolve the SIMT problem for a specific large port with multiple adjacent container terminals. Lahyani et al. [17] optimized a rich multiproduct, multiperiod, and multicompartment vehicle routing of the olive oil collection process in Tunisia using a mathematical formulation, specifically an exact branch-and-cut algorithm. Coelho and Laporte [18] defined and compared four main categories of the multicompartment delivery problem (MCDP). They proposed formulations and models for certain MCDP cases and versions and then described a branch-and-cut algorithm that is applicable to all variants. Ostermeier and Hübner [19] identified vehicle-dependent costs within empirical data collection and used a large neighborhood search to solve an extended multicompartment vehicle-routing problem (MCVRP) for grocery distribution. Vahdani et al. [20] developed a biobjective optimization model and used two metaheuristic multiobjective algorithms to integrate the assignment of quay cranes in container terminals and internal TS assignment.

One part of the CMPSRPMT literature focuses on the problem of cooperation in the distribution networks. Cooperation among PDs largely influences the efficiency of their distribution networks. Qi et al. [21] proposed an alternative approach based on spatiotemporal partitioning to solve a large-scale VRP with time windows (VRPTW) using a genetic algorithm developed for K-medoid clustering of large-scale customers. Wang et al. [22] presented an improved PSO algorithm to minimize the total cost of the multiple centers’ joint distribution network when each distribution center is assigned to serve a series of distribution units. Wang et al. [23] used a mixed-integer linear programming model to minimize the total operating cost of nonempty coalitions of two-echelon heterogeneous cooperative logistics networks and the GA-PSO algorithm to solve customer clustering units’ reallocation problem. Wang et al. [24] established a linear mathematical model to minimize the total cost of the two-echelon logistics delivery and pickup network (2E-LDPN) and solved it by combining an improved PSO algorithm with the ant colony optimization algorithm to create hybrid metaheuristics. In the above studies, different heuristics are used to solve the problem in combination with various models.

Another part of the CMPSRPMT literature focuses on the TWA in distribution. Neves-Moreira et al. [25] presented a new formulation for the time window assignment vehicle-routing problem (TWAVRP) where TWS are defined for multiple product segments. The means of a fix-and-optimize-based math-heuristic method are used to solve this two-stage stochastic optimization problem. Martins et al. [26] extended research on MCVRPS by tackling a multiperiod setting with a product-oriented TWA and proposed an adaptive large neighborhood search as the solution. Wang et al. [27] established a biobjective programming model that optimizes the total operating routing cost and the total number of delivery vehicles for a collaborative multidepot VRP with TWA (CMDVRPTWA). They solved the model using a hybrid heuristic algorithm consisting of K-means clustering, Clarke–Wright (CW) saving algorithm, and extended nondominated sorting genetic algorithm-II (E-NSGAII).

Generally speaking, the mathematical model of vehicle assignment and routing problem is NP hard. In order to solve this kind of problem, it is very important to select and apply appropriate algorithms. Kuo et al. [28] proposed a hybrid PSO with GA (HPSOGA) for solving the capacitated VRP with fuzzy demand (CVRPFD). Azadeh et al. [29] presented a PSO algorithm synchronized with a local search heuristic and developed two other hybrid algorithms based on GA and ant colony optimization (ACO) algorithms to solve the crew scheduling problem. Zhou et al. [30] proposed two partheno-genetic algorithms (PGA) and adapted PSO and one state-of-the-art method to solve multiple traveling salesman problems (MTSPs). Tohidifard et al. [31] used GA and PSO to solve a new mathematical model for a multidepot VRP with TWS for home health care firms. Zhang et al. [32] presented a two-stage improved GA-PSO algorithm to solve the coal bed methane gathering network optimizing problem.

The aforementioned studies tackle numerous PSRP aspects but suffer from the following issues. (1) The replenishment network design procedure rarely considers the cooperation among PDs by regional partitioning. (2) Minimal attention is paid to distribution TS, multicompartment truck application, and transship transportation among the participants of a collaborative multidepot optimization network. (3) A single intelligent algorithm and heuristic approach are difficult to apply directly to a specific scale of CMPSRPMT with numerous PSs.

Combining observations in Table 1, the main contributions of this study lie in the following aspects: (1) proposing a cooperation mechanism based on the regional partitioning method and constructing a collaborative multidepot PS replenishment network with multicompartments and TWA; (2) establishing a mixed-integer linear programming model based on the minimum total operating cost and the number of multicompartment trucks for the CMPSRPMT; (3) designing a hybrid heuristic algorithm that combines GA and PSO algorithm to effectively address the optimization model; and (4) executing a real-world case study to assess the applicability of the proposed model and approach and compare the costs and benefits before and after the collaboration. In addition, this study lays a foundation and strategy for optimizing the cooperative energy supply network, which is conductive to improving the construction of the sustainable energy supply chain.

3. Problem Statement and Model Formulation

3.1. Problem Statement

CMPSRPMT integrates the problems of cooperative distribution, TS, and TWAs. Figure 1 illustrates a noncooperative refined products’ distribution network in which PDs operate independently and serve only their own customers. As such, long-haul deliveries are inevitable. A single PD also needs a large number of delivery petrol tankers to meet different demands and time window requirements of its customers. This case results in a substantial decrease in the distribution efficiency and an increase in distribution costs. Given the numerous cross-transportations, the distribution network becomes increasingly complicated and thus vulnerable to disruptions.

Figure 2 displays an optimized refined products’ distribution network with cooperation, TS, and TWAs. In this network, large tankers transfer refined products across depots. These petrol distribution trucks are responsible for refined products’ delivery from each depot to each PS. This cooperation leads to a more organized and ordered distribution system than the noncooperative distribution network. The required number of petrol distribution trucks and service scope of each PD are likewise reduced.

On the basis of the regional partitioning method, the transport time between petrol depots and stations is used to cluster corresponding PSs. Different customer groups are assigned to different PD services, and the demand of an individual PS is combined with the required time window. Different types of petrol tankers are assigned to distribution on the basis of the appropriate time window for each PD.

Five assumptions underline the corresponding mathematical model. (1) In a relatively short time period, each PS only generates one distribution order for refined products’ demand. The demand for each kind of refined products does not exceed the maximum loading capacity of a single compartment of the distribution truck. (2) Each customer (PS) can only be served once in a time period, but its demand can be fulfilled by multiple distribution trucks. Specifically, one PD can organize a distribution task, but multiple trucks can complete the distribution task. (3) The transportation speed of petrol transfer trucks and petrol distribution trucks is constant. (4) During a time period, distribution trucks may be dispatched repeatedly if time permits, regardless of the transfer time across PDs. (5) The loading and unloading service times of PDs and stations are related not to the refined products’ type but to the truck type and operational quantity.

3.2. Model Formulation

The proposed model is mathematically formulated as an optimization problem to minimize the total cost when each PD is assigned to serve a group of PSs with different trucks and TWS [23, 27]. Table 2 lists the related notations and definitions adopted in the CMPSRPMT optimization.

CMPSRPMT is formulated as a mixed-integer linear programming model to minimize the total cost. The cost function contains three components, namely, , , and , which are described as follows.

Equation (1) shows the formulation for , which denotes the total transportation cost of transferring refined products across PDs and the maintenance cost of transferring petrol trucks within a working period:

Equation (2) shows the formulation for , which denotes the total transportation cost, maintenance cost of all distribution trucks assigned to deliver different refined products from a depot to a PS during a working period, and the penalty cost for delivering trucks that arrive early or late:

Equation (3) shows the formulation for , which denotes the fixed and variable costs of petrol depots that belong to the collaborative petrol distribution network:

The optimization model of the CMPSRPMT is defined as follows:subject to

Equation (4) shows the objective function, which minimizes the total cost of the regional petrol distribution network. Constraint (5) ensures that the demand for one kind of refined products at any PS is less than or equal to the maximum capacity of a single compartment of the distribution truck. Constraint (6) guarantees that the demand for certain products in a PS can only be distributed by a certain compartment of a certain truck. Constraint (7) defines the loading service time of a certain delivery truck that distributes refined products to a certain PS. Constraint (8) defines the unloading service time of a distribution truck distributing refined products at a PS. Constraint (9) specifies the arrival time of a distribution truck to the PS to be served. Constraint (10) represents the time when a distribution truck returns to the departure depot after serving a PS. Constraint (11) indicates that a PD in the regional distribution network can either choose to join or not join the regional petrol distribution. Constraint (12) defines the total distribution capacity of different refined products of a PD in a time period. Constraint (13) represents the total demand quantity of refined product from all the PSs in a time period. Constraint (14) indicates the demand quantity of all kinds of refined products from all the PSs in a time period. Constraint (15) ensures that a distribution truck serves only one PS at a time. Constraints (16) and (17) represent the value range of variables. Constraint (18) defines the service cost allocated by a PD over a time period.

4. Solution Methodology

4.1. Hybrid Algorithm Solving Procedure

A GA-PSO hybrid algorithm is designed to address the petroleum distribution of the multidepot PS optimization model. GA is an evolutionary computing approach used to mimic the natural selection procedure and study combinatorial optimization problems [33]. PSO was proposed by Kennedy and Eberhart as one of the swarm intelligence stochastic evolutionary metaheuristic approaches [34]. Based on swarm theory, PSO is inspired by observations of the social behavior of bird flocking and fish schooling. This algorithm can be used to optimize local and global locations according to the particle position and velocity. Traditionally, GA and PSO have the merits of being population-based heuristics with the ability to evaluate the fitness of each possible solution. Through local and global search capabilities, GA and PSO are successfully applied to solve problems in engineering, management, and pure science domains. Given their individual abilities, GA and PSO are combined as a hybrid algorithm in this study to inherit the merits of both approaches and increase the probability of obtaining optimal solutions.

As a hybrid algorithm, GA-PSO is applied to meet the requirements of the multidepot PSRP optimization for more complex algorithms. Existing traditional heuristics are already proven efficient. However, the complexity of petrol distribution networks limits their capacity in finding near-optimal solutions. As such, properly integrating different methods into a hybrid solution approach such as GA-PSO can effectively improve the optimization results. Other hybrid heuristics exist but may display weaknesses compared with GA-PSO in terms of performance. For example, Chen et al. [35] addressed a task assignment problem by embedding a local search heuristic into the ACO algorithm to improve the solution search ability. Their proposed methodology had a satisfying global search capability but insufficient local search. In addition, some studies [3638] discussed the related variants of GA-PSO and its application in the centralized charging strategy of electric vehicles, multiqueue scheduling, and spatiotemporal task scheduling. And some research studies [39, 40] also applied more advanced particle swarm optimization variants in the neural network, burden distribution matrix, and other aspects. Thus, combining GA and PSO ensures good performances at local and optimum search levels, improves the optimization procedure, and reduces the number of necessary iterations. The relevant operations and processes of the proposed hybrid algorithm are illustrated in the subsequent sections.

Similar to existing research [41, 42], Figure 3 describes the procedure of the hybrid algorithm. The details of the main steps are as follows:Step 1: algorithm initialization: in the hybrid algorithm, the PD performs the refined products’ delivery service. An integer within is randomly selected and assigned to the gene location of each chromosome to check whether it meets the requirements. represents the number of PDs set for refined products’ distribution service. New chromosomes are generated until requirements are fulfilled. Formula (19) shows the calculation for the fitness function value . indicates the objective function of the kth chromosome, represents the unqualified chromosome number, and denotes the penalty weight for each unqualified chromosome (particle).Step 2: given the improved chromosome fitness function value, roulette wheel selection is then carried out. Crossover and mutation operations are executed based on the crossover probability pc and the mutation probability pm, respectively. The optimal solutions and new chromosomes are updated.Step 3: E chromosomes are selected as the initial particles in the population to handle the PSO operations as part of the hybrid algorithm. In addition, the individual optimal solution and the global optimal solution are calculated according to formulas (20)–(22). represents the allowable maximum velocity for the PS replenishment service. denotes the corresponding particle velocity. and are the acceleration coefficients used in PSO. represents a random fraction between 0 and 1. indicates the position of each particle (chromosome). and denote the individual best position of particle k at the tth iteration and global best position of particle k at the tth iteration, respectively. assures that each particle is an integer. denotes the initial inertia weight. denotes the inertia weight for , which indicates the maximum number of iterations. represents the iteration number. Formulas (20) and (21) are used to update the velocity and position for refined products’ replenishment services. Formula (22) is used to obtain the inertia weight .Step 4: repeat the loop steps indicated in Figure 3 until the maximum number of iterations is reached or the termination cases are satisfied.Step 5: if the number of iterations reaches its maximum, the loop process ends. The existing best known particle (chromosome) and fitness function value are selected as the optimal solution; otherwise, the above steps are repeated, starting from Step 2.Step 6: the optimal solution is calculated and selected from all available chromosomes and serves as the final result for the CMPSRPMT.

4.2. Shapley Value Model and MPS

In the optimized CMPSRPMT, each PS arranges for delivery of refined oil from the nearest PD. Refined products’ dispatching and distribution TS are implemented among PDs to reduce costs and increase profits. Alliance members agree to join the cooperation depending on the fairness of profit allocation. Participants are encouraged to join a coalition where the benefits are proportional to their contributions. Therefore, the leader or coordinator needs to implement an effective revenue delivery mechanism to ensure the group stability. The Shapley value model provides an efficient cost and profit allocation according to the participants’ marginal contributions in cooperative game theory [43]. This theory is widely used in the study of cooperative behavior of participants, including their profit and cost savings allocation [44].

4.2.1. Shapley Value Model

The Shapley value is a solution concept that provides a unique solution to the cost allocation problem. The computation formula in the following expresses the cost to be allocated to participant based on the assumption that the grand coalition is formed by accepting new members one at a time. The Shapley value is the average marginal cost of participants that entered in a completely random order. The cost allocated to participant is equal to

represents the number of participants in the considered coalition. The summation is that of overall coalitions that contain participant . The quantity, , is the amount by which the cost of coalition increases when participant decides to join. Such quantity is denoted by the marginal cost of participant with respect to that of coalition .

The Shapley value model is based on four fairness properties, namely, efficiency, symmetry, dummy property, and additivity [22, 44]. The calculated cost and profit allocation strategy satisfies these four properties which show several expected features from a practical perspective. The reasonability and stability of allocation can be insured on this basis.

4.2.2. MPS Strategy

The strictly monotonic path (MPS) strategy [44] is mainly applied to estimate the group stability according to the cost and profit allocation. Large numbers of cooperation possibilities generally provide a wide range of cost and profit allocation manners. A sequence can be adopted only if the cost reduction percentage of each participant increases whenever a newcomer joins. If is a sequence of the coalition, can denote the location of PD in sequence . indicates the cost reduction rate when PD joins the coalition as the member and can be computed as follows:

In formula (24), indicates that the participant decides not to join the cooperation. In the next section, the cost reduction percentages are used to explain the SMP, a sequence wherein the cost reduction percentages for all committed participants monotonically increase with each new member. The above procedure is applied in the case study.

5. Case Study

5.1. Data Source

A practical study in Chongqing city in China is considered to evaluate the effectiveness of the proposed CMPSRPMT optimization. As the central city of China, Chongqing is an ideal research subject for this study. Figure 4 shows the locations of five PDs (denoted as PD1, PD2, …, PD5) and 65 PSs (denoted as PS1, PS2, …, PS65). The PSs are marked as follows: circles for those served by PD1; rectangles for those served by PD2; triangles for those served by PD3; cruciform for those served by PD4; and hexagons for those served by PD5. PDs are marked with solid five-pointed stars. Table 3 shows the initial service assignment in the petrol distribution network. Given the assumed constant transportation speed, the transportation time across facilities in the petrol replenishment network can be used to represent the transportation distance between them for the convenience of calculation. Table 4 shows the transportation time across facilities.

In this study, three kinds of refined products are considered, namely, nos. 92, 95, and 98. Table 5 shows the related data based on the demand for different kinds of refined products and the corresponding service time window of each PS.

5.2. Parameter Setting and Optimization Results

To ensure normal operations, PDs need to bear the fixed and variable costs related to their distribution tasks. Several parameter settings in the hybrid algorithm and model are determined according to the associated previous studies [1, 24, 41, 42] and real conditions as follows:(1)Objective function parameters: three kinds of petrol distribution trucks are available. The set of petrol distribution trucks and their compartments are and , respectively. Thus, , , , , , , , , .(2)Algorithm parameters: according to Gao et al. [45], both PSO and GA are trained to get the best parameters in solving practical problems. Particles in PSO and the chromosomes in GA are trained by the dendritic neuron model (DNM), so as to get optimal parameter settings. is the popular size used to increase the diversity of initial chromosomes. expresses the maximum velocity for the refined products’ distribution service. and indicate the crossover and mutation probabilities, respectively. indicates chromosomes as the initial particles in the population. expresses the number of chromosomes for exchanges between GA and PSO. denotes the number of iterations used for replacement between GA and PSO. represents the numbers of PDs for the refined products’ distribution service. is the penalty weight for each unqualified chromosome or particle. is the maximum number of iterations. is the initial inertia weight, and is the inertia weight of the maximum evolution generation. Moreover, is the coefficient used for PSO speed calculation.

In this study, a working period can be divided into several time periods. GA-PSO algorithm is used to assign PSs to corresponding PDs and compute the total cost in one time period. In particular, relevant government departments or core enterprises in the industry are encouraged to offer an initial subsidy, set at 7% of the cost, to those who wish to join the cooperation. Table 6 shows the optimization results, where represents the cost difference before and after optimization. Table 6 shows the summary of the best PS allocation in the entire cooperation.

Table 3 shows the initial PS allocation, while Table 7 shows the cooperative network. Clearly, the number of assigned PSs varies from a PD to another. Although several depots observe an increase, others are allocated fewer stations. For example, PD4’s PSs decreased from 12 to 8, whereas PD5’s PSs increased from 14 to 19. As a consequence, the total time travelled by each PD’s trucks decreases, and savings increase. The sharing of distribution capacities and trucks allows PDs to reduce their number of distribution trucks.

For comparison purposes, we implement and test the proposed GA-PSO hybrid algorithm, PSO, HGA (hybrid genetic algorithm), and TS-ACO (tabu search and ant colony optimization hybrid algorithm) with same data. PSO simulates the social behavior of bird flocking and fish schooling [34]. HGA incorporates the elitism genetic operator and some improvement heuristic methods [46]. TS-ACO merges the ideas of ACO into a TS algorithm to achieve a balance between the diversification and the intensification [47]. Each algorithm is executed 20 times, and the best solution is selected as the optimal cost for convergence. The optimal total cost and the computational times can be used to measure the effectiveness of four algorithms shown in Table 8.

Compared with other three algorithms, the average optimization cost from our proposed algorithm is of better quality. In terms of computing time, in addition to PSO, the average computational time of our proposed algorithm also has significant advantages. Thus, we can conclude that GA-PSO has the following merits: (1) exchanging best-fit chromosomes and worst-fit particles between the GA and the PSO algorithm enhances the GA-PSO’s capability to obtain better solutions. For example, the GA-PSO algorithm has a higher probability to obtain the best solution. (2) The proposed algorithm combines GA and PSO algorithm’s local and global search capabilities. As illustrated, the average cost from the proposed hybrid algorithm is lower than that of PSO, HGA, and TS-ACO algorithms by 4.8%, 3.4%, and 2.4%, respectively. (3) In the 20 runs, PSO is computed faster than the other three algorithms. Although GA-PSO requires slightly more computation time, the low average cost still makes this algorithm desirable in a practical setting where a slightly longer computation time can be tolerated (e.g., in operation management). Therefore, we conclude that the proposed GA-PSO hybrid algorithm is superior to the three existing algorithms when optimizing multidepot refined products’ distribution networks.

Comparison between the cost changes and the use of distribution trucks before and after the cooperation enables the intuitive analysis of the benefits of the collaborative PS replenishment network. Figures 5 and 6 show the cost reduction and the variations before and after the formation of the cooperation. Compared with the previous numbers, operating costs significantly decreased after the formation of cooperation among PDs. Cost savings also increased with the high number of PDs joining the collaborative petrol distribution network. Whether for a single or multiple PDs, the number of required petrol distribution trucks decreased due to the cooperation and resource sharing among PDs. Table 9 illustrates the optimization of distribution trucks.

In the CMPSRPMT, TS is an important mechanism. A PD joining the cooperation can coordinate the sharing of petrol distribution trucks. Before cooperation, the PSs’ demands and the required service TWS for each PD must be considered. Then, the largest numbers of delivery trucks are needed to certify customer satisfaction. After cooperation, a chronological and overall time window plan can be made for all PDs. Trucks can be shared and reasonably arranged, which greatly decreases the required number of total petrol distribution trucks in the network. For example, Table 8 shows that truck usage decreases from to for {PD1 PD3 PD5}, which significantly reduced .

5.3. Model Comparison

Based on the above parameter settings, the cost and the number of distribution trucks of integrated MILP are calculated. Then, we compared them with the model calculation results adopted in this study. The comparison is shown in Table 10. According to the table, significant decline in the total cost and the number of distribution trucks can be observed by the model of this study. Because the model adopted in this study not only includes the MDVRP but also readjusts the customer groups served by each PD through the cooperation among PDs with the feedback mechanism, better results are obtained.

5.4. Stability of the Coalition

Many game theory-based approaches to allocating profits are available. In this study, minimum cost-remaining savings (MCRS), GQP, the improved Shapley value model, and the equal profit method (EPM) are adopted to conduct a comparative analysis that tests the stability of alliance [44]. We explore the grand coalition scenario where five PDs form a single coalition and evaluate the profit allocated to each of the five PDs (called “profit allocation scheme” throughout the paper) under the four game theory models. The core of the most stable profit allocation scheme can be calculated using formula 25. Based on our network where five petrol depots constitute an alliance, the core area forms a closed polygon. Each surface of the core area is compressed and narrowed into a single point, which is the core center. represents a member, and is a variable for controlling its range. The center can be expressed as follows:

According to the snowball theory, the proximity of a profit distribution scheme to the core center reflects the stability of the alliance with the given profit distribution scheme [48]. In particular, a profit allocation scheme that is near the core center indicates a robust coalition strategy. Following formula (25), the core center is calculated as (333, 281, 368, 206, 298). The profit allocation schemes under the four game theory models and their distances to the core center are summarized in Table 11.

Table 11 shows the distance between profit allocation schemes and the core center, with the most stable scheme at the center and the other schemes at the periphery. The profit allocation scheme calculated by the Shapley model is nearest to the core center and thus is superior to the other three models. The comparison result confirms that the Shapley model outperforms the other methods in terms of stability. The profit allocation schemes under the MCRS and the EPM are unstable given their long distance from the core center. Therefore, among the four profit allocation schemes, the profit allocation scheme obtained from the Shapley model will result in the highest probability of participants agreeing to cooperate.

5.5. Shapley Value Model Application and Coalition Sequence Selection

Upon the CMPSRPMT optimization, the benefits and cost savings should be reasonably allocated among each depot to ensure the long-term group stability. Then, the Shapley value model and the SMP method introduced above are applied to determine the benefit allocation and order analysis of membership.

5.5.1. Shapley Value Model Application

According to formula (23), Table 12 shows the cost saving allocation of each PD analyzed under different collaborations.

Table 12 shows the cost savings of each alliance and the profit allocation of each PD as computed using the Shapley value model. In the final grand alliance, PD1, PD2, PD3, PD4, and PD5 gain cost savings amounting to $327.10, $270.79, $376.82, $199.55, and $301.42, respectively. However, forming a grand alliance is difficult, requiring the communication and coordination of each participant and the fair and reasonable allocation of cost savings and benefits.

5.5.2. Sequential Alliance Selection

The analysis of cooperation sequences is extremely important for the benefit allocation strategy and the participants’ willingness to join. In other words, the order in which the participants join the cooperation affects the allocation of benefits and the satisfaction of SMP principles. Figure 7 shows the proportions of cost reduction during the formation of the CMPSRPMT computed according to formula (24).

Formula (23) is used to calculate the cost saving percentages and changes of different membership orders. The optimal sequence with PD1, PD2, PD3, PD4, and PD5 is determined. Table 13 shows the first member.

In the five sequences, the cost reduction percentage for each PD reflects the results of taking different PDs as first participants. The optimal cooperation sequence of coalition is , determined from the five sequences using the method analyzed in Section 4.2.2. Table 14 shows the specific cost reduction percentages, and Figure 8 shows their changes.

5.6. Implication

The design of collaborative PS replenishment networks on the basis of regional partitioning and rational resource sharing propels the sustainable development of refined products’ distribution and of the entire energy supply system. The CMPSRPMT optimization presents a reasonable division of refined products’ distribution service zones and reduces long-distance and cross-transportation phenomena. Thus, costs are minimized, and additional benefits are offered to each PD. Effective TWA coordination and shared strategies among trucks and PSs are important characteristics of the proposed network. These improvements significantly contribute to energy and social resources and cost savings for PD operators and transportation administrations.

Cooperation among logistics facilities recently plays an important role in the optimization of distribution processes. Incorporating further transportation resource sharing can allow for more cost savings. In addition, traffic administration policies that encourage joint distribution are also a sign of political will to achieve the sustainable development of administrated areas. As one of the main development factors, refined products’ distribution activities can be further organized with the coordination of TWAs and reduce the number of petrol distribution trucks. Therefore, encouraging the formation of a grand coalition is a relevant approach that can benefit not only PDs but the entire society.

6. Conclusions

This study presents an effective method to solve the optimization of the CMPSRPMT, which improves the cooperation level of PDs and the operational optimization of refined products’ distribution. The distribution and resource sharing are optimized through the regional petrol joint distribution formed by the cooperation among PDs. In the optimization process, we consider the mechanisms of PS clustering, time window coordination, and truck coutilization. The comparison of data before and after the cooperation shows that the total number of distribution trucks and the total operating costs are significantly reduced.

The optimization model considers customer clustering, TWA, and multicompartment distribution TS. As a result, the overall transportation distance and the number of trucks used are reduced. A regional petrol distribution network in Chongqing city in China is taken as an example to evaluate the effect of the proposed model and method in practice. We propose a hybrid heuristic approach based on GA and PSO and use the Shapley value method to allocate the benefits of PD in the cooperation. The optimal sequence of the PD joining the cooperation is found according to the SMP principle.

In summary, the study of petrol cooperative optimization is consistent with the reality. The proposed optimization method is better than that of existing research in this field. Based on the analysis, a summary of the conclusions is provided as follows. (1) Regional petrol distribution network through customer clustering, coordination of TWA, and sharing of distribution trucks can considerably shorten the distribution distance, reduce the required number of distribution trucks, and reduce the total operational costs of the network petrol distribution. (2) Optimizing the PSs served by each depot and using each other’s delivery trucks when the time window allows can highly relieve the traffic stress of refined products’ supply in urban areas, reduce the negative effect of the transport energy supply system, and contribute to the sustainable development of urban transportation. (3) The benefit allocation of each PD and the determination of the optimal entry sequence ensure the formation of a more cooperative distribution network and strengthen the group stability.

The results of this study point toward interesting research directions in the future. The following points can be considered. (1) This study only considers the cooperation between the PD and the PS in the process of distribution operation. Thus, the cooperation can be extended to a wider range of the transport energy supply chain. (2) This study assumes that the transport speed of petrol distribution trucks is constant, which is consistent with most of the existing joint distribution literature. Future research can consider real-time urban traffic speed analysis to achieve more realistic results. (3) In the future, the dynamic CMPSRPMT can be built by considering the product inventory of PDs and PSs.

Data Availability

The transport time, demand quantity, and service time window data used to support the findings of this study are available from the corresponding companies and administrative departments.

Conflicts of Interest

The authors declare that they have no conflicts of interest.

Acknowledgments

This research was supported by the National Nature Science Foundation of China (Grant nos. 71471024 and 71871035), the Humanity and Social Science Foundation of Ministry of Education of China (Grant nos. 18YJC630189 and 17YJA630079), and Social Science Planning Project of Chongqing of China (Grant no. 2019YBGL049).