Abstract

This paper presents a closed-loop location-inventory-routing problem model considering both quality defect returns and nondefect returns in e-commerce supply chain system. The objective is to minimize the total cost produced in both forward and reverse logistics networks. We propose a combined optimization algorithm named hybrid ant colony optimization algorithm (HACO) to address this model that is an NP-hard problem. Our experimental results show that the proposed HACO is considerably efficient and effective in solving this model.

1. Introduction

According to eMarketer, worldwide business-to-consumer (B2C) e-commerce sales reached $1.471 trillion in 2014, increasing by nearly 20% over 2013 [1]. Customers have grown accustomed to return unwanted products back to the store for any reasons. It is reported that the proportion of customer returns online range from 18% to 74% of original orders under e-commerce environment [2, 3]; thus enterprises take various measures to prevent the appearance of quality defects. However, quality defect is inevitable. So, it is necessary for us to take into account both quality defect returns and nondefect returns; we call it mixed quality defect returns (MQDR), when considering the closed-loop supply chain as a support system in e-commerce environment.

As a classic discrete dynamics problem, the customer service level is determined by three important decisions: facility location decision, inventory decision, and transportation decision [4]. Obviously, facility location, inventory control, and transportation optimization are highly related. For example, delivery in small lots and high frequency leads to reducing the in-inventory cost but increases the additional transportation cost. In addition, facility location decision needs to consider inventory decision and distribution decision. Perl and Sirisoponsilp [5] discuss the interdependence between the three key elements. Ballou and Masters [6] provide a schematic representation of the interrelationships among facility location, inventory control, and transportation optimization.

In the literature, many papers studied the integration and coordination of any two of the above three decisions: location-inventory problem (LIP), location-routing problem (LRP), and inventory-routing problem (IRP). For reviews on LIP, readers can refer to Erlebacher and Meller [7], Daskin et al. [8, 9], and Liao et al. [10]. For LRP, please refer to Balakrishnan et al. [11], Min et al. [12], and Nagy and Salhi [13]. Refer to Chan et al. [14], Kleywegt et al. [15], and Adelman [16] for IRP.

There are few researches about the integration optimization of location-inventory-routing problem (LIRP). Some researchers attempt to carry out research on LIRP [17]. Liu and Lee [18] firstly studied this interesting problem; they proposed a two-phase heuristic method to solve the multidepot location-routing problem (MDLRP) considering inventory optimization. In order to avoid the local optimal solution, Liu and Lin [19] designed a global optimizing heuristic method to find the solutions for LIRP. Shen and Qi [20] presented an algorithm based on Lagrangian relaxation to minimize the inventory and routing costs in strategic location models. They focused on the layout phase and used continuous approximation to get the approximate optimal routing cost, but the vehicle routing was not optimized in their models. Javid and Azad [21] presented a novel LIRP model and proposed heuristic method containing two stages: constructive stage and improvement stage. Ahmadi-Javid and Seddighi [22] presented a mixed-integer programming model and a three-phase heuristic to solve the LIRP with multisource distribution logistics network. Guerrero et al. [23] researched the LIRP with deterministic demand and provided the hybrid algorithm to solve the problem. Zhang et al. [24] proposed a hybrid metaheuristic solution to LIRP considering multiple depots and geographically dispersed customers. Nekooghadirli et al. [25] presented a novel biobjective model of LIRP model considering a multiperiod and multiproduct system. Based on Lagrangian relaxation and a column generation technique, Guerrero et al. [26] developed a relax-and-price heuristic to solve ILRP; they proposed two dependent constraint sets with an exponential nature: Lagrangian relaxation and a column generation technique.

However, little research has been conducted on the LIRP considering returns. Li et al. [27] presented the HGSAA algorithm to solve a LIRP model considering returns under e-supply chain environment. To be more consistent with reality, Liu et al. [28] introduced a stochastic demand into LIRP considering returns in e-commerce and proposed a PPGASA algorithm as the solving approach.

The above two researches mainly focus on the returns without quality defect but did not consider the MQDR. In this paper, we propose a model of closed-loop LIRP with MQDR. To the best of our knowledge, it is the first time to introduce the MQDR into LIRP in e-commerce. An effective hybrid algorithm named hybrid ant colony optimization (HACO) is provided to solve this model. Results of numerical instances indicate that HACO outperforms ant colony optimization (ACO) on optimal solution, iterations, and computing stability.

The remainder of this paper is organized as follows. Section 2 presents the mathematical model of LIRP with MQDR. Section 3 proposes the solution approach named HACO. Section 4 analyses the parameters of HACO and shows the results of different experiments. Section 5 gives the conclusion and future research directions.

2. Model Formulation

As we all know, customers’ return in e-commerce is higher than traditional commerce. Because of personal dissatisfaction, or a mistaken purchase of the wrong product, some of the returns are without quality defects. These returns can reenter into the market after a simple repackaging process without being recovered [29]. While the other returns result from quality defects, which need to be sent back to the plant and be recovered.

In order to meet the needs of MQDR, the merchandise center (MC) is necessary to deliver normal merchandises to the demand points (DPs) of downstream and collect the returned merchandises. MC integrates the functions of distribution center and recycling center and provides quality inspection and repackaging services. Meanwhile the returned merchandises are collected to MCs. Returned merchandises without quality defects become resalable normal items after repackaging treatment at MCs. The plant will recover the returns with quality defects and bring them to the market again.

The operation mode of the system is shown in Figure 1. The closed-loop supply chain in this paper consists of one plant, multiple MCs, multiple DPs, and a single type of product with continuous inventory policy under the e-commerce environment.

The goal of this study is to decide the quantity and location of MCs and arrange the vehicle routes and determine the ordering times on each route. To minimize the total cost of logistics operations, this problem involves the following three decisions: () location decisions: obtain the optimal number of MCs and their locations; () inventory management: determine the ordering times on each route; () routing optimization: arrange the vehicles to delivery merchandises and collect returns.

To benefit from the risk of MQDR, we take assumptions ()–() from Li et al. [27] into consideration: since the single-product system is researched in this paper, assumption () is necessary; in the capacitated vehicle routing problem, assumption () should be satisfied [30]; assumption () eliminates the indeterminacy from the different type of vehicle; assumption () means that each DP is well served by the only vehicle route [31]; assumption () ensures that each route will return to the same MC after traversing; assumption () follows the early published papers considering uncapacitated MCs [32]; assumption () takes MCs as the distribution center and recycling center; assumption () is a simplification of the reality [33].

The returned merchandises without quality defect are processed and repackaged at MCs, while others will be shipped back to the plant for reprocessing after a predetermined quantity at the MCs. Assume that the demand at each retailer is known and let be the set of candidate MCs. Let be the set of DPs and let be the number of DPs. Let be set of vehicles from the MCs to DPs. Let . Let the following notation denote the decisions of the firm:: ordering times of on routing .: =1, if node is served by on routing from node , or 0 otherwise.: =1, if node is assigned to on routing , or 0 otherwise.: =1, if is selected as an MC location, or 0 otherwise.: auxiliary variable avoiding the subtour constraints in route .

According to the aforementioned assumptions, the inventory levels depend on both demand and the quantity of MQDR. So, during each replenishment cycle, the holding cost of MCs is , where is annual inventory holding per unit merchandises, is mean (daily) demand for , and and are quantity of merchandises without and with quality defect returned by per day.

In order to exactly describe the logistic distribution costs. Let be the transportation costs per unit product from plan to . Let be the delivering cost per unit distance. Let be the distance from node to node . And let be the working days per year. The total transportation costs from plant to DPs through MCs can be expressed, respectively, as and .

So the cost of forward distribution iswhere denote the fixed cost of dispatching vehicles per time at , denote the fixed (annual) administrative and construction cost of , and denote the ordering cost per unit product from plant to .

We let to be the returning cost per unit of merchandise from DPs to MCs, so the total reverse transportation costs from DPs back to MCs are

The cost of deal with mixed quality defects is

We adopt as the inspecting cost per unit for the returned product and as the repackaging cost of unit returned merchandise without quality problem at MCs.

In summary, the model is formulated as follows:

It is easy to find that the objective function (4) is convex in . We can simplify the objective function by solving . Consequently, the optimization solution of can be obtained by taking the derivative of the function with respect to ; the result is as follows:

The optimization problem (4) given a known can now be written as

The objective function (6) is to minimize the total cost; (7) ensure the selected MC is not empty; (8) ensure each DP is traversed by a unique vehicle which belongs to a certain MC; (9) ensure the amount of each delivery from MC on each route must be within vehicle capacity; (10) ensure that each route has only one vehicle; (11) ensure each DP must be followed by exactly one note; (12) ensure every DP node of the system will be serviced before it gives services to the others; (13) ensure the subtour is eliminated. Equation (14) ensures that each DP is assigned to an MC when there exists a route that starts from the MC passing through the DP. Equations (15)–(17) ensure the nonnegativity and integrality of decision variables.

3. Solution Approach

Like the VRP, the closed-loop LIRP is also an NP-hard problem, since it includes the VRP and is more complex than VRP. Generally speaking, there does not exist a complete, efficient, and accurate analytic algorithm to address NP-hard problems; ant colony optimization (ACO) has been proved very successful and widely applied to solve the static and dynamic problems as an EC algorithm [34]. However, ACO does not distinguish ant behavior results. The pheromone concentration will distribute in every direction for the iteration. Therefore, this leads to low searching efficiency. The algorithm may get caught in local optimization if we do not take preventive measures. On the other hand, ABC provides an effective institution to find the global optimal solution from the trapping of local optimal solution [35]. So, in this study, we present a hybrid ant colony optimization algorithm based on the combination of ACO and ABC to solve the above LIRP model.

3.1. Initialize Solution

Since the natural number is an efficient coding method for these problems, the sequence of solutions is composed of candidate MCs () and DPs, which are indicated by (). The candidate solution of our proposed model will be described well by those natural number sequences. As an example, Figure 1 fully interpreted the perceptions of our method, which refers to the individual feasible solution: .

In the HACO, the moving strategy of the ant in node is depending on the pseudorandom proportional rule. The rule indicates that the ant has both exploiting and exploration ability, which means the ant is guided by the pheromone trails as well as the heuristic information. In this case, the ant has a higher degree of exploring unknown knowledge. The connected function of the pheromone values and the heuristic values is shown aswhere is the density of pheromone remaining on the edge , is the inverse value of distance between node and node , and are user-defined parameters for corresponding pheromone concentration and heuristic information, and is the remaining nodes to be visited by ant .

3.2. The ABC Phase

In order to improve the performance of global searching of our algorithm, the paper applied the scout bee searching phase into the ACO. Scout bees are free bees used for finding a new better solution from the neighbor known solution. As soon as a scout bee finds a new solution, she turns into an employed bee. If there is no improvement in the quality of solution, the bee will abandon that source and continue to search for another new solution.

The searching function of scout bees is as

To meet the requirements for coding sequence type, we described two operations to complete scouts searching process, namely, random array reverse (RAR) and random swap (RS).

Step 1. Set the initial number of scout bee and probability .

Step 2. Generate two positions randomly named and , for each .

Step 3. Get a random probability ; if turn to Step ; otherwise, turn to Step .

Step 4. Reverse the array between positions and as a new solution.

Step 5. Swap the position of and as a new solution.

Step 6. Calculate the cost of new solution.

Step 7. Keep the best solution to the next iteration and return to Step .

The pseudocodes of ABC are shown in Pseudocode 1.

Procedure: ABC
Input: the initial sequence, the number of scout bee and
Output: the better sequence
Begin
Take and
while
  
  take
 if
   Reverse the array between position and
 else
   Swap the position of and
Output: the better sequence
End
3.3. Global Pheromone Trail Update

The global pheromone updating rule is triggered at the end of iteration to reward tours that are in line with the objective of impedance minimization. This strategy is applied to reinforce the pheromone density on the sets of edges belonging to the inspect tour and to increase the likelihood that this tour will also be selected by other ant agents. The rule of global pheromone updating is given bywhere

is a constant initial pheromone. is the cost of the best of all the tours produced by all agents from the beginning of the iteration. is the pheromone evaporation coefficient. To improve the pheromone trail quality, a part of the worst result is removed.

3.4. Local Pheromone Trail Update

In addition to the global pheromone trail updating rule, the selected ants will update the local pheromone trail in the process of passing an . It is opposite to the normal pheromone trail updating rule that increases the pheromone density while ants cross over arcs. The purpose of using the local pheromone trail update rule is to prevent stagnation behavior because the arc becomes less desirable for the following ants. The rule of local pheromone updating is as follows:where is a constant at the beginning of pheromone trails and is a user-defined coefficient that lowers the pheromone density of arcs traversed by the intelligent ants.

3.5. Algorithm Flow

Step 1. Get the formulas for solving .

Step 2. Set the initial parameters for the model: set of candidate MCs , set of DPs , set of vehicles , inspecting cost , ordering cost , transportation costs , daily demand , dispatching vehicles cost , fixed (annual) administrative and construction cost , vehicle capacity , holding cost , and returning cost .

Step 3. Parameter setting for HACO is as follows: ants number , evolution terminate iteration , pheromone concentration impact factor , heuristic information pheromones impact factor , evaporation rate of the pheromone , constant initial pheromone , and mutation probability array reverse .

Step 4. Using unit matrix , calculate the probability

Step 5. Ant solutions generation module: each ant will generate a feasible solution after traversing the DPs.

Step 6. Best ant solution module: after calculating each ant’s solution, select the best solution which is known as the iteration best to compare with the global best. Keep the next best solution as the next global best.

Step 7. Scout bee module: random selection probability ; if , turn to random array reverse operation. Otherwise, turn to random swap operation.

Step 8. Pheromone updating module: update the information pheromones as follows:

Step 9. Termination module: if the parent optimal solution and offspring optimal solution are equal during continuous generations, stop the algorithm. Otherwise, return to Step after increments.

Step 10. Output.

The pseudocodes of HACO are shown in Pseudocode 2.

Procedure: HACO for LIRP
Input: coordinates of nodes, demands and returns of DPs, MC parameters, vehicle capacity, HACO parameters
Output: the best solution (include routes, MCs locations, order times and order size)
Begin
Take
while
  for 1 to
     Foraging Behavior of Ants
     
    Calculate individual total cost Tcost()
  end
    Tcost_best =
  for 1 to Neighbor range
   Scout bee searching the neighbor range
   
   
   make
   if
    ;
   else
    exchange()
   end
    if Tcost() < Tcost_best
    Tcost_best = Tcost()
    end
  end
    for 1 to
     if Tcost() > Tcost_ave
      for 1 to
     
      end
     end
    end
end
Output: the best solution
End

4. Computational Experiments and Results Analysis

In this section, numerical simulations are given to illustrate the performance of HACO compared with the traditional ACO. Both algorithms in this paper are compiled by Matlab R2014a and run on a computer with 8 GB main memory and 3.6 GHZ CPU. All instances come from the LRP database in University of Aveiro [36].

4.1. Parameters Discussion

Parameter values selection is crucial to the efficiency of algorithms. An example named Gaskell 67-22 × 5 from the database, which contains the nodes coordinate and the DPs demand, is used to determine the optimal parameter. Gaskell 67 is the instance’s name and 22 × 5 means 5 candidate MCs for 22 DPs. The inventory holding cost , the vehicle capacity , transportation costs , returning cost , working days , and the delivering cost per unit distance . The other parameters of the instance are as follows: ; ; ; ; ; and .

The parameters of algorithm are initialized as follows: ant’s number , evolution terminate iteration , pheromone concentration impact factor , heuristic information pheromones impact factor , evaporation rate of the pheromone , constant , mutation probability array reverse .

We run the program 50 times on the same computer. The performance of ACO and HACO varies with the different values of the parameters, which are shown in Tables 16. In these tables, the symbol C.V. means the coefficient of variation.

Tables 16 represent the parameters’ effect on the objective function values. The data was normalized through two dimensions, that is, cost and iterations, and three indicators, that is, mean, standard deviation, and coefficient of variation. Actually, in order to find the minimal cost, we usually take the parameter values, where the cost is lower and more stable. From the discussion, we found that HACO reaches the best performance when , , , , , and , while ACO reaches the best performance when , , , , and .

4.2. Computational Experiment

To get a reliable conclusion, we run another 50 times on the same computer with the best parameter values in Gaskell 67-22 × 5. One of the best solutions of objective function in the 50 experiments of HACO is 30.2 million CNY. Table 7 shows the solution. MCs were established at MC1, MC2, and MC5 with five vehicles distribution routes. Figure 2 shows topological structure of the closed-loop supply chain.

Figure 3 shows their trend of optimal objective function values along with iterations. The fluctuation curves of optimal objective function value are varied by different algorithm, which are shown in Figures 4(a) and 4(b).

As shown in Figure 3, the cost and iterations of HACO are lower than ACO; and in Figure 4, the range and mean value of the minimum cost of HACO are also lower than ACO, which both imply that HACO is more efficient than ACO in solving the LIRP.

4.3. Extended Experiments

In this section, a series of instances are given to show that HACO is more efficient and stable than classical software and ACO. In order to ensure the demands of DPs are not more than the vehicle capacity, we need to enumerate some instances. In this paper, the daily demands are set as 1/10 of corresponding demands of the database.

As we know, Lingo is a representative classical optimization software tool. Thus we used Lingo 11.0 to solve the problem by using a small-sized instance named Peal 183-12 × 2 and two medium-sized instance named Gaskell 67-22 × 5 and Gaskell 67-36 × 5; the results are shown in Table 8.

Each instance was run 50 times by HACO and ACO with their optimized parameters values, respectively; the results are shown in Tables 9 and 10.

4.4. Result Analysis

According to Table 8, we found that () for the small-sized instance, HACO can obtain better result than Lingo within less time and () for medium-sized instances, Lingo cannot get the global optimization within 1 hour, while HACO can solve the problem in a short time.

Observe, from Tables 9 and 10, that HACO is more efficient than ACO for the following reasons. () The cost of HACO is significantly lower than ACO (); () the difference of the number of iteration between HACO and ACO is not significant (); () HACO is more stable than ACO as the coefficient of variation (C.V.) is lower. To sum up, our algorithm reduces the cost with the same number of iterations compared with ACO.

By improving pheromone updates and bee colony searching, we improve the solution quality of the algorithm and make it useful as a guide for the ant searching process. Observed from the results of numerical simulations, HACO can get better result with a fewer number of iterations. Hence, comparing with ACO, HACO is adopted as a better approach in solving this LIRP with MQDR.

5. Conclusion and Future Research

With the development of e-commerce, customers’ return keeps a high rate with MQDR, which can be reentered into markets after being repackaged or recovered. In this research, we built a closed-loop LIRP model considering both quality defect returns and nondefect returns; we call it MQDR in this paper. We perform an extensive computational study and observe the following interesting results.(1)Considering MQDR are computationally beneficial for the formulation presented, the MQDR and closed-loop pattern with returns are features of the proposed problem in e-commerce, which is never considered in previous work.(2)Since the evolutionary computation algorithm has been proved successfully in tackling NP-hard problem, a hybrid algorithm is proposed by combining ACO algorithm and ABC algorithm to solve the LIRP. HACO integrated the scout bee searching phase into the ACO to improve the global searching ability.(3)The performance of HACO is evaluated by using the instances in the LRP database, and HACO outperforms ACO on convergence, optimal solution, and computing stability. This numerical study shows the efficiency and effectiveness of the solution method.

However, developing other elements for the LIRP will lead to further research directions. And analyzing the model under the dynamic demand of customs and a time-varying demand can be a valuable subject. The design of experiments and verification by discrete dynamics simulation should be established. Fruit fly optimization algorithm (FOA) as one of the best EC algorithms has attracted the attention of various researchers [37]. It is important to apply these models and algorithms to the operation and management of enterprises to improve the decision-making efficiency of e-commerce logistics system.

Competing Interests

The authors declare that there is no conflict of interests regarding the publication of this paper.

Acknowledgments

This work was supported by the National Natural Science Foundation of China (nos. 71171093,71471073, and 71571082) and self-determined research funds of CCNU from the colleges’ basic research and operation of MOE (nos. CCNU14Z02016 and CCNU15A02046).