Abstract

Due to the uncertainty and complexity of multilinks and multifactors in urban express logistics system, it is very difficult to analyze the risk factors and the correlation among them for urban public security. In this paper, a method combining domain knowledge and data learning is proposed to construct Bayesian network, which can effectively deal with this problem. Based on the literature review and the investigation of transportation companies, this paper summarizes the risk factors to public safety caused by pick up, warehouse storage, transport, and the end distribution in the process of urban express logistics, which are divided into 5 dimensions: management, weather, human, transportation tools and facilities, and goods, including 11 risk factors. In this paper, Interpretative Structural Model is used to construct the initial hierarchical model to describe the complex relationship between factors, and then causal mapping method is used to improve the initial model to transform the structure into the final Bayesian network model. Finally, the sensitivity of one node to other nodes is analyzed based on the incident data. The results show that Bayesian network is effective in improving urban express logistics operation ability and avoiding public safety risks and has a strong generalization ability, which is simple and easy in practical application.

1. Introduction

Taniguchi et al. [1] defined urban logistics as “in the market economy, considering the urban traffic environment, traffic congestion, and energy consumption, while private enterprises to achieve the overall optimal logistics and transport activities process.” Based on the national conditions of China, this paper gives the definition of urban logistics: urban logistics is an activity that takes improving the competitiveness of a city as the core, realizes the optimization of urban logistics and transportation activities through the application of advanced information technology, and tries to reduce the negative impact of logistics activities on urban traffic congestion, traffic environment, and energy consumption. The operation process of urban express logistics involves many links, such as pick up, warehouse storage, transport, and the end distribution, during which the business status is diverse and the risk factors are numerous, which can easily threaten the public security of the city. The risk of urban public security refers to the force majeure and the possibility of objective existence that threaten the basic values, norms, and interests of urban public domain [2]. The control of urban public security risks should prevent risks and potential harm caused by public security incidents from the source of risk factors.

The risks of urban logistics to public security are mainly manifested as vehicle collision and explosion, warehouse fire and explosion, contraband transportation, burglary, and assault by courier. Frequent in recent years, the Chinese express logistics accidents caused a great deal of personnel and property losses. However, there is no unified system of the research on the risk factors affecting the public security of urban express logistics. At present, most of the research studies focus on the single risk of a certain logistics link, such as the hidden danger in the process of transportation [3, 4] and fire risk in logistics storage [5] or the internal risk analysis of logistics enterprises, such as the risk cost of logistics enterprises [6], logistics outsourcing risk [7], and project logistics risk [8]. There is no systematic study on the risks and hidden dangers of the whole logistics link to public security. The analysis of risk factors is the primary part of accident prevention because it can provide operational information for logistics-related enterprises and management departments. They can focus on the potential risks and take preventive measures.

At present, risk-based research is generally concentrated in the fields of automobile collision, coal mine, transportation of dangerous goods, and aviation. Commonly used risk research methods include BP neural network [9], fuzzy comprehensive evaluation method [10], analytic hierarchy process (AHP) [11, 12], and Bayesian network (BN) [13, 14], focusing on the evaluation of risk, but lack of risk identification and control research. Other traditional risk research methods are fault tree analysis (FTA) [15], probability theory [16], Swiss cheese model, human factor analysis, and classification system (HFACS) [17]. However, the various influencing factors in these methods are independent. In reality, these factors are often related to each other.

Based on the above problems, this paper adopts the method of literature review and the survey of transportation companies to divide the factors that affect the risk of urban logistics into five dimensions: management, weather, human, transportation tools and facilities, and goods, including 11 influencing factors. The impact factors of urban logistics on public safety risk are analyzed by using Interpretative Structural Model (ISM), and the initial network model is established. Then, the initial model is transformed into BN model by using causal mapping method. The sensitivity of specific nodes to other nodes is obtained, and the relationship between risk factors of urban logistics is identified. It helps managers to enhance the scientific nature of risk management decisions, provides a new basis for improving the operational stability of urban logistics, helps to reduce the occurrence of urban public security incidents, and alleviates the potential loss of risks to society, economy, and environment.

The rest of this paper is organized as follows: Section 2 presents the historical research on risk identification and ISM. Section 3 introduces the basic theory including BN and ISM. Section 4 constructs a Bayesian network model and analyzes the sensitivity of each node. Section 5 discusses the advantages and future development of this study. Section 6 provides our conclusions.

2. Literature Review

Most of the previous research studies that analyzed the factors responsible for urban logistics accidents used a statistical method. The researchers primarily limited to the collection, analysis, and interpretation of data from accident reports or accident databases [13]. Ren [5] analyzed in a large number of fire accidents of logistics warehouses reveal four factors that affect the fire risk: warehouse building, commodity, management, and environment. Zhao et al. [13] collected 94 cases of dangerous goods transportation accidents and adopted the expectation maximization algorithm to derive three main risk factors affecting hazardous materials transportation: human factors, transportation vehicles and facilities, and packaging and loading of dangerous goods. Chen and Liu [18] analyzed the data of logistics road transportation accidents and concluded that the main risk factors include personnel factors, vehicle factors, and road factors. They used the Delphi method to make qualitative analysis of the potential accident, effectively helping the security management personnel of logistics enterprises to recognize the causes and potential problems of truck accidents and take precautions in advance.

Although statistical methods can be used to obtain the accident influencing factors, it cannot explain the connection between the different factors and the important role of key factors in accidents. In recent years, BN has been widely used in economic analysis [19], biological genetic [20], medical diagnosis [21], mechanical engineering [22], civil engineering [23], transportation [24], computer science [25], mining accidents [26], and other fields. BN is also applicable to the field of logistics. Aiming at the limitation that complex logistics service supply chain system is difficult to carry out reliability analysis in the face of a lot of uncertain fuzzy information, Cai and Liu [27] proposed a reliability analysis method of polymorphic system combining Bayesian network and fuzzy set theory. The reliability analysis efficiency of logistics system is improved and theory and data support for logistics enterprises are provided to improve the weak links. Yan and Suo [28] identified and classified the risks of logistics financial business, took the failure of enterprises as the root node, constructed a Bayesian network to measure the risk level of enterprises, and found the most critical risk factors by calculating several important indicators. Zhu and Yang [29] constructed the early warning index system of agricultural products logistics by using Bayesian network, designed the early warning model, and realized the early warning of agricultural products cold chain logistics.

As far as we know, there are few research papers on the application of BN in logistics risk factor analysis, such as Li et al. [30] based on factor analysis to determine the effect of emergency logistics risk factors. The factors are divided into human factor, equipment factor, materials factor, environment factor, and supervision factor. And the Bayesian network of emergency logistics risk is established. The prior probability and the posterior probability can be used to find the weak links of logistics and realize the rapid positioning of logistics risk. Huang and Qian [31] analyzed the main faults in the process of cold chain transportation using Bayesian network, summarized the influence of different factors on the faults, and provided an effective method for improving the fault analysis and prevention in the process of cold chain transportation.

However, these literature studies only apply BN model to logistics risk and do not discuss how to establish BN model of logistics for public security. This paper proposes a method to construct BN by means of ISM and causal mapping method and conducts sensitivity analysis on risk factors of urban logistics for public security. The relationship among the factors and the important influencing factors are obtained, and the corresponding methods for accident prevention are put forward.

3. Methodology

3.1. Bayesian Network

Bayesian network is a network combining probabilistic inference and graph. It encodes the probabilistic relationship between the variables of interest. BN can provide a rational and coherent theory under various conditions of uncertainties (e.g., uncertainty in parameters and models or uncertainty in domain knowledge) and complexity that are described by subjective belief or probability [32]. BN has several advantages for data analysis:(1)It is easy to handle the absence of certain data items.(2)It can be modeled in terms of causality, so it can be used to gain an understanding of the problem domain and predict the outcome of the intervention.(3)Since the model has both causal and probabilistic semantics, it is an ideal representation of combining prior knowledge and data (usually in the form of a causal relationship). BN is particularly useful for modeling uncertainty.

Based on the advantages of BN and its wide application in the field of risk assessment, this paper proposes a hybrid method combining domain knowledge and data learning to construct a BN and realize the integration of multiple factors and quantification of uncertainty in the network model to assess the public security risks of urban logistics.

BN is represented by a series of variable nodes and a directed arc representing causality. The conditional probability table (CPTs) is used to determine the quantitative relationship between variables. BN can be represented as , where represents the network, represents the graph, and represents the joint probability of the network. is a graph, which can be further represented by , where represents the set of nodes and represents the directed edge of causal relationship between variables. In addition, the calculation method of joint probability equation on variables is shown in equation (1). The joint probability P is obtained by multiplying the respective local conditional probability distributions:

The construction of BN usually adopts three methods: data-based method, knowledge-based method, and the combination of the two methods. The data-based method uses the conditional independence semantics of BN to generalize the model from the data. The knowledge-based approach utilizes the causal knowledge of domain experts to construct BN. Knowledge-based approaches are particularly useful when domain knowledge is critical and data availability is scarce. The method modeling combining domain knowledge and data is the most commonly used method at present, which not only avoids the dependence on expert domain knowledge, but also avoids the invalid learning when given a small or noisy dataset.

3.2. Interpretative Structural Modeling

Interpretative Structural Modeling (ISM) is developed by Warfield [33] in 1973 when analyzing complex structural problems in socioeconomic systems. The basic idea is to extract the elements of the problem through various creative techniques and use directed graph, matrix, and computer technology to process the information of the elements and their mutual relations. The model is intuitive and instructive and is widely used to recognize and deal with complex system problems. Singh et al. [34] implemented knowledge management in the engineering industry and found the interdependence among the variables. Hu et al. [32] evaluated the seismic liquefaction potential by combining interpretation structure model and BN method and achieved good prediction results. Ravi and Shankar [35] adopted the ISM method to analyze the obstacles in reverse logistics activities and extract the key influencing factors.

The advantages of ISM include the following [25]:(1)Systematic integration of expert opinions and domain knowledge(2)Providing enough opportunities for decision modification(3)For a system with factors between 10 and 15, the amount of calculation is small, which is convenient for practical application

The basic working principle and steps of the method are as follows:Step 1: determine the factors of the system and relationship table. According to the field knowledge and field research, we can summarize the factors related to the problem. Establish the relationship between various factors according to relevant knowledge in the field.Step 2: construct a structural self-interaction matrix (SSIM) and calculate the reachability matrix M. From the determination of the direct relationship between every two factors, SSIM of each factor is constructed. At this time, the matrix belongs to Boolean matrix and follows the logic operation rules.For example, there is a system containing factors, . If factor has no influence on factor , then in the intersection position between the row and the column is represented as “0”; if there is a direct influence, it is represented as “1.” The structural self-interaction matrix is expressed as follows:By adding the binary relation matrix and the identity matrix , the connection matrix can be obtained, and then matrix can be calculated through Boolean operation. The reachability matrix represents whether there is a connection path from one element to another.Step 3: hierarchical decomposition. The reachability matrix is divided into different hierarchical structures. The reachable set represents the factors set reachable by factor in the reachable matrix and is defined in the following formula:The antecedent set represents the factors set of reachable factors in the reachable matrix and is defined as follows:Furthermore, an intersection set C is defined, i.e., , as follows:According to the condition of , hierarchical extraction is carried out. Each round, one layer of elements is extracted. These elements are separated from other elements, and no iteration is carried out until the level of all elements is determined.Step 4: draw a directed graph. According to the reachability matrix and the obtained hierarchical relationship, a multilevel hierarchical directed graph is drawn.Step 5: evaluate and modify. Evaluate the points in the model that are inconsistent with the concept and make necessary corrections.

4. Building Bayesian Network Model

4.1. Building the Initial Model
4.1.1. Defining Variables and Structures

There are many risk factors caused by urban logistics. Our research mainly focuses on the impact of urban logistics on public security, so we focus on the risk factors that are likely to cause hidden dangers to public security. According to literature review [18, 30, 36], survey of transportation companies and combine the actual cases collected, follows the principle of (1) the main factors, (2) the factors available for most logistics accident data, and (3) the factors that are easy to obtain and identify, human factor (), transportation tools and facilities factor (), and goods factor () are identified as the direct factors of the accidents, and management factor () and weather factor () are identified as indirect factors. Generally, the research on logistics risk focuses on four aspects: people, machinery, environment, and management. We refine the indicators of these four aspects and creatively incorporate the factor of goods into the study, considering the possible impact of goods categories and improper storage on public security.

Human factor includes professional skill (), physical condition (), safety awareness (), and personnel quality (). Transportation tools and facilities factor includes transport vehicle (), maintenance and inspection (), and facility and instrument (). Goods factor includes goods category () and storage issues (). Tables 1 and 2 provide detailed description and value set of 11 factors. Direct factors have binary (yes/no, normal/abnormal) value sets, while indirect factors may have more values in the value set.

We have collected 96 urban express logistics accidents for public safety in China from 2016 to 2019 [3739], and we can determine each value of 11 parameters. Table 3 summarizes the accident data.

For example, example 1 in Table 3 represents an accident that happened in Beijing on June 1, 2017, due to a complaint about the wrong operation of a courier. Survey found that the transport company management problems (), the weather is sunny (), the courier professional skill level is not high (), the courier is in normal health (), the safety awareness of the courier is normal (), the courier’s quality is low (), normal delivery vehicles (), the maintenance and inspection of the vehicle are normal (), the equipment and maintain are normal (), normal delivery goods (), and storage requirements are normal ().

4.1.2. ISM Model Construction

In this paper, ISM is first used to construct an initial hierarchical model, and then causal mapping is adopted to improve the initial model and transform the structure into BN model. The research framework is shown in Figure 1.Step 1: determine the risk factors and their relationship of urban logistics for public safety. Based on the expert knowledge and accident reports, the relationship between the factors involved in the urban express logistics accidents is determined. Any pair of nodes have a relational value set {, , , O}. indicates that directly leads to , indicates the interaction between and , O indicates that the two factors are irrelevant. The details are shown in Table 4.Step 2: construct the structural self-interaction matrix (SSIM) and calculate the reachability matrix (M). SSIM is constructed by determining the direct relationship between the two factors in Step 1:By means of 4 steps of Boolean operation, can be calculated. . Table 5 shows the accessible set, the antecedent set, and their intersection set table, and the structural relationship among the factors is established. The factors of accessible set include the elements of matrix whose corresponding row value is 1, including the factors themselves and the factors that may affect. By contrast, the antecedent set consists of the elements of matrix corresponding to the column median of 1, including the factors themselves and the factors that may affect them.Firstly, the hierarchy is extracted according to the condition . The factors that satisfy the condition are the top layer in the hierarchy. Then the top level factors in the subsequent analysis are removed and the same process is repeated to identify the next level factors. The grading process continues until the level of each factor is found:Step 3: ISM model divides the risk factors of urban express logistics into different hierarchical structures, and the results are as follows:Table 6 shows the structural hierarchy of risk factors, among which management factor and weather factor are first identified as level 5. The first level is the goods factor, including the goods category factor and storage issues factor. The main factors including human factor and transportation tools and facilities factor are in the middle level.Step 4: according to the hierarchical relationship obtained in Step 3, a multilevel hierarchical directed graph is drawn, and the initial model of BN is established, as shown in Figure 2.ISM method is used for risk identification, the hierarchical structure is established, and the initial directed graph of BN is further developed. The network diagram describes the horizontal interdependence among factors and the vertical influence of factors at different levels.

4.2. Improving the Bayesian Network Structure

The initial network diagram is complex and shows circular links that are not allowed in Bayesian network. So we changed the initial network diagram to be BN compatible. The modification of the initial model is based on a causal mapping method proposed by Nadkarni and Shenoy [40]. When modifying network graph, this method generally adopts two principles. The first principle is to reduce the complexity of the network presentation by removing redundant links. This requires the distinction between direct and indirect relationships, and only direct relationships remain. For example, although management factor () is related to safety awareness factor (), experts believe that the direct relationship between the two factors is weak. directly affects physical condition () and personnel quality (), both of which subsequently affect . Therefore, the link to was removed from the network.

The second principle is to eliminate any cyclic relationships to satisfy the noncyclic graphic structure of BN. For example, transport vehicle (), maintenance and inspection (), and facility and instrument () are interrelated, forming a circular connection. will affect the performance of the vehicle and then cause latent risks, which is directly related to . However, when the availability of vehicle is not a problem, has no significant influence on , which is the same with . Therefore, the relationship between them can be eliminated. Finally, a directed graph suitable for BN is shown in Figure 3.

The perfect network diagram using causal mapping method specifies the interdependence between the same-level or cross-level factors, which makes the overall structure clear and easy to understand, deals with the relations among factors more effectively, and ensures a deeper understanding of the problem.

4.3. Parameter Learning of Bayesian Network

When the BN structure is established, the values of some variables can be obtained from the case database, and then the parameters of the model (i.e. conditional probability) can be estimated with these cases. This is called parameter estimation or parameter learning. Maximum likelihood estimation algorithm (MLE) is a universal method to find a set of parameters of maximum likelihood , provided that a complete dataset is available. Record the sample set as .

The joint probability density function is called the likelihood function of relative to and is defined as follows:

If is the value can make the likelihood function largest in the parameter space, is the maximum likelihood estimator of . In order to facilitate analysis, the logarithmic similarity is defined as follows:

We use Netica32 software and combined the sample data in Table 3 to complete the parameter learning process. Table 7 shows a CPT example associated with the child factor (safety awareness) and its corresponding parent factors (physical condition) and (personnel quality). Figure 4 shows the established BN model. Assume that (maintenance and inspection) is taken as the evidence variable and the state value is set to 1, indicating that the evidence variable is known state. The result is shown in Figure 5.

At this time, the abnormal probability of (vehicle state) changes from 12.6% to 62.1%, the abnormal probability of (facilities and instrument) changes from 4.0% to 20.0%, and the abnormal probability of (storage issues) changes from 11.4% to 16.4%. It can be seen that the risk probability of other factors will increase if the maintenance and supervision work is not done properly.

4.4. Sensitivity Analysis

One of the main functions of BN is to analyze the evidence sensitivity of query nodes. Sensitivity analysis can get the sensitivity of query node probability to the probability change of evidence node. In probability system, entropy is a common index to express the degree of uncertainty. The following equation represents the entropy of the distribution of the variable X:

Variance is also an indicator of the degree of uncertainty, as shown in the following equation:where is the mean, that is, .

Table 8 shows the sensitivity analysis of other evidence nodes with as an example. Details of other nodes are shown in Table 9. The second column represents the mutual information values, the third column represents the percentage of entropy decreased, and the fourth column is variance.

As can be seen from Table 8, the nodes that have the most influence on are their parents and children. The factors that are most likely to have the greatest influence on the reliability of are listed in the front.

5. Discussion

This paper presents a new risk factor analysis method based on the combination of domain knowledge and data. Nearly 4 years of China’s urban logistics express for public safety accident data are modeled, and the sensitivity to risk factors is evaluated. The advantages and development of this study are further discussed as follows:(1)As a method of probabilistic reasoning, BN model has some specific advantages over other probabilistic models such as artificial neural network (ANN) and support vector machine (SVM). BN can fuse multiple information sources and infer uncertainty. In the BN model, domain knowledge can be encoded as a prior distribution, which means that it is independent of any sample data. This feature makes it easy to combine domain knowledge with sample data. ANN, on the other hand, usually needs a lot of parameters to learn for a long time and cannot observe its learning process. In addition, BN can integrate different variable types into a model for processing, such as continuous, discrete, quantitative, and qualitative variables, while ANN and SVM models cannot process. More importantly for BN, the model can be updated according to the new given data. Even incomplete data can be processed to improve the previous BN model. These are not done by other models such as ANN and SVM.(2)Comprehensive multisource information for decision-making. It is a complex multiattribute decision-making problem to analyze the influencing factors of urban express logistics on public security risk. There are many kinds of information, such as investigation report, detection sensor, and expert opinion. How to integrate objective and subjective data from multiple sources to analyze the risk factors of the system has become an urgent problem. This study combines expert knowledge and accident data and established a comprehensive analysis framework of risk factors of urban express logistics on public safety (Figure 1), including four work links, five risk categories, and 11 influencing factors. According to the results of this study, among all risk factors of urban logistics accident, management error and human factor have the highest prior probability. This result means that most accidents are caused by these factors, which is consistent with previous research results [18, 30]. It can be considered that management () and human factor () are the key factors to reduce public safety accidents. Enterprises should further strengthen safety supervision, reduce the employee safety violations, strengthen the safety management of employees, provide safety training courses for employees, strict assessment system, and enhance safety awareness.(3)The output of sensitivity analysis shows that the low safety awareness of employees () and management factor () are the main reasons of accidents caused by the failure of transportation vehicles. This is consistent with previous research results [41, 42]. Accidents may occur when employees neglect the safety inspection of vehicles and fail to do regular maintenance of vehicles, and the enterprise is poorly supervised. A lack of safety knowledge may be one reason why employees ignore safety issues. In order to reduce these errors, the company’s management must pay attention to these factors, strengthen supervision, clarify safety management policies, implement safety training plan, improve the staff’s attention to details, and reduce mistakes.(4)Although this study proposes a modeling framework using the artificial intelligence methods, the BN model developed is only a prototype, and further studies are needed to improve it. For example, some of the risk factors identified in urban express logistics represent general operational process concepts. In order to be more realistic, some factors must be refined to manage specific express logistics risk factors. In addition, in order to improve the integrity of BN model, more domain experts should be introduced.

6. Conclusion

In this paper, by combining Interpretative Structural Model and causal mapping method, Bayesian network is constructed to analyze the risk factors of urban express logistics on public security, including risk factor identification, relationship analysis between factors, prior probability and likelihood calculation, reasoning, and interpretation. With the help of domain experts, ISM and causal mapping methods can be effectively applied to the BN construction of urban express logistics, extending the application of BN model in the risk factor analysis of urban logistics. The flexibility of this approach allows the integration of multiple types of information sources to quantify the relationship between risk factors, and the model can be constantly updated based on new information. This study considers more important factors (11 factors) of urban logistics accidents from five aspects of management, weather, human, transportation means, and goods and concludes the interrelation and relative importance of risk factors, revealing that human factor and management factor are important direct factors of accidents. Through the sensitivity analysis, it is concluded that the low safety awareness of employees and the poor management of enterprises bring potential risks to vehicles, so as to increase the possibility of accidents. These research results combined with the actual situation of urban logistics in China have certain practical significance and provide guidance for logistics managers to take necessary measures to reduce accidents.

Data Availability

The data used to support the results of this study are provided in Table 3.

Conflicts of Interest

The authors declare that they have no conflicts of interest.

Acknowledgments

The research was supported by grants from the National Key R&D Program of China (2018YFB1601600).