#### Abstract

Based on an overall consideration of factors affecting road safety evaluations, the Bayesian network theory based on probability risk analysis was applied to the causation analysis of road accidents. By taking Adelaide Central Business District (CBD) in South Australia as a case, the Bayesian network structure was established by integrating K2 algorithm with experts’ knowledge, and Expectation-Maximization algorithm that could process missing data was adopted to conduct the parameter learning in Netica, thereby establishing the Bayesian network model for the causation analysis of road accidents. Then Netica was used to carry out posterior probability reasoning, the most probable explanation, and inferential analysis. The results showed that the Bayesian network model could effectively explore the complex logical relation in road accidents and express the uncertain relation among related variables. The model not only can quantitatively predict the probability of an accident in certain road traffic condition but also can find the key reasons and the most unfavorable state combination which leads to the occurrence of an accident. The results of the study can provide theoretical support for urban road management authorities to thoroughly analyse the induction factors of road accidents and then establish basis in improving the safety performance of the urban road traffic system.

#### 1. Introduction

With the expansion of urban development and the surging of vehicle ownership, urban travel becomes vulnerable to three “chronic diseases,” which are congestion, accident, and pollution. Among the above three, accident has been recognised as the most negative aspect, in particular in and around Central Business District (CBD). According to* Global Plan for the Decade of Action for Road Safety 2011–2020* developed by the UN Road Safety Collaboration in 2011, nearly 1.3 million people die as a result of road traffic collisions per annum, which means more than 3,000 fatalities per day. And 20 to 50 million more people sustained nonfatal injuries from collisions, and these injuries were an important cause of disability worldwide. The case in Australia is also at an alarming level; there were around 25 deaths and 700 serious injuries per week, and cost to tax payers was more than 32 billion dollars a year [1]. Unless immediate and effective action is taken, road traffic injuries are predicted to become the fifth leading cause of death in the world. Therefore, the analysis and evaluation on the influencing factors on traffic accident, estimation of the potential safety hazards, and selection of appropriate measures in advance, so as to reduce the frequency and severity of traffic accidents, are important research topics in road safety engineering.

Previous studies showed that there are many reasons behind road accidents; these causes may be coherent to each other, and, for instance, poor road alignments and unexpected vehicle compositions or behaviours may result in the confusion of road users, which may lead to traffic accidents. However, many official records of road accidents indicated that most of the crashes are only pointed to single causes, especially human errors. For example, according to a crash causation survey released by the US National Highway Traffic Safety Administration (NHTSA) in 2015 [2, 3], drivers are to be criticised for 94% of crash cases. Apparently, the 94% of such accidents are also related to other causes from common experience, such as road alignment [4–6], traffic sign [7–9], and weather condition [10–12]. Therefore, the existing road accident statistics cannot fully reveal the causes, and traffic engineers and road infrastructure designers are provided with limited information for the accident mechanism and the formulation of improvement plans. It is of great importance to take full advantage of the traffic accident statistics and mine potential information so as to provide a basis for the analysis of accident mechanism and the improvement of road safety.

Bayesian network is one of the effective methods in the field of artificial intelligence to express uncertainty analysis and probability reasoning of a system. It can exploit the dependence relationships based on local conditions in a model to conduct bidirectional uncertainty investigation for prediction, classification, and diagnostic analyses. At present, there are some software platforms available for the construction of a Bayesian network, such as Bayes Net Toolbox (BNT), BayesBuilder, and JavaBayes, of which the MATLAB-based BNT developed by Murphy [13] is extensively used. This toolbox provides a lot of underlying basic function libraries for Bayesian network learning, but it does not integrate the basic functions for Bayesian network learning into a system. Moreover, BNT does not have Graphical User Interface (GUI), which is not user-friendly, nor can it be well generalized. Netica is a Bayesian network learning software developed by Norsys Software Corporation in Canada, which has been extensively applied in uncertainty management such as business, engineering, medicine, and ecology [14–16] due to its powerful functions, friendly GUI, reliable computation, and good performance. In this paper, a model is formulated using Bayesian network for road accident studies, and then a Bayesian network learning process, posterior probability reasoning, most probable explanation, and inferential analysis are conducted by using Netica.

This paper is organized as follows: Section 2 reviews the related literature on causation analysis of road accidents; Section 3 describes the construction of a Bayesian network model; Section 4 presents a case study on Bayesian network model application for Adelaide Central Business District (CBD) in South Australia; the findings of this study are summarized in Section 5.

#### 2. Literature Review

The use of causation theory for road accident analysis aims to extract the accident mechanisms and accident models from a large number of typical accidents so as to provide theoretical basis for the qualitative and quantitative analyses, the predication and prevention of accidents, and the improvement of safety management. Scholars across the world have done some researches in road accident causation analysis and various data sources, variables, sample sizes, and analytical models, such as aggregated models which include Frequency Analysis [17–19] and Test [20, 21].

In terms of disaggregated models, as the frequency of road accidents is in a form of nonnegative, discrete, and abnormal distribution and based on experience the frequency of accidents follows Poisson distribution, the Poisson regression model can be applied to analyse the influence of each risk factor on the frequency of accidents [22]. The negative binomial distribution regression is based on Poisson distribution, but its specification error follows Gamma distribution. The negative binomial regression model has been extensively applied in traffic safety analysis model [23–27]. However, the assumption that the mean value of Poisson distribution is equal to the variance is often inconsistent with realities. And in the analysis of longitudinal data samples, the adoptions of Poisson regression model and negative binomial regression model are likely to generate biased estimate and even incorrect results. When the explained variables only take a limited number of multiple discrete values, the established regression model is a discrete choice model, in which Logit model is the earliest discrete choice model and is one of the widely used models [28–31]. For an applicable statistical model, research object is required to be in independent distribution; while the safety data has a complex spatial distribution, the accuracy and robustness of safety level estimation will be greatly affected if the spatial feature is neglected.

Through the review of the existing literature, it has been discovered that past researches on the causation analysis of traffic accidents are gradually evolving from the descriptive simple analysis based on aggregated models to the multivariable complex modeling analysis based on disaggregated models. However, the deficiencies of the existing studies are the following: the influencing factors on accidents are not fully considered; most are based on specific, isolated, superficial single-factor analysis, considering only the main influence factors. These studies revealed the inherent rules of the occurrence of accidents in one aspect or case but ignored the multidimensionality of accident relationships and their correlations, so that the complex logical relationship between causes, accident occurrence, and accident consequence was not reflected. Therefore, research methods and analysis technologies are not generally applicable. Although some scholars used Decision Tree [26, 32, 33], Bayesian network [34–36], and other complex systems to research the correlation between accident causes, the theoretical systems and related supporting technologies have not been systematically established.

#### 3. Construction of Bayesian Network Model

##### 3.1. Basic Principles of Bayesian Network

Bayesian network, also referred to as belief network, is considered as one of the most effective theoretical models in the fields of uncertainty knowledge representation and reasoning. It is a directed acyclic network topology consisting of node set and directed edge, and each node denotes one variable state, while directed edge denotes the dependence between variables. The correlation intension or confidence coefficient between variables is described by using Conditional Probability Table (CPT). Prediction, diagnosis, classification, and other tasks can be achieved by using learning and statistical inference functions of Bayes theorem. Bayesian network uses probability to denote the uncertainty of all forms and uses the probabilistic rules to achieve learning and reasoning process. It has the following relationship:A set of variables of Bayesian network consists of the following components [37] is a network structure which denotes the conditional independent assertion in variable set , is a set of local probability distributions associated with each variable, denotes the variable node, and denotes the father node of in .

and define the joint probability distribution of . is a directed acyclic graph (DAG), and each node in corresponds to a variable in (Figure 1). The default arc between nodes of denotes conditional independence.

Use to denote the local probability distribution in (1), namely, the product term ; then the binary group denotes the joint probability distribution .

The construction of a Bayesian network mainly involves the following steps:(1)Structure learning: determine the factor variables (nodes) related to the study object, and then determine the dependent or independent relationship between the nodes so as to construct a directed acyclic network structure(2)Parameter learning: based on the given Bayesian network structure, learn the Conditional Probability Table (CPT) at each node of the Bayesian network model

##### 3.2. Structure Learning

As the network structure and data set can be used to determine the parameters, structure learning is the basis of Bayesian network learning, and the effective structure learning is the key to constructing the optimal network structure.

The construction of Bayesian network structure includes the following three points:(1)Based on expert experience and prior knowledge, determine the variable nodes of Bayesian network so as to determine the structure of Bayesian network.(2)Through the learning of sample data, automatically acquire the Bayesian network structure by using machine learning algorithm.(3)Based on expert experience and machine learning of data, acquire the Bayesian network structure by using data fusion method.

As the third point combines the advantages of expert experience and machine learning and avoids the disadvantage of using one method to determine the Bayesian network structure only, in this paper, the third method to determine the Bayesian network structure for the causation analysis of road accidents will be used. Common machine learning methods include K2 algorithm, MCMC algorithm, and hill-climbing algorithm. K2 algorithm is based on the scoring function and hill-climbing algorithm, which lies in the basic principle: from an empty network, according to the predefined order of nodes, select the node with the most posterior probability as the father node of this node, sequentially traverse all nodes, and gradually add the optimal father node to each variable. K2 algorithm uses posterior probabilities as the scoring function, which is described as follows:where is a set of variables. is the network structure. are the numbers of nodes in the graph. are configurations (states) of the parents of the th node. are mutual exclusive states of the th node. are instances of the th node being in the th state when its parents are in their th configuration, and . are the hyperparameters of the Dirichlet distribution and correspond to the a priori probability distribution of taking on its th state while its parents are in their th configuration. . The gamma function satisfies and .

K2 algorithm uses a variable order and a positive integer to limit the search space, which seeks the optimal model that meets the following two conditions: (1) the number of father nodes of any variable in should not be greater than and (2) is a topological order of . However, as K2 algorithm adopts greedy search strategy, which may easily fall into the local optimal solution and cannot guarantee that the network acquired is the optimal network, the knowledge and experience of experts need to be integrated so as to acquire the optimal network structure. In this paper, the combination of expert experience and K2 algorithm will perform the Bayesian network structure learning for the causation analysis of road accidents.

##### 3.3. Parameter Learning

After determining the topological structure of Bayesian network, the parameter learning of Bayesian network can be performed. In the process of collecting road accidents information, missing data often occurs due to various reasons, for instance, recording instruments malfunction and confusion of respondents in answering questions. Most of statistical models cannot directly analyse the data with missing values, and in the case of any missing values, the record with missing values is generally eliminated directly to ensure that the statistical model can be properly fitted. If the missing values are less, this will not greatly affect the results if the record with missing values is directly eliminated. However, if the multivariate analysis is performed, more variables will be studied, which means that more records will be eliminated; it may cause a loss of information, reduce the power of test, and cause some bias to research results [38].

The Expectation-Maximization (EM) algorithm is an asymptotic deterministic estimation method for the unknown parameter with missing data. It can be used to perform maximum likelihood estimation on the parameters from incomplete data set, which is a practical learning algorithm [39]. EM algorithm can be widely used to deal with incomplete data, such as missing data and censored data. EM algorithm mainly involves two steps: Expectation Step (-Step) and Maximization Step (-Step). The algorithm is described as follows.

*(**1) Initialize *. Set accuracy and correction value of estimated value .*(**2) *-*Step*. Calculate the expected sufficient statistic of missing value .

The probability distribution of iswhereThe sufficient statistic is where is the probability distribution of under the condition that and are known, is the joint distribution of and , is the th variable, is the count of all possible joint instantiations between and denoted by and , respectively.

*(**3) *-*Step*. Calculate the new maximum likelihood (ML) or maximum a posteriori (MAP) values of in the given condition .

In Expectation-Maximization, we have the following: ML: MAP: where is the Dirichlet parameter that can be obtained through the iteration process of -Step and -Step.

-Step is used to calculate the expected sufficient statistic of , and -Step is used to conduct new estimation of learning parameter by using the statistic obtained in -Step. In this paper, the Bayesian network parameter learning of road accidents is performed by using EM algorithm in Netica.

#### 4. Case Studies

##### 4.1. Study Area and Data Source

Adelaide Central Business District (CBD) in South Australia is selected as the study case, as it attracts 22% of metropolitan Adelaide’s work trips [40] and has the first and the second most dangerous accident concentration areas which are North Terrace and West Terrace in the CBD [41].

The crash data of South Australia from 2006 to 2008 were provided by the Department of Planning, Transport and Infrastructure (DPTI), and ArcGIS 10.5 software was used to locate the precise crash sites, as shown in Figure 2.

##### 4.2. Variable Selection and Data Preprocessing

By using ArcGIS 10.5, 1558 and 756 data sets of road accidents in Adelaide CBD from 2006-2007 and 2008 are obtained, respectively. The statistical data from 2006 to 2007 will be used for the construction of Bayesian network model and calibration, and the statistical data in 2008 will be used for the model validation process.

Previous studies [33, 42–45] provided some in-depth insights to guide the variable selection, discretization, and classification in this research. As a result, fourteen variables are selected from the data sets as having “significant influence,” that is, “crash type,” “driver’s apparent error,” “road geometry,” “vehicle type,” and others, as shown in Table 1. However, according to the statistical result in Table 1, it can be seen that the percentage of “inattention” reaches up to 39.84%, which is the biggest contributing factor in “driver’s apparent error” category. In our daily routine, “inattention” is explained as “failure to give attention, or negligence.” Generally, such usage is quite convenient for record purposes; however, from the perspective of psychology and physiology, the usage is not clear and definite. There are lots of other reasons that may be behind traffic accidents, such as human factors (driver’s physical and mental state, knowledge and skill, and the operational approach), objective factors (vehicles, roads, and road facilities), and safety management. If all above factors are simply summarized as “inattention,” then the causes of traffic accidents are to be extremely simplified. And the prevention measures will be hardly developed. Therefore, in this research, the factor of “inattention” will be excluded, and all variables used for modeling are shown in Table 2.

Bayesian network can be used to process continuous variables and discrete variables. As the classification result of traffic accident variables obviously has the discrete characteristic, discrete variables are adopted for Bayesian network analysis of road accidents. Before structure learning, discretization processing has to be conducted for road accident variable. The discretization values and value descriptions of processed variables are shown in Table 2.

##### 4.3. Structure Learning

In this paper, the method combining K2 algorithm and experts’ knowledge is used to formulate the Bayesian network structure. Based on K2 algorithm, FullBNT-1.0.7 is utilized to conduct structure learning via MATLAB. Through repeated selection and sequencing of variables by experts, the Bayesian network structure is finally developed, as shown in Figure 3. The network is composed by 14 nodes and several lines. The 14 nodes refer to 14 variables, and lines between these nodes indicate the relationships among the variables.

It can be seen from Figure 3 that some road accident variables have demonstrated clear hierarchical relations of affecting others and being affected by others. Road accidents result from the interaction of variables from “traffic participant, vehicle, road, and environment,” which is fully reflected by the Bayesian network structure as well. For instance, “vehicle movement” is affected by “road geometry” and “driver’s apparent error,” but it can also affect “total units involved” at the same time. The actual situation of road accidents can be fully embodied by the interactional hierarchical relationship of variables in Bayesian network.

##### 4.4. Parameter Learning

Once a Bayesian network structure is formed, parameter learning can be carried out. The Bayesian network structure can be created in Netica, and then parameter learning can be conducted, thus obtaining the conditional probability distribution of nodes. Finally, the Bayesian network model for the road accident causation analysis can be determined, as shown in Figure 4.

##### 4.5. Model Calibration and Validation

To validate the parameter learning accuracy and prediction accuracy of the Bayesian network model, sensitivity analysis is used to identify the sensitive factors with a significant impact on the target node from a number of uncertain factors, and then the target node is set as the evidence variable to conduct model fitting and prediction with these sensitive factors.

###### 4.5.1. Sensitivity Analysis

In Bayesian network, the sensitivity analysis refers to the analysis of the influence and influence degrees of multiple causes (node states) on result (target node). Based on sensitivity analysis, the elementary events with relatively greater contribution to the probabilities of the consequential events can be determined to facilitate the reduction of probabilities of these elementary events by taking effective measures, so that the probabilities of the consequential events will be reduced.

The sensitivity analysis function of Netica can be used to identify which factors have more important safety management values in analyzing traffic accidents. In Netica, select the target node and then analyse the impact degrees of other nodes on the target node in a descending order. Taking the node “crash type” as an example, make sensitivity analysis of it and the result is as shown in Table 3.

The mutual information refers to the direct or indirect information flow rate and measures the degree of dependence between nodes. In other words, the mutual information between two nodes can indicate if the two nodes are dependent on each other and if so how close their relationship is [46]. As shown in Table 3, it can be seen that the mutual info (=0.57217) of node “driver’s apparent error” is the largest, which means that it has the strongest impact on “crash type,” followed by “vehicle movement” and “road geometry” which have mutual info = 0.33948 and 0.04044, respectively.

###### 4.5.2. Model Fitting and Prediction

Based on the sensitivity analysis result of “crash type,” the posterior probabilities of “driver’s apparent error,” “vehicle movement,” and “road geometry” obtained from the Bayesian network are compared with the actual calculations from 2006 to 2007 and from 2008, respectively. Due to the large amount of data, “rear end” from the parameter learning results of “crash type” is used for exemplificative explanation.

Figures 5, 6, and 7 show the posterior and actual probability distributions of “driver’s apparent error,” “vehicle movement,” and “road geometry,” respectively, when the evidence variable is “rear end.” Compared with the actual computational results from 2006 to 2007, the maximum mean absolute error (MAE) of Bayesian network model is 5.58%. Similarly, compared with the actual computational results of 2008, the maximum MAE is 6.14%, which suggests that the Bayesian network model has both high fitting accuracy and high prediction accuracy. Therefore, it is feasible to use the Bayesian network model to conduct result prediction and inferential analysis of each variable of road accidents accordingly.

##### 4.6. Bayesian Network Model Application

###### 4.6.1. Posterior Probability Reasoning

Bayesian network model can be used to perform probability reasoning, including posterior probability calculations. Precisely, it aims to calculate the posterior probabilities of some targeted nodes, control the influence degrees of determined specific nodes on the nodes of interest, predict the possibility of accident occurrence, and analyse the major accident sources under the condition that states of specific nodes are determined. In brief, the posterior probabilities in inferring the result from cause and inferring the cause from result are referred to as accident prediction and causal inference, respectively.

*(1) Accident Prediction. *Figure 8 is the accident prediction on the assumption that a driver “disobeys traffic lights” when driving in Adelaide CBD. Input the evidence variables (“traffic signals” and “disobey traffic lights”) emerging from this circumstance into the Bayesian network, so it becomes a problem to solve the posterior probabilities of other nodes, with the known status of some evidence variables.

In Netica, set both the statuses of “traffic signals” and “disobey traffic lights” as 100%; that is, the statuses of the evidence variables are determined. Then update the probabilities of the whole network; the probability change of relevant nodes, namely, the probability change of “crash type” and other nodes, can be observed. In this case, the probability of “right angle” in “crash type” is found to increase from the initial 16.6% to 58.0%. This suggests that if the driver “disobeys traffic lights,” the probability of “right angle” will significantly increase.

As shown in Figure 9, in addition to “disobey traffic lights,” assume that the driving time is at night; namely, the status of “night” in “light condition” is set as 100%. After automatically updating the probabilities of the whole network, the probability of “right angle” is found to further increase from 58.0% to 59.4%, which means that the probability of “right-angle” traffic accident is higher. Go one step further and assume that it is also a rainy night (namely, set the status of “raining” in “weather condition” as 100%). According to Figure 10, once again, it can be found that the probability of “right angle” further increases from 59.4% to 63.3%. This suggests that “driver’s apparent error,” “light condition,” and “weather condition” will all affect the probability of “right angle” to various degrees. Therefore, it can be found that the status change of evidence node variables will affect the probabilities of query nodes, which is consistent with the engineering practice.

*(2) Causal Inference. *Another important application of the Bayesian network is fault diagnosis of the system. The bidirectional reasoning technology of the Bayesian network can calculate not only the probability of a system failure under combined fault conditions but also the posterior probabilities of various components under the system fault condition and easily find out the most likely combination that caused system failure, thereby making the computational analysis more intuitive and flexible.

Conduct causal inference by taking the “side swipe” in “crash type” as an example. In this case, the evidence variable is “side swipe,” so its status probability is 100%. As shown in Figure 11, after inputting the evidence, the probability of “change lanes to endanger” in “driver’s apparent error” increases greatly from 15.2% to 46.7% through the automatic updating function of Netica. And the probability of “swerving” in “vehicle movement” also increases from 15.5% to 44.9%, which reaches the maximum probability. This suggests that, in the absence of other evidences, the most probable cause to “side swipe” is “swerving” (vehicle) caused by “change lanes to endanger” (driver).

###### 4.6.2. Most Probable Explanation

Bayesian network model can be used to make the most probable explanations, precisely, from sets of multiple causes (node states) which are likely to lead to a conclusion; use Netica to identify the set that is most likely to lead to the result, and this set with the maximum likelihood will be the most probable explanation.

In the example of “side swipe” as illustrated in Figure 12, use “Most Probable Explanation” function in Netica to find out the most probable cause set. As shown in Figure 12, the most probable explanation cause (node state) set of “side swipe” is [cross road, change lanes to endanger, swerving, daylight, not raining, dry, traffic signals]. It explicitly shows that most probable explanation and causal inference are highly consistent when the evidence variable is “side swipe,” and the set is also consistent with the engineering practice.

##### 4.7. Inferential Analysis of Accidents Based on “Serious Injuries” and “Total Estimated Damage”

The application of the Bayesian network model in Netica to solve the posterior probability reasoning problem, maximum posterior hypothesis problem, and most probable explanation problem highlighted the inferential capability of the Bayesian network model. To further analyse the factors contributing to traffic accidents, especially serious traffic accidents, the Bayesian network model was used to calculate the probabilities of “serious injuries” and “total estimated damage over 10,000 AUD” under the influence of “driver’s apparent error,” “road geometry,” “weather condition,” “light condition,” and “crash type,” respectively. The results are shown in Table 4.

###### 4.7.1. Driver’s Apparent Error

The results presented in Table 4 indicate that “driving under the influence” (DUI) will most likely cause serious injuries and heavy property damage, as DUI is more easily to lead to dangerous behaviours including speeding, not wearing a safety belt, and reckless or erratic driving. According to the inference results, DUI is most likely to cause traffic accidents on cross roads with the inference probability of 49.0%. As for the crash types, the inference probabilities of rear-end and head-on crashes are the two highest (resp., 15.6% and 15.4%). In previous studies, among all traffic accidents caused by DUI, unrestrained occupants were 4.70 times more likely to die or 4.66 times more likely to be injured than restrained occupants [47]. Besides, drunk drivers show weak control ability of vehicles, and the higher ethanol content in their blood, the higher probability that they will have illegal actions mentioned previously [48], which are more likely to cause serious traffic accidents with heavy casualties and property damage.

###### 4.7.2. Road Geometry

Intersections are an important part of road system and are potentially the most dangerous locations in a network as well. Previous studies have shown that intersections, especially cross roads, have higher crash rates and greater crash severity, particularly in urban areas [49–51]. As shown in Table 4, the most dangerous “road geometry” in Adelaide CBD also is “cross road,” which means that the probabilities of causing serious injuries and heavy property damage at “cross roads” are both the biggest. According to the accident records in Adelaide CBD, among serious traffic accidents caused at cross roads, there are two main crash types, “right angle” and “right turn,” respectively, accounting for 44.35% and 42.61%, while “fail to stand” and “disobey traffic lights” become the two main reasons of traffic accidents caused at cross roads, respectively, accounting for 38.26% and 34.78%.

###### 4.7.3. Weather Condition

Rainfall will not only decrease the effectiveness of drivers’ visual search [12] but also lower the friction coefficient of roads, which makes roads slippery, increases braking distance greatly, and thus results in the possibility of traffic accidents. A study for Melbourne, Australia, by Keay and Simmonds [10] found that rainfall was the strongest factor that correlated to weather parameter and it had the greatest impact in winter and spring. Keay and Simmonds [52] also found a contributing parameter, which is the lagged effect of rain. Symons and Perry [53] found that wet roads or raining is increasing the probability of traffic accidents which can reach up to 70 percent. Similarly, Qiu and Nixon [11] found that rain can increase the crash rate by 71% and the injury rate by 49%. This coincides with the results found in this research, which indicate that rainfall is associated with traffic accidents that had serious injuries and heavy property damage.

###### 4.7.4. Light Condition (Urban Heat Island)

Traditional bituminous pavement can absorb and store large amounts of heat during the day and continuously output the heat to the external environment at night, which will result in an increase in external environment temperature and lead to urban heat island (UHI). UHI might have some impacts on road durability and safety, such as accelerating bituminous pavement aging and exacerbating road high-temperature rutting may lead to road accidents. With the continuous expansion of city size, the comprehensive phenomenon of such microclimatic variation will become increasingly obvious [54]. As one of the major Australian cities, Adelaide is also affected by UHI [55]. And as waste heat from vehicles and temperature regulation of buildings is an important determinant of UHI magnitudes, Adelaide CBD, which has the largest density of traffic network and the largest number of buildings, has become the center of UHI. As shown in Table 4, the number of road accidents that occurred at night demonstrates a larger proportion of 55.56% than by daylight. Under the influence of UHI, Jusuf et al. [56] found that, at nighttime, commercial area has the highest ambient temperature among the four land use types (commercial, residential, industrial, and airport). Similarly, Parker [57] also found that the urban heat island is strongest at night in high-rise city centers. Therefore, the correlation analysis between road accidents and nighttime UHI is a potential research direction.

###### 4.7.5. Crash Type

Head-on crashes are among the most severe collision types and are of great concern to road safety authorities [58]. For instance, according to an annual report presented by NHTSA in 2015, head-on crashes occupied only 2.3% of total crashes; however, they accounted for 9.6% of fatal crashes. As shown in Table 4, no matter “serious injuries” or “property damage over 10,000 AUD,” the possibility of “head-on” crashes always takes the first place and is largely higher than other types of traffic accidents. As for the road geometry, the head-on crashes are most likely to happen on cross roads with the inference probability of 39.6%, while the inference probability of “disobey traffic lights” is the biggest in drivers’ apparent errors which cause head-on crashes. These results agree with Bham et al. [59] who found that head-on collisions were at a higher risk for severe injuries compared with other collision types. And Rizzi et al. [60] concluded that 31% and 21% of crashes at intersections could have been avoided entirely or influenced by anti-lock braking system (ABS); however, the head-on crashes were the only crash type for which ABS seemed to be ineffective.

#### 5. Conclusion and Future Work

As road accidents are unexpected, random, complex, and latent, it is necessary to conduct investigations on accident mechanism and accurately identify the exact causes. The Bayesian network combining the probability theory with graph theory not only has a rigorous mathematical consistency but also has the structure chart that can intuitively identify problems. Therefore, it is one of the most powerful and effective tools to deal with uncertainties.

The occurrence of road accident results from the interactions of “traffic participant, vehicle, road, and environment,” and there is a potential hierarchical relation (impacting and impacted) among the variables. In Bayesian network, the directed acyclic graph is a visual expression form that is closer to the characteristics of thought and reasoning mode of human. In the study, the Bayesian network structure for the road accident causation analysis was achieved by using K2 algorithm and experts’ knowledge which combines the advantages of machine learning and experts’ knowledge. The structure learning result of the Bayesian network fully reflects the hierarchical relations among the accident related variables and allows for better prediction and analysis of the characteristics of road accidents.

In this study, the Bayesian network model for the road accident causation analyses was established by using Netica, Bayesian network-based software with friendly GUI. The Expectation-Maximization algorithm that can deal with missing data was adopted to process the parameter learning, and then the calibration and validation, posterior probability reasoning, most probable explanation, and inferential analysis were carried out after the construction of the Bayesian network model. The results showed that the Bayesian network model is feasible and effective for road accident causation analyses; in particular the use of posterior probability of the Bayesian network can not only more precisely and quickly find the key causes for traffic accidents but also identify the most likely cause (state) combination. The result can be used as an important theoretical basis in developing road traffic management strategies so as to improve road traffic safety.

Follow-up studies will consider the rationality of the Bayesian model and other factors that may lead to traffic accidents and establish a more accurate and comprehensive model. As the Bayesian model is a probabilistic model, more comprehensive and extensive basic data are needed to enhance its reliability, in which some data can be obtained only by carrying out experiments, despite the ability of the Bayesian mode to make up for missing data. Moreover, as the influencing road accident factors in reality are more than the factors used in this study and as Netica is also suitable for the establishment of a larger and more complex accident analysis model, the model can be expanded to a more sophisticated model that can consider more factors.

#### Conflicts of Interest

The authors declare that there are no conflicts of interest regarding the publication of this paper.