Abstract

Extreme value of vehicle load plays an important role in bridge design and risk assessment. Peaks over threshold (POT) is a method commonly used in extreme load estimation. The selection of thresholds for the POT method is extremely crucial, but the selected optimal threshold varies under different test criteria. Therefore, a method to select the suitable threshold is developed based on multiple criteria decision analysis (MCDA) in the paper. In MCDA, Chi-Square () test, Kolmogorov–Smirnov (K–S) test, and Root Mean Square Error (RMSE) in probability distribution functions (PDF) are employed as the test criteria and the weight of these criteria is calculated using the entropy method. Finally, vehicle loads obtained from simulation and field measurement are adopted to validate the effectiveness and feasibility of the proposed method. The results indicate that the proposed MCDA is a useful and complementary tool for threshold selection in the extreme value analysis.

1. Introduction

Vehicle load is one of the most significant factors for bridge design, safety assessment, and fatigue analysis [15]. Overloaded heavy vehicles is the primary reason for the deterioration of structural components and the degradation of the bridge’s overall state [68]. The condition of road traffic has gained a notable increase during the service period due to the rapid economic growth. Many bridge collapse accidents have resulted from overloading vehicles. In addition, the number of bridges worldwide stepping into their ageing stage is growing, which raises the risk of structural safety for users. It is extremely expensive to rehabilitate or maintain the degraded bridge due to the project itself, traffic disruption, and resulting delays. Large amounts of costs associated with rehabilitation and maintenance can be saved by proving that the bridge is safe through accurate risk assessment or proposing an appropriate repair scheme [9]. Therefore, it is necessary to have a good estimation of vehicle loads, and this typically involves using Extreme Value Theory (EVT).

Collecting the actual load time history is time consuming and costly. Generally, the extreme loads among the design life of the bridge cannot be measured through limited period of data. The EVT can solve the problem through the estimation of the extreme vehicle loads from a relative short period to a long period. The EVT is used to quantify the stochastic behavior of a process at unusually large or small levels. Most of the published works about the EVT applied to vehicle loads are on the basis of the block maximum (BM) and POT methods [10]. The BM method divides the observation period into nonoverlapping periods of equal size. Then the largest value in each period is extracted to generate maximum sample, which can be fitted by generalized extreme value (GEV) distribution. Using the BM method to extreme value estimation requires a large quantity of data, and thus it is not appropriate to use this method when the available data is insufficient. An alternative to the BM method is peak over threshold (POT) method. The POT method extracts the peak values exceeding a threshold, and the extracted data can be fitted by generalized Pareto distribution (GPD).

A well-known problem for the POT method is to select a suitable threshold [11]. Threshold selection is the trickiest part of the POT method because only a narrow range of thresholds is applicable. On the one hand, a considerable high threshold is required to guarantee that GPD parameters are stable. On the other hand, the threshold level cannot be too high so that sufficient data can be included [12]. The literature suggests that the most used threshold selection methods are based on judgement [13]. One way is to use graphical diagnostic plots, but interpreting these plots is subjective and rather challenging. The other way is to select a fixed quantile threshold corresponding to a high nonexceedance probability. The drawback of this method is that once the threshold has been determined, it is treated as fixed and the associated subjectivity and uncertainty are ignored in subsequent inferences.

In contrast to the above methods, the numerical approaches are relatively objective for threshold election, which are conducted based on goodness-of-fit tests across a range of different thresholds. There are many popular criteria for the goodness-of-fit tests to measure the distance between the cumulative distribution function of the reference distribution and the empirical distribution of the sample, such as Root Mean Square Error (RMSE) test, Kolmogorov–Smirnov (K–S) test, Pearson’s Chi-Square () test, and Anderson–Darling (A–D) test. It is observed that the selected threshold under one test criterion may not be the optimal threshold for other criteria. The performance of extreme value estimation is greatly affected when the extracted sample changes due to the test criterion variation. To select an optimal and stable threshold for the POT method, it is meaningful to do some research to take multiple criteria into consideration. The multiple criteria decision analysis (MCDA) approach works when practical problems are characterized by several conflicting criteria [14]. An entropy method is employed to assign the weight value of each criterion for the MCDA, and then the MCDA can integrate results obtained from goodness-of-fit under different criteria into a comprehensive one, which makes the selection more scientific and objective [15].

In this paper, the MCDA is proposed to select a threshold for the POT method in the extreme value analysis of vehicle loads. Section 2 presents the theory of the POT method, the mathematical methodologies of MLE, and the framework of extreme loads estimation. Section 3 presents details of the MCDA approach involving test criteria and the maximum entropy theory. Sections 4 and 5 present a practical application of theories illustrated in Sections 2 and 3 on simulated and field monitored vehicle loads. Finally, Section 6 presents some conclusions of this paper.

2. Theory of Extreme Value Analysis

2.1. The POT Method

Figure 1 shows the POT method of how to extract extreme values from the observations, where μ is the threshold. The distribution of the entire observations can be treated as a combination of the body part and the tail part. Under the POT framework, our focus is the observations exceeding the threshold, not the entire observations. When the values of the observations exceed the threshold, these observations are called exceedances (). Supposing is a sequence of independent and identically distributed random variables belonging to a continuous distribution function, that is, . The exceedances above the threshold μ are modelled by , and the distribution function of is expressed as follows:When μ is sufficiently large, the distribution function of approximately obeys the GPD, whose cumulative distribution function is defined inwhere ξ is the shape parameter of the distribution, σ is the scale parameter of the distribution, and θ is the vector . Often, the threshold μ is supposed to be known prior to parameters estimation and the GPD is specified only by parameters σ and ξ. The GPD is divided into three types depending on the value of shape parameter ξ: is short-tailed when ; is medium-tailed when ξ = 0; and is heavy-tailed when ξ > 0. The estimation of extreme vehicle loads is calculated through return level analysis. In the GPD model, the return value of the -year return period can be obtained as follows:where is the number of observations and is the number of observations exceeding the threshold.

2.2. MLE Method

Calculating the parameters of the GPD is another decisive issue in the extreme value analysis. The maximum likelihood estimation (MLE) method can balance the observed data and possible prediction [16]. When the distribution is known, the MLE method is commonly used to calculate the unknown parameters of the distribution with advantages of asymptotic normality, invariance, and congruency. The MLE method is based on the likelihood function as follows:where is the PDF of th observed variable and is the number of the extracted samples. In practice, the logarithmic form is more convenient to perform. The logarithmic form of the likelihood function takesWhen the extracted sample is obtained, the likelihood function becomes the function of the variable . The maximum value of occurs when and the variable is determined.Substituting (5) into (6), (7) can be obtained. The parameters of the GPD are obtained by solving

2.3. Flowchart of Extreme Loads Estimation

Figure 2 illustrates the flowchart of the extreme vehicle loads estimation. As illustrated in Figure 2, once a value is set as candidate threshold, the corresponding GPD fitting can be carried out. The accuracy of the GPD fitting is checked by goodness-of-fit tests under multiple criteria, and then the optimal threshold is determined based on MCDA.

The key point of the flowchart is MCDA, whose process contains four main stages including scope determination of candidate thresholds, criteria selection, criteria weight determination, and threshold examination. According to the character of the load distribution, candidate thresholds are set in ascending order, where is the relatively low quantile of the load data value and is the 99% quantile of the load data value. Three popular test criteria are adopted in the MCDA, and the weight of each criterion is determined based on the maximum entropy theory. The maximum entropy method is an objective weight-determining method because the criterion that has the greatest difference will take the largest weight in decision-making [15, 17]. To validate the reliability of the proposed method of the threshold selection, the graphical quantile-quantile (Q-Q) plot is performed. Finally, the estimation of extreme vehicle loads is conducted through return level analysis.

3. Methodology of the Threshold Selection

3.1. Test of Goodness-of-Fit

There are nearly 40 test criteria in the statistical literature, and the principles of these criteria are different. The most popular methods are Chi-Square test (), Kolmogorov–Smirnov (K–S) test, Anderson–Darling (A–D) test, Root Mean Square Error (RMSE) test, and correlation coefficient test. To examine the fitting effect of the GPD in the MCDA, three commonly used test criteria, the K–S test, test, and RMSE test, are adopted. The details of these tests are shown as follows:The test value measures the greatest deviation between the theoretical CDF and the empirical CDF; the test value computes the deviation between the observed frequency and the expected frequency ; and the test value calculates the root mean deviation between the theoretical PDF and the empirical PDF. Among the three tests, the smaller the test values , , and , the better the effect of the fitting distribution obtained.

3.2. Entropy Method

For each candidate threshold, a test value will be calculated under the corresponding test criterion. For a series of candidates under multiple test criteria, there will be many test values. Each of them has its role in the evaluation of fitting accuracy. To find a suitable threshold under multiple criteria, these criteria should be considered simultaneously. Considering the different principles of the test criteria, the greater the weight value is, the more important the test criterion will be. The weight values of the criteria should be settled objectively, and therefore the maximum entropy method is used.

Given that the tests are achieved by a range of candidate thresholds, each threshold belongs to a criterion. The number of candidate thresholds is , the number of test criteria is , and represents the performance value of the th candidate threshold to the th criterion from the decision matrix where the vectors () represent the criteria and the alternatives () represent the candidate thresholds. The vector represents the importance and relative weights of the chosen criteria, which can be obtained from the decision matrix. To show the weight values of different criteria more directly, the decision matrix is transformed toThen the normalized decision matrix is obtained from (10) as follows:As a useful tool to describe the relative weights of the criteria, the th entropy value is defined by and obtained from the following formulation:Note that there is an inverse relationship between and . To ensure (13), is substituted with and normalized for direct illustrationwhere is the objective weight value of the th criterion. The most reasonable expression of normalized weight value is defined byThe normalized weight value of different criteria can be obtained fromFinally, the comprehensive evaluation indicator can be calculated considering multiple test criteria by (16). When the GPD fitting is performed, the smaller the comprehensive evaluation indicator, the better the fitting effect of goodness-of-fit obtained

4. Simulation Study

To validate the application and feasibility of the proposed MCDA method, a simulation study is carried out. The advantage of the simulation is to produce a required mixture distribution where the tail part obeys the GDP distribution and the threshold is known. Referring to the probabilistic vehicle load models (Yang et al. 2014), the proportions and statistical parameters of different kinds of vehicles are shown in Table 1. The body part of the mixture distribution in the simulation includes two-axle vehicle loads of light vehicles (Type I), heavy vehicles (Type II), and trucks (Type III). The two-axle vehicle loads obey the log-normal distributions as expressed in (17), where is the random variable of the vehicle’s gross weight and and are the mean value and the standard deviation of logarithmic values of vehicle loads. The tail part of the mixture distribution in the simulation includes multiple-axle vehicles (Type IV). The multiple-axle vehicle loads obey the GPD, and the parameters are set as ξ = 0.001, σ = 6, and μ = 17.5. According Table 1, simulated vehicle loads are generated, and their mixture distribution is shown in Figure 3. The two-axle vehicle loads in the body part range from 0 to 17.5 t and the multiple-axle vehicle loads in the tail part are above the threshold 17.5 t, and the threshold of the mixture distribution is 17.5 tA total of 201 candidate thresholds are set from the interval , and the GPD of each threshold is fitted to the extracted samples of extreme values, respectively. The goodness-of-fit tests are performed based on single criterion and multiple criteria, and the results are shown in Figure 4. It can be seen from Figure 4(a) that the statistics can reach extremum rather than minimum when the candidate threshold is 17.5 t. reaches its minimum when the threshold is 20.3 t, reaches its minimum when the threshold is 21.6 t, and reaches its minimum when the threshold is 16.1 t. Based on the maximum entropy method, the objective weight values of the statistics , , and are calculated: . Then the comprehensive indicator is calculated. It can be concluded from Figure 4(b) that the comprehensive evaluation indicator reaches the minimum value when the threshold is 17.5 t and that the value is the same with the preset threshold of the mixture distribution.

5. Case Study

5.1. Fit the Distribution Using Traditional Approach

Figure 5 shows the real image of the Dalian Northern Suspension Bridge. The bridge, opened to public in May 1987, is the first three-span suspension bridge in China with simply supported stiffening trusses and earth anchorage system. The bridge has two pedestrian lanes and two vehicular lanes that can carry heavy traffic, which played a significant role in the coastal highway of Dalian. Restricted to the road width, multiple-axle wheeled vehicles are not allowed to pass through the bridge. During the rehabilitation in April 2016, a sophisticated online SHM system was designed and equipped in the suspension bridge. The SHM system incorporates a WIM system, which is located at the north side of the bridge to measure the traffic load. The measured vehicle loads of the suspension bridge from June 2016 to June 2017 are employed to illustrate the proposed method of threshold selection for the POT framework in the extreme vehicle load analysis.

Figure 6 shows the time history of the actual vehicle load and its statistical distribution. The vehicle load of the Dalian Northern Suspension Bridge has been continuously observed since the installation of the SHM system, and a representative example of the measured load history is shown in Figure 6(a). The entire range and tail region of the statistical distribution of vehicle loads are illustrated in Figures 6(b)6(d). The theoretical fitting to the empirical data uses log-normal distribution, which is recommend by Unified Standard for Reliability Design of Highway Engineering Structures (GB/T 50283-1999). It can be seen from Figures 6(c) and 6(d) that the fitting effect is not ideal as the theoretical distribution deviates considerably from the empirical distribution, especially in the tail region.

5.2. Threshold Selection Using MCDA

The measured vehicle loads range from 0.14 t to 32 t, a total of 281 candidate thresholds from 2 t to 30 t are set as the intervals, and the GPD fitting under each candidate threshold is conducted after the POT process, respectively. Figure 7 shows the statistics of , , , and their comprehensive evaluation indicator. It can be seen from Figure 7(a) that the selected thresholds are not consistent under different test criteria and the curve trends for different test criteria are quite different. reaches its minimum when the threshold is 3.6 t; reaches its minimum when the threshold is 22.6 t; and reaches its minimum when the threshold is 10.9 t. Based on the maximum entropy method, the objective weight values of the statistics , , and are calculated: , and then the comprehensive evaluation indicator is obtained as shown in Figure 7(b). It can be concluded from Figure 7(b) that the comprehensive evaluation indicator reaches the minimum value when the threshold is 7.4 t. The statistics of , , and reach their extremum when the threshold is 7.4 t, and this value is selected as the threshold for extreme load estimation.

5.3. GPD Fitting and Extreme Load Estimation

After selecting a reasonable threshold, the monitored vehicle loads are processed through the POT framework and the extracted extreme values are fitted to the GPD. Figure 8 shows the GPD fitting of extracted extreme vehicle loads above the selected threshold and its graphical Q-Q plot. It can be seen from Figure 8(a) that the fitting accuracy of extreme vehicle loads has been improved considerably compared with traditional approach. When the fitting effect is satisfactory, most empirical observation values excess threshold are scattered near the line in the Q-Q plot. It can be seen from Figure 8(b) that most empirical observations can scatter near the fitting line and there is only a small difference between fitted values and empirical values in the Q-Q plot.

To estimate the extreme vehicle loads and validate the fitting effect of GPD, return level analysis is carried out. Figure 9 shows the return level analysis of the vehicle loads under the thresholds μ = 7.4 t and μ = 10.9 t. It can be from Figure 9 that the empirical observations over the selected threshold are located within 95% confidence interval of the estimated levels. When the return period is 100 months, the return values are 30.7 t and 33.7 t under the threshold μ = 7.4 t and μ = 10.9 t, which indicates that the estimated values of extreme load are in a more reasonable range because heavy multiple-axle wheeled vehicles are restricted. The results convince us that the GPD fitting effect is relatively satisfactory and the selected threshold is reasonable.

6. Conclusions

When estimating extreme values of vehicle loads through the POT method, threshold selection is the key point because the chosen threshold is sensitive to the extracted extreme loads sample and parameter estimation. To model a reliable extreme distribution by reducing the fitting error of the GPD and evaluate the selected threshold, a comprehensive evaluation approach based on MCDA is proposed in this paper. It is a new trial considering multiple criteria in threshold selection, and the weight values are calculated by using the maximum entropy theory. The optimal threshold is determined through a range of candidate thresholds automatically and fitting results are analyzed quantitatively.

The simulated data and field measurement of vehicle loads were adopted to validate the practicability and superiority of the proposed method, respectively. The results illustrate that the MCDA method proposed in this paper reduces subjectivity during threshold selection, which can be flexibly used in various situations. The proposed MCDA can help select a suitable threshold for the POT framework, and it exhibits a better estimation of extreme vehicle loads through return level analysis with the selected threshold. In practice, the proposed MCDA is a useful tool in the extreme value analysis that complements the existing methods of thresholds selection.

Conflicts of Interest

There are no conflicts of interest related to this paper.

Acknowledgments

This presented research was financially supported by the National Natural Science Foundation of China (Grant no. 51778108).