Threshold Selection for POT Framework in the Extreme Vehicle Loads Analysis Based on Multiple Criteria

Wu, Guangrun; Qiu, Wenliang

doi:https://doi.org/10.1155/2018/4654659

Shock and Vibration

On this page

Abstract Introduction Conclusions Conflicts of Interest Acknowledgments References Copyright Related Articles

Research Article | Open Access

Volume 2018 | Article ID 4654659 | https://doi.org/10.1155/2018/4654659

Threshold Selection for POT Framework in the Extreme Vehicle Loads Analysis Based on Multiple Criteria

Guangrun Wu¹and Wenliang Qiu¹

Academic Editor: Roberto Palma

Received10 Nov 2017

Revised22 Jan 2018

Accepted05 Mar 2018

Published11 Apr 2018

Abstract

Extreme value of vehicle load plays an important role in bridge design and risk assessment. Peaks over threshold (POT) is a method commonly used in extreme load estimation. The selection of thresholds for the POT method is extremely crucial, but the selected optimal threshold varies under different test criteria. Therefore, a method to select the suitable threshold is developed based on multiple criteria decision analysis (MCDA) in the paper. In MCDA, Chi-Square () test, Kolmogorov–Smirnov (K–S) test, and Root Mean Square Error (RMSE) in probability distribution functions (PDF) are employed as the test criteria and the weight of these criteria is calculated using the entropy method. Finally, vehicle loads obtained from simulation and field measurement are adopted to validate the effectiveness and feasibility of the proposed method. The results indicate that the proposed MCDA is a useful and complementary tool for threshold selection in the extreme value analysis.

1. Introduction

Vehicle load is one of the most significant factors for bridge design, safety assessment, and fatigue analysis [1–5]. Overloaded heavy vehicles is the primary reason for the deterioration of structural components and the degradation of the bridge’s overall state [6–8]. The condition of road traffic has gained a notable increase during the service period due to the rapid economic growth. Many bridge collapse accidents have resulted from overloading vehicles. In addition, the number of bridges worldwide stepping into their ageing stage is growing, which raises the risk of structural safety for users. It is extremely expensive to rehabilitate or maintain the degraded bridge due to the project itself, traffic disruption, and resulting delays. Large amounts of costs associated with rehabilitation and maintenance can be saved by proving that the bridge is safe through accurate risk assessment or proposing an appropriate repair scheme [9]. Therefore, it is necessary to have a good estimation of vehicle loads, and this typically involves using Extreme Value Theory (EVT).

Collecting the actual load time history is time consuming and costly. Generally, the extreme loads among the design life of the bridge cannot be measured through limited period of data. The EVT can solve the problem through the estimation of the extreme vehicle loads from a relative short period to a long period. The EVT is used to quantify the stochastic behavior of a process at unusually large or small levels. Most of the published works about the EVT applied to vehicle loads are on the basis of the block maximum (BM) and POT methods [10]. The BM method divides the observation period into nonoverlapping periods of equal size. Then the largest value in each period is extracted to generate maximum sample, which can be fitted by generalized extreme value (GEV) distribution. Using the BM method to extreme value estimation requires a large quantity of data, and thus it is not appropriate to use this method when the available data is insufficient. An alternative to the BM method is peak over threshold (POT) method. The POT method extracts the peak values exceeding a threshold, and the extracted data can be fitted by generalized Pareto distribution (GPD).

A well-known problem for the POT method is to select a suitable threshold [11]. Threshold selection is the trickiest part of the POT method because only a narrow range of thresholds is applicable. On the one hand, a considerable high threshold is required to guarantee that GPD parameters are stable. On the other hand, the threshold level cannot be too high so that sufficient data can be included [12]. The literature suggests that the most used threshold selection methods are based on judgement [13]. One way is to use graphical diagnostic plots, but interpreting these plots is subjective and rather challenging. The other way is to select a fixed quantile threshold corresponding to a high nonexceedance probability. The drawback of this method is that once the threshold has been determined, it is treated as fixed and the associated subjectivity and uncertainty are ignored in subsequent inferences.

In contrast to the above methods, the numerical approaches are relatively objective for threshold election, which are conducted based on goodness-of-fit tests across a range of different thresholds. There are many popular criteria for the goodness-of-fit tests to measure the distance between the cumulative distribution function of the reference distribution and the empirical distribution of the sample, such as Root Mean Square Error (RMSE) test, Kolmogorov–Smirnov (K–S) test, Pearson’s Chi-Square () test, and Anderson–Darling (A–D) test. It is observed that the selected threshold under one test criterion may not be the optimal threshold for other criteria. The performance of extreme value estimation is greatly affected when the extracted sample changes due to the test criterion variation. To select an optimal and stable threshold for the POT method, it is meaningful to do some research to take multiple criteria into consideration. The multiple criteria decision analysis (MCDA) approach works when practical problems are characterized by several conflicting criteria [14]. An entropy method is employed to assign the weight value of each criterion for the MCDA, and then the MCDA can integrate results obtained from goodness-of-fit under different criteria into a comprehensive one, which makes the selection more scientific and objective [15].

In this paper, the MCDA is proposed to select a threshold for the POT method in the extreme value analysis of vehicle loads. Section 2 presents the theory of the POT method, the mathematical methodologies of MLE, and the framework of extreme loads estimation. Section 3 presents details of the MCDA approach involving test criteria and the maximum entropy theory. Sections 4 and 5 present a practical application of theories illustrated in Sections 2 and 3 on simulated and field monitored vehicle loads. Finally, Section 6 presents some conclusions of this paper.

2. Theory of Extreme Value Analysis

2.1. The POT Method

Figure 1 shows the POT method of how to extract extreme values from the observations, where μ is the threshold. The distribution of the entire observations can be treated as a combination of the body part and the tail part. Under the POT framework, our focus is the observations exceeding the threshold, not the entire observations. When the values of the observations exceed the threshold, these observations are called exceedances (). Supposing is a sequence of independent and identically distributed random variables belonging to a continuous distribution function, that is, . The exceedances above the threshold μ are modelled by , and the distribution function of is expressed as follows:When μ is sufficiently large, the distribution function of approximately obeys the GPD, whose cumulative distribution function is defined inwhere ξ is the shape parameter of the distribution, σ is the scale parameter of the distribution, and θ is the vector . Often, the threshold μ is supposed to be known prior to parameters estimation and the GPD is specified only by parameters σ and ξ. The GPD is divided into three types depending on the value of shape parameter ξ: is short-tailed when ; is medium-tailed when ξ = 0; and is heavy-tailed when ξ > 0. The estimation of extreme vehicle loads is calculated through return level analysis. In the GPD model, the return value of the -year return period can be obtained as follows:where is the number of observations and is the number of observations exceeding the threshold.

2.2. MLE Method

Calculating the parameters of the GPD is another decisive issue in the extreme value analysis. The maximum likelihood estimation (MLE) method can balance the observed data and possible prediction [16]. When the distribution is known, the MLE method is commonly used to calculate the unknown parameters of the distribution with advantages of asymptotic normality, invariance, and congruency. The MLE method is based on the likelihood function as follows:where is the PDF of th observed variable and is the number of the extracted samples. In practice, the logarithmic form is more convenient to perform. The logarithmic form of the likelihood function takesWhen the extracted sample is obtained, the likelihood function becomes the function of the variable . The maximum value of occurs when and the variable is determined.Substituting (5) into (6), (7) can be obtained. The parameters of the GPD are obtained by solving

2.3. Flowchart of Extreme Loads Estimation

Figure 2 illustrates the flowchart of the extreme vehicle loads estimation. As illustrated in Figure 2, once a value is set as candidate threshold, the corresponding GPD fitting can be carried out. The accuracy of the GPD fitting is checked by goodness-of-fit tests under multiple criteria, and then the optimal threshold is determined based on MCDA.

The key point of the flowchart is MCDA, whose process contains four main stages including scope determination of candidate thresholds, criteria selection, criteria weight determination, and threshold examination. According to the character of the load distribution, candidate thresholds are set in ascending order, where is the relatively low quantile of the load data value and is the 99% quantile of the load data value. Three popular test criteria are adopted in the MCDA, and the weight of each criterion is determined based on the maximum entropy theory. The maximum entropy method is an objective weight-determining method because the criterion that has the greatest difference will take the largest weight in decision-making [15, 17]. To validate the reliability of the proposed method of the threshold selection, the graphical quantile-quantile (Q-Q) plot is performed. Finally, the estimation of extreme vehicle loads is conducted through return level analysis.

3. Methodology of the Threshold Selection

3.1. Test of Goodness-of-Fit

There are nearly 40 test criteria in the statistical literature, and the principles of these criteria are different. The most popular methods are Chi-Square test (), Kolmogorov–Smirnov (K–S) test, Anderson–Darling (A–D) test, Root Mean Square Error (RMSE) test, and correlation coefficient test. To examine the fitting effect of the GPD in the MCDA, three commonly used test criteria, the K–S test, test, and RMSE test, are adopted. The details of these tests are shown as follows:The test value measures the greatest deviation between the theoretical CDF and the empirical CDF; the test value computes the deviation between the observed frequency and the expected frequency ; and the test value calculates the root mean deviation between the theoretical PDF and the empirical PDF. Among the three tests, the smaller the test values , , and , the better the effect of the fitting distribution obtained.

3.2. Entropy Method

For each candidate threshold, a test value will be calculated under the corresponding test criterion. For a series of candidates under multiple test criteria, there will be many test values. Each of them has its role in the evaluation of fitting accuracy. To find a suitable threshold under multiple criteria, these criteria should be considered simultaneously. Considering the different principles of the test criteria, the greater the weight value is, the more important the test criterion will be. The weight values of the criteria should be settled objectively, and therefore the maximum entropy method is used.

Given that the tests are achieved by a range of candidate thresholds, each threshold belongs to a criterion. The number of candidate thresholds is , the number of test criteria is , and represents the performance value of the th candidate threshold to the th criterion from the decision matrix where the vectors () represent the criteria and the alternatives () represent the candidate thresholds. The vector represents the importance and relative weights of the chosen criteria, which can be obtained from the decision matrix. To show the weight values of different criteria more directly, the decision matrix is transformed toThen the normalized decision matrix is obtained from (10) as follows:As a useful tool to describe the relative weights of the criteria, the th entropy value is defined by and obtained from the following formulation:Note that there is an inverse relationship between and . To ensure (13), is substituted with and normalized for direct illustrationwhere is the objective weight value of the th criterion. The most reasonable expression of normalized weight value is defined byThe normalized weight value of different criteria can be obtained fromFinally, the comprehensive evaluation indicator can be calculated considering multiple test criteria by (16). When the GPD fitting is performed, the smaller the comprehensive evaluation indicator, the better the fitting effect of goodness-of-fit obtained

4. Simulation Study

To validate the application and feasibility of the proposed MCDA method, a simulation study is carried out. The advantage of the simulation is to produce a required mixture distribution where the tail part obeys the GDP distribution and the threshold is known. Referring to the probabilistic vehicle load models (Yang et al. 2014), the proportions and statistical parameters of different kinds of vehicles are shown in Table 1. The body part of the mixture distribution in the simulation includes two-axle vehicle loads of light vehicles (Type I), heavy vehicles (Type II), and trucks (Type III). The two-axle vehicle loads obey the log-normal distributions as expressed in (17), where is the random variable of the vehicle’s gross weight and and are the mean value and the standard deviation of logarithmic values of vehicle loads. The tail part of the mixture distribution in the simulation includes multiple-axle vehicles (Type IV). The multiple-axle vehicle loads obey the GPD, and the parameters are set as ξ = 0.001, σ = 6, and μ = 17.5. According Table 1, simulated vehicle loads are generated, and their mixture distribution is shown in Figure 3. The two-axle vehicle loads in the body part range from 0 to 17.5 t and the multiple-axle vehicle loads in the tail part are above the threshold 17.5 t, and the threshold of the mixture distribution is 17.5 tA total of 201 candidate thresholds are set from the interval , and the GPD of each threshold is fitted to the extracted samples of extreme values, respectively. The goodness-of-fit tests are performed based on single criterion and multiple criteria, and the results are shown in Figure 4. It can be seen from Figure 4(a) that the statistics can reach extremum rather than minimum when the candidate threshold is 17.5 t. reaches its minimum when the threshold is 20.3 t, reaches its minimum when the threshold is 21.6 t, and reaches its minimum when the threshold is 16.1 t. Based on the maximum entropy method, the objective weight values of the statistics , , and are calculated: . Then the comprehensive indicator is calculated. It can be concluded from Figure 4(b) that the comprehensive evaluation indicator reaches the minimum value when the threshold is 17.5 t and that the value is the same with the preset threshold of the mixture distribution.

(a)

(b)

(a)

(b)

5. Case Study

5.1. Fit the Distribution Using Traditional Approach

Figure 5 shows the real image of the Dalian Northern Suspension Bridge. The bridge, opened to public in May 1987, is the first three-span suspension bridge in China with simply supported stiffening trusses and earth anchorage system. The bridge has two pedestrian lanes and two vehicular lanes that can carry heavy traffic, which played a significant role in the coastal highway of Dalian. Restricted to the road width, multiple-axle wheeled vehicles are not allowed to pass through the bridge. During the rehabilitation in April 2016, a sophisticated online SHM system was designed and equipped in the suspension bridge. The SHM system incorporates a WIM system, which is located at the north side of the bridge to measure the traffic load. The measured vehicle loads of the suspension bridge from June 2016 to June 2017 are employed to illustrate the proposed method of threshold selection for the POT framework in the extreme vehicle load analysis.

Figure 6 shows the time history of the actual vehicle load and its statistical distribution. The vehicle load of the Dalian Northern Suspension Bridge has been continuously observed since the installation of the SHM system, and a representative example of the measured load history is shown in Figure 6(a). The entire range and tail region of the statistical distribution of vehicle loads are illustrated in Figures 6(b)–6(d). The theoretical fitting to the empirical data uses log-normal distribution, which is recommend by Unified Standard for Reliability Design of Highway Engineering Structures (GB/T 50283-1999). It can be seen from Figures 6(c) and 6(d) that the fitting effect is not ideal as the theoretical distribution deviates considerably from the empirical distribution, especially in the tail region.

(a)

(b)

(c)

(d)

5.2. Threshold Selection Using MCDA

The measured vehicle loads range from 0.14 t to 32 t, a total of 281 candidate thresholds from 2 t to 30 t are set as the intervals, and the GPD fitting under each candidate threshold is conducted after the POT process, respectively. Figure 7 shows the statistics of , , , and their comprehensive evaluation indicator. It can be seen from Figure 7(a) that the selected thresholds are not consistent under different test criteria and the curve trends for different test criteria are quite different. reaches its minimum when the threshold is 3.6 t; reaches its minimum when the threshold is 22.6 t; and reaches its minimum when the threshold is 10.9 t. Based on the maximum entropy method, the objective weight values of the statistics , , and are calculated: , and then the comprehensive evaluation indicator is obtained as shown in Figure 7(b). It can be concluded from Figure 7(b) that the comprehensive evaluation indicator reaches the minimum value when the threshold is 7.4 t. The statistics of , , and reach their extremum when the threshold is 7.4 t, and this value is selected as the threshold for extreme load estimation.

(a)

(b)

5.3. GPD Fitting and Extreme Load Estimation

After selecting a reasonable threshold, the monitored vehicle loads are processed through the POT framework and the extracted extreme values are fitted to the GPD. Figure 8 shows the GPD fitting of extracted extreme vehicle loads above the selected threshold and its graphical Q-Q plot. It can be seen from Figure 8(a) that the fitting accuracy of extreme vehicle loads has been improved considerably compared with traditional approach. When the fitting effect is satisfactory, most empirical observation values excess threshold are scattered near the line in the Q-Q plot. It can be seen from Figure 8(b) that most empirical observations can scatter near the fitting line and there is only a small difference between fitted values and empirical values in the Q-Q plot.

(a)

(b)

To estimate the extreme vehicle loads and validate the fitting effect of GPD, return level analysis is carried out. Figure 9 shows the return level analysis of the vehicle loads under the thresholds μ = 7.4 t and μ = 10.9 t. It can be from Figure 9 that the empirical observations over the selected threshold are located within 95% confidence interval of the estimated levels. When the return period is 100 months, the return values are 30.7 t and 33.7 t under the threshold μ = 7.4 t and μ = 10.9 t, which indicates that the estimated values of extreme load are in a more reasonable range because heavy multiple-axle wheeled vehicles are restricted. The results convince us that the GPD fitting effect is relatively satisfactory and the selected threshold is reasonable.

(a)

(b)

6. Conclusions

When estimating extreme values of vehicle loads through the POT method, threshold selection is the key point because the chosen threshold is sensitive to the extracted extreme loads sample and parameter estimation. To model a reliable extreme distribution by reducing the fitting error of the GPD and evaluate the selected threshold, a comprehensive evaluation approach based on MCDA is proposed in this paper. It is a new trial considering multiple criteria in threshold selection, and the weight values are calculated by using the maximum entropy theory. The optimal threshold is determined through a range of candidate thresholds automatically and fitting results are analyzed quantitatively.

The simulated data and field measurement of vehicle loads were adopted to validate the practicability and superiority of the proposed method, respectively. The results illustrate that the MCDA method proposed in this paper reduces subjectivity during threshold selection, which can be flexibly used in various situations. The proposed MCDA can help select a suitable threshold for the POT framework, and it exhibits a better estimation of extreme vehicle loads through return level analysis with the selected threshold. In practice, the proposed MCDA is a useful tool in the extreme value analysis that complements the existing methods of thresholds selection.

Conflicts of Interest

There are no conflicts of interest related to this paper.

Acknowledgments

This presented research was financially supported by the National Natural Science Foundation of China (Grant no. 51778108).

References

A. Getachew and E. J. Obrien, “Simplified site-specific traffic load models for bridge assessment,” Structure and Infrastructure Engineering, vol. 3, no. 4, pp. 303–311, 2007.
View at: Publisher Site | Google Scholar
D. M. Frangopol, A. Strauss, and S. Kim, “Bridge reliability assessment based on monitoring,” Journal of Bridge Engineering, vol. 13, no. 3, pp. 258–270, 2008.
View at: Publisher Site | Google Scholar
S. G. M. Neves, A. F. M. Azevedo, and R. Calçada, “A direct method for analyzing the vertical vehicle-structure interaction,” Engineering Structures, vol. 34, pp. 414–420, 2012.
View at: Publisher Site | Google Scholar
D. Sun, X. Wang, B. Chen, and B. Sun, “Probability model of hangzhou bay bridge vehicle loads using weigh-in-motion data,” Shock and Vibration, vol. 2015, Article ID 576083, 10 pages, 2015.
View at: Publisher Site | Google Scholar
M. Soriano, J. R. Casas, and M. Ghosn, “Simplified probabilistic model for maximum traffic load from weigh-in-motion data,” Structure and Infrastructure Engineering, vol. 13, pp. 454–467, 2017.
View at: Publisher Site | Google Scholar
G. Fu, L. Liu, and M. D. Bowman, “Multiple presence factor for truck load on highway bridges,” Journal of Bridge Engineering, vol. 18, no. 3, pp. 240–249, 2011.
View at: Publisher Site | Google Scholar
W. Qiu, M. Jiang, and C. Huang, “Parametric study on responses of a self-anchored suspension bridge to sudden breakage of a hanger,” The Scientific World Journal, vol. 2014, Article ID 512120, 10 pages, 2014.
View at: Publisher Site | Google Scholar
X. Shi, Z. Zhou, and X. Ruan, “Failure analysis of a girder bridge collapse under eccentric heavy vehicles,” Journal of Bridge Engineering, vol. 21, Article ID 05016009, 2016.
View at: Google Scholar
M. Liu and D. M. Frangopol, “Optimizing bridge network maintenance management under uncertainty with conflicting criteria: Life-cycle maintenance, failure, and user costs,” Journal of Structural Engineering, vol. 132, no. 11, Article ID 016611QST, pp. 1835–1845, 2006.
View at: Publisher Site | Google Scholar
P. Thompson, Y. Cai, D. Reeve, and J. Stander, “Automated threshold selection methods for extreme wave analysis,” Coastal Engineering Journal, vol. 56, no. 10, pp. 1013–1021, 2009.
View at: Publisher Site | Google Scholar
A. Tancredi, C. Anderson, and A. O'Hagan, “Accounting for threshold uncertainty in extreme value estimation,” Extremes, vol. 9, no. 2, pp. 87–106, 2006.
View at: Publisher Site | Google Scholar | MathSciNet
C. Scarrott and A. MacDonald, “A review of extreme value threshold estimation and uncertainty quantification,” REVSTAT–Statistical Journal, vol. 10, no. 1, pp. 33–60, 2012.
View at: Google Scholar | MathSciNet
C. Caballero-Megido, J. Hillier, D. Wyncoll, L. Bosher, and B. Gouldby, “Technical note: Comparison of methods for threshold selection for extreme sea levels,” Journal of Flood Risk Management, 2017.
View at: Publisher Site | Google Scholar
H. Çalışkan, B. Kurşuncu, C. Kurbanoğlu, and Ş. Y. Güven, “Material selection for the tool holder working under hard milling conditions using different multi criteria decision making methods,” Materials & Design, vol. 45, pp. 473–479, 2013.
View at: Publisher Site | Google Scholar
J.-J. Wang, Y.-Y. Jing, C.-F. Zhang, and J.-H. Zhao, “Review on multi-criteria decision analysis aid in sustainable energy decision-making,” Renewable & Sustainable Energy Reviews, vol. 13, no. 9, pp. 2263–2278, 2009.
View at: Publisher Site | Google Scholar
J. R. Hosking and J. R. Wallis, “Parameter and quantile estimation for the generalized pareto distribution,” Technometrics, vol. 29, no. 3, pp. 339–349, 1987.
View at: Publisher Site | Google Scholar
J. Wang, N. Wang, Z. Wang, Y. Zhang, and L. Liu, “Determination of the minimum sample size for the transmission load of a wheel loader based on multi-criteria decision-making technology,” Journal of Terramechanics, vol. 49, no. 3-4, pp. 147–160, 2012.
View at: Publisher Site | Google Scholar

Copyright

Copyright © 2018 Guangrun Wu and Wenliang Qiu. This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

PDF Download Citation

Download other formats

Order printed copies

Views

2632

Downloads

945

Citations