#### Abstract

The quality of in situ data is key to calculating resistance factor of bored piles. However, it is difficult to summarize accuracy data due to various uncertainties in engineering. This paper employs the Bayesian method and mathematical statistics theory to put forward an estimation method for updating in situ data. A testing database (33 tests in noncohesive soils and 53 tests in cohesive soils) of bored piles is summarized. The model factor of bored piles is quantified as the ratio of the measured capacity to the calculated capacity. The proposed method is used to classify summarized data into three categories, which are “good data,” “general data,” and “bad data.” The “bad data” are discarded because of bad contribution to calculation, and Bayesian theory is incorporated into updating the model factor statistics. Three methods are used to calculate the reliability index and resistance factor of bored piles, and the results show that the reliability index and resistance factor are sensitive to the quality of data. Finally, the available values of resistance factors are proposed based on resistance factor design for bridge design specification, which can offer references to revision relevant specifications. The proposed method can be used to update other geotechnical data.

#### 1. Introduction

Bored piles, especially large-diameter piles, are commonly employed to support high-rise buildings and bridges in China and other countries because of their ability to sustain large load [1–5]. The safety of bored pile foundation is significantly important. Due to various uncertainties, the design parameters should have random variances. However, design parameters are described as constants by allowable stress design (ASD) philosophy, which is unreasonable and unscientific. To overcome the deficiencies, load resistance factor design (LRFD) method is mandated by American Association of State Highway and Transportation Officials [6]. Therefore, resistance factor calculation of bored piles is of engineering significance.

The resistance factor is calculated incorporating in reliability analysis methods based on pile load test data [7, 8]. Enough-accuracy in situ data are necessary to calculate resistance factor of bored piles. However, it is difficult to collect accuracy in situ data to calculate reliability index and resistance factor because of various uncertainties, for example, parameter uncertainty, calculation model uncertainty, testing random error, and systematic error. A large number of investigations are carried out to calibrate resistance factor of driven piles [7–16], and significant achievements have been developed. However, few investigations about resistance factor of bored piles have been reported.

Parameter uncertainty and model uncertainty are two troubles for pile foundation designers. Numerous investigations are conducted to study parameter uncertainty, which shows that parameter uncertainty contains random error of monitor, system error, statistical uncertainty, and so on. Model uncertainty is mainly caused by simplified calculation model. European design specification of geotechnical engineering (EN997-1) clearly suggests that a model revised factor should be incorporated when the pile capacity is calculated using the simplification model [17]. However, this specification does not specify revised factor values and only suggests that different countries should adopt different values. Jones et al. [18], Kulhawy and Trutmann [19], Lacsse and Nadim [20], Meyerhof [21], and Phoon and Kulhawy [22, 23] study model uncertainty based on lots of in situ data of pile capacity, which shows that enough-accuracy data are necessary to solve model uncertainty problem. However, it is difficult to get enough-accuracy data due to various uncertainties. In addition, the quality of collected data is not all perfect, and some of them are considered as “bad data” or “data outliers.” It is necessary that data optimization is incorporated into calculating resistance factor of bored piles.

This paper puts forward a Bayesian estimation method to update the in situ data of bored piles, and three reliability index calculation methods are incorporated into calculating the reliability index of pile capacity using the processing data. Then American LRFD for Bridge Design Specification is used to estimate the resistance factor of bored piles.

#### 2. Bayesian Optimization Method

Bayesian principle is a tool to update the probability distribution using new information. Assuming that the prior distribution of a random variable (*X*) is , its posterior distribution can be written as [24]where is posterior distribution of *X*; *K* is a normalization constant; and *L* (*X*) is likelihood function. Normal distribution and log-normal distribution are frequently employed to fit probability distribution of pile capacity [25].

Assume that *n* values of *X* are collected from engineering, which are described as . The mean () and standard variance () are

Assume that and are considered as the mean and standard variance of likelihood function. If *X* obeys normal distribution, the posterior mean () and variance () are

If *X* obeys log-normal distribution, it can be translated into normal random variable through dealing *X* with natural logarithm, which is described as ln *X*. The mean () and standard variance () of are

The model factor is frequently represented as the ratio of the measured capacity to calculated capacity [1, 7, 8]:where *λ* is model factor of pile capacity; *Q*_{m} is the measured pile capacity; and *Q*_{p} is the calculated pile capacity. Numerous investigations show that model factor is a random variable and obeys log-normal distribution [1, 2, 7, 10, 13, 14].

To improve accuracy of collected data from engineering, this paper employs the biased factor of *λ*, which is shown in the following equation [26]:where *λ*_{i} is the *i*th model factor; is *i*th biased factor of *λ*; and *λ*_{R} is the mean of *λ*.

Based on equation (8), the data are classified as follows [26]:(1)If , the data are defined as “good data” because it is near to the fact data.(2)If , the data are defined as “general data.”(3)If , the data are defined as “bad data.” “Bad data” are identified as extreme values, which should be discarded.

“Good data” are considered to be more reliable and should be treated as the prior information in estimation of the population statistics. However, the sample size of the “good data” is not sufficient to represent the total. This paper employs Bayesian updating technique to evaluate the probability characteristics of the resistance bias factor for bored piles. The “general data” are treated as prior information, and the “good data” are treated as likelihood information. Then, the updating model factor statistics can be obtained using equations (3)–(6).

#### 3. Resistance Factor Estimation

This paper summarizes various bored pile capacity data shown in Tables 1 and 2 [1]. The data are divided into two groups, which are the data in noncohesive soil (D-NC) and the data in cohesive soil (D-C).

Dithinde et al. [1] use load-displacement curves (shown in Figure 1) to improve the quality of collected data. The characteristic of Case Numbers 25 curve is far away from other curves; it falsely needs to be discarded. In addition, Dithinde et al. [1] employ Box-Plots Method to detect that Case Number 24 and Case Number 26 are outliers. Therefore, Case Numbers 24, 25, and 26 should be discarded. Figure 2 shows the scatter diagram of the remaining data, which indicates that there are no data deviating markedly from other data. However, it does not mean that the remaining data are absolutely reliable. This paper will use the proposed method to update remaining data.

The classified results are shown in Tables 3 and 4. There are 21 and 38 pieces of data, of which bias factor is less than 0.25 in noncohesive soil and cohesive soil, respectively. The in situ uncertainties have little contribution to these data classified as “good data.” There are 8 and 12 pieces of data, of which bias factor is larger than 0.25 but less than 0.50 in noncohesive soil and cohesive soil, respectively, considered as “general data.” However, the bias factors of Case Number 15 in noncohesive soil and Cases Numbers 37, 68, and 69 in cohesive soil are 0.5187, 0.5271, 0.7315, and 0.5203, respectively. These data are classified as “bad data.” These data can cause insecurity to engineering and should be discarded.

Log-normal distribution is used as the distribution of model factor, and the model factor statistics are presented in terms of the mean and coefficient of variation (COV). Based on equations (5) and (6), the updating model factors statistics are obtained in Table 5 for reliability analysis and resistance factor calculation. The coefficient of variation of updating data is minimum. The coefficient of variation for “ general data” is maximum. In summary, the updating model factors are reliable enough to calculate the reliability index and resistance factor of bored piles.

According to reliability theory, the limit state equation of bored pile capacity is [16]:where *R* is vertical pile capacity (kN); is dead load (kN); and is live load (kN). Three methods are employed to calculate the reliability index.

##### 3.1. First-Order Reliability Method

If the three parameters in equation (9) obey log-normal distribution, the calculation formula of reliability index can be written as [6]where FOS is the factor of safety according to allowable stress design method; and are the partial factors of dead load and live load, respectively; and are the coefficients of variation of dead load and live load, respectively; Table 5 gives the means and coefficients of variation for “updating data,” “good data,” and “general data,” and the specifications give the load statistics; then the reliability index can be calculated using equation (10), which is described as .

##### 3.2. Design Point Method

The limit-state function is linear at a point on the failure surface; its performance function is [6]

All the parameters in equation (11) have the same meanings as equation (10). The calculation can be carried out using MATLAB software, which is described as .

##### 3.3. Monte Carlo Simulation Method

Monte Carlo simulation method is an accuracy method to calculate reliability index, which is employed for comparison with the accuracy of other calculation methods. Its performance function is

The calculation can be carried out using MATLAB software; the times of simulation are 10 million, described as .

The values of , , , and can be obtained according to LRFD for Bridge Design Specification. 3.69 is selected as the value of [27].

Figures 3 and 4 show the calculation results of reliability index. The results indicate that the deviation of reliability index for “good data” and “general data” is larger than 1.0, which is caused by the quality of data. However, the reliability index of “good data” is near to the reliability index of “updating data.” In addition, reliability index is sensitive to soil type. Reliability index in cohesive soil is larger than that in noncohesive soil.

**(a)**

**(b)**

**(c)**

**(a)**

**(b)**

**(c)**

The formula of load and resistance factor design method is [6]where *R*_{n} is standard value of resistance (kN); *Q*_{i} is standard value of load (kN); *ϕ* is the resistance factor; and *γ*_{i} is load factor.

Reliability analysis is the bias of resistance factor calculation. Load and resistance factor design method proposes the calculation formula shown in equation (14) based on first-order reliability method [6]:where is target reliability index of piles. 1.75 and 1.08 are selected as the values of and according to LRFD for Bridge Design Specification. The resistance factor is described as according to equation (14).

If the reliability index is calculated using equation 11), the limit state equation for resistance factor calculation is

The resistance factor is described as according to equation (15).

If the reliability index is calculated using equation (12), the limit state equation for resistance factor calculation is

The resistance factor is described as according to equation (16).

2.0, 2.5, and 3.0 are selected as the target reliability index. Based on equations (14)–(16), the calculation results of resistance factor are shown in Table 6.

The quality of data has distinct contribution to resistance factor of bored piles. The accuracies of design point method and Monte Carlo simulation method are satisfactory, which can be considered as the criterion to verify the accuracy of proposed method. The results based on two methods are larger than the results based on first-order reliability method, and the difference are 6.9% and 18.3%, respectively. Meanwhile, the difference between the two methods is near 0. The accuracies based on “good data” and “updating data” are better than the accuracies based on “general data” and “all data.”

In summary, Table 7 shows the recommended values of resistance factors. However, the reliability theory of pile foundation is not perfect enough to be applied in engineering fact. The recommended values are proposed only according to the calculation results and American LRFD for Bridge Design Specification. Its application in engineering field needs to be further studied.

#### 4. Conclusions

From this study, some conclusions are presented:(1)The proposed method incorporating probability theory and Bayesian method can not only classify the in situ data but also overcome the deficiency caused by small sample for accuracy data.(2)Data classification has significant contribution to reliability index and resistance factor. The results according to “good data” and “updating data” are larger than the results according to “general data” and “all data.” Meanwhile the difference of results using two types of data is near 0. Therefore, “good data” and “updating data” can be used as the basis of resistance factor calculation.(3)Reliability index and resistance factor are sensitive to the type of soil, and the calculation results in cohesive soil are larger than the results in noncohesive soil.(4)The recommended values are proposed only according to the calculation results and American LRFD for Bridge Design Specification. Its application in engineering fact needs to be further studied. However, the proposed method can be used to update other geotechnical data.

#### Data Availability

The data used to support the findings of this study are available from the corresponding author upon request.

#### Conflicts of Interest

The authors declare that there are no conflicts of interest regarding the publication of this paper.

#### Acknowledgments

The authors express their gratitude to the National Natural Science Foundation of China (no. 51978247) and Key Science and Technology Projects of Henan Province (no. 202102310242).