Abstract

The modeling of cracks and identification of dam behavior changes are difficult issues in dam health monitoring research. In this paper, a time-varying identification model for crack monitoring data is built using support vector regression (SVR) and the Bayesian evidence framework (BEF). First, the SVR method is adopted for better modeling of the nonlinear relationship between the crack opening displacement (COD) and its influencing factors. Second, the BEF approach is applied to determine the optimal SVR modeling parameters, including the penalty coefficient, the loss coefficient, and the width coefficient of the radial kernel function, under the principle that the prediction errors between the monitored and the model forecasted values are as small as possible. Then, considering the predicted COD, the historical maximum COD, and the time-dependent component, forewarning criteria are proposed for identifying the time-varying behavior of cracks and the degree of abnormality of dam health. Finally, an example of modeling and forewarning analysis is presented using two monitoring subsequences from a real structural crack in the Chencun concrete arch-gravity dam. The findings indicate that the proposed time-varying model can provide predicted results that are more accurately nonlinearity fitted and is suitable for use in evaluating the behavior of cracks in dams.

1. Introduction

Cracks are the most common structural threats to concrete dams. According to an investigative report issued by the International Committee on Large Dams, a total of 243 dams have collapsed as a result of cracking problems [1]. As an example, in the case of the Koelnbrein arch dam in Austria, diagonal cracks emerged at the heel of the dam, causing the grouting curtain to break. Under the enormous water pressure at the crack surfaces, these cracks expanded throughout the foundation and heel of the dam, which led to the full head of uplift pressure being applied over the entire surface of the base [2]. Therefore, the dam could no longer operate safely. In another example, soon after the pouring of the Phase II concrete for the Chencun dam in China, a crack of 300 m in length and more than 5 m in depth, which stretched across dam sections #5 to #28, emerged at the interface between the concrete layers poured in Phases I and II. Serious crack problems have also occurred during the operation of the Dworshak solid gravity dam in America, the Sayano-Shushenskaya arch-gravity dam in Russia, and the Xiaowan arch dam in China [35]. Cracks are destructive. Severe cracks can weaken the strength and rigidity of a dam, destroy its integrity and antipermeability, accelerate the corrosion and carbonation of the concrete, and endanger the safe operation of the dam.

The research on structural health monitoring of cracks is mainly classified into two categories [6, 7]. The first one focuses on the nondestructive detection and characterization of cracks, including location, size, depth, and other information. Numerous sensing techniques are proposed and applied, such as strain gauges [8], optical fiber sensor [9], piezoceramic sensor [10], carbon nanotube thread [11], and other vibration-based methods [12, 13]. The other one focuses on the assessment and prediction of crack propagation, as well as structural behavior, based on quantification monitoring data analysis. However, these data are still difficult to be obtained accounting for the limitations of sensing techniques, heterogeneity of concrete material, and bulkiness of dam concrete [7]. Luckily, there are still some real-time cracking series being obtained by later buried crack gauges in dam engineering, such as Chencun and Xiaowan dam [3].

To date, methods of propagation analysis and abnormality diagnosis for concrete crack behavior have mainly relied on fracture mechanics methods and applied mathematical methods. Currently, criteria such as the stress intensity factor criterion [1416], the strain energy density factor criterion [17], the maximum energy release rate criterion [18], the mixed energy-stress criterion [19], and the dual-K fracture criterion [20] are frequently adopted to determine whether unstable propagation is occurring. But the variables on these criteria are difficult to monitor accurately and inconvenient to apply in practical engineering. The other approach [19, 21] is based on the consideration that the crack will lose its stability rapidly when the crack tip opening displacement (CTOD) reaches its critical value (CTODC). Several mathematical models based on the CTOD have been established based on the relationship between the CTOD and the crack aperture. However, CTOD theory is also difficult to use in the diagnosis of crack propagation behavior, mainly because the CTOD is extremely difficult to monitor. Moreover, the indicator, CTODC, must be measured in fracture tests using large-sized specimens, which is difficult to achieve in typical laboratories [22]. Therefore, these methods are subject to limitations in the diagnosis of structural behavior in dam engineering.

For research on dam health monitoring, especially with regard to major cracks, Wu et al. [2325] considered the contributions of water pressure, temperature, and structural aging to establish a statistical model for crack monitoring data. In this model, the water pressure component is constructed based on the relationship between water level and COD derived from dam engineering theory; the structural aging component is derived based on the creep theory of concrete; and the temperature component is obtained by using the results of temperature field simulation or fitting a semiexperienced periodic function. Li and Gu [35, 26] suggested that the degree of abnormality of a crack in a dam can be diagnosed based on a time-dependent analysis. A semiparametric statistical model and a nonparametric statistical model [4, 26] were established by considering the relationship between the degree of abnormality of dam crack behavior and its statistical point of change to improve the fitting and explanation of crack monitoring data from concrete dams. Li et al. [3] analyzed the dynamic properties of crack monitoring series and proposed a dynamic structural diagnosis method for crack abnormality based on a fuzzy cross-correlation factor exponent. Li et al. [5] proposed a fluctuation-based method of regression coefficients for crack monitoring series, in which each abnormal event is detected from a cumulative sum of regression model residuals. These models play an important role in COD monitoring. However, the fitting and prediction capabilities of the models need to be improved accounting for the nonlinearity of crack monitoring series, and feasible crack monitoring criteria need to be proposed for more effective application in dam engineering. In pursuit of nonlinear modeling and prediction methods, Panizzo and Petaccia [27] developed predictive models based on random forests (RFs), boosted regression trees (BRTs), neural networks (NNs), support vector machines (SVMs), and multivariate adaptive regression splines (MARSs), and a comparison of the prediction accuracy of these models revealed that they showed poorer performance on average than did a conventional statistical model. Therefore, further research is needed to construct nonlinear modeling approaches for crack monitoring that are able to better fit and predict crack opening behavior and evaluate anomalous characteristics of crack propagation and structural health.

A technique based on the use of SVMs for regression, namely, support vector regression (SVR), is an advanced statistical method that has recently been used in the modeling of nonlinear systems [2832]. In SVR, a nonlinear transformation is employed to map monitoring data into a higher-dimensional feature space, and the nonlinear relationships between the monitored values of dam behavior quantities and their influencing factors are linearized by searching for an optimized fitting function. With the introduction of the insensitive error loss function as proposed by Vapnik, SVR has emerged as one of the best methods of regression because of its improved robustness and generalization performance [33]. Several researchers have employed SVR in modeling methods for structural identification to verify its potential capabilities in this respect. Ranković et al. [34] established a nonlinear autoregressive SVR model that yields accurate results for the prediction of the tangential displacement of a concrete dam. Lee et al. [35] successfully predicted the strength of concrete based on its mix proportions using the SVR technique and NNs, as indicated by a comparison against experimental results, and concluded that the SVR method can be used to predict the compressive strength of concrete with higher estimation accuracy and within a shorter computation time compared with the NN method. Su et al. [36] developed a time-varying identification model for dam behavior before and after structural reinforcement based on the SVR method, and this model was found to yield more accurate fitted and forecasted results compared with classical statistical models. In these studies, one key factor in establishing a useful model is the selection of suitable SVR parameters, such as the penalty coefficient, the loss coefficient, and the width coefficient of the kernel function. There is an urgent need for the development of an appropriate algorithm for parameter optimization.

Based on the brief review presented above, it is clear that the modeling of crack monitoring data is still an important problem in research on concrete dams. In the present study, SVR is applied to establish a parametric statistical model for the safety monitoring of cracks in concrete dams, with the intent of improving the extraction of the nonlinear effects of the various influencing factors on COD. The BEF method is then introduced to optimize the model parameters. Finally, forewarning criteria based on the predicted components of the time-varying features of the crack monitoring model are proposed for the real-time diagnosis of abnormal dam behavior.

2. Time-Varying Forewarning Criteria for the Crack Monitoring of Concrete Dams

Monitoring equipment such as crack measurement meters and optical fibers provides a large amount of COD data for concrete dams. The statistical model for crack monitoring data that is considered in this paper has been confirmed to be effective for analyzing the crack evolution process. However, there is still a need for suitable forewarning criteria to be proposed to characterize the relationship between the crack propagation process and abnormal structural behavior in dam engineering applications. In this section, the prediction of the components of the statistical model and their time-varying features are introduced to establish these criteria.

2.1. Prediction Using the Statistical Crack Monitoring Model

The factors that affect the monitored crack quantities in a concrete dam can be attributed to external loads (such as water pressure and temperature) and time-varying effects. Therefore, the monitoring data for dam cracks can be decomposed into contributions from different influencing factors as follows:

In the formula above, is a measured value in the crack opening monitoring sequence; and are the crack opening components associated with the action of water pressure and temperature, respectively; and is the time-varying component, which characterizes the other effects on crack behavior that are induced by creep or structural changes in the concrete.

Once the factors influencing the COD have been decomposed, a regression model and a prediction model for the monitored crack displacement values can be established based on the measured sequence, and then the estimated values for each component, namely, , , and , can be calculated. The statistical prediction model for crack monitoring data is as follows:

In the formula above, is the predicted value of the COD at time ; , , and are the estimated water pressure, temperature, and time-varying components, respectively; is the residual standard deviation of the model; and is the variance.

2.2. Forewarning Criteria for Dam Behavior

According to the prediction model presented above, the predicted value of the COD at a future time is calculated to be , and the measured value of the COD at that time is denoted by . If it is assumed that the residual standard deviation of the model satisfies a normal distribution, then, according to the mathematical theory of probability and statistics, the relationship between the distance from the predicted value to the measured value and the residual standard deviation of the model is as follows: the probability that falls into the interval is 95.5% and the probability that falls into is 99.7%. That is to say, both and are events that will occur with low probability. However, the time-dependent component of the COD reflects the irreversible changes to the concrete caused by creep and damage and is consequently an important index for characterizing the dam safety trend. Accordingly, by comparing between the measured values and the values predicted by the model and their eigenvalues and analyzing the trend of the development of the time-dependent component, time-varying forewarning criteria for the COD of a concrete dam can be established. Thus, a measured time sequence of data can be used to determine whether the structural state of the dam is changing. The forewarning criteria are as follows:

Ifthen the safety state of the concrete dam structure is normal.

Ifthen the safety state of the concrete dam structure is abnormal.

Ifthen the safety state of the concrete dam structure is significantly abnormal.

In the formulas above, and are the predicted and measured values, respectively, of the monitored COD; is the historical maximum value of the COD in the measured monitoring sequence; and is the time-dependent component extracted from the prediction model.

3. Crack Prediction Model Based on B-SVR

The stepwise regression method is used in the traditional statistical modeling process. However, problems with modeling accuracy often arise because of the nonlinear characteristics of crack monitoring data. To address this problem, a nonlinear modeling approach, namely, the SVR method, is adopted here to establish a bridge between the monitoring data and the components of the statistical model. The Bayesian evidence framework is then applied to determine the optimal SVR modeling parameters for achieving the best possible accuracy of the crack monitoring model.

3.1. Modeling of Crack Monitoring Data Based on SVR

Considering the nonlinear relationships between the effects on the monitored crack quantities and their influencing factors, let be the input sequence of the set of crack-influencing factors and let be the input sequence of monitored crack quantities. Then, the independent identically distributed sample set () of the crack monitoring sequence is subject to a certain distribution ,,, in the function space . Based on the method of support vector regression (SVR) [28], the basic idea used to solve the above nonlinear prediction problem in a multidimensional function space is as follows: the set of monitoring data is mapped from the input space to a high-dimensional feature space via a nonlinear transformation , and the hyperplane function () that yields the optimal classification of the linearized regression problem is found to fit the relationships between the influencing factors and the effects on the input crack quantities . Finally, the optimization problem for reproducing the monitored quantities based on the SVR model is established as follows:

In the formulas above, represents the risk function of the optimization problem; is the vector of the weights of all influencing factors; is a penalty parameter, which describes the tradeoff between the empirical risk and the model complexity; is a slack variable such that describes the deviation of the monitored crack quantity data with respect to the ideal conditions of the classification; is a constant bias term; and is the function used to map the input space to the high-dimensional feature space. The basic principle of the modeling of monitored crack quantities based on SVR to solve the corresponding nonlinear problem is illustrated in Figure 1.

In accordance with the optimization theory [37], the risk expectation minimization problem for the monitoring data model presented in formula (6) is transformed into a dual problem. The Lagrange multiplier method is used to seek the optimal solution at the saddle point of the above function, and the Wolfe duality technique [38] is applied to solve the dual planning problem with the following inequality constraints:where is the vector of the Lagrange multipliers and and are nonlinear transformation functions.

The solution to the nonlinear problem above is obtained by mapping the low-dimensional input samples to the high-dimensional feature space by introducing a kernel function to transform the nonlinear support vector regression problem into a linear problem. Finally, the hyperplane that represents the optimal solution to the problem is constructed. According to functional theory, the inner product function () that satisfies Mercer’s condition () is the kernel function that is sought. Commonly used kernel functions include a polynomial kernel function, a sigmoid kernel function, and the radial basis function. Research shows that the radial basis function kernel can be used to extract fewer support vectors with the same classification accuracy. Therefore, the radial basis function kernel () can be selected when modeling the measured crack monitoring data for a concrete dam. However, the value of must be determined according to the specifics of the modeling problem.

To enhance the robustness and sparsity of the algorithm, the nonsensitive loss function () is defined as follows:where is the loss coefficient.

For the optimal regression problem shown in formula (6), by adopting the nonsensitive loss function as a constraint condition and using the Lagrange multiplier method to construct the function to be optimized, the dual optimization problem can be obtained as follows:where is the Lagrange multiplier corresponding to , such that . The corresponding sampled data point is defined as a support vector only when .

The SVR model and the estimated bias parameter of the crack monitoring sequence are expressed by formula (10) and formula (11), respectively, after the above quadratic optimization problem is solved. In the modeling of COD values, the vector of the inputs to the model, , should be defined to consist of the crack-influencing factors, namely, the water pressure, the temperature, and the time-varying factor. The output of the model can be defined as the monitored quantity () of the COD. A structural representation of the modeling process using this support vector regression method is shown in Figure 2.where is the number of support vectors and is the number of standard support vectors. denotes the set of standard support vectors.

3.2. Deduction of Model Parameters Based on the Bayesian Evidence Framework

The penalty coefficient (), the loss coefficient (), and the width coefficient of the radial kernel function () need to be determined before the SVR modeling process for an observed series of concrete dam crack opening data. There are two traditional optimization approaches that are used to choose the best model parameters. The first method is to select the model parameters based on the user’s experience, which is likely to result in prediction distortion due to subjective factors. The other method is cross validation, in which the samples are divided into a training set, a validation set, and a test set; the SVR model is trained using the training set for each , and the reliability of the model is verified using the validation set. Once the constructed model has been confirmed to be reliable, the model parameters that yield, on average, the minimum mean squared errors of prediction for the above SVR model can be adopted. To reduce the effort required in the cross-validation process, the optimal solution for the model parameters is deduced via the following steps by maximizing the posterior probability of the parameter distribution based on the Bayesian evidence framework. The steps of obtaining the optimal SVR parameters are as follows.

Step 1. First, and are deduced. Suppose that the training set for model is , ; then, the posterior probability of the parameters can be expressed as in (12), according to the Bayes criterion [39, 40]:where is the generalized constant that is related the deduced parameters and , is the joint prior probability of and , and is the likelihood function.
If is independent of parameter , then the likelihood function can be transformed into the following:In (13), let the parameter be independent of ; thus, . Let and obey a Gaussian distribution and a uniform distribution, respectively; then, (12) satisfies the following relation:Therefore, the optimal parameters corresponding to the maximum a posteriori estimation, denoted by and , can be obtained by minimizing the negative logarithm of the risk function given in formula (6).

Step 2. Second, and are deduced. The Bayesian evidence framework is introduced to deduce the parameters and from the training set . and are assumed to be independent; . Thus, it can be seen thatThe optimal solution for can be obtained by solving the following minimization problem, adopting the same assumption as in formula (14):where denote the nonzero eigenvalues and is the number of nonzero eigenvalues.

Step 3. Finally, is deduced. For a certain submodel , is assumed to obey a uniform distribution, whereas and obey the Gaussian distributions and , respectively. From the Bayes rules, it can be deduced thatAccording to [41], the standard deviations of the Gaussian distributions can be approximated as follows: and . The optimal kernel function coefficients of the SVR model can be obtained from the following formula:where is obtained in the parameter estimation of the Gaussian distributions; is the number of support vectors; and and are the optimal penalty parameter and loss parameter, respectively, obtained in the last step.

In conclusion, the steps of analyzing monitoring data for early warning of crack opening in a concrete dam using the B-SVR model can be summarized as follows. First, the prediction model expressed by formulas (10) and (11) is established using support-vector-based modeling based on measured monitoring data consisting of a series of COD. Then, the optimal parameters of the model are deduced using formulas (14), (16), and (18). Finally, the trend of crack variation can be predicted using the established model with the optimal parameters, and an early warning analysis of the variation in the structural behavior of the dam can be conducted by comparing the prediction results with the forewarning criteria expressed in formulas (3)–(5).

4. Example Engineering Application

4.1. Engineering Background

The application case considered here involves the Chencun concrete arch-gravity dam, with crest elevation of 126.3 m and a maximum dam height of 76.3 m, which is located in China. The dam consists of a total of 28 dam sections from the left bank to the right bank. The dam was built in two phases: construction was begun in 1958, halted in 1962, restarted in 1968, and essentially completed in 1972. When construction was shut down in 1962, most of the upstream dam sections had been poured to a height of 105.50 m; the corresponding zone is referred to as Phase I concrete. When construction was restarted after an interval of 6 years, the remaining Phase I concrete in the river bed sections was also poured to a height of 105.50 m before October 1970. Subsequently, the Phase II concrete in the downstream dam sections was poured very rapidly in the year 1971. The pouring temperature was up to 29°C, and the temperature of the cooling water in the preburied pipeline also reached 28–30°C because of the high summer temperatures. Some dam sections were also erected without water cooling. Therefore, the dam concrete was subjected to a relatively minor cooling effect. Moreover, the top part of the Phase II concrete was poured on top of the lower Phase I and II concrete within a very short interval of time after the lower Phase II concrete had risen up to the level of the Phase I sections. For these reasons, under the cold wave that arrived in the winter of 1971, a horizontal crack emerged as a result of the stress concentration at approximately the height of the joint between the Phase I and II concrete layers. The length of the crack is greater than 300 meters; it lies near the top of the Phase I concrete and extends from section #5 to section #28 of the dam, as seen in Figures 3 and 4.

4.2. Crack Monitoring Data and Modeling Factors

In this paper, a total of 1146 monitoring samples of COD, which were measured using a joint meter buried in dam section #18 from Jan-1-1973 to Dec-18-2006, are used to demonstrate the effectiveness of the proposed time-varying model for dam safety prediction. The process curve corresponding to this time sequence is presented in Figure 5.

To verify the effectiveness of the proposed method, two subsequences of the COD data were selected for modeling and prediction. The modeling periods and forecast analysis periods are summarized in Table 1.

For the modeling process, the kernel function used for support vector regression was chosen to be the Gaussian radial basis function: . The factors influencing COD were decomposed in accordance with previous dam engineering and mechanical experience, allowing the monitoring model of COD to be summarized as follows:where is the COD as measured by the joint meter; is the water level in the reservoir; is the temperature change between the first recorded date and the monitoring date; is the cumulative number of days from the first recorded date to the monitoring date; is the aging factor, which is generally taken to be ; is a coefficient that is generally assigned a value of 3 for a gravity dam and a value of 4 or 5 for an arch dam; , , , , , , and are the regression model parameters; and , , , and are the numbers of influencing factors.

These influencing factors were taken as the input vectors for the modeling of the COD. Afterward, time-varying identification and dam safety analysis were conducted based on formulas (3)–(5).

4.3. Analysis of Optimal Modeling Parameters

SVR dam safety monitoring modeling was performed within the Bayesian framework using the data of subsequence 1. The first 245 monitoring data points (for the modeling period of 1997–2005) were used to infer the best model parameters, resulting in values of , , and . These model parameters were held fixed for the prediction of 28 subsequent displacement events in the year 2006. The average deviation between the model-predicted displacement and the monitoring results was 0.044 mm.

To verify the validity of the abovementioned optimal parameters, a series of SVR models were established using the same data samples (subsequence 1). In the modeling process, the set of values for the kernel parameter, , was preliminarily established to be based on the previous modeling experience. Then, 75 SVR regression models were established using the same 245 data samples (subsequence 1) with a loss factor set of and a penalty factor set of . The prediction deviations of each model are shown in Figure 6. The following conclusions can be drawn from Figure 6. () The average prediction deviations range from 0.049 mm to 0.331 mm for all 75 tests; the smallest of these values is still slightly larger than that for the B-SVR model (0.044 mm). () The optimal value of the SVR parameter obtained using the Bayesian framework, , is consistent with the observation that, for fixed values of and , the deviations decrease as increases from 0.2 to 0.5 and subsequently increase as further increases to 1.0. () A similar phenomenon can be observed if the value of is fixed. In general, the best model parameters obtained by the Bayesian framework are acceptable in comparison with the results of the 75 modeling tests.

4.4. Time-Varying Prediction Analysis Based on the B-SVR Model

To validate the effectiveness of the proposed forewarning criteria, COD modeling and prediction were conducted based on the optimal parameters using the two subsequences represented in Table 1. The monitored values, fitted values, and residual process curves for the two subsequences are shown in Figures 7 and 8. The residual standard deviations for the monitored and model-fitted values are 0.08 mm and 0.09 mm, respectively. Then, predictions were generated for the two subsequences. The corresponding monitored values, predicted values, and residual process curves are shown in Figures 9 and 10.

It can be seen from Figures 7 and 8 that a high fitting degree was obtained for both subsequences using the B-SVR model. These findings demonstrate that the established model accurately reflects the variation in the COD and can be used for forewarning analyses of dam behavior.

It can be seen from Figure 9 and the forewarning criteria given in (3)–(5) that () the deviation between the monitored values of the COD and the predicted values output by the model for subsequence 1 satisfies the equation ; () all monitored values for subsequence 1 are smaller than the maximum COD throughout the entire history; that is, ; () the rate of change of the aging component remained essentially equal to zero, . In general, these three conditions imply that the monitored process of COD (subsequence 1) is consistent with the normal pattern of dam behavior, and the time-varying crack process remained safe during the analyzed period.

However, a different result is revealed by the forewarning analysis of subsequence 2 (Figure 10): () a serious deviation appears between most of the monitored values of the COD and the predicted values during the prediction period for subsequence 2 (i.e., ) and () at some time points the time-dependent component of the COD behavior satisfies the equations and . According to the forewarning criteria expressed in (3)–(5), obvious abnormality is evident in the variation of the crack opening displacement. At this time, an urgent dam safety alarm should be sent out, and a further comprehensive inspection of the overall safety conditions of the dam should be conducted immediately.

5. Conclusions

An accurate determination of the nonlinear relationships between the COD and its influencing factors is useful for predicting and diagnosing COD variations for a forewarning analysis of dam health. Based on a monitored COD time series, the SVR method is adopted to describe its nonlinearly varying behavior, and the BEF method is applied to determine the best SVR modeling parameters. With the establishment of a suitable mathematical model, the identification of crack behavior and the degree of abnormality of dam health can be achieved in accordance with the proposed forewarning criteria considering the predicted and monitored time-varying COD behavior.

In the case considered as an example here, the optimal modeling parameters were obtained and shown to be acceptable according to the results of 75 SVR modeling tests with a range of specified parameters. These optimal parameters were then used for the COD modeling of two monitoring subsequences from an actual crack in the Chencun concrete dam. According to the proposed forewarning criteria, both normal and abnormal behaviors were identified. Therefore, the time-varying identification model presented here is suitable for use in evaluating the behavior of cracks in dams.

Competing Interests

The authors declare that there are no competing interests regarding the publication of this paper.

Acknowledgments

The authors would like to acknowledge the research funds provided by the National Natural Science Foundation of China (Grants nos. 41323001, 51479054, 51579086, 51379068, 51579083, 51279052, 51579085, and 51609074), the Jiangsu Natural Science Foundation (Grants nos. BK20140039, BK20160872), a project funded by the Priority Academic Program Development of Jiangsu Higher Education Institutions (Grant no. YS11001), the National Key Research Program (Grant no. 2016YFC0401601), and the Key Laboratory of Earth-Rock Dam Failure Mechanism and Safety Control Techniques, Ministry of Water Resources (Grant no. YK914002).