Abstract

Machine learning has become increasingly popular for modeling dam behavior due to its ability to capture complex relationships between input parameters and dam behavior responses. However, the use of sophisticated machine learning methods for monitoring dam behaviors and making decisions is often hindered by model uncertainty and a lack of interpretability. This paper introduces a novel model for dam health monitoring, focused on monitoring radial displacement and seepage, using optimized sparse Bayesian learning and sensitivity analysis. The model hyperparameters are optimized using an intelligent optimization method integrating the multi-population Rao algorithm and blocked cross-validation, while sensitivity analysis is employed to calculate the relative importance of input variables for a better understanding of the dam’s state. The effectiveness of the proposed model is verified by using long-term monitoring data of a prototype concrete arch dam. The results confirm that the proposed model provides satisfactory performance on both the point predictions and the interval predictions for dam structural behaviors while obtaining effective explainability.

1. Introduction

Concrete dams play a crucial role in flood control, power generation, water supply, and irrigation and are thus significant assets in the social and economic domains. Over their service lifetime, these structures are exposed to diverse operational and environmental loads, including rare or extreme events such as earthquakes, prolonged droughts, and severe flooding. Moreover, the overall performance of concrete dams may gradually decline over time due to a range of factors, including hydraulic erosion, age-related deterioration, and other factors. The displacement and seepage are important indicators that can intuitively reflect the structural performance of a concrete dam. Through analysis of historical monitoring data, it is possible to develop a data-driven model that can effectively monitor the structural response of a dam and thereby provide early warning of any anomalies.

Hydrostatic-season-time (HST) and hydrostatic-temperature-time (HTT) are common statistical models for monitoring dam structural responses [1]. These models utilize measured reservoir water height, seasonal parameters (or temperature data), and time-related parameters as input, with dam displacement as the output. To monitor dam uplift or seepage behaviors, rainfall parameters are incorporated as additional model inputs [2].

The advent of artificial intelligence has spurred increasing interest in the use of machine learning techniques for dam health monitoring [25]. Various ML methods have been utilized for data-driven modeling of dam structural behavior, such as feed-forward neural networks [69], extreme learning machines [1012], recurrent neural network (RNN) [1315], support vector regression (SVR) [1619], Gaussian process regression [2022], and decision trees-based ensemble models [2325]. Besides, some novel data-driven methods or models have been proposed for dam health monitoring, including switching Kalman filter [26], dynamic time warping [27], panel data model [28], cloud model [29], correlated multi-target stacking [30], and spatiotemporal association mining [31]. Recently, the concept of automated machine learning (AutoML) has been also applied in dam response prediction. AutoML allows the user to search among a range of machine learning methods with optimized hyperparameters and identifies the best one [32].

Based on the literature reviewed above, it is evident that most existing methods in the field of dam health monitoring provided deterministic values (i.e., point prediction) for dam structural behaviors, without adequately accounting for the uncertainties associated with these results. Uncertainty quantification (UQ) plays a key role in monitoring and decision making during the dam operation period [3335]. The uncertainties in dam engineering are mainly divided into two categories, i.e., aleatoric and epistemic uncertainty, where the former is referred to as the data noise, and the latter is associated with the model uncertainty (i.e., the uncertainties of model input, structure, and parameters) [36]. Probabilistic predictions are generally considered more valuable and reliable than point predictions, as they provide information about the predicted values in the form of prediction intervals. These intervals capture the uncertainty associated with the predictions and can be particularly useful for decision making and risk management under a UQ framework.

Another significant concern in the field of probabilistic prediction is model explainability. While existing probabilistic prediction methods [20, 37] often provide accurate results, they can be considered black-box machine learning models, lacking the ability to provide practitioners with information about the inner workings or features of the model. Although some data mining methods have been used in the monitoring of dam structural behaviors to determine the relative importance of each input [23, 38, 39], their applicability to probabilistic machine learning methods can be limited.

The objective of this study is to establish an explainable probabilistic prediction model that is capable of providing reliable and efficient prediction intervals of concrete dam behaviors, as well as visualization of input influencing variables, thereby offering a tool for risk-based decision making in dam health monitoring. To fulfill these objectives, an optimized sparse Bayesian learning (SBL) approach is proposed for generating high-quality prediction intervals, with the model hyperparameters determined via the integration of the Rao algorithm with blocked cross-validation. To address the issue of model explainability, a new sensitivity analysis technique is employed to display the influencing features of model inputs. In summary, the main novelties and contributions of this paper are as follows:(i)Developing a Bayesian-based probabilistic model to make interval predictions of concrete dam structural behaviors.(ii)Presenting an efficient optimization framework for selecting model hyperparameters adaptively.(iii)Utilizing novel sensitivity analysis techniques to render the dam health monitoring model explainable.(iv)Verifying the performance of the health monitoring model by comparing it with other popular ML-based models.

The remainder of the paper is summarized as follows. Section 2 provides the relevant theory and the implementation procedure of the proposed monitoring model. The case study and detailed performance evaluation are demonstrated in Section 3. Finally, the concluding remarks and future work are presented in Section 4.

2. Methodology

The sparse Bayesian learning, strategy of hyperparameter optimization, and sensitivity analysis-based data mining approach are the important components of the proposed dam health monitoring model. The basic theories of algorithms and the implementation procedure are introduced in the following subsections, and the diagram of the model framework is shown in Figure 1.

2.1. Sparse Bayesian Learning

Sparse Bayesian learning (SBL) is a nonparametric and probabilistic machine learning method. SBL shares a similar form to SVM, but its operation speed is faster and provides more flexibility for the choice of the basis function [40]. Moreover, SBL provides the probabilistic output by estimating the probabilistic density of the weights of kernel functions, which can quantify the uncertainty of the prediction results.

Given the input dataset , the quantitative relationship between and computed by SBL is as follows:where is the matrix of the influencing variables of the dam structural behaviors, is the measured displacement or seepage, denotes the weight vector, and is the error term. Considering the predictive uncertainty, the model output follows the normal distribution:

The output of SBL is represented by the linearly weighted sum of kernel functions, which is shown below:where denotes the weight value and denotes the kernel function.

For the input vectors, the likelihood of output is expressed as follows:where represents target vector, represents weight vector, is the design matrix, and .

It is noted that the maximum likelihood estimation of the unknown parameters and above could lead to overfitting. To overcome this problem, effective practice is imposing a set of explicit priors over these unknown parameters to constrain the variation, as shown below:where is a vector containing hyperparameters that implies how far from zero each weight is allowed to deviate.

Based on Bayesian posterior inference, the posterior distribution over is obtained as follows:where the posterior covariance is with and the mean vector is .

For the uniform hyperpriors over and , the learning of SBL is transformed to maximizing as follows:

Values of and are obtained iteratively by using the following formulas:where denotes the th posterior mean weight that can be defined by and represents the th diagonal element of the posterior weight covariance.

With the convergence of the iterative process, the most probable (MP) values of and at termination are denoted as and . For the trained SBL model, the predictive distribution given the new inputs is as follows:

In equation 9, considering both terms in the integrand is Gaussian distribution, the predictive mean and variance are computed bywhere denotes the mean value of predictive distribution while the uncertainty of predictive distribution is quantified by variance . Once the significance level is determined, the prediction interval under the confidence level is obtained:where and denote the lower bound and upper bound, respectively. is determined by the significance level . For example, the 95% prediction interval of dam structural behavior is , while the 99% prediction interval is .

Various kernel functions can be adopted in SBL modeling [40], such as Gaussian, linear, polynomial, and sigmoid kernel functions. In this paper, the widely used Gaussian kernel is adopted in SBL, as follows:where denotes the kernel width, which is the hyperparameter of SBL. It is noted that a small value of the kernel width may lead to a low convergence speed and overfitting problem. Conversely, a large value of kernel width may cause underfitting [41]. Therefore, it is necessary to determine the suitable values of to improve the model performance. The technical details of hyperparameters optimization are introduced in the next subsection.

2.2. Tuning Hyperparameters Using MP-Rao Algorithm and Cross-Validation

It can be inferred from equation (3) that the hyperparameters of SBL have influential effects on the prediction performance. To prevent overfitting and obtain sensible hyperparameters with less human intervention, the modified Rao algorithm called the multi-population Rao algorithm (MP-Rao) and blocked cross-validation [42] are integrated. MP-Rao is a parallel and metaphor-less optimization algorithm, which contains the common controlling parameters (i.e., population size and the number of iterations) and is improved based on the Rao-3 algorithm [43]. The main principle of the Rao algorithm and its variant lies in that the obtained solutions get away from the worst solution and move closer to the best one. Once the target function of optimizing model hyperparameters is given, the target values are dynamically updated based on the differences between the best candidate, the existing one, and the worst solution as follows:where is the value of the th variable for the th candidate during the th iteration, and are the random numbers within the range of , represents the value of the th variable for the best candidate, and represents the value of the th variable for the worst candidate. The term indicates the stochastic interaction among the population, where the current solution is compared with a randomly selected solution . If the target function value of th solution is smaller than that of th solution, then “” becomes . Otherwise, “” becomes .

Compared to the original Rao algorithm, the MP-Rao algorithm splits the entire population into subpopulations, and the solution in each subpopulation is independently updated by equation (13) in parallel. Therefore, the algorithm is expected to reach an optimum solution.

Moreover, to balance the model fitting and predictive performance, the weighted mean squared error of prediction in cross-validation () is built as the target function:where is the number of samples, is the number of folds in blocked cross-validation ( is determined as 5 in this research), denotes the predicted value of the inner validation samples, denotes the measured value of the inner validation samples, denotes the simulated value of the inner training samples, and denotes the measured value of the inner training samples. and are controlling weight coefficients (, , ). The values of and are set as 0.3 and 0.7, respectively [11].

After the optimization of hyperparameters, SBL utilizes the obtained optimal parameters to perform the prediction task on the new testing set. The procedure of the adaptive hyperparameter tuning is summarized in Algorithm 1.

Inputs: The training dataset and search range of hyper-parameters .
Outputs: The optimal hyperparameter
Initialize: Population size , sub-population size .
  () Compute the target function value using equation (14).
  Identify the best and worst solutions, respectively.
  Update the solutions by equation (13), and compute the new target function value .
  
   Accept the new solution, .
  
   Reject the new solution.
  
   The termination condition is achieved
   Output the search results in th sub-population.
  
   Return to the key step ().
  
 Find the best solutions among sub-populations.
Find the best solutions from populations, as well as .
2.3. Evaluation Metrics for Model Performance

Both point prediction metrics and interval prediction error metrics are adopted for evaluating the performance of the proposed probabilistic model. The formulas for the error metrics are introduced in the following subsections.

2.3.1. Point Prediction

Correlation coefficient (), normalized root mean squared error (NRMSE), and mean absolute error (MAE) are selected as point prediction metrics, which have been defined in the literature [3, 11]. While for , the larger value is preferable (i.e., closer to 1.0), for NRMSE and MAE, the smaller values indicate the better performance of the model. It is worth noting that NRMSE (shown in equation (15)) is a normalized indicator based on the RMSE, which can be used to compare the model prediction performance on different response variables.where denotes the th measured value and is the sample number.

2.3.2. Interval Prediction

To assess the effectiveness of the prediction intervals obtained from probabilistic prediction, the prediction interval coverage probability (PICP), normalized mean prediction interval width (NMPIW), and coverage width-based criterion (CWC) are chosen [44].

Prediction interval coverage probability denotes the percentage of data points lying within the yielded intervals:where denotes the measured value, denotes the total amount of the target variables, and and are the upper and lower values of the th prediction interval, respectively. Theoretically, PICP should be close to the confidence level [44]. However, this hardly happens due to the influence of outliers in testing samples, model uncertainty, and imperfect model training (i.e., overfitting or underfitting).

NMPIW is another important metric that measures the average width of the interval, as follows:where denotes the range of the target variable. If the width of the achieved interval is too small, the corresponding PICP would be lower, and more target variables would lie out of the yielded prediction interval. Conversely, the larger interval width would make the prediction interval invalid as the abnormal variations of the dam behavior are not easily detected.

It is recognized that a reliable monitoring model should provide high-quality prediction intervals, achieving high coverage of the measured value under the scenario of a narrow-width interval. CWC is a comprehensive metric for evaluating the prediction intervals in terms of the coverage probability and interval width:where denotes the nominal confidence level associated with prediction intervals. denotes the penalty parameter, which should be large enough, and is assumed as 5 in this study. is a boolean variable. If PICP is less than the nominal confidence level, CWC would be large regardless of the width of prediction intervals. On the contrary, if PICP is equal to or greater than the nominal confidence level, then , and NMPIW would be the controlling parameter. As a result, the reliability of the obtained prediction intervals increases as the CWC decreases.

2.4. Interpretation of Dam Behaviors Using Sensitivity Analysis

The primary shared trait between SBL and other black-box ML models is their lack of transparency, meaning that the inner workings of the model are difficult to comprehend. Inspired by the pioneering work of Cortez and Embrechts [45], the sensitivity analysis (SA) method is introduced to mine the hidden information from the dam monitoring data. The SA is commonly considered a post hoc data mining algorithm [46]. Its primary principle involves computing relative importance scores for the managed inputs to clarify the inner workings of a model [47]. These scores quantify the sensitivity of each feature on the model outputs. A comparison of the scores among different groups of input variables unveils the importance granted by the model to each of such input variables when producing its outputs.

The detailed procedure of the SA is summarized in Algorithm 2, where and represent the number of influencing variables and the sample number, respectively, denotes the number of subgroups (each subgroup contains at least one influencing variable, ), denotes the generated meta-inputs by holding all the variables at their mean values except the th influencing variable (), and denotes the corresponding fitted results.

Inputs: The training dataset .
Outputs: Relative importance score .
Initialize: Divide into subgroups, .
 Generate the th meta-inputs.
 Compute the th outputs: .
 Compute the th sensitivity factors .
 Compute the relative importance score:

In sensitivity analysis, a larger RIS indicates a greater influence of the load effects on dam structural behavior. In Algorithm 2, the variance and average absolute error (AAE) are adopted to build sensitivity factors ( and ), as follows:where denotes the measured value, denotes the th fitted value (posterior mean value), denotes the median value of the target variable , denotes the mean value of the target variable , and is the sample number.

2.5. The Procedure of the SA-OSBL Model for Dam Health Monitoring

In this section, an explainable probabilistic model for monitoring concrete dam structural behaviors based on SA-OSBL is proposed. The detailed steps are described below, and the flowchart of the model implementation procedure is shown in Figure 2.

2.5.1. Step 1: Generating the Model Input and Output

Generate the model inputs of the dam structural behaviors using hydrostatic, temperature, aging, and rainfall-related variables based on in situ monitoring data of the dam (see equations (20) and (21)).

2.5.2. Step 2: Splitting the Data

The dataset is divided into the training set and testing set, where the former is utilized for the construction of the OSBL-based probabilistic model, and the latter is utilized for the validation of the model’s effectiveness. Before model training, the samples are standardized in the range of [0, 1] using the min-max normalization approach.

2.5.3. Step 3: Optimizing and Training the Model

The kernel parameters of the model are estimated via the framework combining the MP-Rao algorithm and blocked cross-validation (5-fold). Then, the SBL model is trained using the optimized hyperparameters based on the training dataset.

2.5.4. Step 4: Validating the Performance of the Simulation

The training and testing samples set are fed into the OSBL model to obtain the fitted and predicted results, respectively. Then, evaluate the simulation performance of the monitoring model comprehensively based on different error metrics and interval prediction metrics.

2.5.5. Step 5: Interpreting the Model

The relative importance scores for different groups of influencing variables are computed via the sensitivity analysis based on the historical monitoring data, to offer the interpretation of the dam’s structural behaviors.

3. Case Study

In this paper, a double curvature arch dam located in the south of France is chosen as a case study. The dam is owned by EDF and was finished in 1960. The arch dam consists of 13 blocks, and the maximum dam height above the foundation is about 45 m with the crest length being 166 m. To monitor the dam service status, the dam is equipped with a comprehensive monitoring system and instruments. Figure 3 presents the layout of the dam pendulum monitoring instruments.

3.1. Data Collection and Preparation

The radial dam displacement and seepage of the arch dam are taken as modeling targets. For the radial dam displacement, the measurements of pendulums on the central block are selected, where CB2 denotes the radial displacement between the altitudes 236 m (dam crest) and 196 m (dam toe), while CB3 denotes the radial displacement in the foundation between the altitudes 195 m and 161 m. It is noted that the positive value of the radial displacement indicates the movement of the monitoring point to the downstream direction. For the seepage, the flow rate is measured using a weir located in the gallery at the downstream dam toe. The measured seepage is the total amount of water originated from different locations including the seepage from surrounding rock, potential leakages in concrete cracks, and moisture transport in concrete. The dam behavior data are measured and collected regularly, and the time series of the long-term monitoring data are illustrated in Figure 4.

The corresponding ambient data include the water level, temperature, and daily rainfall (see Figure 5). Since the dam is located on the top of a glacial threshold, the reservoir water height is zero once the water level is lower than 196 m. The air temperature is not measured at the location of the dam. Therefore, the ambient temperature is provided by interpolation based on the measured temperature near the dam region, where the interpolation takes into account the altitude of the dam and is computed on a mesh of 1 km2. Daily rainfall precipitation is collected from a rain gauge located about 5 km from the dam. The time series of the in situ monitoring data covers a period of thirteen years (i.e., from January 2000 to December 2012). The data from the first ten years (i.e., from January 2000 to December 2009) are used for model training and calibration, while the remaining data (from January 2010 to December 2012) are used for testing. The detailed statistical information of the measured data is summarized in Table 1.

It is widely accepted that the structural behaviors of the concrete dam are mainly influenced by hydrostatic load, temperature load, rainfall effect, and aging effect [3]. For the hydrostatic load, the reservoir water height-related parameters or the segmented moving averages of the reservoir water height [24] are selected as factors. The calculation of temperature effects depends on the layouts of the thermometers; considering that the ambient temperature is continuous and available, the segmented moving average of ambient temperature [48] is selected to characterize the temperature effect for displacement and seepage. Likewise, the segmented moving average of daily rainfall is selected to characterize the rainfall effect of seepage [39]. The aging effect reflects the irreversible changing characteristics of dam behaviors, which can be quantitatively represented by the linear, logarithmic, exponential, or hyperbolic functions regarding the time variable [20].

Consequently, a total of fourteen factors are selected to build the input variables for displacement modeling, which are comprised of four hydrostatic load factors, six temperature factors, and four time factors (aging factors), as follows:where is the reservoir water height, denotes the measured air temperature of the monitoring day, denotes the moving average air temperature from to days before the monitoring day, represents the cumulative day numbers from the initial day to the monitoring day, and .

As for seepage, the lag effect of external load is considered. Therefore, six hydrostatic load factors, six temperature factors, and six rainfall factors are selected to build the input variables, as shown in equation (21). Based on the engineering judgment, the time factors are discarded as there is no significant trend change in seepage time series.where denotes the reservoir water height of the monitoring day, denotes the measured daily rainfall of the monitoring day, and and denote the moving average values of reservoir water height and daily rainfall from to days before the monitoring day, respectively. The meanings of the rest symbols are the same as those in equation (20).

3.2. Hyperparameter Optimization and Model Training

The kernel width is automatically determined using the proposed optimization framework, in which the search range of kernel width is . As for initial parameters in the MP-Rao optimization algorithm, the population size and subpopulation size are set to 40 and 20, respectively. The classical particle swarm optimization (PSO) and the Rao-1 algorithm (Rao) are adopted for performance comparison, where the population size of PSO and Rao is set to 20. As for initial parameters in PSO, and are set to 1.4 and 1.6, respectively. For all the algorithms, the maximum number of iterations is 40.

Figure 6 illustrates the convergence curves of error function using different optimization algorithms. It is obvious that MP-Rao performs best in terms of convergence rate and searching capability, indicating that the proposed MP-Rao algorithm-based optimization framework for hyperparameter tuning is reliable and effective. Based on the optimization results, the hyperparameters of OSBL used for modeling CB2, CB3, and seepage are 2.46, 2.50, and 2.59, respectively.

3.3. Performance Comparison on Point Prediction

The fitting and prediction results using OSBL for CB2, CB3, and seepage are shown in Figure 7, which provides an intuitive reflection on the goodness of fit via time series plots and scatter plots. The corresponding error metrics(r, NRMSE, and MAE) are also summarized in Table 2. Based on the results, it is clear that the fitting and prediction results of radial displacement are better than those of seepage, and the prediction deviations are larger than fitting deviations.

In this research, three kernel-based regression algorithms, namely, KELM, RBFNN, and GPR, are chosen as benchmark models to compare the point prediction performance with that of OSBL. In KELM, the Gaussian kernel is selected as the kernel function, and the regularization parameter along with kernel width are two hyperparameters to be optimized [11]. Gaussian kernel is also applied in the RBFNN model, the number of hidden neurons is the same as the number, and the value is the hyperparameter that needs to be optimized [48]. In GPR, different covariance functions are tested and the squared exponential covariance function is finally adopted in the model. The kernel width and noise term are the main parameters of GPR, which can be estimated by solving the maximum likelihood estimation problem [20, 21].

The long-term prediction results of the radial displacement and seepage using different models are shown in Figure 8. It can be observed that all models perform well on the displacement prediction task, and the predicted values are close to the measured values.

Table 3 summarizes the error metrics, with the best values highlighted in boldface. As seen, the OSBL obtains the smallest NRMSE and MAE values in the three tasks, outperforming the other three models in terms of prediction accuracy.

3.4. Performance Comparison on Interval Prediction

In this research, OSBL is compared with the GPR and multiple linear regression (MLR) in terms of the quality of the achieved prediction intervals. GPR is regarded as the probabilistic ML model that can provide interval prediction directly by the prediction mean and prediction variance. Specific methods for prediction interval generation using GPR can be found in [20, 49]. For the MLR model, the internal coefficients are achieved through the least square method, and the prediction interval is achieved through the confidence interval method [50], where the fitting residuals should be normally distributed.

In the application of the established interval prediction-based health monitoring model, the confidence level is set to 99% (i.e., ). The PICP, NMPIW, and CWC metrics of the prediction intervals are summarized in Table 4 with the best results highlighted in boldface. It is observed that the PICP values of OSBL in the case of CB2 and CB3 are almost close to the preset confidence level. Besides, the NMPIW values of the OSBL model are smaller, suggesting that the predicted intervals are narrower in each scenario. Based on the CWC results, it is concluded that OSBL is the best model. It is also found that the GPR model performs worst in the case of CB3 while the MLR model performs the worst in the cases of CB2 and seepage. Notably, the prediction interval of the MLR model in the case of seepage is invalid because the prediction intervals are too wide with the NMPIW value even larger than 1.0.

Figures 9 and 10 illustrate the obtained prediction intervals for displacement for the OSBL and GPR models, respectively. As seen, most of the displacement points are within the prediction interval except for a few abnormal values. The obtained prediction intervals for seepage are illustrated in Figure 11. Combined with results listed in Table 4, it can be seen that nearly 30% of the seepage points are not within the prediction interval achieved by GPR, which is likely caused by the uncertainties of data noise and model discrepancy. Compared with GPR, the reliability of OSBL is much improved as the PICP value of OSBL is 0.122 larger than that of GPR. It is observed that the peak values (i.e., regarded as potential anomalies) are not within the prediction intervals, indicating that the models can be applied for early warning of dam seepage. In summary, the effectiveness and reliability of the prediction intervals achieved by OSBL are confirmed, and OSBL outperforms GPR due to its smaller CWC values.

3.5. Model Interpretation

Figure 12 shows the RIS of three groups of input variables using two sensitivity measures, with the relative importance of hydrostatic load, temperature, time, and rainfall effects visualized by the bar plot. A larger RIS indicates a greater influence of the load effects on dam structural behavior. However, it is important to note that the RIS cannot quantify the absolute contribution of each group of variables to displacement or seepage.

The RIS results using two sensitivity measures are very similar, revealing that the hydrostatic load has the greatest impact on displacement variation near the dam crest (CB2), followed by the effect of temperature, while the time effect (aging effect) is insignificant. Additionally, the effect of temperature on the displacement variation at the dam toe (CB3) is notably smaller than that on CB2. It is also observed that seepage is more sensitive to the hydrostatic load compared to the rainfall effect, suggesting that changes in reservoir water height dominate seepage variation.

From Figures 4 and 5, it is found that the radial displacements and reservoir water height have positive correlations, and there are no significant time-varying characteristics in radial displacements. According to the dam engineering mechanics knowledge, temperature-induced deformation at the dam foundation is smaller than that at the dam crest due to the smaller variations in temperature at the dam foundation compared to the dam crest. Similarly, aging-induced deformation at the dam foundation is commonly larger than that at the dam crest due to the foundation creep and restraint effects of the dam abutment. As for seepage, the reservoir is the main source of the leakage and it is reasonable that the reservoir water height is the most influencing factor of the seepage. Therefore, the interpretation results in Figure 12 are generally consistent with engineering experience and knowledge, confirming the effectiveness of the proposed sensitivity analysis.

4. Conclusions and Future Work

In this paper, we proposed an explainable probabilistic model for monitoring concrete dam displacement and seepage, which was validated using long-term monitoring data from an arch dam. The model provided the satisfactory performance of simulation and anomaly detection for dam behaviors, and the conclusions are summarized as follows:(1)The proposed OSBL model is superior to KELM, RBFNN, and GPR models in terms of goodness of fit and interval prediction results. Meanwhile, OSBL provided better interval prediction results compared to GPR and MLR models.(2)The OSBL model effectively quantified uncertainty through dynamic prediction intervals, where the widths of the prediction intervals varied at different periods. The upper and lower limits of the intervals served as real-time monitoring indices for dam structural behaviors.(3)A novel sensitivity analysis-based data mining approach was integrated with the proposed OSBL to enhance the model’s explainability. The interpretation results, presented as relative influence scores for different groups of input variables, revealed that reservoir water height had the greatest influence on dam displacement and seepage, while the impact of rainfall was negligible.

The proposed model is proved to be a competitive tool for dam health monitoring. For future work, the following remarks can be considered:(1)Develop the multi-output SBL model to monitor multi-point displacement or seepage of the dam simultaneously.(2)Incorporate interval metrics into the model training to improve performance and produce more reliable prediction intervals.(3)Combine the OSBL model with numerical methods (e.g., finite element modeling) to establish the data-mechanical hybrid-driven model for dam health monitoring.

Data Availability

The monitoring data used to support the findings of this study are available from the corresponding author upon request.

Conflicts of Interest

The authors declare that they have no conflicts of interest.

Authors’ Contributions

Chaoning Lin was responsible for conceptualization, validation, formal analysis, investigation, data curation, original draft preparation, visualization, and funding acquisition. Siyu Chen was responsible for conceptualization, methodology, software, investigation, formal analysis, data curation, review and editing, visualization, and funding acquisition. Mohammad Amin Hariri-Ardebili was responsible for conceptualization and review and editing. Tongchun Li was responsible for review and editing, resources, and funding acquisition.

Acknowledgments

This study was supported by the National Key R&D Program of China (No. 2022YFC3005403), the Fundamental Research Fund for Nanjing Hydraulic Research Institute (No. Y723003), the Open Research Fund of Key Laboratory of Reservoir and Dam Safety Ministry of Water Resources (No. YK323007), and the Excellent Postdoctoral Program of Jiangsu Province.