Abstract

The geographically and temporally weighted regression (GTWR) model is a dynamic model which considers the spatiotemporal correlation and the spatiotemporal nonstationarity. Taking into account these advantages, we proposed a spatiotemporal deformation modelling method based on GTWR. In order to further improve the modelling accuracy and efficiency and considering the application characteristics of deformation modelling, the inverse window transformation method is used to search the optimal fitting window width and furthermore the local linear estimation method is used in the fitting coefficient function. Moreover, a comprehensive model for the statistical tests method is proposed in GTWR. The results of a dam deformation modelling application show that the GTWR model can establish a unified spatiotemporal model which can represent the whole deformation trend of the dam and furthermore can predict the deformation of any point in time and space, with stronger flexibility and applicability. Finally, the GTWR model improves the overall temporal prediction accuracy by 43.6% compared to the single-point time-weighted regression (TWR) model.

1. Introduction

Regression analysis is a commonly used deformation modelling method. It can establish a functional model between deformation and influence factors and then carry out physical interpretation and prediction of deformation. Specific regression modelling methods include multiple linear regression [1], stepwise regression [2], principal component regression, and independent component regression (ICR) models [3, 4], but all of them are for single-point models without considering the spatial correlation and the spatiotemporal nonstationarity of the deformation monitoring data. In order to take into account the temporal nonstationarity, Lu et al. [5] used the time-weighted regression (TWR) method to model dam deformation. The TWR model assumes that the linear regression coefficient is a function of time to adapt the temporal nonstationarity, and the prediction accuracy of the model is higher than that of the linear regression model. However, TWR does not consider the spatial correlation between the monitoring points and still belongs to the single-point model. When there are many monitoring points, using the single-point model will cause model redundancy and it is difficult to analyze the overall deformation of the deformable body since spatial correlation is not considered. Li et al. [6] and Dai et al. [7] applied the space-time autoregressive (STAR) model and the spatiotemporal independent component regression (STICR) model to build a deformation model. Both consider the spatiotemporal correlation and can establish an overall spatiotemporal deformation model for all points. However, both do not consider the spatiotemporal nonstationarity, which will reduce the prediction accuracy of the models. In order to take both the spatiotemporal correlation and the spatiotemporal nonstationarity of the deformation data into consideration, a spatiotemporal deformation modelling method based on the GTWR model was proposed and applied for dam deformation modelling. Furthermore, to avoid the boundary effect of the model regression coefficient fitting in GTWR, the local linear estimation method is used in the fitting coefficient function. The statistical test is very important in GTWR modelling to ensure the reliability of the model. However, most of the current statistical test methods in GTWR modelling only focus on the spatiotemporal nonstationarity test of the model as a whole and ignore the spatiotemporal nonstationarity test of the regression coefficients. In this paper, we combine the existing statistical test theory of the GTWR model, including the regression coefficients test of the GTWR model proposed by Xiao et al. [8] and then establish a five-step test method to conduct a comprehensive statistical test on the GTWR model.

2. Geographically and Temporally Weighted Regression

Huang et al. [9] introduced a time factor based on the geographically weighted regression (GWR) model [10, 11], and it is extended to the GTWR model [9, 12, 13]. The GTWR model assumes that the linear regression coefficient is a function of time and the coordinate parameters as follows:

In equation (1), , is a random error, u and represent spatial two-dimensional coordinates, and t is defined as time coordinates. represents the coordinates of the j-th space-time point, and is the function of the k-th regression coefficient. represents the dependent variable corresponding to the k-th regression coefficient of the j-th space-time point. In order to more clearly represent the sampling information of the monitoring points in different spatial positions at different times, the space-time point variation parameter j in equation (1) is separately expressed as a spatial position parameter and a time parameter, and then the GTWR model can be expressed as equation (2). In equation (2), i represents a spatial position variation parameter, h represents a time variation parameter, where , is a regression coefficient, and are the observation values at time h of the i-th monitoring point. Let , that is, the model containing the spatiotemporal intercept term . The random error is independent of each other and satisfies and .

2.1. Local Linear Estimation Method of Regression Coefficient Function

Huang et al. [9] used the local constant estimation method to fit the GTWR model, but it has a boundary effect. Because the local linear estimation can avoid the boundary effect [14], it is used to establish the GTWR model in this paper.

It is assumed that each regression coefficient function in equation (2) has a continuous partial derivative with respect to space-time three-dimensional coordinates . is any point in the study area, where m and p are the spatial position and time parameters, respectively. Each of the regression coefficient functions can be approximated according to the Taylor series as follows:

In equation (3), denote the values of partial derivatives of k for , and at point , respectively. According to local linear fitting of the variable coefficient regression model, least squares function can be constructed as follows:where is the weight of point to point . Let the partial derivative of this function for equal to 0, then the estimated value of can be obtained aswhere are shown in equation (6). The nondiagonal elements of the weight are irrelevant because they are 0:

Let to calculate the estimated values of all regression coefficients as follows:then the fit value at each point can be calculated using the following equation:where in , and all the fitted values are showed as the following equation :where is an order matrix and is also called a pseudohat matrix [14]:

The Gaussian kernel function is used to define the space-time weights of the regression coefficient fitting [9], and its expression is shown as the following equation:where , respectively, represent the optimal space window width parameters and the optimal time window width parameters. As the GTWR model defines the space-time distance, the definition of time and space weights will introduce two window width parameters of time and space [7, 15] which should be determined in modelling. We generalized the crossvalidation (CV) method to determine the optimal window width parameters in time and space [9, 14, 16]. The CV expression is shown as the following equation:

For , the fitting value of the ih-th point is obtained by removing the observation information of the ih-th point under the given space-time window width and , and the minimum value of CV is the best parameter of the space-time window width. If the smoothness of coefficient function varies greatly, the variable window width method should be used to improve the fitting accuracy [17, 18].

2.2. Statistical Tests of GTWR

Statistical test is an important issue for GTWR. However, there is no comprehensive and complete model test method for GTWR. In this paper, a complete set of GTWR model test methods is proposed based on the existing research results [8, 19, 20], including spatiotemporal global nonstationarity test, temporal global nonstationarity test, spatial global nonstationarity test, regression coefficient function temporal nonstationarity test, and regression coefficient function spatial nonstationarity test.(1)Spatiotemporal global nonstationarity test:The original hypothesis is a multiple linear regression, that is, the regression model has a spatiotemporal stationarity. It is tested by constructing M statistic, and then the value of the test is calculated according to the exact or approximate calculation formula of the quadratic distribution of the normal variables [14, 21]. The expression of M is shown as follows:where .(2)Temporal global nonstationarity test:The original hypothesis is the GWR model, i.e., the hypothesis model has temporal stationarity and spatial nonstationarity. Based on the theory of Brunsdon’s construction of test statistic using the sum of squared residuals [22], the statistic is constructed and its expression is shown as follows:where and , respectively, represent the pseudohat matrix of the GWR model and the GTWR model.(3)Spatial global nonstationarity test:The original hypothesis is a TWR model, which assumes that the model has temporal nonstationarity and spatial stationarity and constructs the statistics as the temporal global nonstationarity test. The expression is the same as equation (16), and is a pseudohat matrix of the TWR model.(4)Temporal nonstationarity test of regression coefficient function:After completing the global nonstationarity test, it is necessary to test the spatiotemporal nonstationarity of each coefficient function.The original hypothesis assumes that the k-th regression coefficient function has time-stationarity. First, define and let , then isThen, the statistical is constructed for testing (20), and the expression of is as follows:In equation (20), is a block diagonal matrix, and its subblock matrix is , is a T-order square matrix in which all elements are 1 and is the T-order identity matrix. Then, the value of the test is determined according to the exact or approximate calculation formula of the quadratic distribution of the normal variable [21].(5)Spatial nonstationarity test of regression coefficient function:

The original hypothesis assumes that the k-th regression coefficient function has spatial stationarity. The expression of the statistic is the same as the temporal nonstationarity test of the regression coefficient function. At this time, the in equation (20) is and the test is completed by constructing the statistic .

Given a significant level of , the above five hypothesis tests were tested separately. If some regression coefficients in the above tests are spatiotemporal stationary, a mixed geographically and temporally weighted regression model needs to be established [23].

2.3. Spatial and Temporal Prediction

Since the regression coefficients in the GTWR model are all implicit function relations, it is difficult to get the display expression of the model. When the model is applied, the regression coefficient value of the predicted point should be calculated according to the coefficient function [24]. The calculation steps are as follows: First, the space-time weight matrix and the matrix of the predicted point are calculated according to equation (6). Then, the regression coefficient of the predicted point is obtained by equation (7). When predicting, the predicted value of the predicted point is obtained by substituting the calculated regression coefficient into equation (22). In equation equation (22), is the independent variable observation vector of the point to be predicted :

3. Spatiotemporal Deformation Model of Dam Based on GTWR

3.1. Data Source

The experimental data are from the monitoring system of the Wuqiangxi Dam. The Wuqiangxi Dam is located above the Yuan River in the Yuanling County of Hunan Province. The dam is a concrete gravity dam with a maximum dam height of 85.83 m. The normal water level is 108 m, with a total storage capacity of 4.29 × 109 m3. The dam is equipped with automatic monitoring systems such as lead lines, inverted plumbs, static level, seepage monitoring, buoyancy monitoring, and water level measurements. This paper refers to some points on one of the lead lines on the Wuqiangxi Dam (EX2-10∼EX2-21). The distribution of the measurement points on the dam is shown in Figure 1; the points EX2-10 and EX2-16 do not participate in modelling and are used as two spatial prediction points, and then a total of 10 points are involved in the modelling. The coordinates of points EX2-10 to EX2-21 are shown in Table 1. The data collection time was 2005-04-27∼2008-01-29, and the sampling interval was 1 day. A total of 1000 days of data were involved in the modelling. The sampling value of 2008-01-22∼2008-01-29 was used as the prediction comparison data.

3.2. Model Establishment

Figure 2 is a flow chart of the dam GTWR deformation modelling, and then the GTWR model of the selected modelling point is built according to the flow chart.

In order to compare the GTWR model, GTWR and TWR modelling were done using the same data.

3.2.1. Data Preprocessing

The horizontal displacement observation of the monitoring points uses 3 times the error method for gross error elimination. Lagrange interpolation is used to interpolate the missing item data.

3.2.2. GTWR and TWR Model Fitting

The dependent variable is the displacement of monitoring sites, and the independent variables are water level, rainfall, and temperature. To model the selected 1000 days sampling data, the optimum window width parameters of TWR and GTWR models are first determined according to the CV criterion. The experimental analysis shows that the smooth parameter values of the model are all greater than 1, and the best window width is needed in a large range, which is not only time-consuming, but also difficult to determine the search interval of . In order to improve the search efficiency, the inverse number transformation method is used to search the optimal window width, that is, to make , then search as a pseudowindow width parameter, and the range of is between 0 and 1, which greatly improves the search efficiency. Since the GTWR model is an overall model of multiple monitoring points, there is only one pair of optimal spatiotemporal window width parameters. The CV distribution and the optimal window width value are shown in Figure 3. The TWR is a single-point model, and the optimum window width parameter for each monitoring point must be determined separately. The distribution of its CV value and the optimum window width for each point are shown in Figure 4.

In Figures 3 and 4, B-h represents the optimal window width, and the position of the optimal window width in the CV distribution is indicated by the red ∗ symbol.

The optimal window width parameters of each model were used for model fitting, and the fitting precision and residual difference of GTWR and TWR models at each modelling point were obtained, respectively, as shown in Table 2 and Figure 5. Table 3 shows the comparison of overall fitting precision of the model. In Tables 2 and 3, the root mean square error (RMS) of the GTWR model is smaller than the TWR model and the fitting correlation coefficient (R2) is larger. It can be concluded that the GTWR fitting result is evidently better than that of the TWR model.

3.2.3. GTWR Model Statistical Test

Given a significant level of , the five-step hypothesis test is performed on the GTWR model according to the contents of Section 2.2. Table 4 shows the spatiotemporal global nonstationarity statistical test, the temporal global nonstationarity statistical test, and the spatial global nonstationarity statistical test. Table 5 shows the temporal nonstationarity statistical test and the spatial nonstationarity statistical test of the regression coefficients. From these tables, it can be concluded that the value of each test of the model is less than 0.05, which indicates that both the model and the regression coefficient function are spatiotemporal nonstationarity, statistically demonstrating that the spatiotemporal observation sequence of the dam is suitable for establishing the GTWR model.

3.2.4. Comparison of Temporal Prediction Accuracy between GTWR and TWR Models

The temporal prediction was carried out for the 10 points involved in the modelling, and the displacement of monitoring points was predicted 8 days later. It was shown that the RMS of the overall temporal prediction residuals of GTWR and TWR models was 0.44 mm and 0.81 mm, and the GTWR model improves the overall temporal prediction accuracy by 43.6% compared with the TWR model in Table 6. The displacement predicted values, predicted residuals, and the RMS of the predicted residuals of the GTWR and TWR models at each modelling point are shown in Figures 6 and 7 and Table 6, respectively. It can be seen from Figure 6 that the predicted value of GTWR is closer to the true value than that of the TWR model. As can be seen from Table 6, except for the points EX2-11 and EX2-17, the GTWR model at all points is smaller than the predicted residual RMS of the TWR model and the temporal prediction accuracy of the GTWR model of these modelling points is improved by 27.6%∼68.3 compared with the TWR model. The prediction accuracy of the GTWR model at EX2-11 and EX2-17 points is lower than that of the TWR model, which may be caused by the end effect of the spatiotemporal model. It can also be seen from the prediction residual diagram that the amplitude of the predicted residual fluctuation around 0 of the GTWR model is smaller than that of the TWR model, and it can also be concluded that the temporal prediction accuracy of the GTWR model is higher than that of the TWR model.

3.2.5. Temporal and Spatial Prediction of GTWR Model for Unknown Points in Space

The GTWR model considers the spatiotemporal correlation, and not only it can make the temporal prediction, but also can do the spatial prediction. However, the TWR model only considers the temporal correlation and cannot carry on the spatial prediction. We select the points EX2-10 and EX2-16 which are nonparticipating modelling points for spatial prediction. First, spatial prediction is performed on these two points that are not involved in modelling, and the prediction result is shown in Figure 8. The point EX2-16 is located in the middle of the modelling point group, and the point EX2-10 is located at the edge of the modelling point group, so the EX2-16 is more spatially correlated than the point EX2-10. From Figure 9, the point EX2-10 has a small RMS and small residual amplitude, which indicates that the point EX2-16 has higher spatial prediction accuracy than EX2-10. Then, the temporal prediction of these two unknown points is carried out, and the displacement of the measuring points is predicted for the next 8 days. The prediction result is shown in Figure 9 and the accuracy statistics in Table 7. From the precision statistics in Tables 6 and 7, it can be found that the temporal prediction accuracy of the unknown points is comparable to that of the known modelling points, which shows that the spatial correlation of these measuring points is relatively strong. Compared with the TWR model, the GTWR model not only has higher temporal prediction accuracy, but also can take into account the spatial prediction for unknown points.

4. Conclusion

The GTWR model considers the spatiotemporal nonstationarity and the spatiotemporal correlation of deformation monitoring data. The spatial correlation between the modelling points, all of them constitute a whole model, can reflect the overall deformation trend and deformation law of the deformation body. This model can predict any point of the deformation body in time and space, avoiding the redundancy of the model and having higher prediction accuracy in deformation modelling. These features have important significance in both deformation modelling and forecasting. When the GTWR model is built, it needs to be tested comprehensively. If the test is not passed, the mixed geographically and temporally weighted regression model can be considered. The next step is to establish the spatiotemporal deformation model by using the mixed geographically and temporally weighted regression as shown in this paper.

Data Availability

The mat data used to support the findings of this study are included within the supplementary information files.

Conflicts of Interest

The authors declare that they have no conflicts of interest.

Acknowledgments

This work was supported by the National Natural Science Foundation of China (grant no. 41674011) and the Fundamental Research Funds for the Central Universities of Central South University (grant no. 2019zzts661).

Supplementary Materials

The experimental data in this paper include two files: GTWR_DISPLACEMENT_DATA.mat and GTWR_INFLUENCE_DATA.mat. The file format is Matlab’s .mat file, where GTWR_DISPLACEMENT_DATA.mat is the monitoring point displacement file and GTWR_INFLUENCE_DATA.mat is the influencing factor file. (Supplementary Materials)