Abstract

The relationships between geological features and rockmass behaviors under complex geological environments were investigated based on multiple intelligence classifiers. Random forest, support vector machine, bayes' classifier, fisher's classifier, logistic regression, and neural networks were used to establish models for evaluating the rockmass stability of slope. Samples of both circular failure mechanism and wedge failure mechanism were considered to establish and calibrate the comprehensive models. The classification performances of different modeling approaches were analyzed and compared by receiver operating characteristic (ROC) curves systematically. Results show that the proposed random forest model has the highest accuracy for evaluating slope stability of circular failure mechanism, while the support vector Machine model has the highest accuracy for evaluating slope stability of wedge failure mechanism. It is demonstrated that the established random forest and the support vector machine models are effective and efficient approaches to evaluate the rockmass stability of slope.

1. Introduction

Since their introduction, researches into the areas of machine learning and their applications continue to captivate scientists and engineers from a variety of disciplines. This growing interest among researchers is stemming from the fact that these learning machines have an excellent performance in the issues of pattern recognition and the modeling of nonlinear relationships of multivariate dynamic systems [1]. The widely used and representative classification methods include random forest, support vector machine, baye’s classifier, fisher’s classifier, logistic regression, and neural networks.

Comprehensive classification of slope rockmasses, an important activity during exploration, design, and construction for underground openings, is restrained by our limitations in defining the complex geological environments and modeling the relationships between geological features and rockmass behaviors [2, 3]. The published literatures reporting on learning machines provided effective and efficient approaches to establish the nonlinear relationships between geological features and rockmass behaviors [4, 5].

Some recent publications on various geotechnical engineering topics using different methods are given as follows: remote sensing and GIS based landslide susceptibility assessment [6]; landslide susceptibility mapping [79]; early warning landslide susceptibility model using geographic information system (GIS) [10]; regional prediction of landslide hazard [11]; predicting of rockburst classification [12, 13]; predicting destructive effect of masonry structure under blasting vibration of open-pit mine [14]; prediction of seismic liquefaction of sand soil [15]; classification of rocks surrounding in tunnel [16, 17]; classification of top coal cavability of the steep seam [18]; comprehensive evaluation for seismic stability of slopes [19]; prediction rock mechanical behaviors [4]; predicting landslide deformation [20, 21]; predicting of P-wave velocity and anisotropic property of rock [22]; estimating rock properties using sound levels produced during drilling [23]; automated tunnel rock classification using rock engineering systems [24]; estimation of the rock mass deformation modulus using a rock classification system [25]; predicting blast disaster in open pit blasting operation [26, 27]; evaluation of penetration rate of tunnel boring machine in hard rock condition [28]; comparative study of cognitive systems for ground vibration measurements [29]; prediction of longitudinal wave velocity [30]; optimization of tunnel construction [31]; prediction of the rock mass diggability index [32]; prediction of rock properties from sound levels produced during drilling [5]; modeling mine gas gushing forecasting on virtual environment [33]; rainfall reliability evaluation for stability of municipal solid waste landfills on slope [34]; determination of reservoir induced earthquake [35]; seismic event identification [36]; and prediction of elastic modulus of jointed rock mass [37].

However, researchers study the applications of support vector machine, bayes’ classifier, fisher’s classifier, logistic regression, and neural networks for evaluating rockmass stability, but few focus on applications of the advanced random forest method in the area, especially that few focus on the overall comparison of performances of different classifiers.

This paper investigated the validity of utilizing different learning machines in the physical problem of slope stability prediction. Random forest, support vector machine, bayes’ classifier, fisher’s classifier, logistic regression, and neural networks were used to establish comprehensive models for evaluating rockmass stability of slope, and the classification performances of different modeling approaches are analyzed and compared using ROC curves.

2. Data, Models, and Results

2.1. Data

The main scope of this work is to implement the random forest, support vector machine, bayes’ classifier, fisher’s classifier, logistic regression, and neural networks in the problem of slope stability estimation. In order to forecast the status of stability ( ) in the case of rock or soil slopes, the factors that influence have to be determined. The input layer data consist of six input parameters in the case of circular failure and eight input parameters in the case of wedge failure. The output layer is composed of a single output parameter, the status of stability ( ). In this work, the status of stability is considered as a function approximation problem taking values in the range of , instead of the discrete values 0 and 1, with 1 indicating stable and 0 indicating failed.

The datasets used in this paper were collected from the publication by Sakellariou and Ferentinou [1]. The first dataset consists of 46 case studies of slopes analyzed for circular critical failure mechanism. Of them, 23 cases are dry (13 failed and 10 stable) and 23 cases are wet (16 failed and 7 stable). The second dataset consists of 22 case studies of rock slopes analyzed for wedge failure mechanism. All cases are dry (10 failed and 12 stable). The original data covering the 46 case studies are presented in Table 1, while the original data covering the 22 case studies are presented in Table 2. In the tables, is the safety factor. The parameters that have been selected are related to the geotechnical properties and the geometry of each slope. More specifically, the parameters used for circular failure (Figure 1(a)) were unit weight ( ), cohesion ( ), angle of internal friction ( ), slope angle ( ), height ( ), and pore water pressure ( ). In the case of wedge failure (Figure 1(b)), the corresponding input parameters were unit weight ( ), cohesions ( ) and ( ), angles of internal friction ( ) and ( ), angle of the line of intersection of the two joint sets ( ), slope angle ( ), and height ( ), where and refer to the two joint sets.

2.2. Random Forest Models

The random forest [38] is an ensemble approach that can also be thought of as a form of the nearest neighbor predictor. Random forests are an ensemble learning method for classification (and regression) that operates by constructing a multitude of decision trees at training time and outputting the class that is the model of the classes output by individual trees. The algorithm for inducing a random forest was developed by Breiman [38] and Adele Cutler, and “Random Forests” is their trademark.

The principle of random forests (RFs) is the aggregation of a large ensemble of decision trees [38]. During training, each individual tree in the ensemble is fitted by sampling the training data with replacement (bootstrap) and growing the tree to full depth on the training sample. The optimal data split at each tree node is determined by randomly choosing of the available input variables and selecting the one which splits the node best.

This implementation is based on the original Fortran code authored by Breiman, the inventor of RFs. We considered different parameter configurations for the values of = 300, 500, and 1000 (number of trees to build) and nodesize = 2 (minimal size of the terminal nodes of the tree). The results for circular failure mechanism and wedge failure mechanism are listed in Tables 1 and 2.

2.3. Support Vector Machine Models

The extensive applications literature on text categorization, image recognition, rock mechanics and other fields shows the excellent empirical performance of support vector machine (SVM) in many more domains [4, 39]. The underlying idea of SVM classifiers is to calculate a maximal margin hyperplane separating two classes of the data.

To learn nonlinearly separable functions, the data are implicitly mapped to a higher-dimensional space by means of a kernel function, where a separating hyperplane is found. New samples are classified according to the side of the hyperplane they belong to [22]. Many extensions of the basic SVM algorithm can handle multicategory data. The “one-versus-rest” SVM works better for multiclass microarray data [1, 6], so this method was adopted for the analysis of multicategory datasets in the present study. In summary, this approach involves building a separate SVM model to classify each class against the rest and then predict the class of a new sample using the SVM model with the strongest vote.

We used SVM implementation in the DPS software with RBF kernel. The type of support vector machine is C-SVC, the kernal function is RBF, and value is 1. The results for circular failure mechanism and wedge failure mechanism are listed in Tables 1 and 2, respectively.

2.4. Bayes’ Classification Models

The aim of the naive bayesian classifier (NBC), as with other classifiers, is to assign an object to one of discrete sets of categories based on its observable attributes . NBCs are used in a variety of applications, including document classification, medical diagnosis [40], systems performance management, probability classification of rockburst [41], and other fields. Domingos and Pazzani [42] proved optimality of the NBC under certain conditions even when the conditional independence assumption is violated.

This probability calculation is straightforward; conditioning on the observed attributes, we want to find the probability that belongs to each category, that is, .

Applying Bayes’ Theorem [39], this is rewritten as Under the mutual conditional independence assumption [39], this reduces to for each category . Since the denominator will be the same for all categories, we only need to calculate the numerator for each category , choosing and assigning to category . Then, the probability that an event belongs to category is computed by (2).

In the present study, unit weight ( ), cohesion ( ), angle of internal friction ( ), slope angle ( ), height ( ), and pore water pressure ( ) for circular failure mechanism were expressed as , , , , , and . The implementation was through the SPSS software with actual sizes of samples as a priori probabilities, and the discriminant function was given as follows: For the case of wedge failure mechanism, unit weight ( ), cohesions ( ) and ( ), angles of internal friction ( ) and ( ), angle of the line of intersection of the two joint sets ( ), slope angle ( ), and height ( ) are expressed as , , , , , , , and . Bayes’ functions are given as follows: According to the above established models, the results of circular failure mechanism and wedge failure mechanism cases were obtained and listed in Tables 1 and 2, respectively.

2.5. Fisher’s Classification Models

Fisher’s discriminant analysis is a classification method that projects high-dimensional data onto a line and performs classification in this one-dimensional space, which is widely used to determine which variable discriminates between two or more classes and to derive a classification model for predicting the group membership of new observations with high accuracy [1416, 43, 44]. In the present work, the Fisher discriminant analysis was used to establish discriminator for discriminating between failed and stable statuses of slope.

Based on the Fisher discriminant theory, the score of Fisher discriminator can be calculated by where is the coefficient of the Fisher discriminator. And the Fisher scores of the center for failed and stable statusescan be calculated as and , respectively. Then, the threshold can be obtained by . Every case has a set of values of , and corresponding to a Fisher score, if the Fisher score is greater than the threshold, the slope belongs to stable the slope, otherwise to the failed slope.

The indicators unit weight ( ), cohesion ( ), angle of internal friction ( ), slope angle ( ), height ( ), and pore water pressure ( ) for circular failure mechanism are also expressed as , and . The calculation was executed through the SPSS software. The discriminant function is The and are 0.874 and −1.509, respectively.

For the case of wedge failure mechanism, as in Section 2.4, unit weight ( ), cohesions ( ) and ( ), angles of internal friction ( ) and ( ), angle of the line of intersection of the two joint sets ( ), slope angle ( ), and height ( ) are also expressed as , , , , , , , and . The Fisher discriminant function is given as follows, and the and are −2.743 and 2.134, respectively, as According to the above established Fisher models, the results of circular failure mechanism and wedge failure mechanism cases were also obtained and listed in Tables 1 and 2, respectively.

2.6. Logistic Regression Models

Logistic regression (LR) is a statistical modeling technique in which the probability of a category is related to a set of explanatory variables. An explanation of logistic regression begins with an explanation of the logistic function, which always takes values between zero and one. The logistic model is defined by the following equations: where is a measure of the contribution of the explanatory variables , are the regression coefficients which are obtained by maximum likelihood in conjunction with their standard errors , and is the categorical response of variables that represents the probability of a particular outcome. In this particular application, are the slope rockmass parameters of interest, and is the probability of having stable and failed statuses. The calculation of LR is finished through the SPSS software, and logistic functions for circular failure mechanism and wedge failure mechanism cases are given as follows: The threshold of the above two logistic regression models is 0.5, and the results are listed in Tables 1 and 2.

2.7. Neural Networks Models

Neural networks (NNs) have long been used in problems such as this, with a lot of data, many variables, and the possibility of noise in the data.

Each input point is a high-dimensional vector. The neural network is organized in a series of layers, where the input vector enters at the left side of the network, which is then projected to a “hidden layer.” Each unit in the hidden layer is a weighed sum of the values in the first layer. This layer then projects to an output layer, which is where the desired answer appears.

In the present work, a multi-layer perceptron network model is used. Training took place for the specific range of values that cover the training dataset. Trying to achieve the best network’s performance, several networks with different architectures were developed using all of the possible variations of the backpropagation algorithms available in MATLAB 2010b. The final network architecture for the prediction of safety factor against circular failure it is (6-10-1), whereas in the case of wedge failure is (8-12-1). The learning rate was set to 0.01 and the error goal was set to 0.0001. The results for circular failure mechanism and wedge failure mechanism are listed in Tables 1 and 2, respectively.

3. Comparisons and Discussions

The ROC curve is used to evaluate and compare the established RF, SVM, Bayes, Fisher, LR, and NN classification models in slope stability evaluation. ROC is a graphical plot which illustrates the performance of a binary classifier system as its discrimination threshold is varied [45]. It is created by plotting the fraction of true positives out of the positives (TPR = true positive rate) versus the fraction of false positives out of the negatives (FPR = false positive rate), at various threshold settings.

ROC analysis provides tools to select possibly optimal models and to discard suboptimal ones independently from (and prior to specifying) the cost context or the class distribution. ROC analysis is related in a direct and natural way to cost/benefit analysis of diagnostic decision making.

In the present study, the stable and failed statuses of slope were considered a two-class prediction problem (binary classification), in which the outcomes were labeled either as positive ( , stable) or negative ( , failed). There are four possible outcomes from a binary classifier. If the outcome from a prediction is and the actual value is also , then it is called true positive (TP); however, if the actual value is , then it is said to be false positive (FP). Conversely, a true negative (TN) value has occurred when both the prediction outcome and the actual value are , and a false negative (FN) value is when the prediction outcome is while the actual value is .

An experiment from positive and negative was defined for instances. The four outcomes can be formulated in a 2 × 2 contingency table or a confusion matrix, as follows in Table 3.

The specificity or the true negative rate (TNR) is defined as the percentage of slope which is correctly identified as being failed: The quantity 1 − specificity is the false positive rate and is the percentage of slopes that are incorrectly identified as being stable statuses. The sensitivity or the true positive rate (TPR) is defined as the percentage of slope which is correctly identified as being stable status: For the cases of circular failure mechanism, the proportions of TP, TN, FP, and FN for the RF, SVM, Bayes, Fisher, LR, and NN classification models are shown in Figure 2(a). ROC curves are shown in Figure 3(a). And the areas under the ROC curves of RF, SVM, Bayes, Fisher, LR, and NN classification models are listed in Table 4. Figure 2(a) shows that the accuracy of RF model is the highest with the TP of 15 and TN of 29. Figure 3(a) and Table 4 show that RF has the biggest area (0.941), followed by NN (0.895), SVM (0.882), Bayes (0.877), Fisher (0.860), and LR (0.819). For the cases of wedge failure mechanism, the proportions of TP, TN, FP, and FN for RF, SVM, Bayes, Fisher, LR, and NN classifiers are shown in Figure 2(b). ROC curves are shown in Figure 3(b). And the areas under ROC curves of RF, SVM, Bayes, Fisher, LR, and NN classifiers are listed in Table 5. Figure 2(b) shows that the accuracy of SVM models is the highest with the TP of 12 and TN of 9. Figure 3(b) and Table 5 show that SVM has the biggest area (0.95), and followed by Bayes’ and Fisher models, and LR, which have the same area (0.908), and then followed by RF (0.9) and NN (0.783). Figures 2(a) and 2(b) clearly show that both the RF and SVM models have the highest TP and TN. It is suggested that the evaluated methods for different failure mechanism slopes are different, and RF and SVM models can be the preferred ones for circular failure and wedge failure landslides, respectively. It is noted that the trained and calibrated models are influenced by the size of training samples, and the reliability and applicability of the proposed models in this paper can be improved with increasing training samples.

4. Conclusions

This paper demonstrates the applicability and feasibility of the RF, SVM, Bayes (NBC), Fisher, LR, and NN classification models to evaluate the rockmass stability of slopes. Samples of both circular failure mechanism and wedge failure mechanism were considered to establish and calibrate discriminant models. The classification performances of different modeling approaches are analyzed and compared by ROC curves. Results show that the established RF, SVM, Bayes, Fisher, LR, and NN classification models can evaluate the slope statuses with a high accuracy. RF models have the highest accuracy for slope cases for circular failure mechanism, while SVM models have the highest accuracy for slope cases for wedge failure mechanism. Both the RF and SVM models have the highest TP and TN. It is suggested that the established models for different failure mechanism slopes are different, and RF and SVM models can be the preferred ones for evaluating the stability of circular failure and wedge failure landslides, respectively.

Conflict of Interests

The authors declare that there is no conflict of interests regarding the publication of this paper.

Acknowledgments

The support of the National Natural Science Foundation of China (50934006), National Basic Research (973) Program of China (2010CB732004), China Scholarship Council, Doctoral Candidate Innovation Research Support Program by Science & Technology Review (kjdb201001-7), and Scholarship Award for Excellent Doctoral Student from Ministry of Education of China is gratefully acknowledged.