Research Article  Open Access
Landslide Occurrence Prediction Using Trainable Cascade Forward Network and Multilayer Perceptron
Abstract
Landslides are one of the dangerous natural phenomena that hinder the development in Penang Island, Malaysia. Therefore, finding the reliable method to predict the occurrence of landslides is still the research of interest. In this paper, two models of artificial neural network, namely, Multilayer Perceptron (MLP) and Cascade Forward Neural Network (CFNN), are introduced to predict the landslide hazard map of Penang Island. These two models were tested and compared using eleven machine learning algorithms, that is, Levenberg Marquardt, Broyden Fletcher Goldfarb, Resilient Back Propagation, Scaled Conjugate Gradient, Conjugate Gradient with Beale, Conjugate Gradient with Fletcher Reeves updates, Conjugate Gradient with Polakribiere updates, One Step Secant, Gradient Descent, Gradient Descent with Momentum and Adaptive Learning Rate, and Gradient Descent with Momentum algorithm. Often, the performance of the landslide prediction depends on the input factors beside the prediction method. In this research work, 14 input factors were used. The prediction accuracies of networks were verified using the Area under the Curve method for the Receiver Operating Characteristics. The results indicated that the best prediction accuracy of 82.89% was achieved using the CFNN network with the Levenberg Marquardt learning algorithm for the training data set and 81.62% for the testing data set.
1. Introduction
Landslide hazard is a particular case of natural hazard which is defined as the probability of occurrence within a specified period of time and within a given area of a potentially damaging phenomenon [1, 2]. Numerous occurrences of landslides have caused lives to perish and incurred losses in terms of financial stakes, across the entire world annually. However, the main causes behind the occurrence of the landslides are still unspecified. Different factors such as geological, topographic, physical, and human causes (disregard for sustainable form of developments) contribute to landslide occurrences [3]. Therefore, many studies have been conducted and different techniques have been applied to predict the occurrence of landslides. These techniques involve variations and mixtures in approaches, from logical, experiencebased analyses, extending to complex mathematical and computer based system.
Over the last two decades, a keen interest has been shown in the application of artificial neural networks (ANNs). It has been widely applied in forecasting, decision making, food industry, agriculture sector, and many other different applications [4–7]. The popularity of ANNs is due in part to their computational simplicity, finite parameterization, and stability. Different ANNs architecture such as the MLP, radial basis function (RBF), and recurrent neural networks (RNN) have been proposed in the literature [8]. Amongst all these models, the most commonly and widely used model for landslide is the MLP model [9, 10].
ANNs are one of the techniques which produce good accuracy when used to predict the occurrences of landslides [11]. However, an ideal method for predicting landslide occurrence has not been agreed upon yet [12]. Therefore, an intelligent computer system is proposed to enable automatic prediction of landslide using MLP and CFNN in ANNs.
Penang Island is being subject of interest for many studies. Pradhan (2010) produced a landslide hazard map for Penang Island using MLP neural network. Five training sites from Penang island and nine different factors involved in their analysis include slope angle, slope aspect, curvature, distance from drainage, distance from lineament, geology, land cover, soil, and rain precipitation [13].
Pradhan et al. (2010) also investigated the possible application of an artificial neural network model and its crossapplication of weights at three study areas in Malaysia, namely, Penang Island, Cameron Highlands, and Selangor. The weight of each factor was calculated. The factors are, namely, slope angle, slope aspect, plan curvature, altitude, stream power index, wetness index, distance from drainage, distance from road, distance from faults, geology, land use, soil texture, soil material, vegetation index, and topography. The results show that case of the weight using the same test area showed slightly higher accuracy than the weight used for the crossapplied area [10].
Lim et al. (2011) used probabilistic methods such as frequency ratio, statistical index, certainty factor and landslide susceptibility analysis, and logistic regression to produce landslide hazard maps for Penang Island. In their study, twelve factors including four topographic factors were used. The importance of the input factors was not estimated in their study [14].
Oh and Pradhan (2011) applied adaptive neurofuzzy inference system (ANFIS) with seven factors: altitude, slope angle, plan curvature, distance from drainage, distance from road, soil texture, and stream power index on an area of Penang Island covering only 8.064 km^{2} of Penang Island [15].
Pang et al. (2012) used Decision Tree (DT) to produce landslide hazard mapping for Penang Island with the same twelve factors, used by [14]. DT model was calculated and constructed using the DT algorithm. The use of DT method improves the landslide hazard map where the percentage of past landslide event increases at three risk levels, that is, most hazardous, hazardous, and moderate, while the percentage is reduced in the nonhazardous level [16].
Digital Elevation Model has been generally used as the basic source for extracting the topographic factors such as slope aspect and curvatures. It also is one of the core database sources for several GIS applications [17]. For this study the DEM with 5 meter/pixel resolution was used to extract the slope angle, slope aspect, profile curvature, plan curvature, and general curvature.
Fourteen factors were used as the input features for the MLP and CFNN. These factors are slope angles, slope aspect, profile curvature, plan curvature, general curvature, distance from the road, distance from the fault lines, elevation, distance from the drainages, soil texture, land cover, vegetation cover, geology, and the rain precipitation as a triggering factor. The MLP and CFNN were trained with eleven learning algorithms to produce the most accurate prediction results. The 11 learning algorithms used were Levenberg Marquardt (LM), Broyden Fletcher Goldfarb (BFG), Resilient Back Propagation (Rp), Scaled Conjugate Gradient (SCG), Conjugate Gradient with Beale (CGB), Conjugate Gradient with Fletcher Reeves updates (CGF), Conjugate Gradient with Polakribiere updates (CGP), One Step Secant (OSS), Gradient Descent (GD), Gradient descent with momentum and adaptive learning rate (GDX) and Gradient Descent with Momentum (GDM) algorithms.
The organization of this paper is as follows. The CFNN and MLP are explained as landslide prediction methods in Section 2. Section 3 introduces the study area and provides descriptions on the data collection and factor extraction. Results of the prediction performance are presented in Section 4. Conclusion is drawn in Section 5. Figure 1 describes the methodology for this work.
2. MLP and CFNN
The popularity of the MLP and CFNN comes from their stability, simplicity of application, and smaller structure size for a particular problem, as compared to the other structures [18]. The network learns the relationship between pairs of factors (inputs) and output (responses) vectors by altering the weight and bias values [19]. Figure 2 shows an example of a standard MLP and CFNN. It consists of three layers in the order of input, hidden, and output layer. Each layer consists of independent processing units called neurons [20].
These neurons receive inputs; each input value is multiplied by the weight (the strength of the input). The input is computed using a mathematical function that determines the activation values of the neuron and is then passed to the next layer. The output from the hidden layer is given bywhere is the output from hidden layer and and denote the inputs and the weight from input to hidden unit in the first layer, respectively. is the bias for hidden unit and is the transfer function. For the hidden layer, the tan sigmoid function was used.
The predicted output of the th node in the output layer is denoted as which can be expressed as inwhere denotes the weights from the hidden layer to the output layer. denotes the number of outputs neurons. Combining (1) and (2), the complete representation of the output for the MLP network is obtained as inCFNN network shares the same structure and the work methodology with MLP network. However, CFNN includes a weight connection from the input to the output layer and from each layer to the successive layers [21]. As shown in Figure 2, for the CFNN network with output nodes, hidden nodes, and input nodes, the output of the th neuron, in the output layer is given byBecause of the fact that neural networks have numerous numbers of neurons, adjusting of the neural weights without a learning algorithm may be quite difficult. For that, various learning algorithms have been developed and established for two reasons: minimizing the error rate between the actual output and the output results and building up the weights, and , for the inputs factors [22, 23]. In this paper, MLP and CFNN were trained with eleven learning algorithms. Detailed descriptions on the learning algorithms can be found in [24]. Both MLP and CFNN with different learning algorithms are assessed based on their performance in producing landslide hazard map of Penang Island.
3. Data Collection and Preparation for the Neural Network
For this work, 14 factors were investigated and analyzed. The data for 14 factors were collected and extracted for the study area. This study is focused on Penang Island which is shown in Figure 3. Study area lies between 5°15′ and 5°30′N latitude and 100°10′ and 100°20′E longitude. It occupies an area of 285 km² and is one of the thirteen states of Malaysia. The island is bounded to the north and east by the state of Kedah, to the south by the state of Perak, and to the west by the Straits of Malacca and Sumatra (Indonesia). It consists of both the island of Penang and a coastal strip on the mainland which is known as Province Wellesley. The island of Penang is the study area in this research work. Penang Island experiences frequent landslides, which occur quite frequently during the rainy seasons [14, 15, 25]. Penang Island has a tropical climate with high temperatures of 29°C to 32°C and humidity ranging from 65% to 96%. Topographic elevations vary between 0 m and 820 m above sea level, and the slope angle ranges from 0° to 87°. Flat lands make up 43.28% of the island. Geological data from the Department of Mineral and Geosciences show that Feringgi granite, Batu Maung granite, clay, and sand granite represent more than 72% of the study area’s geology. Vegetation cover consists mainly of forests and fruit plantations.
Data collection on the geographical database of Penang Island was obtained through various government agencies. Factors such as geology, road, fault lines, elevation, drainage, soil texture, land cover, vegetation cover, and rain precipitation maps were obtained from Department of Survey and Mapping Malaysia (JUPEM), Department of Agriculture Malaysia (DOA), Department of Minerals and geosciences Malaysia (JMG), Geographic information System Center of Penang (Pusat PeGIS), Malaysian Meteorological Department (MMD), and Department of Irrigation and Drainage Malaysia (JPS). Topographic factors which include slope angle, slope aspect, profile curvature, plan curvature, and general curvature were extracted from the elevation data [16]. Landslide occurrence locations were also collected and determined. The range of each factor and the ratio of occurrence on the study area are shown in Table 1.

The data is prepared for the neural network, including training data set and testing data set. The data were normalized to range between 0 and 1, for each of the factors individually based onwhere the sample is the sample to be normalized and is the minimum or the maximum sample value for every single input factor. The neural network outputs are represented by an output of 1 for landslide and 0 for no landslide. An effective neural network requires a comprehensive trained data set. Therefore 137572 data samples were selected from each factor in this analysis, where 68786 samples represent landslides and 68786 samples represent no landslides. Twothirds of the data (91715) were used for training and the remaining onethird (45857) was used for testing. The two neural networks, MLP and CFNN, were trained using the Matlab software.
To determine the network parameters, the experiments were carried out by varying the number of hidden neurons from 1 to 100. For each number of hidden neuron, the network was trained by varying the number of epochs from 1 to 1000. The purpose was to find the number of epoch that produced the best generalization for each number of hidden neuron. The optimum epoch and hidden neuron, which produced the minimum value of mean squared error for the testing set, was noted and its prediction accuracy was determined.
4. Results and Discussions
Two characteristics were used for the neural network performance analysis, which are the accuracy and the MSE. Model validation using MSE is tested by calculating the mean squared errors after each epoch. The MSE is defined as the average squared error between the actual output and the predicted output. The MSE at every epoch is given bywhere and are the actual output and the predicted output for a given set of estimated parameters after epochs, respectively, and is the number of data that were used to calculate the MSE.
To verify the accuracy of each model, the Receiver Operating Characteristics (ROC) method was used and the Area Under the Curve method (AUC) was calculated for all the models. AUC is one of the popular accepted methods for models prediction in natural hazard and the extracted AUC becomes the value of the accuracy.
The ROC plots the false positive rate on the axis and the false negative rate on the axis. The plot shows the tradeoff between the two rates, where AUC is one of the indicators computed based on ROC. In addition to that, the AUC explains the accuracy of the model in predicting landslides. In general, the lowest value of AUC is 0.5, which means that the model does not predict any better than a random approach.
Table 2 shows the testing performance for the training data sets that were achieved from the standard MLP and CFN, using the eleven different learning algorithms based on the 14 input factors. Based on the results in Table 2, it can be clearly seen that the performance values vary considerably across the model of the neural network and the learning algorithms. The best performance achieved was obtained through the CFNN model with LM learning algorithm. The accuracy is 82.89% with MSE of 0.0620. The worst accuracy of 71.15% and MSE of 0.1839 was obtained through MLP model with GDM learning algorithm.

The best accuracy rate obtained using MLP was achieved by the LM algorithm, that is, accuracy of 81.57% with MSE of 0.0910. On the contrary, the worst accuracy was 71.15% with MSE of 0.1839 achieved by the GDM algorithm. For CFNN, the worst performance was achieved using the GD algorithm with 71.24% with MSE of 0.1607. Meanwhile, the LM algorithm showed the best accuracy regardless of the neural network model. The LM learning algorithm achieved the best accuracy of 82.89%, with MSE of 0.0620 and 81.57% with MSE of 0.0910 for CFNN and MLP, respectively. On the other hand, GDM algorithm has the worst results in MLP neural network, whereas GD algorithm has the worst results in CFNN.
Overall, CFNN model achieved better accuracy and MSE as compared to MLP model, using six learning algorithms, that is, LM, Rprop, CGB, CGF, GD, and GDM, while for learning algorithms including BFG, SCG, OSS, CGP, and GDX, the MLP achieved better accuracy compared to CFNN.
In Table 3, the testing data sample was tested by using the same networks parameters. As expected the test accuracy result followed the training accuracy result where the CFNN with LM training algorithm achieved the best accuracy. Figures 4, 5, 6, and 7 show the ROC of CFNN and MLP with 11 learning algorithms applied on the testing data set.

5. Conclusion
In this paper, fourteen suitable factors were collected and applied as input factors for ANN models. Two efficient neural network models, MLP and CFNN, are proposed and compared using eleven learning algorithms. The 14 factors show a good performance in predicting the landslide occurrence of Penang Island with accuracy up to 81.62%. The comparison results show that the CFNN network trained with LM can successfully be adopted for prediction of the landslide with significantly high performance. Moreover, applying the CFNN for prediction of the landslide on different study areas could be subject of interest in the future work.
Conflict of Interests
The authors declare that there is no conflict of interests regarding the publication of this paper.
Acknowledgments
This research was supported by Ministry of Education, Malaysia (Grant no. 203/PJJAUH/6711279), Japan Science and Technology Agency (JST)/Japan International Cooperation Agency (JICA), and Science and Technology Research Partnership for Sustainable Development (SATREPS). The authors would also like to thank the JKR, JPS, JMG, and PEGIS, Malaysia, for the data used in this research.
References
 D. J. Varnes, Landslide Hazard Zonation: Preview of Principals and Practice, vol. 3 of Natural Hazards, UNESCO, International Assciation of Engineering Geologists, Commission on Landslides and Other Mass Movementson Slopes, Paris, France, 1984.
 D. J. Varnes, Landslide Hazard Zonation: Preview of Principals and Practice. Paris, UNESCO, Int Assciation of Engineering Geologists, Commission on Landslides and Other Mass Movementson Slopes, Natural Hazards V3 Pp176, 1984.
 J. N. Hutchinson, “Keynote paper: landslide hazard assessment,” in Landslides, Proc. Sixth Int. Symp. on Landslides, February, Christchurch, New Zealand, D. H. Bell, Ed., vol. 3, pp. 1805–1841, A.A. Balkema, Rotterdam, The Netherlands, 1995. View at: Google Scholar
 G. Zhang, B. E. Patuwo, and M. Y. Hu, “Forecasting with artificial neural networks: the state of the art,” International Journal of Forecasting, vol. 14, no. 1, pp. 35–62, 1998. View at: Publisher Site  Google Scholar
 L. Sanzogni and D. Kerr, “Milk production estimates using feed forward artificial neural networks,” Computers and Electronics in Agriculture, vol. 32, no. 1, pp. 21–30, 2001. View at: Publisher Site  Google Scholar
 M. Kumar, N. S. Raghuwanshi, R. Singh, W. W. Wallender, and W. O. Pruitt, “Estimating evapotranspiration using artificial neural network,” Journal of Irrigation and Drainage Engineering, vol. 128, no. 4, pp. 224–233, 2002. View at: Publisher Site  Google Scholar
 M. Lashkarbolooki, Z. S. Shafipour, and A. Z. Hezave, “Trainable cascadeforward backpropagation network modeling of spearmint oil extraction in a packed bed using SCCO_{2},” The Journal of Supercritical Fluids, vol. 73, pp. 108–115, 2013. View at: Publisher Site  Google Scholar
 M. S. Alkhasawneh, U. K. Bt Ngah, T. L. Tien, and N. A. B. Mat Isa, “Landslide susceptibility hazard mapping techniques review,” Journal of Applied Sciences, vol. 12, no. 9, pp. 802–808, 2012. View at: Publisher Site  Google Scholar
 S. Lee, J. Ryu, K. Min, and J. Won, “Development of two artificial neural network methods for landslide susceptibility analysis,” in Proceedings of the IEEE International Geoscience and Remote Sensing Symposium (IGARSS '01), vol. 5, pp. 2364–2366, 2001. View at: Google Scholar
 B. Pradhan, L. Saro, and M. F. Buchroithner, “A GISbased backpropagation neural network model and its crossapplication and validation for landslide susceptibility analyses,” Computers, Environment and Urban Systems, vol. 34, no. 3, pp. 216–235, 2010. View at: Publisher Site  Google Scholar
 S. Lee, J. Choi, and K. Min, “Probabilistic landslide hazard mapping using GIS and remote sensing data at Boun, Korea,” International Journal of Remote Sensing, vol. 25, no. 11, pp. 2037–2052, 2004. View at: Publisher Site  Google Scholar
 F. Guzzetti, A. Carrara, M. Cardinali, and P. Reichenbach, “Landslide hazard evaluation: a review of current techniques and their application in a multiscale study, Central Italy,” Geomorphology, vol. 31, no. 1–4, pp. 181–216, 1999. View at: Publisher Site  Google Scholar
 B. Pradhan and S. Lee, “Delineation of landslide hazard areas on Penang Island, Malaysia, by using frequency ratio, logistic regression, and artificial neural network models,” Environmental Earth Sciences, vol. 60, no. 5, pp. 1037–1054, 2010. View at: Publisher Site  Google Scholar
 K.W. Lim, L. T. Tay, and H. Lateh, “Landslide hazard mapping of Penang island using probabilistic methods and logistic regression,” in Proceedings of the IEEE International Conference on Imaging Systems and Techniques (IST '11), pp. 273–278, May 2011. View at: Publisher Site  Google Scholar
 H.J. Oh and B. Pradhan, “Application of a neurofuzzy model to landslidesusceptibility mapping for shallow landslides in a tropical hilly area,” Computers & Geosciences, vol. 37, no. 9, pp. 1264–1276, 2011. View at: Publisher Site  Google Scholar
 P. K. Pang, L. T. Tay, and H. Lateh, “Landslide hazard mapping of Penang Island using decision tree model,” in Proceedings of the International Conference on Systems and Electronic Engineering (ICSEE '12), December 2012. View at: Google Scholar
 Q. Zhou and X. Liu, “Analysis of errors of derived slope and aspect related to DEM data properties,” Computers and Geosciences, vol. 30, no. 4, pp. 369–378, 2004. View at: Publisher Site  Google Scholar
 M. Barletta and A. Gisario, “An application of neural network solutions to laser assisted paint stripping process of hybrid epoxypolyester coating on aluminum substrates,” Surface and Coatings Technology, vol. 200, no. 24, pp. 6678–6689, 2006. View at: Publisher Site  Google Scholar
 M. S. AlBatah, N. A. Mat Isa, K. Z. Zamli, Z. M. Sani, and K. A. Azizli, “A novel aggregate classification technique using moment invariants and cascaded multilayered perceptron network,” International Journal of Mineral Processing, vol. 92, no. 12, pp. 92–102, 2009. View at: Publisher Site  Google Scholar
 S. Haykin, Neural Networks a Comprehensive Foundation, 2nd edition, 1999.
 M. Kurban and U. B. Filik, “A new approach for next day load forecasting integrating artificial neural network model with weighted frequency bin blocks,” in Neural Information Processing, M. Ishikawa, K. Doya, H. Miyamoto, and T. Yamakawa, Eds., vol. 4985, pp. 703–712, Springer, Berlin, Germany, 2008. View at: Google Scholar
 C. M. Bishop, Neural Networks for Pattern Recognition, Oxford University Press, Oxford, UK, 1995. View at: MathSciNet
 G. Hinton, D. E. Rumelhart, and RJ. Williams, “Learning internal representations by error propagation,” in Parallel Distributed Processing: Explorations in the Microstructure of Cognition, D. E. Rumelhart and J. L. McClelland, Eds., vol. 1, pp. 318–362, MIT Press, Cambridge, UK, 1986. View at: Google Scholar
 K. K. Aggarwal, Y. Singh, P. Chandra, and M. Puri, “Bayesian regularization in a neural network model to estimate lines of code using function points,” Journal of Computer Sciences, vol. 1, no. 4, pp. 505–509, 2005. View at: Publisher Site  Google Scholar
 New Starits Time, Don't OverDevelop Penang to Avoid Landslides, 2012, http://www.nst.com.my/latest/dontoverdeveloppenangtoavoidlandslides1.193992.
Copyright
Copyright © 2015 Mohammad Subhi Albatah et al. This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.