Abstract

An experimental investigation was conducted to explore the fundamental difference among the Mamdani fuzzy inference system (FIS), Takagi–Sugeno FIS, and the proposed flood forecasting model, known as hybrid neurofuzzy inference system (HN-FIS). The study aims finding which approach gives the best performance for forecasting flood vulnerability. Due to the importance of forecasting flood event vulnerability, the Mamdani FIS, Sugeno FIS, and proposed models are compared using trapezoidal-type membership functions (MFs). The fuzzy inference systems and proposed model were used to predict the data time series from 2008 to 2012 for 31 subdistricts in Bandung, West Java Province, Indonesia. Our research results showed that the proposed model has a flood vulnerability forecasting accuracy of more than 96% with the lowest errors compared to the existing models.

1. Introduction

Flood disaster [1] is one of the significant problems in some countries including Indonesia. Flood disaster occurs mostly in populated areas. The high rainfall in many cities increases the risk of flooding. The flood occurs mainly because of a high rate of rainfall for extended periods in the wet season. The drainage system could not control the problem of rising water volume in a river or a channel. The critical thing in meteorology, hydrology (e.g., flood warning), environmental policy, and agriculture, is the accurate measurement of rainfall [2]. As a result of flood, people loss their homes and their crops, miss the school education, and even worse, lose their life.

In October 2016, some regions in Indonesia were affected by the flood, and the disaster management officials reported that the flash flood in the Bandung city caused death of one person and damaged thousands of homes. The National Disaster Management Authority, also known as BNPB, informed that there was 77 mm of rainfall in the town in just 1.5 hours around midday [3, 4]. Most of the areas of the city were inundated with flood water between 120 cm and 200 cm deep [3, 4]. In the same month, a part of Gorontalo, Indonesia, was affected by flood; at least 1,500 homes were damaged, and around 4,500 people were forced to evacuate their homes [5, 6]. In December 2016, a part of West Nusa Tenggara Province, Indonesia, was hit by two floods in the space of days, forcing over 100,000 people to leave their homes. BNPB informed that thousands of homes were damaged, and streets were left with flood water from one meter to three meters deep [4, 6]. On January 26, 2017, flood hit North Sulawesi, Indonesia. This flood affected 11 villages in Gorontalo Utara and damaged over 700 homes, some schools, and agricultural land, forcing more than 100 people to leave their places [4, 7]. On March 3, 2017, at least six people died, two seriously injured, and thousands displaced due to floods in Indonesia's West Sumatra Province. The flood also caused power and communication outage, and over one hundred electrical substations were shut down, leaving almost 15,000 houses without electricity [4, 8].

The answer to overcome the flood disaster by applying the fuzzy system is assisting, predicting, and deciding these events. In 1965, Lotfi, a mathematician who created the theory of fuzzy logic, derived the result of the insufficiency of Boolean algebra for many real-world problems [9, 10]. In fact, as most of the information is imprecise, one of the human’s greatest abilities is to process inaccurate problems and vague information efficiently. The Oxford English Dictionary defined the word “Fuzzy” as blurred, indistinct, imprecisely defined, confused, or vague. The fuzzy systems are knowledge-based or rule-based systems. The heart of a fuzzy system is the so-called knowledge-based fuzzy IF-THEN rules [11].

Many researchers have argued on meteorological forecasting, with the purpose to help assessing the disaster impact in some regions. Forecasting is also used in other areas such as detection and prediction of diseases [12, 13]. Asklany et al. [14] proposed the probabilistic prediction using two skill scores, namely, the Brier Score and Friction Score. Those researchers mentioned that the rainfall event prediction was highly accurate with the data aggregated by the stations with increasing output toward the real-time event. Otherwise, the fuzzy inference system output precedes the recorded maximum data in six hours before the rainfall. Li-Chiu et al. [15] introduced a model to predict the stream-flow an hour after a flood event occurs. They also compared the result using a fuzzy exemplar-based inference system with backpropagation neural network to show that a fuzzy model performs better than the neural network. de Bryun et al. [16] described a method based on fuzzy arithmetic to estimate the possible range of flow rates and water level based on possible rainfall events by the forcing and uncertainty model. A previous research [17] explained a rainfall level detection system to predict the weather using the Mamdani method. In [18], the rainfall event was analyzed using the defuzzification method. In [19], the genetic algorithm (GA) was discussed to obtain a higher accuracy of the rainfall forecasting system. In [20], the agricultural irrigation control using the fuzzy method for determining water quantities was studied. In [21], the Mamdani FIS was studied to predict the rainfall events in the Khorasan region. In [22], flood events were predicted using numerical weather prediction (NWP) with improved accuracy. In [23], the main objective is to predict the high-risk area by water level using Artificial Neural Network in Masantol, Pampanga. In [24], a flood event prediction model based on SVM and boosting algorithm was presented. Mitra et al. [25] explained that flood forecasting employs the Internet of Things and artificial neural networks based on the water level in a river basin. In [26], the flood forecasting system development in the upper reaches of the Zhangweihe River Basin was discussed. In [27], the grey model for flood forecasting based on the rainfall and the flood index in Nanjing, China, was studied. All the previous studies described flood forecasting based on different methods and employed rainfall parameters. Gilbert Brunet described meteorological forecasting using numerical weather prediction (NWP) and obtained results with good quality and accuracy with the complexity of numerical computing; however, a certain percentage for the accuracy was not obtained [28]. Mencattini et al. utilized the type-2 fuzzy logic system in their system which aims at meteorological forecasting. The measurement consists of humidity and temperature that was obtained from the Neuronica Lab at Politecnico of Torino (Italy). This research achieved accurate forecasting results even many hours in advance with a mean absolute error of 33.8% for humidity and 8.2% for temperature [29]. Hansen and Riordan employed case-base reasoning (KNN method) and fuzzy set theory to predict the airport weather. The data tested were more than 300,000 hours of airport weather observation recorded for 36 years. The testing methods used to solve the problem were cloud ceiling and visibility of those airports produced by 6-hour predictions. The result showed good accuracy for the airport weather prediction [30]. Ding et al. employed the artificial neurofuzzy inference system (ANFIS) and clustering technique. The ANFIS was used to make sure of the minimal errors of the parameters of membership functions (MFs) and the design of MFs. The result was efficient and close to that of actual load forecasting in practice. However, the result had drawbacks of the neural network and was sensitive to the noise [31]. Mountis and Levermore utilized the fuzzy logic, artificial neural network (ANN), and neurofuzzy method to forecast the weather at the Manchester International Airport for the years 1982–1995, which examined only summer data. The accuracy of the forecast depends on how well its parameters are defined and adjusted by the ANN method. The results of this research provided satisfactory MAE values: neurofuzzy 17 %, ANN 18%, and fuzzy logic 30% [32]. Ahmad et al. in their research employed the ANN method with the supervised learning technique for forecasting. The study area in this research was Senai, Johor, Malaysia. Data inputs for the forecasting were pressure, time of day, dry bulb temperature, wet bulb temperature, dew point, wind speed, wind direction, cloud cover, and rainfall in the past hour. The accuracy was 78% [33]. Qin et al. employed ANNs with a gradient descent technique RPROP with dynamic tunneling technique to train the actual data of past 24 months in 1999–2000 with the data tested of six months (2001/1–2001/3, 2001/7–2001/9) from Chongqing, China. The result gave faster convergence and global optimization for forecasting. The result is close to the real data [34]. In [35], the flood prediction employing artificial neural network was described, and the flood water level was successfully predicted 24 hours, 48 hours, and 72 hours ahead of time. The results showed MAE varying from 0.6 to 0.9 and RMSE varying from 0.05 to 0.11. In 2012, in a study case in Tancheon, South Korea, Choi et al. employed neurofuzzy system to forecast the flood [36]. The results showed the average RMSE was 0.367%.

This study proposes a hybrid approach based on the neural network and fuzzy inference system for flood event vulnerability, namely, hybrid neurofuzzy inference system (HN-FIS). The HN-FIS is a model which can automatically learn and also obtain the output which can present the essence of fuzzy logic. The system was applied in 31 subdistricts in Bandung. The flood forecasting depends on several variable inputs: population density, altitude of the area, and rainfall in time series from 2008 to 2012. The main contributions of this paper are (i) presenting a hybrid forecasting for flood vulnerability based on the neural network and fuzzy inference system for accurate flood forecasting employing data variables which utilized Bandung database for flood vulnerability forecasting and (ii) developing an effective hybrid forecasting approach for flood vulnerability with higher accuracy.

2. Study Area

This study used data collected from Bandung, West Java Province, Indonesia (Figure 1). Geographical conditions of the subdistricts have total area cover about 176238.67 Ha, which lies between longitudes 107022′ and 108050′ east and latitudes 6041′ and 7019′ south. Most of the area of Bandung is located between the surrounding hills and mountains. To the north lies Mount Bukittunggul with a height of 2200 meters and Mount Tangkuban Perahu with a height of 2076 meters, which borders West Bandung and Purwakarta. On the south, there is Mount Patuha with a height of 2249 meters, Mount Malabar with a height of 2321 meters, Mount Papandayan with a height of 2262 meters, and Mount Guntur with a height of 2249 meters. Bandung has mountainous areas with an average slope of from 0–8%, 8–15%, to above 45%. Bandung has a tropical climate that is influenced by the monsoon climate with average rainfall between 2000 mm and 3000 mm per year (Table 1). Air temperature ranges from 12°C to 24°C, which results in air humidity of about 78% in the rainy monsoon season and 70% in the hot dry season.

The primary variables (Tables 13 used to obtain the flood forecasting value were divided into four parameters (population density, altitude of the area, rainfall, and vulnerability of flood). The definitions of the main parameters and the fuzzy value are presented below:(1)Population density: the population density of the subdistricts in which the people are located. The value zero means the population density is very low (less than or equal to 50 persons/km2). The value one means the population density is very high (that is, greater than 400 persons per square kilometer).(2)Altitude of the area: the distance above sea level of the land, mountain, sea bed, or any other place. If the altitude of an area is less than 200 meters above sea level or coastal area, it is shown in fuzzy as low level (value 0), and the altitude of area greater than 350 meters above sea level or mountain area means the altitude in high level (value 1).(3)Rainfall: according to the rate of rainfall, it is classified as low level “0,” light rain which happens when the precipitation rate is less than 20 mm per hour, and high level “1,” very heavy rain which happens when the precipitation rate is more than 100 mm per hour.(4)Vulnerability of flood: the inability to resist a flood or to respond when the flood occurs: 0 = safe (when the area is secured from flood) and 1 = danger (when the area is under a threat of flood).

3. Material Parameters

The flood vulnerability in 31 subdistricts in Bandung is predicted using the Mamdani system, Sugeno system, and proposed flood forecasting model (HN-FIS). It consists of three inputs for vulnerability of flood level: population density, altitude of the area, and rainfall. The population density is in the range of 350 to 9000 people/km2. The altitude of the area is in the range of 0 to more than 1000 meters above sea level (masl). The rainfall is in the range of 0 to more than 200 mm. All the fuzzy models in this research were applied in the trapezoidal type trying to find the best one for the prediction of the vulnerability of flood event. The classification of fuzzy sets employed in the flood forecasting method is presented in Table 4. There are three or four indexes to indicate the parameters according to Tables 13, respectively. The inputs have three or four membership functions as shown in Tables 57 presented for the population density, altitude of the area, and rainfall (also shown in Figures 2(a)2(c)), respectively. The output (the vulnerability of flood) is taken in values ranging from 0 to more than 374, presented for three conditions (safe, alert, and danger) as shown in Table 8 and Figure 3.

According to Table 5, the population density has four fuzzy classifications such as very low, low, high, and overpopulation, respectively (Table 2).

Table 6 describes the fuzzy classification for the altitude of the three-level fuzzy area parameters, such as low, moderate, and high.

In Table 7, the fuzzy classification was divided into four levels such as low, moderate, high, and extreme rainfall.

Table 8 provides the fuzzy classification for the output of the vulnerability of flood. It has three levels fuzzy: safe, alert, and danger.

4. Distributed Implementation of Hybrid Flood Forecasting Model

Based on the measurement and theoretical analysis, both Mamdani and Sugeno models required a significant number of forecasting to obtain a higher level of accuracy for the vulnerability of flood. The models considered the parameters that are in flood forecasting. We present the Mamdani model and the Sugeno model for practically distributed flood prediction.

4.1. Mamdani Fuzzy Inference System (Mamdani FIS)

Mamdani and Assilian proposed the first type of fuzzy inference system (FIS) in 1975 [37]. A Mamdani FIS has fuzzy inputs and fuzzy output. The architecture of the Mamdani FIS to show the mapping from input space into output space can be seen in Figure 4 (referred from [38]).

According to Figure 4, the system of the crisp input source is first transformed by fuzzifier into a set of linguistic variables in X. The fuzzy inference engine using the input variables and the rules to decide on the fuzzy rule base derives a set of conclusions in V. Defuzzifier purpose to convert into a crisp number which corresponds to the output of the system [39].

4.2. Sugeno Fuzzy Inference System (Sugeno FIS)

Takagi and Sugeno proposed the first fuzzy inference system, namely Sugeno FIS, in 1985 [40] and by Sugeno and Kang in 1988 [41]. A Sugeno FIS has fuzzy inputs and a crisp output.

Referring to the same assumptions as for the Mamdani FIS, the architecture for the Sugeno FIS is illustrated in Figure 5 (according to [38]).

In this short of fuzzy inference system, only the antecedents of the rules are fuzzy, and it means the rules act as an inference mechanism themselves [38, 41]. The main difference of this architecture which compared with Mamdani FIS is that the Sugeno FIS does not require a defuzzification to obtain a crisp result output from the rules consequents. The crisp result can be obtained employing a weighted average of the rules crisp consequents using the firing strength level as weights [38, 41, 42].

4.3. Proposed Flood Forecasting Model

Takagi and Sugeno [40] presented an adaptive neurofuzzy inference system that was obtained from the neural network and fuzzy logic [43] by catching the advantages of both in one framework. The neural network has the capability of automatic learning. However, this model cannot describe how it acquires the output from decision making. On the other hand, the fuzzy logic can obtain output out of the fuzzy logic decision. However, it does not have the ability of learning automatically [44]. Combining neural network and fuzzy logic can generate input and output data pairs, and it has been successfully used in diverse fields at solving nonlinear issues and indicating problems [45]. In this study, the Sugeno fuzzy multilayer, which is equivalent to a neural network, and the Mamdani fuzzy inference system were combined to form a hybrid neurofuzzy inference system (HN-FIS). The advantage of the proposed model is its capability of automatically learning and obtaining an output of fuzzy logic decision more clearly, which can exhibit human judgment reasonably.

Considering Figures 4 and 5 have the same rule base and fuzzification for the variables, there are several defuzzifiers which can be chosen for a Mamdani FIS that originate similar results in a Sugeno FIS, which means an inevitable overlap between both types of systems. The Mamdani FIS is more widely used, particularly for decision support applications, and mostly refers to the intuitive and interpretability nature of the rule base. On the other hand, the Sugeno FIS do not have a linguistic term, and this interpretability is partially lost [41, 46]. However, since Sugeno FIS rule’s consequents can have as many parameters per rule as input values, this translates into more degrees of freedom in its design than a Mamdani FIS, thus providing more flexibility [41]. Mendel reaches this conclusion by comparing the number of possible design parameters for both Mamdani FIS and Sugeno FIS for certain choices of input and output variables [41].

According to that fuzzy inference system, many parameters can be employed in the consequents of the rules of a Sugeno FIS which reasonably approximates a Mamdani FIS. This session described how the proposed flood forecasting model (hybrid neurofuzzy inference system (HN-FIS)) works.

The Takagi-Sugeno (Sugeno) fuzzy model and Mamdani fuzzy model are two great fuzzy rule-based inference systems. The Sugeno fuzzy inference system works well with linear techniques and guarantees continuity of the output surface [40, 47]. However, the Sugeno fuzzy model has difficulties in dealing with the multiparameter synthetic evaluation. It has difficulties in assigning weight to each input and fuzzy rule. The Mamdani fuzzy model has had some advantages such as its intuitive, widespread acceptance, and well suitable to human cognition [37, 48, 49]. The researchers employed the Mamdani model and the Sugeno model as a proposed flood forecasting model (hybrid neurofuzzy inference system), which shows the advantages of those models in the output statement, which is more readibility and easy to understand even by the layperson.

A function needs to be assigned to specify the operation of the Mamdani fuzzy model entirely with the following steps:(1)Operator OR or operator AND to the rule firing strength computation with OR’ed or AND’ed references(2)Consequent membership function calculated from the implication operator based on a given firing strength(3)Aggregate operator used to produce overall output membership function by combining the aggregated qualified consequents(4)Defuzzification operator aims to transform an output membership function to a crisp single output value

If the first step is the AND operator, the second step is a product, the third step is the sum, and the fourth step is the centroid of the area (COA) [50, 51], we can derive the following equations. The advantage of applying hybrid neurofuzzy inference system (HN-FIS) model is the ability of learning because of differentiability during computation.

Equations (1) and (2) [37] provide the sum-product composition. The final crisp output when using centroid defuzzification is equal to the weighted average of the centroid of consequent membership functions:where is the weighted factor of ; is the ith fuzzy rule; is the firing strength of ; and is the area of the consequent membership functions of .where and are the area and the center of the consequent membership function, , respectively.

The rules of the HN-FIS model are given as follows:Rule 1(r1): if P is P1, A is A1, and R is R1, then Z=B1.Rule 2 (r2): if P is P2, A is A2, and R is R2, then Z=B2.Rule n(rn): if P is PnA is An and Rf is Rfn, then Z=Bn.

According to the rules, the HN-FIS model can be expressed as shown in Figure 6.

P, A, and R represent the inputs, which are population density (Table 5), altitude of the area (Table 6), and rainfall (Table 7). P1, P2, P3, and P4 represent the membership functions of population density. A1,A2, and A3 represent the membership functions of the altitude of the area. R1,R2,R3, and R4 represent the membership functions of rainfall. The firing strength denoted as , , …, . B1, B2, …, Bn represents the following parameters which need to be adjusted. The consequent parameter B is a multiplication of and (according to (2)). The membership function of the vulnerability output is denoted as .

The HN-FIS architecture is composed of five layers, and Figure 6 illustrates the output of each layer.First layer: fuzzification layer

The membership function is the generalized trapezoidal function, denoted as follows:

P is the crisp value (real value) of population density, A is the crisp value of altitude of the area, and R is the crisp value of rainfall. {si, ti, ui, } is the premise parameter or the set parameter which is used to denote the membership functions in this model.Second layer: the layer of rule or the layer of inference

In this layer, the product method is generated for the firing strength .Third layer: implication layer

The product of this layer comes from the implication operator.Fourth layer: aggregation layer

The result of this layer is the sum of all implication operators in the implication layer. The following parameters are denoted by Bi.Fifth layer: defuzzification layer

The defuzzification (D) method and center of the area (COA) were achieved to produce the crisp output .

In this paper, the trapezoidal functions generalized were used for the type of membership functions (MFs) of the inputs and had four nonlinear parameters to be adjusted ({si, ti, ui, vi}). The MFs of population density have four nonlinear parameters (shown in Table 5 and Figure 2(a)). The MFs of the altitude of the area have three nonlinear parameters (shown in Table 6 and Figure 2(b)). The MFs of rainfall have four nonlinear parameters (shown in Table 7 and Figure 2(c)). In this model, premise parameters are 48, and following parameters are 96. Hence, the total number of the nonlinear parameter is 140.

4.4. Evaluation Criteria for Model Performance

If Mi is the measured value for the number of subdistricts and Pi is the prediction value in the same subdistricts, then the error (Ei) is defined as

Since there are measured values and predictions for n subdistricts, there will be n error terms, and the standard statistical measures can be defined as follows.

Mean absolute error (MAE) is the average of all absolute errors, meaning the amount of all absolute errors divided by the number of errors. The equation of MAE is as follows:

Root mean square error (RMSE) is the square root of the average of squared differences between the measured value and the predicted value shown in the following equation:

The mean absolute error is defined by first making each error positive by taking its absolute value and then averaging the result in the square root. The RMSE is defined by the similar idea of the mean absolute error. In RMSE, the errors are made positive by squaring each one, and then the squared root errors are averaged. The MAE has the advantage of being more interpretable and easier to describe nonspecialists. The RMSE has the advantage of being easier to handle mathematical problems. Each of these statistics deals with measures of accuracy whose size depends on the scale of the data [32, 52].

5. Results and Discussion

The discussion of the results begins with explanation of the performance of the proposed vulnerability of the flood forecasting model based on the neurofuzzy system approach, namely HN-FIS. The flood forecasting models are developed employing MATLAB 2017 software [53]. The results are presented as follows.

In our experiment, three graphics illustrated in Figure 7 are the result of the vulnerability of flood forecasting employing three models: Mamdani, Sugeno, and HN-FIS, respectively, applied in 31 subdistricts in Bandung, Indonesia. Those models have used to design of experiment to find the best model, which could affect the accuracy of flood vulnerability forecasting. Figure 7 shows also that all of the models have similar values from the measured data and predicted data. Further analysis showed that the high accuracy achieved by three models to decide the vulnerability of flood in Bandung (shown in Figure 8) is based on Equation (10).

In order to evaluate the performance of the proposed model, other commonly used techniques such as the Mamdani model and the Sugeno model were employed for comparison purposes. The forecasting errors obtained are presented in Table 9. For MAE measurement, the average error is 1.7797%, 1.4035%, and 1.2556% for the Mamdani model, the Sugeno model, and the proposed flood forecasting model (HN-FIS), respectively. These average values in RMSE measurement are 0.0382%, 1.9479%, and 0.0371% for the Mamdani model, the Sugeno model, and the proposed model (HN-FIS), respectively. The results showed that the proposed model reduces on flood vulnerability forecasting errors considerably in 31 subdistricts in Bandung, indicating the great improvement in flood vulnerability forecasting accuracy.

Figure 9 illustrates MAE and RMSE values in three models for flood vulnerability forecasting. The proposed model outperforms the other models and forecasted values correctly following the trend of flood variation.

Compared with the other methods presented in the literature using other databases, the proposed hybrid model provides reliable flood vulnerability forecasting, as shown in Table 9. In [35], the accuracy obtained in terms of MAE from 0.627 to 0.9357 and RMSE from 0.0523 to 0.1154 for flood prediction of 24 hours to 72 hours ahead of time. In [36], the accuracy of the average RMSE was 0.367% for flood forecasting in Tancheon Basin in Korea. The RMSE obtained with the proposed model varies from 0.0126% to 0.0548% for flood vulnerability forecasting. It means if an error is small, then accuracy will be close to real data and the model will give better flood vulnerability forecasting result. The proposed model achieved improvement in accuracy compared to the existing algorithms.

6. Conclusions

Neural network adopts a linear equation in the consequent part, which cannot present human assessment reasonably. In this case, we propose the hybrid model based on neural network and fuzzy inference system (HN-FIS) which has greater advantages in the following part and intuitive part of fuzzy reasoning. The proposed model has been constructed by a hybrid technique of the Mamdani model and Sugeno model based on the neurofuzzy inference system approach. The HN-FIS model can show its readability and understandability and present the essence of fuzzy logic more clearly. The current study aimed to determine and optimize the performance of the proposed model (HN-FIS). As supported by measurement and the predicted values based on simulation, the proposed model compares favorably with the Mamdani model and the Sugeno model in the capabilities of predicting the vulnerability of flood in 31 subdistricts in Bandung, Indonesia. The most apparent finding to emerge from this study is that three model flood forecasting (Mamdani, Sugeno, and proposed model) achieved the performance of more than 96%. However, the proposed model (HN-FIS) achieved the lowest error rate in both RMSE and MAE (0.0371% and 1.2556%, respectively) and obtained the best performance in flood vulnerability forecasting compared with existing models.

Data Availability

The data used to support the findings of this study are available from the corresponding author upon request.

Conflicts of Interest

There are no conflicts of interest to declare in this paper.

Acknowledgments

The authors thank Prof. Hou Rongtao and Dr. Irfan Dwiguna Sumitra for their support in this research.