Abstract

Extraction of water bodies from satellite imagery has been widely explored in the recent past. Several approaches have been developed to delineate water bodies from different satellite imagery varying in spatial, spectral, and temporal characteristics. The current study puts forward an automatic approach to extract the water body from a Landsat satellite imagery using a perceptron model. Perceptron involves classification based on a linear predictor function that merges few characteristic properties of the object commonly known as feature vectors. The feature vectors, combined with the weights, sum up to provide an input to the output function which is a binary hard limit function. The feature vector in this study is a set of characteristic properties shown by a pixel of the water body. Low reflectance of water in SWIR band, comparison of reflectance in different bands, and a modified normalized difference water index are used as descriptors. The normalized difference water index is modified to enhance its reach over shallow regions. For this study a threshold value of 2 has been proved as best among the three possible threshold values. The proposed method accurately and quickly discriminated water from other land cover features.

1. Introduction

Mapping of natural resources like forest and water bodies using satellite imagery has gained much importance in the recent past. Both forest and water resources are subject to intense exploitation and monitoring them at regular intervals is imperative for their sustainable management. Water bodies, which play a key role in the global carbon cycle and climate variations, are mapped in spatiotemporal domain to analyze and assess the extent and rate of their degradation and disappearance. Geospatial tools are proving to be advantageous for such impact assessment for the implementation of conservation measures [14].

Researchers across the globe have used different satellite data varying in spatial, spectral, and temporal characteristics to generate thematic maps of land use land cover or maps with special emphasis on water bodies [511]. At the same time, various techniques have been adopted to extract these features from satellite imagery and each method has its own merits and demerits.

Visual interpretation of satellite data provides the best delineation of water bodies of varied sizes but is time consuming, especially when working with high resolution data [12, 13]. The simple and common approach of unsupervised classification which uses an interactive self-organizing data analysis technique [14] provides results with very low accuracy, when there is spectral overlap between water bodies with other classes. In contrast, supervised classification presents more accurate and reliable outputs than unsupervised method [15] but may vary when used for high resolution data [16, 17]. Moreover, the supervised technique requires sufficiently large spectral training data sets and is not a fully automated method [1820]. Further it does not take into account the spatial features of the objects [21].

The method of fractal characterization classifies the features based on their texture either smooth or rough. Since water bodies exhibit a smooth texture compared to the other landscape features like vegetation and buildings in the satellite imagery, they can be easily extracted using the fractal method. However the method does not take into account the spectral features of the objects; hence different classes with varied spectral characteristics but with similar textures are classified as one class. Further, the results may also differ significantly with image resolution.

The use of the normalized difference water index (NDWI) method maximizes the reflectance properties of water by minimizing the low reflectance of near infrared (NIR) and maximizing the reflectance in the green wavelength [22, 23]. Studies show that this method yields better results for deeper and worse for shallower parts of the water body.

The threshold method is one of the most widely used algorithms for the extraction of water bodies from satellite imagery. The method is based on the fact that the reflected radiance of water in short-wave-infrared (SWIR) band is lower than that of other objects like vegetation, buildings, bare soil, and roads. Each pixel that passes the threshold test is classified as water body along with some other objects which are not truly a water class, providing false positive results. This method also has constraints related to size and shadows.

Whatever be the approach, generally the user is interested in a method which is fast, accurate, and automated. Towards this objective, researchers adopted hybrid approaches using different algorithms like decision tree classifier [24, 25], neural network [26], and other methods [2734]. Similarly the current study puts forward an automatic approach to extract water bodies from a satellite imagery using a perceptron [35] model for classification.

The Perceptron model has been extensively utilized for object recognitions in the domain of image processing [3644]. Over the past years a considerable increase in the availability of remotely sensed satellite data and the use of single layer and multilayer perceptron models have been reported. Use of perceptron models by several researchers showed better classification results than the conventional multispectral classification method [36] and also requires less training data [4547].

Compared to single layer, multilayer perceptron model (MLP) proposed by Rumelhart et al. [48] is most commonly implemented artificial neural networks (ANN) in various geospatial applications. More specifically, in different domain, such as urban planning, land use, land cover mapping, forestry, change detection, for example, Patra et al. [49] in their change detection study used MLP model with context sensitive semisupervised techniques to differentiate changed and unchanged pixels without prior knowledge of ground inventory. Mishra et al. [50] for Muzaffarpur city, India, B. Ahmed and R. Ahmed [51] for Dhaka city, predicted future LULC changes using MLP modeler. Similarly Gopal and Woodcock [52], Erbek et al. [53], and Petra et al (2014) also employed MLP for pattern recognition and change detection studies. Similarly, MLP was used by Fiset et al. [54] for image matching; Özkan and Erbek [55], Oliveira et al. [56], and Fierens and Rosin [57] for classification and feature extraction from satellite images; Kotsiantis [58], Freund and Schapire [59], Canargo and Yoneyama [60], and Pal and Mitra [61] for Hyperspectral data classification. With respect to forestry, Vehtari and Lampinen [62] used MLP to identify tree trunks in digital images and Boschetti et al. [63] for image fusion of hyperspectral data with pan data to extract vegetation under storey information. Lippit et al. [64] applied MLP to map selective logging sites in deciduous and mixed-deciduous forest in Massachusetts and Li et al. [65] and Jensen et al. [66] for forest age estimations; Chaudhuri and Parui implemented MLP [67] in defense application to identify target objects. Goswami et al. [68] have shown the strength of MLP to identify objects in an image automatically. Their research shows that the perceptron can be effectively used to extract water bodies from satellite data with good accuracy.

This paper presents an automatic approach to extract the water body from a Landsat satellite imagery using a single layered perceptron model based on a linear predictor function that merges few characteristic properties of the object commonly known as feature vectors. The feature vectors, combined with the weights, sum up to provide an input to the output function which is a binary hard limit function. The feature vector in this study is a set of characteristic properties shown by a pixel of the water body. Low reflectance of water in SWIR band, comparison of reflectance in different bands, and a modified normalized difference water index are used as descriptors. The normalized difference water index is modified to enhance its reach over shallow regions.

2. Materials and Methods

Landsat ETM data of 29th October, 2001, of Hyderabad city, India, were used as test material to extract water bodies (Figure 1). Out of the seven bands of Landsat data, five bands were used in the current approach, namely, blue (0.45~5.2), green (0.52~0.60), red (0.63~0.69), near infrared (NIR—0.76~0.90), and short wave infrared (SWIR—1.55~1.75). The method put into use here is a classification based on perceptron which is an algorithm for supervised classification. Perceptron involves classification based on a linear predictor function that combines some characteristic properties of the object commonly known as descriptors or feature vectors. A feature vector is an attribute that represents the object; the more the feature vectors, the easier the process of classification. These feature vectors are combined with weights for the construction of a predictor function. The perceptron is a binary classifier and maps the input “” to a value “” using a hard limit function (Figure 2).

To execute classification using perceptron, the first step is to define feature vectors by finding out the descriptors that are characteristics of a water body, which are either pixel based or object based. In the current study only pixel based information is used. Once vector features are defined the next step is to initialize the weights for each vector. These weights depend on the spatial and temporal properties of the objects. For example, the reflectance value for a band shown by a water body in summer is different from what it shows during monsoon season (flood times, more precisely) because of increased water level and sediment deposition and other impurities. Hence there is no universal value for the weights for each descriptor and the optimal values might change depending on the time and location. The values of weights usually vary from 0 to 1, where a weight of 0 indicates that the descriptor no longer shows any characteristic property of the object and a weight of 1 means that this property must be shown by the object.

After finding the feature vectors and initializing the weights, a weighted sum was calculated. This weighted sum served as the input to the output function. The output function is a hard limit binary function and while performing the process of classification it uses a threshold value. The threshold value depends on the feature vectors and the weights. In this case we choose the maximum possible value of the weighted sum displayed by a water body pixel and the calculation of such value was done by using the extreme values. The weighted sum acquired for each pixel is compared with the threshold value and in this way the classification is done.

2.1. Feature Vectors: Three Feature Vectors Are Used
2.1.1. Low Reflected Radiance of Water in SWIR Band

The reflected radiance of an object that is captured in a remote sensing image depends upon the extent of electromagnetic radiation absorbed by the object; that is, the more it absorbs, the less it reflects. The water body absorbs more in the infrared region. Hence the reflected radiance of water in infrared and short-wave-infrared bands is lower than that of other objects like vegetation, buildings, bare soil, and roads and so forth. The reflectance values for some pixels of water body and land features are shown in Figure 3.

Further from Figure 4 it is clear that the water bodies show a distinctly low reflectance radiance in all infrared bands including band 5, that is, short-wave-infrared band.

2.1.2. Reflectance in B5 < B2 and Reflectance in B4 > B3

(a) The reflectance of water in band 5 is less than reflection of water in band 2. (b) The reflection of water in band 4 is greater than the reflection in band 3.

The above two properties derived by analyzing the pixel values are true for water body but not for other objects like soil, sand, roads, vegetation, or buildings. Hence by applying these two conditions water bodies can be easily extracted.

2.1.3. Modified Normalized Difference Water Index (MNDWI)

The equation for NDWI is (Green − NIR)/(Green + NIR). The selection of these wavelengths maximizes the reflectance properties of water, that ismaximize the typical reflectance of water features by using green wavelengths;minimize the low reflectance of NIR by water features;maximize the high reflectance of NIR by terrestrial vegetation and soil features.

However NDWI does not extract shallow parts of the water body and is unable to separate built-up structures from water feature [22]. Therefore to increase the level of details in the result we have used the summation of NDWI and a modified NDWI (MNDWI) and then did the binarization of result. The index used is where MNDWI is (Blue − NIR)/(Blue + NIR).

2.2. Binarization of Index

After calculation of this index it is binarized with respect to zero. All the values above zero are changed to 1 and below zero to zero. This binarization helps in two ways: first, all the water pixels become 1 and all the nonwater pixel becomes zero. This helps in selecting the threshold for output function. Second, root out the diminishing effect of the negative values while calculating the weighted sum.

2.3. Threshold Value

The threshold value determines whether a pixel belongs to a water body or not. The maximum value for each of the three feature vectors can be 1. Hence the maximum value of the weighted sum cannot be more than 3. If the pixel follows only one of the three descriptors, then the value of the weighted sum would be 1, if it follows two descriptors then the weighted sum would be 2, if it follows all the three descriptors, then the value would be 3, and if it does not follow any of the descriptors, then its value would be 0.

This approach involves a comparative study taking into account all the three possible thresholds. The most optimum of the three thresholds is 2 as it ensures that the pixel shows at least two of the three characteristic properties. After the output function finishes its job, the water bodies get extracted. But along with the water bodies a portion of small regions or single pixels is also extracted, which are usually referred to as noise. The main reason for the existence of noise is the presence of small water bodies formed as a result of heavy rains or any other reason. The only factor that could separate these unwanted results from the permanent water bodies is the size factor and can be achieved by the following two methods.(1) Single pixel removal. Tracing the neighborhood of each pixel, if the pixel is not found lying in the decided range, it is removed from the “water body” class.(2) Size parameter. After implementing a single pixel removal, noise might still exist. So now the task is to find out the maximum size of cluster of pixels that is not the part of water body but still exists in the image. There is a risk of losing some part of the water body region if an appropriate value is not chosen. There is also a possibility that size of few water bodies is itself lesser than that size-threshold limit. Size parameter provides flexibility in the process of noise removal. The value of size-threshold parameter can be set according to the desired size of the water body. If the focus is only on extracting big lakes, the size-threshold can be increased accordingly. If the focus is on extracting all possible water sources, then the size-threshold parameter can even be completely removed.

3. Results and Discussion

For deeper parts of the water body the reflectance values in band 5 are less than 35 but for shallow parts, the reflectance values are high. For marshy regions it goes beyond 55. A comparative study for extraction of water bodies using different values of the SWIT-threshold is carried out starting from the values 40 to 65. Similarly the size parameter is also flexible and depends upon the minimum size of the water body that we are interested in. So the results are tabulated keeping in view different threshold and varying the size parameter (Table 1).

The study is carried out taking into account all the three possible perceptron-threshold values. A perceptron-threshold value of 1 indicates that the pixel classified as water body follows one of the three descriptors. A perceptron-threshold value 1 brings into the result a lot of noise. The results suffer from the same problem of false positive results as the normal SWIR-threshold method. The number of false positive results increases with the increase in threshold value of the SWIR band as more and more nonwater pixels get included in the result.

Perceptron-threshold value of 2 indicates that the pixel classified as a water body pixel follows at least two of the three descriptors. This is the best and most optimum value for perceptron-threshold as it neither allows nonwater pixels to get included nor does it cut down the true positive results. A perceptron-threshold of 3 indicates that the pixels follow all the three descriptors. This perceptron-threshold removes some true positive results but the results acquired by this threshold are very reliable and are independent of the increase in threshold value for the SWIR band.

From Table 1 (showing the accuracy percentage of the results) it is clear that, for a perceptron-threshold value of 1, area classified as water body increases with the increase in SWIR-threshold value. It happens because the shallow parts of the water body reflect more than the deeper parts as a result, the reflectance value being directly proportional to the amount of light reflected increases. And hence if we decrease the SWIR-threshold value these shallow parts are removed. For a perceptron-threshold value of 2, the area increases with increase in SWIR-threshold to a certain value after which there is no effect of increase in the SWIR-threshold. Also the magnitude of gradient between two consecutive SWIR-threshold values is less than that for perceptron-threshold of 1. For perceptron-threshold value of 3, there is a very little increase in moving from SWIR values 40 to 50 but after that the value gets stabilized and has no effect on increasing the SWIR value. It happens because the perceptron-threshold of 3 ensures that the pixel follows all the three descriptors. Noise included as a result of increasing the SWIR value is removed by the other two descriptors. Figure 5 shows the result of taking the SWIR threshold of 60 and size parameter of 30. It is also very clear that the increase in size parameter reduces the area classified as water body.

The water bodies extracted by this method are compared with the thematic map prepared for the same area by Prasad et al. [3] using the same satellite. The results are highly correlated with reference to their map. Also the results were validated with the ground survey data using GPS.

4. Conclusions

In the present study we put forward an automatic method to extract water bodies from remote sensing data in addition to the previous methods that used different algorithms and satellite data. The proposed method accurately and quickly discriminated water from nonwater features using ETM data. The characteristic properties of water, such as low reflectance values in the SWIR band, higher value of reflectance in band 4 than in band 2 and a higher reflectance in band 2 than band 5, were used as a basis in the study. These properties supplementing with an NDWI are used as the feature vectors to improve the classification. Besides, binarization of the modified NDWI is done so as to make a selection of threshold easy and to reduce the effects of negative values while calculating the weighted sum. For this study a threshold value of 2 has proved to be the best among the three possible threshold values possible. A threshold value of 1 includes noise and a threshold value of 3 cuts down the true positive results. If only big and permanent water bodies are to be extracted then a size parameter included in the method can come in handy. The proposed perceptron model works competently for ETM data and researchers in the domain of global water research can implement this model to extract the water bodies with acceptable accuracy and precision.

However, the model should be tested for its sensitivity and performance by considering resolution, season of satellite data in addition to land use, and land cover patterns around water bodies, which are not examined in the present study. The study used ETM data from one time period and was able to delineate water bodies to the best level. However it needs to be checked using diverse temporal data and different seasons for the same sensor (ETM). Also there is a need to assess the efficiency of the proposed algorithm with satellite data from varied sensors. Further, for ETM data the study has come up with a threshold value of 2 as best in extracting water bodies, which needs to be standardized across various spectral and spatial resolutions of satellite data. The observation of suppressing true positive values by threshold value of 3 has to be scrutinized using multiple satellite data to define an average threshold for water body extraction. The proposed method was proved to be accurate in isolating water bodies of larger size and there is a need to improve model by emphasizing a common threshold that can extract even smaller water bodies too in the view of the current global water crisis.

The resolution of data definitely matters in precisely delineating water boundaries. The proposed model has to be tested using high spatial resolution data, such as IKONOS and QuickBird, which are preferred by researchers for better accuracy of land feature mapping. Classification of high resolution satellite data is challenging and if suggested model works out well then water bodies from data can be extracted with more accuracy and in a short time.

Along with satellite data the process of water body mining will also depend on the land use land cover of the landscape understudy. The current study used the landscape structure of the urban scenario of the rapidly growing Hyderabad city. It is necessary to examine the capability of the proposed method considering different landscape structure such as water bodies in densely forested landscape, or in a rural scenario with intermingling of water bodies among agricultural lands having different crops or water bodies adjacent to large rivers and streams. In a sense landscape matters because the land use classes around the water bodies vary in different landscapes. Further water bodies are extracted by differentiating water from nonwater features which are boundary classes for the water bodies and discriminating those classes may defy the context of feature spectral overlap.

In view of the above, the future research has great research scope in the field of perceptron model to design and develop a potential unique method that can work on varied satellite data and extract the water body with more accuracy including smaller size water bodies.

Conflict of Interests

The authors declare that there is no conflict of interests regarding the publication of this paper.

Acknowledgments

The authors duly acknowledge Global Land Facility Programme for providing satellite data used in the study and give thanks to Anonymous reviewer for his constructive comments.