Advances in Meteorology

Volume 2016, Article ID 3092671, 14 pages

http://dx.doi.org/10.1155/2016/3092671

## Terrain Segmentation of Greece Using the Spatial and Seasonal Variation of Reference Crop Evapotranspiration

^{1}Department of Life Sciences and Biotechnology, University of Ferrara, Via L. Borsari 46, 44121 Ferrara, Italy^{2}Environmental Conservation & Management, Faculty of Pure and Applied Sciences, Open University of Cyprus, Latsia, P.O. Box 12794, 2252 Nicosia, Cyprus^{3}Department of Hydraulics, Soil Science and Agricultural Engineering, Aristotle University of Thessaloniki, University Campus, 54124 Thessaloniki, Greece

Received 13 May 2015; Accepted 8 September 2015

Academic Editor: Jorge E. Gonzalez

Copyright © 2016 Vassilis Aschonitis et al. This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

#### Abstract

The study presents a combination of techniques for integrated analysis of reference crop evapotranspiration () in GIS environment. The analysis is performed for Greece and includes the use of (a) ASCE-standardized Penman-Monteith method for the estimation of 50-year mean monthly , (b) cross-correlation and principal components analysis for the analysis of the spatiotemporal variability of , (c) -means clustering for terrain segmentation to regions with similar temporal variability of , and (d) general linear models for the description of based on clusters attributes. Cross-correlation revealed a negative correlation of with both elevation and latitude and a week positive correlation with longitude. The correlation between and elevation was maximized during the warm season, while the correlation with latitude was maximized during winter. The first two principal components accounted for the 97.9% of total variance of mean monthly . -means segmented Greece to 11 regions/clusters. The categorical factor of cluster number together with the parameters of elevation, latitude, and longitude described satisfactorily the through general linear models verifying the robustness of the cluster analysis. This research effort can contribute to hydroclimatic studies and to environmental decision support in relation to water resources management in agriculture.

#### 1. Introduction

The evapotranspiration rate, which can be achieved under no water restrictions from a well-watered ideal grass surface, is called reference crop evapotranspiration and it is one of the most important hydroclimatic parameters for the implementation of various hydrological and agricultural applications [1]. Various methods have been developed for the assessment of [2, 3] and the ASCE-standardized Penman-Monteith method, which is an update of the FAO-56 Penman-Monteith, has been proposed by the ASCE-EWRI Task Committee as the most precise method for estimations [2].

The intra-annual and spatial variability of the for regions of Greece has been investigated using different methods by many authors [4–18]. The latest and the most detailed mean monthly estimations at country scale based on FAO Penman-Monteith method [1] have been performed by Dalezios et al. [5] using 66 meteorological stations covering a period of 15 years. The authors used the kriging technique for the development of 50 km resolution grids. Mardikis et al. [11] tested different interpolation methods for based on 93 meteorological stations and provided improved methods which include the effect of elevation.

The last years, the use of climatic models succeeded to generate high resolution grids of various climatic parameters which enabled the development of global maps. Significant works of long-term mean estimations at global scales have been performed by many authors [19–22]. maps using the FAO-56 Penman-Monteith and ASCE-standardized Penman-Monteith method, which are the most demanding methods in terms of climatic data and elaboration, have been developed at resolutions of 10 arc-min [20] and 0.5 degrees [21] (1 degree = 60 arc-min = 3600 arc-sec ≈ 111 km at equator). The highest spatial resolution maps (30 arc-sec) have been developed [22] using the simple method of Hargreaves [23], which requires only temperature data.

Grids of mean monthly can be used in an attempt to capture the spatial and seasonal patterns and to segment the terrain to regions with distinct seasonal variability. Common approaches for such analysis are the typical multivariate methods such as correlation analysis, principal components, factorial kriging analysis, multi-Gaussian co-kriging, regression analysis, and so forth, which have successfully been used for other environmental quality parameters [24–30]. In addition, terrain segmentation techniques have been applied to multitemporal land surface temperature (LST) data in an attempt to define subregions with different seasonal LST variability, to assess its sensitivity to climatic change by revealing thermal anomalies and to support environmental analysis [31–36]. Similar regionalization and segmentation approaches have also been applied successfully in order to define subregions with different seasonal precipitation patterns [37–39].

Objectives of the study are (a) to develop high resolution grids (30 arc-sec) of 50-years mean monthly for Greece using the ASCE-standardized Penman-Monteith, (b) to analyze the spatiotemporal variability of , and (c) to provide a terrain segmentation scheme for Greece based on the spatiotemporal variation of . The techniques of cross-correlation, principal components (PCA), cluster analysis, and general linear models (GLMs) were used. Cross-correlation and PCA are used to capture the dependence of spatiotemporal variation of mean monthly on geographical attributes (latitude and longitude), topography (elevation), and seasonality. -means clustering is used to segment the Greek territory to regions/clusters in which the mean monthly observations approximate to representative mean values with similar temporal variability. GLMs are used to (a) to parameterize the effect of clusters and together with elevation, latitude, and longitude to build models which can provide estimations of mean monthly (b) to verify the robustness of the cluster analysis and (c) to assess the contribution/effect of each cluster in the mean monthly estimations. This effort will be valuable in assisting environmental decision support and agricultural planning in relation to water resources management.

#### 2. Data and Methods

##### 2.1. Study Area and Data

The study area is Greece (South-East Europe) which is confined between the 34° and 42° parallel N., with a meridional extent from 19° to 28° E. Greece has a typical Mediterranean climate: relatively cold and rainy winters, relatively warm and dry summers, and, generally, extended periods of sunshine. The spatial heterogeneity of climate is mainly attributed to the mountain range of Pindos located in the central part of Greece, while significant differences in the winter’s severity are observed between the central-north continental territory and the islands where in the second case the winter is milder (Hellenic National Meteorological Service: http://www.hnms.gr/hnms/english/climatology/climatology_html).

The analysis was based on climatic data, which were obtained from the following databases:(i)[40]: this database provides mean monthly values for the parameters of maximum, minimum, and mean temperature at 30 arc-sec spatial resolution (~1 × 1 km). The data are provided as grids of mean monthly values of the period 1950–2000 (http://www.worldclim.org/). The database also includes a revised version of the GTOPO30 DEM based on SRTM DEM at 30 arc-sec spatial resolution. The data were produced using observation-based datasets after elaboration with the thin-plate smoothing spline algorithm implemented in the ANUSPLIN package for interpolation, using latitude, longitude, and elevation as independent variables.(ii)[41]: this database provides mean monthly values of parameters such as solar radiation, specific humidity, wind speed at 10 m above surface (which was converted to 2 m above surface), precipitation, and temperature for the period 1948–2006 at 0.5 degrees spatial resolution. The dataset was constructed by combining a suite of global observation-based datasets and it is available in the form of NetCDF files (Network Common Data Form) of monthly values of each year for the period 1948–2006 (http://hydrology.princeton.edu/data.pgf.php). Resampling to 30 arc-sec spatial resolution was performed using the bilinear resampling scheme. The data were converted to grids of mean monthly values of the period 1950–2000.

##### 2.2. Reference Crop Evapotranspiration

The estimation of using the ASCE-standardized Penman-Monteith method is performed by the following equation [2]:where is the daily reference crop evapotranspiration (mm d^{−1}), is the daily net radiation at the crop surface (MJ m^{−2} d^{−1}), is the mean daily wind speed at 2 m height above the soil surface (m s^{−1}), is the mean daily air temperature (°C), is the daily soil heat flux density at the soil surface (MJ m^{−2} d^{−1}), is the mean daily saturation vapour pressure (kPa), is the mean daily actual vapour pressure (kPa), is the slope of the saturation vapour pressure-temperature curve (kPa °C^{−1}), is the psychometric constant (kPa °C^{−1}), and are constants, which vary according to the time step and the reference crop type and describe the bulk surface resistance and aerodynamic roughness. The short reference crop corresponds to clipped grass of 12 cm height and surface resistance of 70 s m^{−1} where the constants and have the values 900 and 0.34, respectively. The tall reference crop corresponds to full cover alfalfa of 50 cm height and surface resistance of 45 s m^{−1}, where the constants and have the values 1600 and 0.38, respectively [2]. In this study, the is estimated using (1) for the commonly used short reference crop as follows: (a) mean monthly values of maximum, minimum, and mean temperature were obtained from the database of [40], while (b) wind speed, specific humidity, and solar radiation were obtained from the database of [41]. The specific humidity was converted to actual vapour pressure [42] before its use. The equations used for intercalculations in ASCE method are given in [2, 43, 44]. All the calculations were performed in ArcGIS 9.3 ESRI environment. Twelve rasters (one for each month) of mean monthly plus one of mean annual of the period 1950–2000 were developed for the Greek territory.

##### 2.3. Methodology for Terrain Segmentation Based on the Annual and Seasonal Variation of a Hydroclimatic Parameter

A set of four statistical methods such as cross-correlation analysis, principal component analysis, -means cluster analysis, and general linear models [45–48] are used in this study. Correlation analysis and principal component analysis (PCA) reveal the temporal and spatial pattern evident within a multitemporal dataset. PCA is a linear transformation technique that produces a set of images known as principal components PCs that are uncorrelated with one another while they are ordered in terms of the amount of variance they explain from the original image set [49, 50]. PCs are computed from the linear combination of eigenvectors and the corresponding pixel values of the initial images [51]. PCA has traditionally been used in remote sensing as a means of data compaction since it is usually observed that the first 2 or 3 components are able to explain the majority of the variability in data values. Later components thus tend to be dominated by noise effects. By rejecting these later components, the volume of data is reduced, with no appreciable loss of information [52]. Standardized principal components analysis [53] is applied (data per month is centered with mean 0.0 and standard deviation 1.0) and so each image is not weighed according to its variance.

-means cluster analysis was used to partition the multitemporal (12-dimensional) imagery of into exclusive clusters. It begins by initializing cluster centroids, then assigns each pixel to the cluster whose centroid is nearest, updates the cluster centroids, and then repeats the process until the stopping criteria are satisfied [51]. The analysis uses Euclidian distance for calculating the distances between pixels and cluster centroids. The underlying idea of cluster analysis is that the cluster centroids represent the mean expression of the derived clusters. So clustering of the multitemporal data sets is expected to define groups of pixels with a rather common centroid curve that expresses their average monthly variability [32]. Elevation, latitude, and longitude statistics per cluster are computed in order to assist interpretation. Finally the clusters were interpreted according to their centroid and their spatial arrangement [36].

General multiple linear regression analysis was performed using Statgraphics Centurion software (StatPoint Technologies). The parameterization of segmentation effects was made using an integer cluster number CN ranging from 1 to . The CN categorical factor together with the quantitative factors of latitude (Lat), longitude (Lon), and elevation () were used as independent variables to describe the mean monthly values of . The values of the aforementioned parameters were extracted from 29765 randomly selected positions homogeneously distributed in the entire Greek territory. The selection was performed using the Sampling Design Tool developed by NOAA (National Oceanic and Atmospheric Administration of USA) based on the procedure of “stratified random sampling.” For each month, the mean values of the dependent variable (mean monthly ) of sampling positions belonging to each cluster were compared using ANOVA-LSD. This analysis was performed in order to verify that the sampling procedure retained the differences between clusters, which were derived by -means using the whole number of pixels of each cluster. Square root transformation was used for the Lat, Lon, and values while BoxCox transformation [54] was used for the mean monthly values to avoid/reduce normality deviations of the dependent and independent variables. The selection of the parameters for the regression model was performed using their variance inflation factor (VIF) in order to avoid multi-co-linearity effects. VIF values above 10 are usually considered to indicate serious multicollinearity [45]. Thus, the following linear model was built to describe the mean monthly (mm) for each month:where is intercept of the regression model and is the regression coefficients of the categorical factor which is regulated by the cluster number CN, : the regression coefficients of the quantitative factors Lon, Lat (decimal degrees), and (m), respectively, and is the month (starting with for January and ending with for December). The value of is equal to the number of clusters minus 1 () whereas the categorical factor is regulated by the cluster number CN as follows:

The general form of BoxCox transformation for the dependent variable for each month is given according to the following formula [54]:where is the power and shift parameters of BoxCox transformation, respectively, and is the number of observations (samples) for a month . Note that is the geometric mean of . In order to improve the efficiency of the model, the power parameter was optimized while the shift parameter was set to 0 based on the default optimization procedure incorporated in the statistical software. Τhe optimal transformation is the one that minimizes the mean squared error of the transformed dependent variable [54].

Outliers were not removed during the regression analysis because of the large number of data (there is no automatic procedure to remove outliers). Autocorrelation of the residuals was tested using the Durbin-Watson test [55, 56], which provides values between 0 and 4 where the optimum value for no autocorrelation is equal to 2. ANOVA was used to estimate the statistical significance of the regression model while square correlation coefficient adjusted for degrees of freedom was used to evaluate the explanatory power of the model. Type III sums of squares analysis was used to estimate the statistical significance of the independent variables CN, Lat, Lon, and .

#### 3. Results and Discussion

##### 3.1. Spatial Variation of Mean Annual

The 30 arc-sec resolution (~1 × 1 km) maps of elevation and mean reference crop evapotranspiration for the period 1950–2000 estimated by the ASCE standardized Penman-Monteith method are given in Figures 1(a) and 1(b), respectively, while the frequency distributions of pixel values are given, respectively, in Figures 2(a) and 2(b).