Abstract

Explicit information of tree species composition provides valuable materials for the management of forests and urban greenness. In recent years, scholars have employed multiple features in tree species classification, so as to identify them from different perspectives. Most studies use different features to classify the target tree species in a specific growth environment and evaluate the classification results. However, the data matching problems have not been discussed; besides, the contributions of different features and the performance of different classifiers have not been systematically compared. Remote sensing technology of the integrated sensors helps to realize the purpose with high time efficiency and low cost. Benefiting from an integrated system which simultaneously acquired the hyperspectral images, LiDAR waveform, and point clouds, this study made a systematic research on different features and classifiers in pixel-wised tree species classification. We extracted the crown height model (CHM) from the airborne LiDAR device and multiple features from the hyperspectral images, including Gabor textural features, gray-level co-occurrence matrix (GLCM) textural features, and vegetation indices. Different experimental schemes were tested at two study areas with different numbers and configurations of tree species. The experimental results demonstrated the effectiveness of Gabor textural features in specific tree species classification in both homogeneous and heterogeneous growing environments. The GLCM textural features did not improve the classification accuracy of tree species when being combined with spectral features. The CHM feature made more contributions to discriminating tree species than vegetation indices. Different classifiers exhibited similar performances, and support vector machine (SVM) produced the highest overall accuracy among all the classifiers.

1. Introduction

The spatial composition of tree species is essential for forest inventory and analysis, which benefits the conservation and exploitation policies of the forests. A great deal of forest management requires the information at tree species level [14]. Both governments and companies have spent a lot of money on forest surveys.

However, it is challenging to discriminate between tree species owing to the diversity of spatial distributions and the complexity of growing environments. In many cases, trees are close to each other, which cause mutual occlusion; besides, the lush weeds and stones bring much noise, which increases the difficulty in segmentation and classification [5]. The observation from a single perspective is unlikely to effectively distinguish fine tree species.

Remote sensing techniques have played an important role in tree species identification [6]. Various sensors have been utilized in tree species classification. Multispectral sensors like Landsat TM and ETM+ have helped to map forest cover and estimate vegetation parameters [7, 8]. Very high resolution (VHR) satellite sensors like Quickbird, Ikonos, and GeoEye have been known to be useful to discriminate tree species with high density of spatial distributions [911]. However, given the lack of spectral information, VHR do not yield good classification results. With the development of satellite sensors, hyperspectral data have provided more spectral bands, thus contribute to more accurate recognition and discrimination of specific species [1214]. More recently, approaches featuring integrating multiple sensors have greatly improved tree species classification. One of the most frequently used methods is to combine spectral instruments with Light Detection and Ranging (LiDAR) sensors [1518]. LiDAR data reveal the vertical structure and the tree height information which can be indicative in tree species identification. Meanwhile, researchers have exploited multiple features in the pixel-wised tree species classification [19, 20]. Different dimensionality reduction (DR) algorithms have been compared for tree species identification [21]. Gray-level co-occurrence matrix (GLCM) [22] has been employed to classify forest tree species by airborne hyperspectral images [23]. In addition, vegetation indices have also helped to identify tree species [24].

However, most studies which took into account multiple features in tree species classification identified tree species under specific growth environment and evaluated the accuracies. But the multifeature matching problem has not been discussed. Besides, few studies detected the contributions of different combinations of features and classifiers in tree species classification, especially in the circumstances with different configurations and growing environments of tree species.

As a result, several problems still exist: (1) separate sensors lead to the inconsistency both in time and space. (2) Studies on a single area with specific growing environment have got restricted conclusions, which may not be representative in a larger geographical range. (3) Multiple features and multiple classifiers should be compared systematically, and more indicative features need to be found.

In this study, the experiments were carried out in two areas with different tree species configurations and growing environments. We extracted multiple features from an integrated system which can simultaneously acquire hyperspectral information and LiDAR data. To detect the contributions of different features in the pixel-wised tree species classification, we designed different schemes considering different groups of features and classifiers.

2. Materials

2.1. Study Area

The work has been carried out in two study areas in Central and East China. One is located in Changshu City, Jiangsu Province, East China (Figure 1(b)). The location is near Chang Jiang River. The climate belongs to a subtropical monsoon. There mainly exist four tree species in the scene, among which Cinnamomum camphora (CC) and Punica granatum (PG) are the dominating tree species.

The other study area is located at Huanshui Park in the north of Henan Province, Central China (Figure 1(a)). The climate belongs to a monsoon of medium latitudes. The tree composition of the area has heterogeneous properties. Seven tree species including both coniferous trees and broadleaf trees cover the main area. Platanus acerifolia (PA) which is the dominating tree species covers about 50% of the whole region (mainly distributed around the roads or space areas). Other species are mainly distributed in the central area.

There are strong contrasts between the growing environments of the two study areas. Changshu area commonly has the same kinds of trees flocking together. However, Huanshui Park has more tree species and heterogeneous compositions. Most tree species in the study areas are a representative of the monsoon climate. In addition, the two study areas have different illumination conditions: Changshu area has a brighter illumination, while Huanshui Park has a darker illumination.

2.2. Data Collection
2.2.1. Airborne Image Data

The images have been acquired through LiCHy (LiDAR-CCD-Hyperspectral) airborne system from the Chinese Academy of Forestry (CAF), which is an integrated system comprising LMS-Q680i, DigiCAM-60, AisaEAGLE sensors, and a GPS/IMU in the same platform. LiCHy is a synthetic system that simultaneously acquires the hyperspectral images, LiDAR waveform and point clouds. It is capable of measuring the vegetation vertical structure, horizontal pattern, and foliar spectra at a very high spatial resolution. The flight altitude is about 1000 meters. Hyperspectral image data acquired by AisaEAGLE airborne hyperspectral sensor have 64 bands with spectral resolution of 9.2 nm and a spectral range from 400 nm to 970 nm. The spatial resolution of the hyperspectral image is 0.6 m for Changshu area and 0.5 m for Huanshui Park. The image size is for Changshu area and for Huanshui Park. The airborne LiDAR scan data have been collected by LMS-Q680i laser scanner at a wavelength of 1550 nm. The LiDAR and HSI data have been georeferenced using GPS and IMU data by the data provider.

The field data and the images for Huanshui Park were obtained on June 2013, while for Changshu study area, they were obtained on August 2013.

2.2.2. Field Data

To record the tree samples on the images and confirm that the samples were reliable, we acquired the ground reference data by field investigation, photo interpretation, and GPS devices. Based on the ground investigation, 761 pixels in the image were collected for Changshu study area and 1173 pixels were collected for the Huanshui Park scene. 10% randomly selected pixels of each class were employed as training samples. The accuracy will be calculated by 10 trails. Tables 1 and 2 list the information of the ground truth for both study areas.

3. Methods

3.1. Feature Extraction
3.1.1. Principal Component Analysis

Principal components analysis (PCA) [25] acquires the information from the original spectral vectors by multiplying a transformation matrix. PCA is exploited as a conventional linear dimensionality reduction (DR) algorithm without class label information and has a small calculated amount. In this study, we extracted the first 10 principal components (PC) from the hyperspectral images. The feature numbers were previously determined. 10 features not only avoided large calculation amount but also contained the major spectral information.

3.1.2. Gabor Textural Features

2D-Gabor textural features have performed very well in many applications of pattern recognition [2628], but have not been widely used in tree species classification. We considered the calculation process in [26]: a Gabor function is defined as where is the image location in the spatial domain. The frequency vector determines the scales and directions of Gabor functions. It is defined as

In our experiment, parameter was fixed to 2. Scale parameter ranged from 0 to 3 and direction parameter ranged from 0 to 7, which meant 4 scales and 8 directions. and were integers. Parameter was fixed to 2π representing the number of oscillations under the Gaussian envelope. According to [28], the textural layers derived from the Gabor filters are the real part of convolving the image with different and

We extracted Gabor textural features from the first PC, for the first PC contained the most information.

3.1.3. Vegetation Indices

Based on hyperspectral bands, we extracted four common vegetation indices (VI), including the normalized difference vegetation index (NDVI), ratio vegetation index (RVI), green chlorophyll index (CIgreen), and enhanced vegetation index (EVI). The computational formulas are shown as follows:

VI has been indicators of the vegetation [2931]. The experiments would discuss whether they are contributive to the improvement of tree species classification. In the experiments, wavelengths of NIR, , GREEN, and were, respectively, 800 nm, 666 nm, 515 nm, and 407 nm.

3.1.4. Canopy Height Model

LiDAR point clouds were labeled by the TerraScan software. A digital surface model (DSM) was derived from LiDAR aboveground returns. A digital elevation model (DEM) was derived from ground surface points with the resolution coinciding with the hyperspectral images. As a result, we modeled the canopy height by subtracting DSM from DEM. In the study, CHM was directly acquired by data processing.

3.1.5. Gray-Level Co-Occurrence Matrix

Gray-level co-occurrence matrix (GLCM) is a common approach of textural presentation featuring by detecting the spatial correlation of the gray levels of the images. GLCM is obtained by calculating the gray-level conditions of two pixels keeping a certain distance in the image. The study extracted eight GLCM features from the first PC, namely mean, variance, homogeneity, contrast, dissimilarity, entropy, second moment, and correlation. The computational process was realized by ENVI software using the default parameters with a window size. The computational formulas can be found in [22].

3.2. Separability Analysis

Separability analysis is an important task of our work. Jeffries-Matusita (J-M) distance [32] is used to analyze the separability of tree species. It provides the information about the separability between two classes. It also reflects the contributions of a certain group of features, e.g., vegetation indices and Gabor textural features. The J-M distance between the th class and the th class can be described as where and means the likelihood probabilities of and . It can be rewritten as where can be described as where and stand for the mean vectors of and . and stand for the covariance matrices of and .

J-M distance ranges from 0 to . We used the squared J-M distance to describe the separability of two classes.

3.3. Classification Scheme
3.3.1. Classifiers

The study employed four classifiers to identify the tree species.

-nearest neighbor (NN) [33] classifier is one of the simplest classifiers in machine learning. A sample to be predicted has nearest samples (known samples) in the feature space. If the majority of the samples belong to one class, the predicted sample belongs to the class.

Maximum-likelihood classifier (MLC) [34] assumes the distribution functions of all classes are subject to the normal distribution. The posterior probabilities of all predicted samples are calculated based on Bayes discriminant criterion [35]. The class corresponding to the highest posterior probability is the predicted class.

Logistic regression (LR) [36, 37] is a kind of nonlinear regression based on sigmoid function. Logistic regression defines the odds of an event as the ratio of the probability of occurrence to that of nonoccurrence. Like MLC, the predicting process is featured by comparing between the probabilities.

Support vector machine (SVM) [3840] has been widely used in classification problems for remote sensing images in recent years, especially for hyperspectral images [41, 42]. SVM is featured by mapping the initial feature space to a higher-dimensional space with the help of a kernel function, so as to “linearly” separate different classes.

3.3.2. Classification Schemes

In the data processing, we masked the tree cover by setting the threshold on CHM and NDVI. Specifically, the areas with CHM values higher than 2 and NDVI values higher than 0.15 were under consideration. Table 3 lists 6 designed schemes, which reflect different groups of features. By taking into account different schemes, we will detect the contributions of different features like PC, VI, CHM, GLCM, and Gabor by means of separability analysis and classification accuracy. In addition, the experiments were carried out, respectively, by 4 conventional classifiers under each scheme. The classifiers were NN, MLC, LR, and SVM. Since the performance of NN classifier is influenced by the value [43, 44], we selected different values in the experiments. The parameter of the NN classifier was, respectively, set to 3, 5, 7, and 10. The penalty factor and Gamma coefficient of SVM were determined by cross-validation. The experiments were based on 10 independent trails.

As discussed above, the overall work of the study can be illustrated by Figure 2.

4. Results

4.1. Separability by Different Features

Tables 4 and 5 list the computational results of J-M distances for both study areas. In general, Gabor textural features were the only group of features by which all the squared J-M distances reached 2 for both study areas, which means Gabor textural features gave the highest separability among all the feature groups. PC and GLCM features also led to high separability between all class pairs, but we discovered that the class pairs had similar separability by PC and GLCM. For example, tree species CC and PG had the lowest separability for Changshu dataset by PC, as well as GLCM; tree species FC had higher separability between all other tree species by PC for the dataset at Huanshui Park, while for GLCM the circumstance was nearly the same. In addition, even though PC and GLCM had present very high J-M values (near to 2), the combination of them did not lead to an obvious increase. So PC and GLCM might have a strong correlation. VI yielded moderate J-M values. CHM was able to discriminate certain tree species, e.g., SB and PG had much higher J-M values by CHM. However, it did not work well with a larger number of tree species.

4.2. Classification Results

Tables 6 and 7 list the classification results of the tree species in both study areas, including the overall accuracy (OA) and kappa index.

It can be discovered from Tables 6 and 7 that we could hardly find good classification performance by only using PC features in specific tree species classification for both study areas, no matter which classifier was employed (scheme 1). Given the enough feature numbers and better properties, Gabor textural features improved the accuracy to the greatest extent (scheme 5) for both datasets. We further analyzed Gabor features with different scales and directions (Figure 3). The accuracy increased fast when the direction parameter was not greater than 4. When it was greater than 4, the accuracy maintained a high level and increased slowly. Though vegetation indices had indicative properties to tell the differences between vegetation and nonvegetation areas, they made less contribution to discriminating specific tree species (scheme 2). CHM features reflected the tree height information, but the feature did not improve the accuracy by a large margin with a higher number of tree species, for the height of several tree species may be close to each other (scheme 3). The classification results were in accordance with the statistical results of the separability analysis. The combination of GLCM did not give rise to the increase in the accuracy in the specific classification of tree species. The results might be caused by a strong correlation between spectral features and GLCM features. PC combined with Gabor textural features had already yielded fine results, while the accuracy had little improvement when VI and CHM were added. The classification results demonstrated the effectiveness of Gabor textural features in the identification of tree species with different numbers and configurations.

Different classifiers exhibited similar performances, and the accuracies had similar variation trends along with different input features. Particularly, NN classifier did not fit Gabor textural features well with a larger . When more features were involved, the accuracy increased slowly with a larger . While for a smaller , the accuracy increased by a larger margin with more features. For example, for the dataset of Huanshui Park, the accuracy in scheme 1 was increased by 16.93% compared with scheme 6 when was 3. However, only 11.51% for , 7.16% for , and 6.10% for . So a small should be selected when more features were combined. Logistic regression exhibited similar performance with NN when was 3. Compared with other classifiers, the results obtained by SVM showed a discernible increase for both study areas. Figure 4 shows the land cover maps of both study areas obtained by SVM for scheme 1 and scheme 6. The white region indicated the nonforest area.

The bar chart (Figure 5) showed that the accuracies increased obviously with the combination of Gabor textural features. However, neither VI nor CHM gave rise to remarkable improvements. Generally, the SVM classifier exhibited a discernible better performance and yielded the highest accuracy when more features were combined.

5. Discussion

5.1. Selection of Feature Groups

According to the experimental results of the two datasets, combined features performed better in both accuracy and visual perspective. The land cover maps (Figure 4) showed significant differences between scheme 1 (PC only) and scheme 6 (PC+Gabor+VI+CHM). Specially, Gabor textural features extracted from the hyperspectral images were strongly recommended in tree species classification. Both the separability analysis and the classification results demonstrated that Gabor textural features made more contributions to discriminating tree species in both heterogeneous and homogeneous distribution. Gabor textural features need little computation time and had enough feature numbers. In the experiments, we found that it was easy to select the parameters of Gabor textural features. When the direction parameter was greater than 4 and the scale parameter was greater than 2, the accuracy maintained a high level. In this study, we selected 4 scales and 8 directions, thus 32 Gabor features were yielded. The number of features was similar to the other applications of Gabor features [27, 28]. The reason for the effectiveness of Gabor textures in tree species classification may also lie in the fact that Gabor features are less sensitive to illumination variations [45]. Woods have complex illumination conditions because of complex spatial distributions. This also explains why Gabor features performed well in the two study sites with different illumination conditions.

We believe the PC features were necessary because they contained the major information of hyperspectral images. VI did not make too much sense in discriminating specific tree species when being concatenated with other features according to the statistics of the accuracy (Tables 6 and 7). VI is effective in the identification between vegetation and nonvegetation, because the vegetation is always green. However, many tree species have a similar green degree, which makes it hard to identify tree species by VI. Nevertheless, we believe VI is necessary in the whole procedure, because it helped to extract the green cover in the first step. CHM always gave more improvement in accuracy than VI when being combined with spectral features. In addition, CHM made different contributions by different numbers and configurations of tree species. Figure 6 showed the CHM distribution of the ground truth for both dataset. CHM only separated certain trees but did not work well in complex situations. Specially, if certain tree species had a wide range of tree height (like FC in Huangshui Park), it may not be easily discriminated from other tree species by CHM. So we recommend employing CHM to identify tree species with a small number of species or to just extract individual trees. The conventional GLCM textural features did not help to improve the accuracy in tree species classification when being concatenated with spectral features.

5.2. Differences in the Performances of the Classifiers

In the specific classification problems of tree species, different classes had similar performances. Samples of different class labels are likely to be located in the neighbors in the feature space, which would influence the prediction of the classifier. SVM outperformed all other classifiers with any combination scheme of features. NN could not exploit Gabor features well with a larger . The neighbors would involve more distant samples which would work on the classifier when was large. In addition, researchers have found that the conventional NN classifier may not handle the high-dimensional data well [46, 47]. The highest overall accuracy did not mean SVM identified all the tree species well. The confusion matrices, respectively, obtained by SVM and NN () classifiers (Tables 8 and 9) showed that SVM better identified PC, GB, PA, and AJ, while for CD, SJ, and FC, SVM did not perform better than NN (). In this case, a classification strategy featuring by selecting and combining the most accurate processing chain (a group of features associated with a certain classifier) for each class would be considered in future studies [48, 49].

6. Conclusions

In this study, multiple features of tree species were extracted from an airborne integrated system, including hyperspectral sensors and LiDAR devices with the same geographic reference. The experiments had been conducted in two study areas in China with different tree species configurations and growing environments. Then, a systematic research had been made to detect the contributions of different features and performances of different classifiers in a pixel-wised tree species classification.

The study highlighted the importance of employing multiple features to identify tree species. Through the experiments, we got which level of accuracy can be achieved by different features and classifiers in the pixel-wised tree species classification. The main conclusions of the study were as follows: (i) multiple features gave better identification results than the single group of features. (ii) Gabor textural features were effective in tree species classification in both heterogeneous and homogeneous growing environments. In contrast, the conventional GLCM textural features made less contribution in tree species classification. However, not all classifiers fit Gabor features well, like the NN classifier with a larger . (iii) When the number of species was small, CHM made more contributions to identifying tree species, while for a bigger number of species and close tree height, CHM did not make much sense. iv) SVM always outperformed other classifiers under different circumstances in tree species classification, but it could not best identify all tree species. v) The integrated system with hyperspectral sensors and LiDAR device is necessary in the regional tree species identification.

This study will improve the detection of the individual trees [5053] when layers of segmented tree objects are merged, thus providing more details for the applications of forests at tree level. As future work, more features should be exploited in tree species classification. Especially, classification maps of tree species from different scales should be integrated, which will be significant work for the tree species classification and the forest management. In addition, multiple features and different classifiers might be considered simultaneously by an approach featuring exploiting the best processing chain (a classifier combined with a group of features) for each specific tree class.

Abbreviations

GLCM:Gray-level co-occurrence matrix
CHM:Crown height model
VHR:Very high resolution
LiDAR:Light detection and ranging
DR:Dimensionality reduction
LiCHy:LiDAR-CCD-hyperspectral
CAF:Chinese Academy of Forestry
PCA:Principal components analysis
PC:Principal components
VI:Vegetation indices
NDVI:Normalized difference vegetation index
RVI:Ratio vegetation index
CIgreen:Green chlorophyll index
EVI:Enhanced vegetation index
DSM:Digital surface model
DEM:Digital elevation model
J-M:Jeffries-Matusita
NN:-nearest neighbor
MLC:Maximum-likelihood classifier
SVM:Support vector machine.

Data Availability

Two kinds of data are involved in this study. One is the airborne image data, including hyperspectral image data with 64 bands and CHM products acquired from LiDAR sensors. All image data are available. The other is field data, including the locations, photos, tree names, and chlorophyll value of the leaf. The field data are available after we submit an application to the Chinese Academy of Forestry (CAF).

Conflicts of Interest

The authors declare no conflict of interest.

Authors’ Contributions

Guang Yang is the main author who proposed the basic idea and completed the experiments and the manuscript. Yaolong Zhao provided the useful suggestions on designing the approaches involved in this study. Baoxin Li and Jiangbo Jing helped to modify the manuscript. Yuanyong Dian provided the data source along with the data processing.

Acknowledgments

The study was funded by the National Natural Science Foundation of China (NSFC) (Grant no. 41871292) and the Guangzhou People’s Livelihood Science and Technology Project (Grant nos. 201803030034 and 201802030008).