Abstract

Recently, numerous studies have attempted to determine forest height using remote sensing techniques that not only have the benefits of fast data acquisition, processing, and analysis but are also cost-effective. However, if there was insufficient data to apply the latest remote sensing techniques, we need to consider many kinds of datasets as possible. In this study, we tried to determine forest height using discrete-return LiDAR data, SRTM, satellite L-band SAR data, and Optical data. We experimented with the differences between LiDAR DSM and DTM, as well as SRTM DSM and LiDAR DTM. In addition, we applied an SBAS algorithm and linear regression to the dataset. From the quantitative evaluation, the RMSE and R2 of the LiDAR-derived forest height (3.22 m and 0.43, resp.) and the SRTM-derived forest height (2.90 m and 0.50, resp.) were both reasonably good, especially when we consider data acquisition time differences and measurement errors in mountainous areas. Moreover, we slightly improved the RMSE and R2 from 2.90 m and 0.50, respectively, to 2.75 m and 0.54, respectively, by correcting the SRTM using the SBAS algorithm. Furthermore, we merged the datasets using linear regression and obtained improved forest heights with RMSE and R2 values of 2.68 m and 0.56, respectively. To generate a forest height map, we used NDVI from Optical imagery and masked heights below 2 m from each sensor. Thus, we excluded urban areas, “bare earth surfaces,” and mountain streams from each sensor’s imagery. Finally, we generated a forest height map by overlapping the datasets. The results of this study indicate that each sensor has the potential for not only determining forest height but also extracting complementary forest area information. Furthermore, this study demonstrates the potential for improvement using the SBAS algorithm and linear regression.

1. Introduction

Recently, remote sensing techniques have received a lot of attention for extracting forest information. Remote sensing techniques are efficient methods for extracting forest information not only due to fast data acquisition, processing, and analysis but also because they are cost-effective. Of the forest information which can be acquired using remote sensing data, forest height plays an important role for estimation of forest volume and aboveground biomass [1]. For these reasons, numerous studies have attempted to estimate forest height using remote sensing data such as LiDAR (Light Detection and Ranging), Optical images, and SAR (Synthetic Aperture Radar).

Each sensor has different wavelength band for data acquisition. For example, Optical images operate with a passive sensor that uses the visible spectrum, while on the other hand, LiDAR and SAR operate active sensors using near-infrared and microwave bands, respectively. In this sense, merging the various remote sensing data has the potential to provide reliable results for forest height extraction.

LiDAR is one of the most powerful methods for estimating forest height; therefore, many studies have attempted to merge LiDAR with Optical imagery. Shendryk et al. [2] combined LiDAR data and multispectral image (SPOT-5) to estimate forest canopy height using linear regression. Erdody and Moskal [3] merged airborne LiDAR and high-resolution color near-infrared imagery to estimate canopy fuel metrics in a ponderosa pine. In addition to the fusion of LiDAR and Optical images, the merging of SAR and Optical imagery has been studied [48].

It is worth mentioning that there are several drawbacks to each sensor. Though it has been successfully used for canopy height mapping, LiDAR data are difficult to update regularly because of the high cost [9]. Additionally, there have been studies concerned specifically with full-waveform LiDAR [10], but this data is generally not commercially available. In the case of Optical data, it has the benefit of extracting horizontal area rather than vertical structure. Meanwhile, most studies which use SAR data focus on polarimetric interferometry ([11], Damawan et al., 2014) or with using the difference in wavelengths sensitivity, such as the difference between the X-band and P-band, to calculate elevation [7]. However, it is difficult to acquire polarization and differing wavelength data, except from an airborne SAR system.

In this paper, we used available data, such as discrete-return LiDAR data, SRTM (Shuttle Radar Topography Mission), satellite L-band SAR data, and Optical data, instead of the latest data, such as full-waveform LiDAR and X- and P-band SAR data, which have been shown in previous studies to be well-suited for forest height calculation. LiDAR DTM (Digital Terrain Model) provided us with consistent terrain for a base map, and SRTM, which was constructed from short 6 cm wavelength, provided reflections from the surface of the forest. Abundant SAR satellite data allowed us to update the SRTM using the SBAS (Small Baseline Subset) algorithm [12] for the terrain error. Generally, Optical images has been primarily used for land cover classification, especially, NDVI (Normalized Difference Vegetation Index) which has been validated as an effective measure for identifying pixels characterized as vegetation. In summary, the determination of forest height proceeded as follows: (1) we used a variety of remote sensing data, not only air-flight data but also SAR satellite data to overcome data insufficiencies in the absence of available state-of-the-art data. (2) By applying the SBAS algorithm to the SAR images to reduce the topography error and then estimate the forest height, we extended the application of SAR satellite images based on the SBAS algorithm.

This paper is organized as follows. First, we describe the collection of the field data and provide details of the remote sensing data acquisition (LiDAR, SAR, and Optical image). We then present the methods used in the processing, and finally, we discuss the results.

2. Study Area and Data

2.1. Study Area

The study area for this investigation was an approximately 20 km2 area within the Jangsan Mountain in Busan, Korea (Figure 1). The altitude of the mountain is 634 m and the shape of the mountain is entirely conical. This area is primarily composed of pine forest with reed grass at the top of the hill. Pine trees are easy to identify top of the tree because of its shape. Therefore, measuring pine tree height is easier than broad-leaved trees (Figure 2). Also, in the area, we could collect remote-sensed data such as SAR, LiDAR, and Optical data. An in situ forest height survey was conducted throughout the entire area during September and October 2012 using laser height measurements except for artificial construction (Table 1). We determined forest height by averaging laser height measurements taken three times in opposing directions. Also, tree species composition has a major effect on forest height measurements. Therefore, we measured 90 points of in situ tree heights to only pine trees to compare with remote-sensed datasets (Table 2).

2.2. Remote-Sensed Data

We obtained SAR images from ALOS PALSAR over the study area for the period between 2007 and 2010 (Table 3). We produced 30 interferograms and used SRTM data not only to correct the topographic phase contribution but to also to estimate forest height as a DSM (Digital Surface Model). The LiDAR data were acquired in February 2007 with an LEICA ALS50 sensor, and aerial photography was acquired to generate NDVI data over the study area in April 2012. A variety of remote sensors has different resolution. In our study, we used a 5 m resolution LiDAR elevation model, 2.5 m aerial photography, 30 m SRTM DEM and 30 m SAR images using multi-look. Therefore, we conducted down- and upsampling for 5 m resolution after image processing.

During the processing, remote-sensed datasets include tree height error induced by acquisition time difference and sensor specification. However, these error sources are at cm level and do not need to be considered while ground-based tree height measurements might be more important, especially in mountain topography.

3. Method

3.1. Extract NDVI Using Optical Images

Optical sensors give pixel values that represent only the intensity of the light that is directly reflected from the forest surface. Therefore, this sensor is usually used for horizontal forest parameters, such as forest area, by interacting with the entire tree in particular bands. These distinct bands allow us to classify land cover, especially using the NDVI method. NDVI is defined as [13] where and are spectral reflectance measurements acquired in the near-infrared and red (visible) regions, respectively.

Several arguments have been made that NDVI is related not only to classification but also to tree height. Wang et al. [14] found that tree height increases are related to changes in NDVI from the previous year using NOAA/AVHRR (National Oceanic and Atmospheric Administration/Advanced Very High Resolution Radiometer) satellite imagery. Therefore, we used NDVI, calculated using high-geometrical-resolution multispectral aerial images to classify forest area and to extract forest height. We obtained an NDVI map, downsampled to 5 m resolution using cubic interpolation. After knowing that NDVI has considerable noise in complex urban areas, we applied 11 × 11 median filtering to remove this noise (Figure 3).

3.2. LiDAR

LiDAR systems measure the range from a sensor to a target using the TOF (time of flight) of a high-frequency laser pulse. Each laser pulse is recorded as a point and provides the vertical difference between the last return (assumed as DTM) and the first return (assumed as DSM). Forest height can be determined using the difference between the DTM and the DSM [15] (Figure 4). We used discrete-return LiDAR data which was one of the available remote sensing datasets. Another role of LiDAR data is that it provides high precision DTM, which rarely changes compared to DSM.

3.3. SBAS Using L-Band SAR

As seen above, in cases of forest height estimation using SAR, most studies are focused on polarimetry and the penetration capabilities of the bands. Considered in this framework, it is necessary to use polarization data, such as TerraSAR-X, and air vehicles with long wavelengths, such as a P-band SAR sensor. These systems, however, are still experimental or generally are not commercially available in our study area; therefore, instead of these systems, we used ALOS PALSAR SAR imagery to overcome data deficiency and we increased accuracy using the SBAS algorithm. We generated a differential interferogram to apply the SBAS algorithm by using the interferograms simulated from the SRTM DEM that were multi-looked using three looks in range and twelve looks in azimuth, respectively. We used SRTM DEM for the differential interferogram, in spite of having higher accuracy LiDAR DTM because the latter covered only a small part of the SAR image. As a result, we estimated the topography error using the SBAS algorithm with the differential interferogram. For the SBAS algorithm, the differential phase is as follows: where is the phase and are time variables for master and slave acquisition time, is surface displacement, and are azimuth and slant-range pixel coordinates, is radar wavelength, is the perpendicular baseline, R is the slant-range distance from sensor to pixel, and is the look angle. The first term accounts for the deformation phase, the second term accounts for topography error, the third is the atmosphere effect on the phase, and the last term accounts for phase noise by decorrelation or noise effect.

From (2), the first term of the deformation phase was not considered because the study area is stable and the third term of the atmosphere effect was neglected because we assumed that the sum of the atmospheric effects are random with respect to time similar to phase noise, so that is and . As a result, we defined topography error (Δz) iteratively as [16]

The topography error shown in Figure 5(b) from the SBAS algorithm represents the C-band-derived SRTM DEM error. Therefore, we estimated forest height from the difference between the SRTM DEM and the LiDAR DTM using a regression model (Figure 5(a)) and then we made a forest height map using DEM correction (Figure 5(c)). Moreover, we estimated and analyzed forest height using linear regression on combined LiDAR, SAR, and NDVI data.

4. Results and Discussion

4.1. Estimated Forest Height

We conducted comparisons between the SRTM-derived forest height and the topographically corrected SRTM-derived forest height with respect to in situ dataset (Figures 6(a) and 6(b)). The root mean square error of the estimated forest height from the first method to the second method was a little improved, from 2.90 to 2.75 as well as the R2 increasing from 0.50 to 0.54. In the case of the LiDAR data, the RMSE and R2 were calculated at 3.22 m and 0.43, respectively (Figure 6(c)); however, when the data acquisition differences between LiDAR and the in situ data are taken into consideration, particularly for the mountainous topography of the study area, these results show that this method is viable for determining forest height.

To improve the results, we merged the data through linear combination as follows: where and f are coefficients and c and f are offset values obtained from each tree height data and a least squares technique.

From (4), the RMSE of the forest height derived from the combined data improved by approximately 10 cm and 50 cm from the SAR-derived forest height and the LiDAR-derived forest height, respectively. This approach may be the most viable method to determine forest height (Figure 7(a)). In the case of NDVI, there have been accounts that there is a relationship between NDVI and tree height [14]; however, it was not possible to estimate forest height using this method for this study because most of the trees in the study area have minor stem diameters and tree rings (Figure 7(b)).

4.2. Forest Area Mapping

We extracted forest height based on LiDAR DTM; however, this method led to unexpected forest areas such as urban area (artificial buildings) or field of reeds. To remove urban areas from the forest height map, we calculated NDVI values using aerial Optical imagery which has NIR and R-bands. First, we masked below 2 m in height from LiDAR- and SAR-derived height maps because we only collect over 2 m in height trees. From the first masking results, the urban area remained as a forest area having both LiDAR- and SAR-derived height maps in common because artificial structures were above the threshold. Furthermore, SAR dataset excluded mountain streams in valley areas. To be precise, there are thin forest areas along the streams that the L-band penetrates more deeply than dense forest areas. Additionally, the “bald earth surface” areas signified construction dated after the SRTM DEM was acquired because the “bald earth surfaces” are not presented in the SAR-derived forest area map. Second, we used an NDVI value for threshold 0.4 as a nonforest area to remove the urban area.

Finally, we generated a forest area map by overlapping three datasets (Figure 8) and generated a forest height map using LiDAR, SAR, NDVI data, and a linear combination of each dataset and forest area (Figure 9). To compare results, we generated a height profile (Figure 10). The profile shows that LiDAR-derived forest height well represents detailed changes while SAR-derived forest height represents the overall pattern well. If we used two merged datasets, we represent not only overall pattern but also detailed change over the forest area. However, NDVI factor have no influence to forest height retrieval.

5. Conclusion

Remote sensing techniques are efficient methods of extracting forest height quickly, with cost-effective tools. However, there is data insufficiency for our study area with regard to the latest data, such as full-waveform LiDAR and X- and P-band SAR data. Instead, we determined forest height using common remote data, such as discrete-return LiDAR data, SRTM, satellite L-band SAR data, and Optical data. Moreover, in order to improve the results, we applied the SBAS algorithm for SRTM DEM error correction and combined each dataset using linear regression. We then extracted the forest areas by overlapping NDVI and forest height data above 2 m allowing the sensors to complement each other. Using this procedure, we generated a forest height map as an end product with an RMSE accuracy of 2.68 m.

In general terms, this study shows the various ways in generating a forest height map using limited data. In spite of different acquisition times, it should be noted that each sensor has the potential for not only determining forest height but also extracting complementary forest area. Furthermore, there is potential for improvement using the SBAS algorithm and linear regression. For those who may encounter a lack of available of data to generate forest height map, this study offers one possible approach.

Conflicts of Interest

The authors declare that they have no conflicts of interest.

Acknowledgments

This research was supported by the National Research Foundation of Korea (NRF) grant funded by the Korea government (MSIP) (no. NRF-2017R1A2B4003258) and Basic Science Research Program through the National Research Foundation of Korea (NRF) funded by the Ministry of Science, ICT & Future Planning (NRF-2015M1A3A3A02013416).