Assessment of Different Remote Sensing Data for Forest Structural Attributes Estimation in the Hyrcanian forests

Aim of study: The objective of the study was the comparative assessment of various spatial resolutions of optical satellite imagery including Landsat-TM, ASTER, and Quickbird data to estimate the forest structure attributes of Hyrcanian forests, Golestan province, northernIran. Material and methods : The 112 square plots with area of0.09 ha were measured using a random cluster sampling method and then stand volume, basal area, and tree stem density were computed using measured data. After geometric and atmospheric corrections of images, the spectral attributes from original and different synthetic bands were extracted for modelling. The statistical modelling was performed using CART algorithm. Performance assessment of models was examined using the unused validation plots by RMSE and bias measures. Main Results: The results showed that model of Quickbird data for stand volume, basal area, and tree stem density had a better performance compared to ASTER and TM data. However, estimations by ASTER and TM imagery had slightly similar results for all three parameters. Research highlights : This study exposed that the high-resolution satellite data are more useful for forest structure attributes estimation in the Hyrcanian broadleaves forests compared with medium resolution images without consideration of images costs. However, regarding to be free of the most medium resolution data such as ASTER and TM,ETM+ or OLI images, these data can be used with slightly similar results. Keywords : Forest structure attributes; quickbird; ASTER; TM; CART algorithm; Hyrcanian forests.


Introduction
The forest has a significant role in human life. It not only has to provide goods for consumption but it also has an ecological, environmental, and aesthetic role in human life. Forest structural attributes such as volume, basal area, and number of trees per unit area are important data needed for effective forest management (Gebreslasie et al., 2010). Hyrcanian forests comprise a diverse vegetation cover in the north of Iran and are increasingly fragmented, degraded and converted to other forms of land use (Mohammadi et al., 2010). The forest's structural attributes are traditionally gathered by ground-based field measurements using hand-held equipment. These measurements are generally expen-2 that using high-resolution image types had better estimates compared to medium resolution images (i.e., SE% of 46, 50, 56 in stem volume and 38, 42, 47 for basal area estimation, respectively for aerial photos, SPOT-XS and Landsat-TM).
It has been confirmed that using high-spatial resolution imagery may lead to estimates of high-precision results, especially for the extraction of forest attributes, due to its ability for finer detection and recognition of spectral reflections of the canopy crown and lower mixed pixels. However, medium spatial resolution imagery has been the most popularly used data in stand and plot level estimations. Comparative studies in mixed and multi-layered forests such as Hyrcanian forests could be useful to determine suitable image sources to extract forest structure attributes.
In recent years, non-parametric algorithms such as decision tree-based algorithms (Breiman et al., 1984) and their variants like CART (classification and regression tree) have been widely used in different studies due to their simple interpretation, high precision, and ability to characterize complex interactions among variables (Cutler et al., 2007). Non-parametric algorithms have obvious advantages over parametric-based algorithms for multisource predictive forest mapping. One major drawback of parametric-based algorithms is that they assume a particular statistical distribution in the dataset, which is usually not compatible with multisource data. Nevertheless, non-parametric-based data have recently been used in a multisource forest inventory for estimating forest characteristics due to their advantages including large geographic coverage and large spectral range (Tuominen & Haakana, 2005). In the past two decades, many researchers (see table 1) have focused on the extraction and retrieving of forest stand parameters such as stand volume, basal area, DBH, and tree stem density using medium-to-high resolution optical sensor data.
This research has used different remote sensing sources from aerial photos to satellite based images, with various spatial, radiometric, or spectral resolutions. In some occasions, the results of previous studies were not satisfactory for managers according to forest condition such as forest composition, forest structure, and topography complexity. The comparative studies for investigation into the capability of various image sources based on spatial or spectral characteristics can help to choose suitable images for the extraction of the desired information. Few research studies have compared the effect of different spatial resolutions from different image sources including very high-spatial resolution (airborne-or space-borne systems) to medium spatial resolution imagery in a same forest. For instance, Hyyppa et al., (2000) used various image sources, such as aerial photographs, SPOT Pan, SPOT XS, and Landsat-TM, and compared the accuracy of retrieving the following forest stand variables: stem volume, mean tree height and basal area. They found

Field data
The in-situ data were gathered in the summer of 2010 (in line with season of images) and on 23 clusters with 5 plots and 112 plots in the natural stands (95 plots) and planted stands (17 plots) with homogenous unities in terms of slopes, aspects, and forest types. The distance between plots in each cluster was 75 m and the plots were square with an area of 30×30 (0.09 ha) m (Fig. 1). The coordinates of the centre of each plot were accurately registered using a DGPS device in a post-processing kinematics (PPK) method. In each plot, the species name, stand height (measured to the nearest tree) and diameter at breast height (DBH) of trees with a diameter greater than 7.5 cm at breast height were taken. Singletree volume in plots was calculated using a local volume table, containing diameter at breast height (d 1.3 ) and height, to estimate the volume of different species in the plots. Plot level volume was computed throughout, adding total singletree volumes. Finally, the volume per hectare (m 3 /ha) was estimated using the total volume of all trees in each plot. The measured DBH of trees was used to determine basal area. In addition, the tree density was computed through counting of measured trees in each plot.

Remote sensing data
Data used for this study included Quickbird images from 7 October 2007, ASTER images from 3 July 2006, algorithms make no assumption on data distribution, and therefore avoid the significant error source. This means that they are free from assumptions of any given probability distribution, and observations are assumed independent of each other (Sironen et al., 2010). Many studies have shown that non-parametric methods provide better estimation results. In some studies, such as Sarunas (1997), it has been demonstrated that even with small training samples, non-parametric estimation algorithms provide better results than parametric ones. Among non-parametric algorithms, tree based algorithms are more famous and more commonly used for both forest attribute estimation and classification. In this paper, the capability of Quickbird, ASTER, and TM data were compared to estimate forest structural attributes using classification and regression trees algorithm (CART) as one of the non-parametric algorithms.
In the past several years, classification and regression tree-based analyses have been implemented in several software programs, and used in many remotesensing applications (Huang & Jensen, 1997;Lawrence & Wright, 2001;Cooke & Jacobs, 2005). The use of non-parametric methods for land cover classification has increased in the past decade. Aertsen et al., (2010) investigated the performance of non-parametric techniques such as CART compared to parametric techniques for the prediction of site index in Mediterranean mountainous forests. Moisen & Frescino (2002) and Wang et al., (2005) evaluated these techniques for the prediction of several species-independent forest characteristics in the interior western United States and for the spatial prediction of site index of Lodge pole pines in Canada. These studies concluded that nonparametric approaches were more effective compared to parametric ones.
The objective of research was comparison of various medium (TM and ASTER) and high (Quickbird) spatial resolution satellite images by CART algorithm for estimation the quantitative forest attributes of the Shastkalateh's forest as a part of the Hyrcanian forests with mixed and multi-layered hardwood stands, in the Golestan province, north of Iran.

Study area
The Hyrcanian vegetation zone is a green belt stretching over the northern slopes of Alborz Mountain and covers the southern coasts of the Caspian Sea. This zone is rich in point of species diversity and includes 80 woody species (trees and shrubs) dominated with 4 estimation of atmospheric transmittance (COST) absolute radiometric correction model (Chavez & Pat, 1996) was applied on each image. This model consists of a modification of the dark-object subtraction (DOS) method by including a simple multiplicative correction for the effect of atmospheric transmittance.

Image processing
After geometric rectification and atmospheric corrections, some suitable processing analyses including tasselled cap transformation (greenness, brightness, and wetness components), standardized principal components (PCA), Pansharpning (for Quickbird images), vegetation indices (Table 3) and texture analyses were performed to produce useful and correlated artificial for quantifying and enhancing biophysical characteristics. The texture analysis is done on VNIR band of ASTER and Quickbird images with grey level co-occurrence matrix (GLCM) indices (Mean, Variance, Homogeneity, Contrast, Dissimilarity, Entropy, Angular second moment, Correlation, GLDV angular second moment, GLDV entropy, GLDV mean, GLDV contrast, Inverse difference). These were included in the statistical analysis via main bands for prediction of forest structural attributes in the regression modelling. and Landsat-5 TM images from 17 September 2010. The dates of images are ranging during summer season so that trees do not have phenology differences due to completing leaf growing and there were not considerable cover changes in the period of five years. The projection system was UTM zone 40N, Datum WGS 1984. Table 2 summarizes the acquisition of the data in detail.

Pre-processing of satellite data
In this study, the image datasets were georeferenced to the UTM coordinate system using ground control points (GCPs) and second-order polynomial equations. The Quickbird images were georeferenced and orthorectified using 24 GCPs collected from DGPS and a digital elevation model (DEM) derived from a 1:25,000 topography map (Yazdani, 2011). The ASTER and TM images were rectified with Quickbird images using 25 and 30 GCPs, respectively. The total root mean square errors (RMSE) of images were obtained approximately of 0.36 and 0.25 for ASTER and TM images, respectively. The geometric precision of images was also verified using road and river vectors.
In this study, due to a lack of some information required for accurate atmospheric models, the cosine Assessment of different remote sensing data for forest structural attributes estimation in the Hyrcanian forests ignated as terminal nodes. Pruning of trees is often necessary to avoid over-fitting of data, often accomplished by setting aside a portion of the training data for pruning. A cross validation methodology is applied for pruning to outcome prediction. Regression trees are insensitive to outliers, and can accommodate missing data in predictor variables using surrogates. For more details on the CART model, refer to Breiman et al., (1984).

Model evaluation
Of the plots used, 85% (95 plots) were used for modelling and the remaining 15% (17 plots) were used to evaluate the model outputs. The reliability of estimates was measured by adjusted coefficient of determination (R 2 adj), root mean square error ( (Makela & Pekkarinen, 2004).

Statistical analyses
In this study, statistical analyses were performed by R-2.14.2 (2011-12-22) software (R core Team, 2012). The spectral values of the main and artificial images of the three sensors were extracted by the Zonal statistics function in ArcGIS software corresponding to ground plots. Using 85% of plots (95 plots), the CART algorithm was conducted to evaluate relationships between forest structural attributes of volume, basal area, and number of stem trees per hectare as dependent and spectral data extracted from main and artificial images as independent variables. The CART algorithm is a nonparametric modelling approach that can explain responses of a dependent from a set of independent continuous or categorical variables. The CART models recursively partitioned the data to find increasingly homogeneous subsets based on independent variable splitting criteria using variance-minimizing algorithms. This model produces outcomes that are unaffected by monotone transformations and differing scales of measurement among predictors. Regression trees are insensitive to outliers and can accommodate missing data in predictor variables using surrogates. The dependent data are partitioned into a series of descending left and right child nodes derived from parent nodes. Once the partitioning has ceased, the child nodes are des-   Mohammadi et al., 2010;Gebreslasie et al., 2010;Wolter et al., 2009;Kalbi et al., 2014). In contrast, the obtained RMSE values were lower than RMSE of different studies (Tokola & Heikkila, 1997;Fazakas et al., 1999;Hyyppa et al., 2000, Hyvonen, 2002Makela & Pekkarine, 2004;Huiyan et al., 2006;Muukkonen & Heiskanen, 2005). The regression tree model for stand volume estimation using Quickbird images as the best image, which could produce better estimations, is shown in fig. 2 (for ASTER and TM images please see Figures S1 and S2 [supplementary]). The models indicate that blue, mean (VNIR1) and mean of blue bands were the most important variables to model the stand volume, and could be explained by three image dates ( Table 5). The performance results of using different data sources by CART for stand volume estimation showed that Quickbird images could produce estimations with lower absolute and percentage RMSE and bias compared to using ASTER and TM data. However, estimations produced using ASTER and TM data were almost similar and had very slight differences. For volume estimation, the results obtained from TM data were better than those obtained from the ASTER data were. Comparative results of implementations are given in Table 6.

Estimating basal area
The results of CART model implementation to create the relationship between spectral values of used images and basal area are shown in the table 5 as the best predictors of basal area with the adjusted R 2 of 0.50, 0.73, and 0.70 and RMSE of 2.44, 10.03, and 9.08 m 2 ha -1 , for Quick bird, ASTER and TM images, respectively ( Table 6).
The R 2 values obtained in this study to predict basal area using Quickbird image were lower than ones were obtained by other researches (Gebreslasie et al., 2010;Wolter et al., 2009;Kalbi et al., 2014). In addition, the RMSE values were lower than those values obtained Where, ŷ i is predicted value, y i is the observed value, ȳ is the mean of the observed values, and n is the number of observations in the test dataset.

Descriptive statistics of field data
The preliminary descriptive statistics results showed that stand volume, basal area and tree density ranged from 22.84 to 647.82 m 3 /ha, 3.29 to 52.57 (m 2 /ha) and 111.10 to 966.57 (n/ha), respectively. The mean stand volume was 294.64 m 3 /ha with standard deviation of 141.21 m 3 /ha, the mean basal area was 25.40 m 2 /ha with standard deviation of 10.09 m 2 /ha and the mean tree density was 366.98 n/ha with a standard deviation of 194.55. Table 4 represents a full range of stand structure attributes in the study area.

Estimating stand volume
The results of CART model implementation to create the relationship between spectral values of used images and stand volume are shown in the table 5 as the best predictors of stand volume, with the adjusted R 2 of 0.61, 0.76 and 0.71 and RMSE of 50.96, 120.92, 102.39 m 3 /ha, for Quickbird, ASTER and TM images, respectively ( Table 6).
The obtained R 2 values of this study were higher than obtained by others (Trotter et al., 1997;Hall et  The results of regression tree model for basal area estimation using Quickbird images as the best image, which could produce better estimations, is shown in Fig. 3 (for ASTER and TM images please see Figures  S3 and S4 [supplementary]). The results indicated that GLDV entropy, PCA1, and greenness were the most important variable determining basal area, could be explained by three image dates above, respectively ( Table 5). The results of using different data sources in CART performances for estimate basal area/ha showed that Quickbird images could produce estimations with lower absolute and percentage RMSE and bias compared to using ASTER and TM data. However, estimation using ASTER and TM data produced slightly by direct estimation to predict basal area (Gebreslasie et al., 2010;Kalbi et al., 2014).
The R 2 values obtained in this study using ASTER images were higher than those that obtained through direct estimation to predict basal area (Gebreslasie et al., 2010;Wolter et al., 2009;Kalbi et al., 2014).
The R 2 values obtained in this study using TM images were higher than those that obtained through direct estimation to predict basal area (Hyyppa et al., 2000;Gebreslasie et al., 2010). In addition, the obtained RMSE values were higher than those obtained by direct estimation and used to predict basal area (Gebreslasie et al., 2010;Wolter et al., 2009;Kalbi et al., 2014).    (Table 6). The obtained R 2 values using Quickbird were lower than ones that obtained through direct estimation to predict tree density by Sivanpillai et al., (2006), Gebreslasie et al., (2010), Mohammadi et al., (2010) and Kalbi et al., (2014). In addition, RMSE values obtained in this study were lower than values obtained when direct estimation was used to predict tree density by Sivanpillai et al., (2006). similar results. Comparative results of implementations are given in Table 6.

Estimating tree density/ha
The results of CART model implementation to create the relationship between spectral values of used images and basal area are shown in the table 5 as the best predictors of basal area with the adjusted R 2 of Assessment of different remote sensing data for forest structural attributes estimation in the Hyrcanian forests ly similar (Table 6). Fig. 5 show maps of estimates by CART algorithm using Quickbird data in the study area.

Conclusions
The relationships between reflectance data recorded by the spectral data and forest structural attributes were analysed through the CART algorithm in this study.
Performance assessment of models was examined using RMSE and bias on the unused plots. The results showed that Quickbird satellite data for each three attributes (stand volume, basal area, and tree stem density) have better results than ASTER and TM satellites. However, estimations of ASTER and TM images showed slightly similar results. It seems that the results obtained from TM data were better than results obtained from ASTER data. This priority could be due to using texture analysis on the ASTER image, and retrieval quantities variables using texture analysis ASTER image cannot be very precise.
Although results of this study are valuable and important for extracting and retrieving forest quantity information, and it could provide valuable information The R 2 values obtained in this study using ASTER images were higher than obtained through direct estimation to predict tree density by Sivanpillai et al., (2006) and Gebreslasie et al., (2010). In addition, the RMSE values obtained in this study were higher than results obtained the studies that were used direct estimation to predict stand volume (Kalbi et al., 2014 andShataee et al., 2012). The RMSE and R 2 values of tree density values were favourably compared to those obtained by Sivanpillai et al., (2006); Gebreslasie et al., (2010);and Freitas et al., (2005). The regression tree model for tree density estimation using Quickbird images as the best image, which could produce better estimations, is shown in Fig. 4 (for ASTER and TM images please see Figures S5 and S6 [supplementary]). The results indicated that entropy, greenness, and PCA3 were the most important variables in determining tree density/ha, which could be explained by the three-mentioned images, respectively (Table 5). The results of CART performances by different data sources to estimate tree density showed that Quickbird images could produce estimations with lower absolute and relative RMSE and bias compared to ASTER and TM data. However, estimations of both data were slight-  about changes in stand structure and help forest resource managers to devise suitable management plans. However, the outcomes of this study should be again tested in similar forests elsewhere and/or be adopted in other types of forests, which have same composition and structure. We suggest that the procedure adopted in this study is tested in other areas while investigating the effects of other satellite data.
The results of modelling showed that in spite of priority of Quickbird data compared to ASTER and TM data to estimate the forest attributes, none of these estimation is not enough for accurate variable mapping for executive planning. The causes of do not be adequate the estimations refer to some things. First, these images are coming from optical remote sensing sources, which are producing the 2D information from reflections of canopy surfaces; however, for estimation of stand volume using the 3D information such as using Lidar data could be improve the estimations. Second, complexity of forest stands in points of multi-layers and species composition can be effect on the inaccurate estimations. Using of combination the optical with Lidar data can be useful for improving the estimations.
In conclusion, the results of this study demonstrated that the reflectance values recorded by satellite data are related to spatial resolution. Updating information periodically through satellite remote sensing technology could provide valuable information about changes in stand structure and help forest resource managers to devise suitable management plans.