Introduction

One economic incentive mechanism for reducing greenhouse gas emissions from deforestation and forest degradation in developing countries is that commonly referred to as reducing emissions from deforestation and forest degradation (REDD+). Establishing a robust and transparent system to support local and national measurement, reporting and verification (MRV) of biomass resources and forestry activities are required for the successful implementation of a REDD+ regime. These MRV requirements are fixed by UNFCCC and need to be fulfilled by the national or local monitoring system over each target forest area. Without precise and credible MRV systems, it will not be possible to compensate countries or communities efficiently, effectively and equitably for reducing their emissions.

At the same time, the quantification of heterogeneity in forest areas has long been an objective of forest inventories. Heterogeneity and the methodology required for assessing it are highly dependent on scale. The spatial and temporal variation of a given property will often depend on the spatial and temporal scale at which that property was sampled, and on the size of the minimum mapping unit. The information levels used in forestry reporting are hierarchically divided into: (1) tree level, (2) stand level, (3) farm level, (4) region level and (5) country level [1], and data collection is normally based on measured sample units or subjective field observations derived from the reporting units. The spatial pattern of these reporting units can be mapped using remote sensing techniques or field observations. Permanently documented practices and local biometric models are also needed in order to establish a reliable monitoring system.

The procedure for estimating above-ground (AG) carbon employs certain specific coefficients and variables, as described below:

$$ \begin{array}{cc}\mathrm{Stem}\ \mathrm{biomass}\hfill & =\mathrm{Stem}\ \mathrm{volume} \times \mathrm{Wood}\ \mathrm{gravity}/\mathrm{density}\ \mathrm{factor}\mathrm{s}\hfill \\ {}\mathrm{Tree}\ \mathrm{biomass}\ \mathrm{A}\mathrm{G}\hfill & = \mathrm{Stem}\ \mathrm{biomass} \times \mathrm{Biomass}\ \mathrm{expansion}\ \mathrm{factor}\hfill \\ {}\mathrm{Tree}\ \mathrm{carbon}\ \mathrm{A}\mathrm{G}\hfill & =\mathrm{Tree}\ \mathrm{biomass} \times \mathrm{Carbon}\ \mathrm{factor}\hfill \\ {}\hfill \mathrm{Forest}\ \mathrm{area}\ \mathrm{carbon}\ \mathrm{A}\mathrm{G}\hfill & =\mathrm{Area}\ x\ \mathrm{basal}\ \mathrm{area} \times \mathrm{Tree}\ \mathrm{carbon}\ \mathrm{A}\mathrm{G}\hfill \end{array} $$

Stem volume/size is typically a function of height, tree diameter and species. Tree heights can be measured using specific remote sensing sensors like LiDAR, interferometric synthetic aperture radar (InSAR), or by photogrammetry, while tree diameter requires field measurements, and species can partly be estimated in boreal areas using near infrared observations of crown shape, e.g. [2]. Although homogenous forest stand can be mapped with optical data, the species recognition is very difficult task in mixed-species stands and tropical areas. Several researchers, e.g. [3, 4] have used hyperspectral data to assess species and found the low changes to detect specific species. Wood gravity factors are in turn highly species-specific (Table 1) and dependent on the status of the forest ecosystem, while biomass expansion factor (BEF) describes stem forms and the allocation of the biomass to components. BEF is an easy, reliable method for estimating biomass components when the trees are uniform and do not have a complex branch structure, but this is very rarely the case in the tropics. BEF and the carbon conversion factor can be unique for each tree species, or else the results can be computed by using the same factor for all species. In addition to tree height, basal area and the area of the target forest can be almost directly measured using LiDAR techniques, e.g. [5••, 6••], when other sensors have rather low capacity to penetrate through canopy and response signals are not too noisy. Local-scale field measurements are integral to MRV, and forest biomass should be estimated using field measurements by applying allometric models and factors assessed by sampling and weighing dried vegetation components. Even so, an international default values have been given by [7], but very often rough estimates (e.g. 0.50) have been employed in the calculations required for remote sensing-based applications without detailed species separation. It suggests that these represent an averaging method and should not be encouraged for use in assessing species variability. Remote sensing has nevertheless been successfully used for mapping total woody vegetation cover, biomass and other vegetation parameters such as basal area and stand age, e.g. [810]. Many radar techniques have been tested for global mapping over the last few decades (see [40]), but still many organisation rely on optical remote sensing, which can indirectly indicate the amount of biomass present when Landsat ETM type of data is used from the right season [11••].

Table 1 Statistics for tree species wood density factors (t/m3) in specific tropical areas as listed by the FAO [13]

The Tier approach introduced by the IPCC has been used to describe levels of methodological quality and complexity in the MRV assessment system. Tier-1 applies when no local field data or models are available. Tier-2 has been proposed when species-specific volume equations exist but there are no local biomass conversion or expansion factors [7]. Tier-3 applies when a local species-specific biomass equation is available for calculating either the total or partial biomass and organised data collection with permanent sample plots has been implemented for monitoring purposes.

Use of the Tier-2 approach is severely limited by the poor accuracy of BEF estimates [12]. The BEFs used in the [7] were developed by [13] for tropical rain forests and were designed to convert industrial volumes into biomass values; a 10-cm-tree diameter limit was also applied to the modelling approach. In dry tropical forests, the merchantable volume is not limited to the trunk but can also include the stump and branches of up to 3 cm in diameter. Availability of wood gravity values is one limiting factor and source of error, however, and when it is not possible to find a species-specific wood gravity estimate, it is preferable to use the average wood gravity for the family or functional group rather than the IPPC average [12].

Although MRV guidelines do not explicitly state accuracy requirements, it has been asserted in previous studies that remote sensing missions should entail biomass errors that remain within 20 % of field estimates and should not exceed 50 Mg ha−1 for a global biomass map at a resolution of 1 ha [12].

State of Forests - Total Estimates and Forest Area Coverage

Recent reviews of methods used to estimate deforestation have highlighted two means of monitoring large areas of forest: the ‘wall-to-wall’ approach, or sampling. Point information obtained from sample plots is normally generalised according to the following methods:

  1. 1.

    A predefined sampling scheme for predefined reporting areas or forest stands, in which sets of remote sensing material with different spatial resolutions can be included within the same multi-phase sampling frame, e.g. [14, 15, 16••],

  2. 2.

    Interpolation based partly on geographical distance and partly on remote sensing data, as in KNN-type imputation, or alternatively generalisation methods, as used in many national forest mapping systems, e.g. [1719] or

  3. 3.

    Modelling without any strict sampling design for remote sensing-based segments or forest stands, e.g. [20].

When a multi-phase sampling and design-based frame is employed, we have ‘text book’ estimates to derive reliability estimates and guarantee verification of the inventory process. Forest inventory is not typically a standard design-based approach, and we need to consider complex model-based estimators [14, 15, 16••]. When very high resolution remote sensing data are used to impute field data using satellite imagery, additional modelling is needed, and an additional phase will have to be included in the sampling frame. Detailed attributes typically call for an additional phase when the original RS data do not bear any actual relation to the required attribute (e.g. species-specific tree growth).

Satellite imagery-based wall-to-wall approaches [2123] have been used to map the full extent of a forest area and its changes by means of various sensors. This is typically done using hybrid approaches that combine automated digital segmentation, classification or fully automated techniques [11••, 2427]. The sampling approach consists of identifying partially representative land cover observations by means of field sample plots or visual interpretation of satellite images in order to estimate the extent of the forest cover and of deforestation over an entire area [28, 29]. The presence of clouds, shadows and haze, and the season of the year, can nevertheless affect the quality and completeness of the data obtained from optical sensors, especially in tropical areas.

State of Forests - Biomass and Vegetation Structure

Several studies in countries such as India [30], Bolivia and Brazil [31, 32], Malaysia [33] and Costa Rica [34] have pointed to significant correlations between biomass and reflectance measured at different optical wavelengths. The limitation to measuring only the upper part of the canopy results in large uncertainties, however, and the methods used may not be applicable under all conditions [35, 36]. Steininger [31], for example, found the canopy reflectance–biomass relationship to saturate at around 150 Mg/ha, while [37] reported that the above-ground biomass, below-ground biomass and necromass estimates for the large geographical expanse of the Brazilian Amazon could vary from minimum estimates of 78 billion Mg up to a maximum of 186 billion Mg. Another approach for estimating biomass by remote sensing is based on canopy density [38, 39], as presented on tree cover percentage maps. The SAR-based technologies are rather sensitive to environmental conditions on a per scene basis [40] and have many challenges in worldwide mapping campaigns. Topographic reliefs, precise viewing angles and wet conditions are difficult to control in large area SAR data mosaic processing. Typically, SAR-based biomass estimates have lower precision at same resolution compared to optical data [41••, 42].

Grinand et al. [27] concluded that the use of multi-date images is a practical way of reducing classification errors, enabling better characterisation of small complex changes in land cover status. However, small geometric and thematic errors such as confusion between the sub-categories of land cover classes (e.g. two different forest strata or seasonal thicket/dry forest alternation) compromised the true detection of changes. Secondly, new machine-learning algorithms could be used to enhance the interpretation process in the case of complex datasets or land cover patterns [25]. The random forests algorithm was found to improve accuracy [27] and the related visual output when a supplementary training plot was added, and the availability of very high resolution (VHR) images such as the QuickBird/Google Earth images can partially meet this need for information without extensive field surveys. It is also concluded that 10-m-resolution images would greatly improve deforestation monitoring and would also enable the detection of regeneration and degradation [41••, 43].

The recent development of very high resolution SAR sensors (e.g. TerraSAR) and processing techniques have increased the expectations towards a feasible SAR mapping concept [40]. Saatchi et al. [44] have demonstrated the high sensitivity of P- and L-band airborne SAR observations to above-ground biomass (AGB) <300 Mg/ha on a one-hectare scale. Still, SAR sensors have mainly been used in experimental and landscape level. Saatchi et al. [34] demonstrated multi-step biomass mapping exercise using a combination of data in situ inventory plots and LiDAR samples of forest structure to estimate carbon storage, plus optical and microwave imagery (1-km resolution) to extrapolate over the landscape. Results were planned to be used for areas exceeding 10,000 ha. Recently, spaceborne LiDAR used together with radargrammetric processing [45] has yielded promising results to provide small area estimates from SAR data, as it allows measurement of the height and vertical structure of a forest. It is by far one of the most accurate options among the various sensors available for estimating biomass on a local scale.

During the last decade, airborne light detection and ranging (LiDAR) has proved able to solve major difficulties by making it possible to measure high-resolution characteristics of forest structure, including height and fine-scale canopy density, or variability in height. One critical feature for forest studies is that the vertical canopy height profile statistics derived from LiDAR have been shown to correlate very well with field-based estimates of vegetation volume, density and structure [6••, 4648] even under variable tropical conditions. Currently, LiDAR is operationally cost-effective mapping method for detailed forest management purposes or based on sampling, e.g. [10]. The spaceborne sensors needed to use LiDAR approaches for large-scale wall-to-wall mapping and monitoring efforts will not be available before the end of this decade [40].

Although the literature on the estimation of tropical forest attributes, especially species-specific stem volume, which can be used indirectly as a measure of carbon via a biomass expansion factor, is still sparse, the statistical relationships between LiDAR metrics and field measurements of total biomass and stem volume have been studied in tropical forests in various parts of world. Zolkos, Goetz and Dubayah [49] conducted a meta-analysis of reported biomass accuracy estimates using more than 70 refereed papers and a variety of remote sensing platforms (airborne and spaceborne) and sensor types (optical, radar and LiDAR). The study was based on six methodologically different categories of biomass estimation: (i) airborne discrete return LiDAR, (ii) airborne full-waveform LiDAR, (iii) spaceborne full-waveform LiDAR, (iv) multi-sensor (i.e. LiDAR plus radar or passive optical), (v) radar and (vi) passive optical. Their conclusion was that the mean of the multiple R 2 was similar for LiDAR and multi-sensor models (0.70–0.80), while that for the radar and passive optical models was considerably lower (R 2 = 0.50–0.59). Thus, this comparison of various forest attributes estimated using different sensors also suggests that those based on LiDAR can be used to predict a wider range of forest attributes with better R 2 values [49].

An attractive alternative to attempting to map areas wall-to-wall by aircraft acquisition methods consists of designs that use multi-phase or multi-stage sampling based on Landsat-type optical data and limited airborne LiDAR [50]. Strips (flight lines) can be flown and field plots placed within these acquisition boundaries, for example, and they can then be extended to larger areas using additional data sources, [51]. The following observations can be made use of to guide the planning of new multi-phase survey campaigns [49]:

  1. (i)

    Above-ground biomass models developed from airborne LiDAR metrics are significantly more accurate than those using radar or passive optical data.

  2. (ii)

    Above-ground biomass models developed from multi-sensor metrics are more variable than LiDAR-only models, and they do not always improve biomass estimates.

  3. (iii)

    Model accuracy varies with forest type, species variability and many underlying factors. Even the quality of field measurements can be open to improvement, since complex stem forms are typical in the tropics.

  4. (iv)

    Modelling errors decline relative to the magnitude of the field biomass with increasing plot size, on account of the averaging effect of a large sampling unit. Small training plots show great variability in surveys, but large numbers of small plots is a more practical solution than a few large plots [52].

Rahlf et al. [53] similarly compared the accuracy of timber volume predictions in a survey area in southern Norway based on four three-dimensional remote sensing datasets: airborne laser scanning (LiDAR), stereo aerial photogrammetry, satellite interferometric synthetic aperture radar (InSAR) based on the TanDEM-X mission and satellite radargrammetry based on the TerraSAR-X mission. They fitted linear-mixed effects models with vegetation height and density metrics obtained from the remote sensing datasets as explanatory variables and used the cross-validated root mean squared error (RMSE) relative to the observed mean as a measure of the goodness-of-fit. Airborne laser scanning (ALS) provided the most accurate prediction at the plot level, with RMSE = 19 %, followed by photogrammetry (31 %), InSAR (42 %) and radargrammetry (44 %). The order of performance of the various methods was the same at the stand level, with RMSE values of 12–23 %.

Techniques for Assessing Change in Tropical Forest Areas

Consistent and accurate detection of tropical forest changes is fundamental to the reliable estimation of greenhouse gas emissions and successful implementation of climate mechanisms such as REDD+ [7]. Satellite-based remote sensing at medium spatial resolution (10–30 m) supported by field observations is an appropriate tool for assessing historical and future changes in forest area and carbon stocks in most tropical countries [5456].

Following the opening of the Landsat archive, many studies have demonstrated the operational capabilities of optical medium-resolution satellite imagery for detecting changes and trends in forest cover on scales ranging from local to global [57, 58], in spite of restrictions imposed on data availability by the frequent cloud cover. Persistent cloud cover inhibits full optical coverage from Landsat-like sensors even when compositing is performed over a period of 1–2 years. A number of optical and C- and L-band radar satellite constellations have provided an additional source of time series data since the mid-1990s, but factors such as wetness differences, atmospheric conditions and the imaging angles of the sensors have quite often caused difficulties in their interpretation.

The available remote sensing-based change detection techniques are mainly based on post-classification, or direct classification of image-to-image change vectors [59••]. Both pixel and object-based approaches have been applied, and it is not immediately obvious which is the optimal one [59••]. While the object of the image is theoretically the most powerful unit for comparison purposes, scene illumination/atmospheric effects, changes in viewing geometry, scale and the need for identification of small ‘sub-area’ changes and various segmentation parameters always entail problems that affect the consistency of the results [59••]. Relative pixel-based change vector analysis with subsequent field checks is still a highly valid method for detecting changes [59••]. When using an ordinal logistic regression (OLR) approach and Landsat NDVI time series data, magnitude of the changes as calculated from the differences between the observed and expected values over a given monitoring period has been found to be an essential predictor of small-scale forest disturbances [60••]. This type of method is principally relevant to tropical countries engaged in REDD+ for whom data availability and complex forest change dynamics limit the ability to track forest disturbances reliably over time.

It is doubtful whether the losses in biomass per hectare over time due to forest degradation can be accurately quantified using optical or radar remote sensing [27]. The same would appear to be the case for aerial photos and data obtained from low-flying unmanned aircraft (UAVs). Obviously, it depends on the degree of degradation, which is very much basal area- and forest density-related phenomena. The remote sensing field has recently achieved advances in the assessment of AGB by combining different types of optical sensor with LiDAR and/or radar data, as the latter techniques are able to sense and record forest structure metrics (e.g. maximum canopy height) and penetrate both cloud and canopy cover [61, 62]. Such surveys have been made at a national level for relatively small countries (e.g. Panama) with a high degree of accuracy and have allowed quantification of a reference level for forest carbon stocks [61]. This image fusion type of approach exhibits great potential for the future monitoring of stock level changes, provided that data acquisition with the current technology can be rendered sufficiently cost-efficient.

A Case Study in the Planning of a Carbon Inventory: Different Forest Populations and Remote Sensing Alternatives

The variability in forest attributes directly affects the amount of fieldwork required. When variation is high, more field plots are required, which may be compensated for by a high R 2 between the field and remote sensing data. The total cost of the fieldwork required for the measurements was estimated during the test inventories, given that the number of plots needed in each reporting are can be determined with reference to the correlation with remote sensing data.

As an example, provincial variation in the Lao PDR was found to be high [63], as seen from the decreasing trend in this and the importance of accurate remote sensing information as illustrated in Fig. 1. As the R 2 value between the field data and the remote sensing data increases, the need for fieldwork decreases rapidly. The size of the sample plots is also important in this respect, however, for if they are too small, the variation between them will be very high. By contrast, however, the inclusion probability for small areas of large trees will then be so low that relatively large plots are required. It is nevertheless recommended that, where possible, whole plots should fall into the same forest type, so that remote sensing procedures can utilise plot information directly as a ground truth training area. It is shown in Fig. 1 that almost double the number of small field plots is required relative to large plots when there is a poor correlation between the field and remote sensing data.

Fig. 1
figure 1

Comparison of field data needs in different provinces and at different R 2 levels. When a strong correlation exists between remote sensing and the field data, there is little need for field data even in provinces with wide variations [59••]

The cost and quality of remote sensing are normally interconnected. Medium-resolution satellite data are quite cheap per areal unit and an entire country can easily be covered at low cost, whereas VHR satellite data and ALS data are normally used on a sampling basis. Depending on the occurrence of autocorrelation in the forest population, certain types of information can be collected from a single sampling unit. For the comparison of different sets of material, cost estimates can be prepared for a number of scenarios. The required sampling-based number of remote sensing data plots in a 25 km2 area in Laos was determined using autocorrelation values, and the coefficients of determination (R 2) were estimated for land use and biomass based on the different sets of material. The values for biomass in the ALS data were based on pilot data.

The cost of field data must be added to the image data costs in order to estimate the total cost of the inventory system. A medium-resolution image mosaic covering the whole country is normally needed, together with additional VHR/ALS imagery for proper field sampling. Thus, the final system is a combination of full coverage, remote sensing sampling and field plots. According to price assumptions in Laos, the most efficient way was to use ALOS AVNIR data, laser scanning and field data. In this case, field plots were planned for the forest areas only, and additional cost figures and plots were needed for the other land use strata (forest, non-forest, uncertain). However, we need to calculate the number of images needed, availability and field data cost factors always according to the actual situation.

Discussion

Among the remote sensing approaches reviewed here, it is LiDAR that has demonstrated the most promising performance when estimating stem volume and basal area in a mixed-species tropical forest. Fused wall-to-wall optical data are always needed when large areas are to be covered for mapping purposes. VHR imagery can give a similar response in small-scale mapping, however, so that airborne CIR and LiDAR are both feasible solutions. The availability of the various data sources and their price relations can vary significantly between areas and service providers.

In terms of methodology, it is feasible to extend the popular Scandinavian approaches to the estimation of boreal forest attributes [8] to this tropical context. Among the data sources tested in different parts of world, it was LiDAR that proved to be the most accurate and competent, thus complementing the findings of other investigations focused on tropical areas. Land cover monitoring has been demonstrated in the tropics on several occasions [24, 50], but inaccurate LiDAR measurements are still prone to occur in hilly or mountainous regions, with errors extending up to several metres [64]. Furthermore, the observation concerning the effectiveness of hybrid models concurs with that of [65], who, when comparing LiDAR-only and joint LiDAR–radar models, concluded that there was little to be gained from a combination of sensors [6••]. The fusion of satellite imagery, strips of LiDAR data and field plots is a promising approach for the inventory of tropical forests [34]. Airborne LiDARs also enable an accurate direct estimation of the forest canopy cover (CC), and thus a sample of LiDAR strips can be used as reference data for creating CC maps which are based on satellite images [39].

With respect to the cost of data procurement, LiDAR is the most costly despite its low pulse density of 1 pulse/m2, the second most expensive being Airborne CIR. Studies with pulse densities from 25 pulses/m2 [48] down to about 1.5 pulses/m2 [45] have indicated that high pulse density is not requisite for estimation of forest biomass. Optical or radar satellite data, in relative terms, are only a tenth of the price of LiDAR. Taking the cost-effectiveness in a tropical context into consideration, Landsat-like data could potentially be useful for obtaining land cover but economical estimates, while LiDAR data provide an alternative that would satisfy a demand for accuracy required in forest management planning and below 40 % at the plot level [6••, 20]. The needs for accurate remote sensing data are well justified in forest areas with high variability.

Forest variables may be predicted accurately in the case of a tropical forest if the relations used are derived from that site. Predictive relations typically decline in accuracy when they are applied to a region other than that for which they had been developed. Site-specific fieldwork is still important even when considering remote sensing-based inventories in tropical forests. Model-based inference used together with synthetic estimation is a relevant and useful approach for estimation and inference purposes in non-sampled areas [14, 51]. As noted, potential applications include remote tropical and boreal forests for which access is difficult and sampling is either limited or even impossible.

Conclusion and Recommendations

It may be stated that remote sensing-based applications are definitely key tools for REDD+ MRV monitoring, but we still need to safeguard the good practices that are already in use in traditional forest inventories [66]. We still need to implement a sampling frame and design that meets the accuracy demands and satisfies the practical cost constraints. The fieldwork needs to be planned so that it achieves adequate spatial and temporal representation and coverage without any appreciable design bias effects. We should also develop procedures for inaccessible areas, which will normally call for the use of remote sensing tools. Accuracy assessment and the provision of confidence intervals as estimates of the uncertainty involved are aspects that are very often forgotten, because little attention is paid to proper designing of the sampling and fieldwork. All previous forest inventories and spatial images can be used when planning a new data collection methodology. Although free Landsat data are available and we have high quality LiDAR instruments, we need to collect and allocate the field plots carefully, especially in tropical forests where snake-like tree stems are very common, and the tree species have unknown wood density factors.

The role of airborne LiDAR is to provide ground truth type of information from large areas. The REDD MRV will need good ground sample for basal area and canopy cover change detection. LiDAR will be also be feasible for forest management planning maps and planning operational REDD activities, but cost-effective mapping of carbon stocks for regional- and national-scale assessments will rely on a combination of satellite imagery and ground-based inventory samples. The distribution of tree species and different forest types need to be measured from the field, and information can be generalised using satellite data with 1–30 m resolution. The global scale maps with the ground resolution 100 m–1 km are mainly for visualisation purposes and can be used to extrapolate distribution of information collected by 1–30 m remote sensing sensors and field plots.

The innermost REDD+ concept is still in the realm of forest inventory, and therefore sources of estimate uncertainty actually arise from the same factors that have long been studied by foresters. Standard sampling theory assumes that the quantities entering the estimators are observed without error, but if measurement errors and model related errors cannot be ignored, uncertainty will be underestimated. Although the importance of proper handling of uncertainties was explicitly advocated by IPCC in its good practice guidance for greenhouse gas reporting, it remains unclear how uncertainty assessments should be conducted. McRoberts [67] demonstrated that the advocated type of assessment through confusion matrices is misleading. There is urgent need to improve reliability estimation techniques (e.g. model-based inference) for less ideal forest inventory situations to improve new information services.