Mangrove ecosystem species mapping from integrated Sentinel-2 imagery and field spectral data using random forest algorithm

Nirmawana Simarmata; Ketut Wikantika; Soni Darmawan; Agung Budi Harto; Anjar Dimara Sakti; Aki Asmoro Santo

doi:10.1117/1.JRS.18.014509

6 February 2024 Mangrove ecosystem species mapping from integrated Sentinel-2 imagery and field spectral data using random forest algorithm

Nirmawana Simarmata, Ketut Wikantika, Soni Darmawan, Agung Budi Harto, Anjar Dimara Sakti, Aki Asmoro Santo

Author Affiliations +

Journal of Applied Remote Sensing, Vol. 18, Issue 1, 014509 (February 2024). https://doi.org/10.1117/1.JRS.18.014509

Abstract

Mangroves maintain coastal balance and have the greatest potential for carbon sequestration. Most mapping studies on mangroves have focused on their extent and distribution and rarely featured mangrove species. Therefore, the objective of our study is to investigate mangrove species mapping from integrated Sentinel-2 imagery and field spectral data using a random forest (RF) algorithm. Study areas are located in East and South Lampung, Indonesia. The field samples used represented 144 points of mangrove species. The classification method used an RF algorithm and four models with varying parameters: model 1 with Sentinel-2; model 2 with both Sentinel-2 and field spectral data; model 3 with Sentinel-2, field spectral data, and spectrally transformed data; and model 4 only with spectrally transformed data. The results showed that Rhizophora mucronata, Sonneratia alba, Avicennia lanata, and Avicennia marina were the most common mangrove species in these areas, with reflectance values in the range of 0.002 to 0.493, 0.006 to 0.833, 0.014 to 0.768, and 0.002 to 0.758. Permutation importance (PI) that affects the classification model is the red band, near-infrared, and green normalized difference vegetation index, where the most PI in model 3 is 0.283. The highest level of agreement for mangrove species is found in model 3. Model 3 is the best parameter for RF classification that showed the best mapping accuracy, with the overall accuracy, user accuracy, producer accuracy, and kappa value being 81.25%, 81.68%, 81.25%, and 0.80, respectively.

1. Introduction

Mangrove ecosystems support various services,¹^–³ including nutrient cycling, and serve as a vehicle for fisheries production that dominates the areas between tides along tropical⁴ and subtropical coasts.⁵^–⁷ Despite mangroves making only up 0.7% of tropical forests worldwide,⁸^,⁹ they contribute $\sim 50 %$ of carbon stocks.¹⁰^,¹¹ Human activities have reduced the area of mangrove forests by 30% to 50% in the last half century due to excessive exploitation/logging,²^,¹¹^,¹² conversion into pond areas, and the development of the areas.²^,¹³

Lampung is a coastal area covered by various mangroves¹⁴ including Rhizophora mucronata, which grows on muddy soil types and, occasionally, on sandy reefs.¹⁵ R. mucronata is distributed in the East Coast of Lampung, mainly in Pasir Sakti, East Lampung Regency. Sonneratia alba is another mangrove that grows in sandy mud. The lower leaves of Avicennia lanata and A. marina exhibit a similar morphology while being salty, exhibiting elliptical leaf tips,¹⁶ and being distributed in sandy areas with fine mud near estuaries in Ketapang subdistrict, South Lampung.¹⁷

Given the several types of scattered species that reflect the development and condition of mangroves, it is necessary to identify and classify mangroves. Several methods have been used previously for identification of mangrove by only in situ measurements¹⁸^,¹⁹ or by utilizing satellite imagery.²⁰^,²¹ In situ mangrove monitoring provides the most accurate information on mangrove distribution; however, data collection through field surveys remains challenging due to limited accessibility to mangroves.²² Mangroves are located in relatively wet areas and subjected to high tides.² Thus remote sensing has become a practical way to map and monitor mangrove forests. Remote sensing provides land cover information using pixel-based analysis.²³

Furthermore, integration of artificial intelligence, mathematical algorithms, and big data analysis with high-resolution sensing imaging data has become more common.²⁴^,²⁵ These data can be collected on a regular basis over a wide geographic area, enabling precise, and accurate monitoring of mangrove forest ecosystems.²² The use of remote sensing technologies is expanding, along with the demand for spatial data. Remote sensing data are essential for extracting parameters and biophysical data in identifying mangrove forests.²⁶ These data in coastal areas can be utilized to monitor mangroves. Passive (optical) and active system synthetic aperture radar images are the two forms of remote sensing images that can be used for land monitoring.²⁶

The classification and segmentation of coastal objects, including mangrove cover and tidal marsh, allows for determining the extent of each object. Using machine learning techniques, including support vector machines (SVM),⁵^,²²^,²⁵^–²⁸ support vector regression, artificial neural network,²⁹ random forest (RF),²²^,³⁰^,³¹ decision tree, symbolic regression,³² extreme gradient boosting regression,³³^,³⁴ light gradient boosting machine (LightGBM), and extreme gradient boosting (XGBoost), allows for the retrieval of data on mangrove distribution. RF has an excellent biomass modeling ability³⁵ and can increase the precision of land cover mapping, wetland mapping,³⁶ and mangrove species classification.⁵^,³⁴^,³⁷

Mangrove identification is performed based on mangrove canopy properties. This can be analyzed using vegetation index transformation.²⁴^,³⁸ One technique for changing vegetation indices is the green normalized difference vegetation index (GNDVI).¹⁷ This technique shows parameters related to vegetation,²⁸^,³⁹ such as green foliage biomass and area, which are essential for vegetation division. In addition, since mangroves are located in relatively wet areas, a moisture index, such as the normalized difference moisture index (NDMI)¹⁷ and a wetness index, such as the normalized difference water index (NDWI) can be used to accurately identify them.

Earlier studies have not differentiated mangrove species and focused only on mangrove forest size and distribution.⁴⁰ Mangrove species composition and distribution data are critical to understanding mangrove ecosystem functions and ensuring sustainable mangrove conservation.²⁹ However, mangroves of a single species typically form tiny patches or thin strips that are invisible on satellite images. Furthermore, using machine learning models and remote sensing data for mangrove species mapping is challenging since there is no clear zoning between species due to the spectral similarity of mangroves.²⁸ This underlies the importance of mangrove mapping at the species level. Correctly identifying species will provide insights into the relationships between them. In addition, mangrove species classification allows the monitoring of a particular species population over time. This will help in detecting changes in the population size, distribution, and health of mangrove species. Previous studies have used machine learning models for mangrove species mapping using SVM classification with Worldview-2 images and aerial photographs.²⁸^,⁴¹ Behera et al.³⁸ identified the mangrove species Heritiera fomes, Excoecaria agallocha, and Avicennia officinalis using reflectance/backscatter bands and vegetation indices derived from Sentinel-2, AVIRIS-NG, and Sentinel-1. Meanwhile, Paramanik et al.⁴² only used AVIRIS-NG hyperspectral imagery, which successfully identified the species H. fomes, E. agallocha, and A. officinalis and two of their combinations (H. fomes–E. agallocha and A. officinalis–E. agallocha), and Ghorbanian et al.²² were limited to mapping mangrove ecosystems using Sentinel-1 and Sentinel-2 images using RF. For developing the classification model, the parameter used was vegetation index;³⁸ this is in contrast to the present study, which not only used the vegetation index but also the wetness and moisture indices. In addition, the directly measured reflectance value was also a parameter used for further modeling, which improved the accuracy of classification mapping.

Mangrove species can be categorized by changing the algorithm or applying new information.²⁸ Mapping and classifying mangrove species can be accomplished using a combination of RF algorithms and field spectroradiometers. Herein, the leaf area index,⁴³ vegetation texture and index,⁴⁴ humidity, and wetness were used as parameters. High-resolution data are useful for classifying mangrove species; nonetheless, they are not available for all areas.⁴⁵

Previous studies have investigated mangrove reflectance using satellite data; however, observations of the spectral properties of mangrove biota using satellite images are insufficient, owing to the challenge of recognizing mangrove species solely from the canopy.⁴⁶ This is supported by previous research showing that object reflectance using a field spectroradiometer can be directly measured for mangrove species mapping.⁴⁵ Other findings have indicated that each mangrove species has a unique spectral reflectance and can be easily identified and mapped with adjacent wavelength bands in the near-infrared (NIR) region.¹⁵

Information on mangrove species is essential for mangrove management; however, insufficient research related to species classification has been conducted, especially in Lampung, Indonesia. Therefore, the objective of this research is to investigate mangrove species mapping using integrated Sentinel-2 imagery and field spectral data processed by an RF algorithm. The novelty of this study lies in the determination of the best parameters for RF classification, thus enhancing the accuracy of identifying mangrove species for the management, monitoring, and rehabilitation of mangroves in study areas.

2. Materials and Methods

2.1.

Study Site

The study was chosen along the coastlines of Lampung, Indonesia, specifically targeting the east⁴⁷ and south areas and located between 5°25′30″S–5°51′0″S and 105°30′0″E–105°52′0″E. Tidal swamp plains are found along the east coast with an elevation of $\sim 0.5$ to 1 m above the mean sea level, and sedimentation areas based on the rising tide characterize the research site in these two districts (East and South Lampung) (Fig. 1).

Fig. 1

(a) Study area (RGB Sentinel-2A: August 6–15, 2022), (b) sampling points in Pasir Sakti, and (c) sampling points in Ketapang.

2.2.

Datasets

2.2.1.

Image data

The image data used in this study were from the Sentinel-2A image, which has 13 channels and various resolutions.⁴⁸ Sentinel-2A imagery is suitable for classifying land cover as they contain four channels with a spatial resolution of 10 m. Sentinel-2A observes the Earth in various spectral ranges, including visible light and NIR. The Sentinel-2A data parameters are listed in Table 1.

Table 1

Sentinel-2A data parameters.

Parameter	Dataset	Spatial resolution (m)
Red band (1)	Sentinel-2A imagery	10
Green band (2)	Sentinel-2A imagery	10
NIR band (3)	Sentinel-2A imagery	10
Red band (4)	Field spectrometer	10
Green band (5)	Field spectrometer	10
NIR band (6)	Field spectrometer	10
Density (GNDVI) (7)	Sentinel-2A imagery	10
Humidity (NDMI) (8)	Sentinel-2A imagery	10
Water index (NDWI) (9)	Sentinel-2A imagery	10

2.2.2.

Field data

The spectral values of each mangrove species were collected from August 6 to 15, 2022. The sample locations and number of sample points were determined based on the raw pixel values of mangrove objects on Sentinel-2A, specifically the red band, green band, and NIR band.⁴⁵ Additionally, the determination considers access to the sampling location as well since some locations are difficult to reach. The pixel values obtained are plotted onto a map to determine the reflectance value of mangrove species in the field.⁴⁵ The obtained reflectance data were used for preparing a spectral library that served as the basis for the construction of classification models. The number of trees taken was 144, with an average tree height of 7 to 15 m. Based on the results of field measurements, four mangrove species were observed, namely: A. marina, A. lanata, S. alba, and R. mucronata. Each species was observed in several different locations by measuring the diameter at breast height (DBH) of several trees in each location. The average DBH of S. alba in Ketapang was 55 cm, that of R. mucronata in Pasir Sakti was 17 cm, and that of both A. marina and A. lanata in Ketapang was 91 cm. The number of samples in each plot varies depending on the size of the tree as the larger the tree DBH, the fewer the number of trees. The range of the number of trees in a sample plot was 10 to 15 trees.

The sample points were evenly distributed in mangrove areas to represent each mangrove species. The spectral reflectance of the mangroves was measured using a field spectroradiometer,⁴⁵ a commonly used method to analyze the spectrum of light reflected or emitted.⁴⁹ Reflectance data were collected at the leaf level by selecting mangrove leaves that represent various species or conditions in the area. The spectroradiometer was placed above the leaf to measure the reflectance of light at various wavelengths, ranging from ultraviolet (315 nm) to SWIR (1100 nm).

The sample plots are determined based on the regulations of the head of the geospatial information agency number 3 of 2014, which include technical guidelines for collecting and processing mangrove geospatial data as well as the Sentinel-2A imagery data. The plot size used is $10 \times 10 m$ .⁵⁰ Each plot had a different number of samples based on the diameter and size of the mangrove canopy. The number of samples in a plot was 5 to 10. The types of species in the study area were similar in some areas because they shared common traits that grouped them together in certain parts of the region. Measurements using a field spectroradiometer can identify mangrove species based on the resulting spectrum patterns.

2.3.

Methodological Approach

SNAP 9.0.0 version was used for Sentinel-2A image preprocessing, whereas Endmapbox was used for processing of field spectroradiometer data, and quantum GIS was used for RF classification. The spectral bands used were red, green, and NIR. The methodology included radiometric and geometric correction, vegetation index transformation, moisture, and water indices, direct spectral measurements in the field, classification, accuracy testing, and mangrove species mapping. Fig. 2 outlines the research process.

Fig. 2

Flowchart of the methodology applied in this study.

Accuracy testing with a confusion matrix method is required for RF algorithm classification.⁵¹ It is possible to determine whether the classification results are sourced from two separate classes by comparing the derivatives of each predicted class in the matrix to the derivative of the actual class in each row. For remote sensing picture classification, the most effective and practical validation tool is the confusion matrix method.⁵² The vast majority of operations are consolidated into the error matrix, which use producer accuracy (PA), user accuracy (UA), and overall accuracy (OA) as indicators; this method is therefore successful.⁵³ A test of the classification results’ correctness was performed to gauge the degree of precision of the usage map produced by the digital classification technique. Although the samples from the training area and accuracy test were different, the accuracy of the accuracy test sample was more commonly acknowledged because it was taken in a different area.⁵⁴

2.3.1.

Image preprocessing

The preprocessing analysis was divided into two main stages: radiometric and geometric corrections. In the radiometric correction analysis, the type of image used was Sentinel-2A MSI Multispectral Instrument, level 1C,⁵⁵ a product that has not been radiometrically and atmospherically corrected; therefore, pixel errors due to atmospheric influences must be minimized. The best image with $< 10 %$ cloud cover was chosen for preprocessing. The selected recording period was between January 1 and December 31, 2022. The best data were the Sentinel-2A image recording on August 7, 2022, which had the minimum cloud cover such that the objects beneath were clearly visible. For each of the image’s multispectral channels, radiometric calibration was carried out by translating digital values (DN) into radians.

The radian image was converted to top of atmosphere reflectance after radiometric calibration. The main objective was to correct for differences in reflectance values due to variations in the Earth–Sun distance on each recording date.⁵⁶ These differences can be significant owing to the differences in geographical conditions and the time of image recording. The fast line-of-sight atmospheric analysis of spectral hypercubes atmospheric correction method was used for the correction to reduce atmospheric impacts.⁵⁶ Sentinel-2A image processing was used to facilitate the analysis of the mangrove cover during the geometric correction stage. The geometric correction used in this study was image-to-map, with the reference data being an Indonesian Landform map based on ground control points (GCPs) collected directly in the field. The interpolation method used was a nearest-neighbor algorithm that only retrieves the nearest pixel value shifted to a new position. Six GCPs were used in the geometric correction, with a root-mean-square error value of 0.33 pixel.

2.3.2.

Mangrove reflectance

The ASD HandHeld 2: hand-held visible NIR spectroradiometer (ASD Inc., Alpharetta, Georgia, United States) was used for mangrove reflectance measurements. The spectroradiometer was calibrated with a white reference prior to use⁴⁵ and recalibrated in cases where a significant difference in light intensity was noticed during use or when a “saturation alert” warning was issued.¹⁵ A total of 144 reflectance values were obtained, and measurements were performed between 09.00 and 14.00 WIB to reduce the influence of weather at the research site. The angular position of the spectroradiometer sensor was set at 45 deg to the direction of sunlight to avoid shadows on the target object.⁵⁷ Measurements were made on land and partly using boats owing to difficult access to the sites.

The presence of mangroves above the water was sufficient to affect the spectral value due to the possibility of water reflection interfering with the mangrove reflectance value. The supported data output format was .txt, with data collection conducted from August 6 to 15, 2022. The Sentinel-2A image recording date was August 7, 2022. Table 2 lists a comparison of the wavelengths used in the Sentinel-2A image and the spectroradiometer. The results of the in situ measurements were used to develop a spectrum library⁵⁸ to compare the spectral image to the reference spectrum. The spectrum library was used as the reference to compare the Sentinel-2A image reflectance value to the field reflectance value.⁵⁹ The pixel values of mangrove species were used as the target spectrum for spectral matching-based object classification.

Table 2

Wavelength bands used by Sentinel-2A and the spectroradiometer.

Sentinel-2A		Spectroradiometer
Wavelength (nm)	Band name	Wavelength (nm)	Band name
433 to 453	Coastal aerosol	315 to 400	Violet
458 to 523	Blue	400 to 525	Blue
543 to 578	Green	525 to 605	Green
650 to 680	Red	605 to 655

CC BY: © The Authors. Published by SPIE under a Creative Commons Attribution 4.0 International License. Distribution or reproduction of this work in whole or in part requires full attribution of the original publication, including its DOI.

Citation Download Citation

Nirmawana Simarmata, Ketut Wikantika, Soni Darmawan, Agung Budi Harto, Anjar Dimara Sakti, and Aki Asmoro Santo "Mangrove ecosystem species mapping from integrated Sentinel-2 imagery and field spectral data using random forest algorithm," Journal of Applied Remote Sensing 18(1), 014509 (6 February 2024). https://doi.org/10.1117/1.JRS.18.014509

Received: 10 June 2023; Accepted: 9 January 2024; Published: 6 February 2024

Access the abstract

JOURNAL ARTICLE
29 PAGES

DOWNLOAD PAPER SAVE TO MY LIBRARY

GET CITATION

KEYWORDS

Reflectivity

Data modeling

Near infrared

Vegetation

Ecosystems

Performance modeling

Image classification

1.

Introduction

2.

Materials and Methods