Skip to main content

Advertisement

Log in

Comparison and integration of feature reduction methods for land cover classification with RapidEye imagery

  • Published:
Multimedia Tools and Applications Aims and scope Submit manuscript

Abstract

Feature reduction (FR) methods can effectively reduce the feature set and improve the accuracy for land cover classification (LCC) using high resolution remote sensing data with high dimensional or strongly correlated feature sets. However, FR methods have rarely been applied for LCC in arid regions with complex geographic environments, especially for the integration of feature selection (FS) and feature extraction (FE) methods. This study investigated the comparison and integration of FR methods for LCC in part of Dunhuang Basin, northwestern China, which is a typical inland arid region and groundwater-dependent ecosystems. Five spectral bands and 9 vegetation indices features that derived from RapidEye satellite imagery were used with support vector machines algorithm. Two wrapper FS methods, based on random forest algorithm (varSelRF and Boruta packages in R software), were used. Three FE methods (principal component analysis, PCA; independent component analysis, ICA; and minimum noise fraction transformation, MNF), were employed to extract a reduced number of reconstructed new features. Integration of varSelRF and PCA methods (varSelRF-PCA) was attempted. All 14 features were relevant, indicated by Boruta method; only 6 features, including the red-edge band selected by the varSelRF module, had higher importance. All the five FR methods could improve classification accuracy, but only varSelRF achieved significant improvement. The varSelRF outperformed the FE methods, followed by MNF, PCA, and ICA. The proposed varSelRF-PCA model significantly improved classification accuracy and outperformed all the FS or FE methods.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3

Similar content being viewed by others

References

  1. Adelabu S, Mutanga O, Adam E (2014) Evaluating the impact of red-edge band from Rapideye image for classifying insect defoliation levels. ISPRS J Photogramm Remote Sens 95:34–41

    Article  Google Scholar 

  2. Barron OV, Emelyanova I, Van Niel TG (2014) Mapping groundwater-dependent ecosystems using remote sensing measures of vegetation and moisture dynamics. Hydrol Process 28:372–385

    Article  Google Scholar 

  3. Belgiu M, Drăguţ L (2016) Random forest in remote sensing: a review of applications and future directions. ISPRS J Photogramm Remote Sens 114:24–31

    Article  Google Scholar 

  4. Bharti KK, Singh PK (2015) Hybrid dimension reduction by integrating feature selection with feature extraction method for text clustering. Expert Syst Appl 42:3105–3114

    Article  Google Scholar 

  5. Breiman L (2001) Random forests. Mach Learn 45:5–32

    Article  MATH  Google Scholar 

  6. Cánovas-García F, Alonso-Sarría F (2015) Optimal combination of classification algorithms and feature ranking methods for object-based classification of submeter resolution Z/I-imaging DMC imagery. Remote Sens 7:4651–4677

    Article  Google Scholar 

  7. Chen P, Tremblay N, Wang J, Vigneaulta P (2010) New index for crop canopy fresh biomass estimation. Spectrosc Spectr Anal 30:512–517 (in Chinese)

    Google Scholar 

  8. Chen W, Li X, Wang Y, Chen G, Liu S (2014a) Forested landslide detection using LiDAR data and the random forest algorithm: a case study of the three gorges, China. Remote Sens Environ 152:291–301

    Article  Google Scholar 

  9. Chen W, Sun Z, Wang Y, Ma R (2014b) Major scientific issues on water demand studying for groundwater-dependent vegetation ecosystems in inland arid regions. Earth Sci: J China U Geosci 39:1340–1348 (In Chinese)

    Google Scholar 

  10. Chen G, Li X, Chen W, Cheng X, Zhang Y, Liu S (2014c) Extraction and application analysis of landslide influential factors based on LiDar dem: a case study in the three gorges area, China. Nat Hazards 74:509–526

    Article  Google Scholar 

  11. Chen X, Fang T, Huo H, Li D (2015) Measuring the effectiveness of various features for thematic information extraction from very high resolution remote sensing imagery. IEEE Trans Geosc Remote Sens 53:4837–4851

    Article  Google Scholar 

  12. Chen W, Wang Y, Li X, Zou Y, Liao Y, Yang C (2016) Land use/land cover change and driving effects of water environment system in Dunhuang Basin, northwestern China. Environ Earth Sci 75:1027

    Article  Google Scholar 

  13. Development Core Team R (2013) R: a language and environment for statistical computing. R Foundation for Statistical Computing, Vienna

    Google Scholar 

  14. Diaz-Uriarte R (2010) VarSelRF: variable selection using random forests; R package version 0.7–3. TU Wien, Vienna

    Google Scholar 

  15. Duro DC, Franklin SE, Dubé MG (2012) Multiscale object-based image analysis and feature selection of multi-sensor earth observation imagery using random forests. Int J Remote Sens 33:4502–4526

    Article  Google Scholar 

  16. ENVI (2009) The Environment for Visualizing Images (ENVI) 4.7. User’s Guide: Research System Institute

  17. Fan H (2013) Land-cover mapping in the Nujiang grand canyon: integrating spectral, textural, and topographic data in a random forest classifier. Int J Remote Sens 34:7545–7567

    Article  Google Scholar 

  18. Fassnacht FE, Neumann C, Förster M, Buddenbaum H, Ghosh A, Clasen A, Josh PK, Koch B (2014) Comparison of feature reduction algorithms for classifying tree species with hyperspectral data on three central European test sites. IEEE J Sel Top Appl Earth Observ Remote Sens 7:2547–2561

    Article  Google Scholar 

  19. Ge X, Ni J, Li Z, Hu R, Ming X, Ye Q (2013) Quantifying the synergistic effect of the precipitation and land use on sandy desertification at county level: a case study in Naiman banner, northern China. J Environ Manag 123:34–41

    Article  Google Scholar 

  20. Gitelson A, Merzlyak MN (1994) Spectral reflectance changes associated with autumn senescence of Aesculus hippocastanum L. and Acer platanoides L. leaves. Spectral features and relation to chlorophyll estimation. J Plant Physiol 143:286–292

    Article  Google Scholar 

  21. Gitelson AA, Kaufman YJ, Merzlyak MN (1996) Use of a green channel in remote sensing of global vegetation from EOS-MODIS. Remote Sens Environ 58:289–298

    Article  Google Scholar 

  22. Gou S, Gonzales S, Miller GR (2015) Mapping potential groundwater-dependent ecosystems for sustainable management. Groundwater 53:99–110

    Article  Google Scholar 

  23. Green RO, Eastwood ML, Williams O (1988) A transformation for ordering multispectral data in terms of image quality with implications for noise removal. IEEE Trans Geosci Remote Sens 26:65–74

    Article  Google Scholar 

  24. Han L, Wan X, Fang F, Zhang Z (2013) Desertification assessments of Hexi regions in Gansu province by remote sensing. Arid Land Geogr 36:131–138 (In Chinese)

    Google Scholar 

  25. Han L, Zhang Z, Zhang Q, Wan X (2015) Desertification assessments in the Hexi corridor of northern China’s Gansu Province by remote sensing. Nat Hazards 75:2715–2731

    Article  Google Scholar 

  26. Hao C, Wu S (2006) The effects of land-use types and conversions on desertification in mu us Sandy land of China. J Geogr Sci 16:57–68

    Article  Google Scholar 

  27. Helwig NE (2015) Ica: independent component analysis; R package version 1.0–1. TU Wien, Vienna

    Google Scholar 

  28. Huang X, Zhang L (2013) An SVM ensemble approach combining spectral, structural, and semantic features for the classification of high-resolution remotely sensed imagery. IEEE Trans Geosci Remote Sens 51:257–272

    Article  Google Scholar 

  29. Hyvarinen A (1999) Fast and robust fixed-point algorithms for independent component analysis. IEEE Transactions on Neural Netw 10:626–634

    Article  Google Scholar 

  30. Hyvarinen A, Karhunen J, Erkki O (2001) Independent component analysis. Wiley, New York

    Book  Google Scholar 

  31. Jones T, Coops NC, Sharma T (2010) Assessing the utility of airborne hyperspectral and LiDAR data for species distribution mapping in the coastal Pacific northwest, Canada. Remote Sens Environ 114:2841–2852

    Article  Google Scholar 

  32. Jordan CF (1969) Derivation of leaf-area index from quality of light on the forest floor. Ecology 50:663–666

    Article  Google Scholar 

  33. Key T, Warner TA, McGraw JB, Fajvan MA (2001) A comparison of multispectral and multitemporal information in high spatial resolution imagery for classification of individual tree species in a temperate hardwood forest. Remote Sens Environ 75:100–112

    Article  Google Scholar 

  34. Kim HO, Yeom JM (2014) Effect of red-edge and texture features for object-based paddy rice crop classification using RapidEye multi-spectral satellite image data. Int J Remote Sens 35:7046–7068

    Google Scholar 

  35. Kursa MB, Rudnicki WR (2010) Feature selection with the Boruta package. J Stat Softw 36:1–13

    Article  Google Scholar 

  36. Ladha L, Deepa T (2011) Feature selection methods and algorithms. Int J Comp Sci Eng 3:1787–1797

    Google Scholar 

  37. Laliberte AS, Browning DM, Rango A (2012) A comparison of three feature selection methods for object-based classification of sub-decimeter resolution UltraCam-L imagery. Int J Appl Earth Obs Geoinf 15:70–78

    Article  Google Scholar 

  38. Li X, Cheng X, Chen W, Chen G, Liu S (2015) Identification of forested landslides using LiDar data, object-based image analysis, and machine learning algorithms. Remote Sens 7:9705–9726

    Article  Google Scholar 

  39. Li X, Chen W, Cheng X, Wang L (2016) A comparison of machine learning algorithms for mapping of complex surface-mined and agricultural landscapes using ZiYuan-3 stereo satellite imagery. Remote Sens 8:514

    Article  Google Scholar 

  40. Liu P, Choo KKR, Wang L Huang F (2016) SVM or deep learning? A comparative study on remote sensing image classification. Soft Comput. doi:10.1007/s00500-016-2247-2

  41. Lv J, Wang X, Zhou Y, Qian K, Wan L, Eamus D, Tao Z (2013) Groundwater-dependent distribution of vegetation in Hailiutu River catchment, a semi-arid region in China. Ecohydrology 6:142–149

    Article  Google Scholar 

  42. Ma Y, Wu H, Wang L, Huang B, Ranjan R, Zomaya A, Jie W (2015) Remote sensing big data computing: challenges and opportunities. Futur Gener Comp Syst 51:47–60

    Article  Google Scholar 

  43. Manandhar R, Odeh IOA, Ancev T (2009) Improving the accuracy of land use and land cover classification of Landsat data using post-classification enhancement. Remote Sens 1:330–344

    Article  Google Scholar 

  44. Melgani F, Bruzzone L (2004) Classification of hyperspectral remote sensing images with support vector machines. IEEE Trans Geosci Remote Sens 42:1778–1790

    Article  Google Scholar 

  45. Meyer D, Dimitriadou E, Hornik K, Weingessel A, Leisch F (2014) e1071: Misc functions of the Department of Statistics (e1071); R package version 1.6–4. TU Wien, Vienna

    Google Scholar 

  46. Namdar M, Adamowski J, Saadat H, Sharifi F, Khiri A (2014) Land-use and land-cover classification in semi-arid regions using independent component analysis (ICA) and expert classification. Int J Remote Sens 35:8057–8073

    Article  Google Scholar 

  47. Pal M, Foody GM (2010) Feature selection for classification of hyperspectral data by SVM. IEEE Trans Geosci Remote Sens 48:2297–2307

    Article  Google Scholar 

  48. Pearson K (1901) On lines and planes of closest fit to systems of points in space. Philos Mag 2:559–572

    Article  MATH  Google Scholar 

  49. Peerbhay KY, Mutanga O, Ismail R (2013) Commercial tree species discrimination using airborne AISA eagle hyperspectral imagery and partial least squares discriminant analysis (PLS-DA) in Kwazulu–Natal, South Africa. ISPRS J Photogramm Remote Sens 79:19–28

    Article  Google Scholar 

  50. Peerbhay KY, Mutanga O, Ismail R (2014) Does simultaneous variable selection and dimension reduction improve the classification of Pinus forest species? J Appl Remote Sens 8:085194

    Article  Google Scholar 

  51. Petropoulos GP, Kalaitzidis C, Vadrevu KP (2012) Support vector machines and object-based classification for obtaining land-use/cover cartography from Hyperion hyperspectral imagery. Comput Geosci 41:99–107

    Article  Google Scholar 

  52. Pontius RG, Millones M (2011) Death to kappa: birth of quantity disagreement and allocation disagreement for accuracy assessment. Int J Remote Sens 32:4407–4429

    Article  Google Scholar 

  53. Rouse JW, Haas RH, Schell JA (1974) Monitoring the vernal advancement and Retrogradation (Greenwave effect) of natural vegetation. Texas A&M University, College Station

    Google Scholar 

  54. Saadat H, Adamowski J, Bonnell R, Sharifi F, Namdar M, Ale-Ebrahim S (2011) Land use and land cover classification over a large area in Iran based on single date analysis of satellite imagery. ISPRS J Photogramm Remote Sens 66:608–619

    Article  Google Scholar 

  55. Sawilowsky S (2007) Mann–Whitney U test (Wilcoxon rank-sum test). In: Salkind N (ed) Encyclopedia of measurement and statistics. SAGE, Newbury Park, pp 566–568

    Google Scholar 

  56. Schuster C, Förster M, Kleinschmit B (2012) Testing the red edge channel for improving land-use classifications based on high-resolution multi-spectral satellite data. Int J Remote Sens 33:5583–5599

    Article  Google Scholar 

  57. Song W, Liu P, Wang L (2016) Sparse representation-based correlation analysis of non-stationary spatiotemporal big data. Int J Digital Earth 9(9):892–913

    Article  Google Scholar 

  58. Stumpf A, Kerle N (2011) Object-oriented mapping of landslides using random forests. Remote Sens Environ 115:2564–2577

    Article  Google Scholar 

  59. Sun A, Chen T, Niu R, Trinder JC (2016) Land use/cover change and the urbanization process in the Wuhan area from 1991 to 2013 based on MESMA. Environ Earth Sci 75(17):1214

    Article  Google Scholar 

  60. Vapnik V (1995) The nature of statistical learning theory. Springer-Verlag, Inc., New York

    Book  MATH  Google Scholar 

  61. Wang X, Chen F, Dong Z (2005) The relative role of climatic and human factors in desertification in semiarid China. Globa Environ Change 16:48–57

    Article  Google Scholar 

  62. Wang L, Song W, Liu P (2016a) Link the remote sensing big data to the image features via wavelet transformation. Cluster Comput 19:793–810

    Article  Google Scholar 

  63. Wang L, Ma Y, Yan J, Chang V, Zomaya AY (2016b) pipsCloud: high performance cloud computing for remote sensing big data management and processing. Futur Gener Comp Syst. doi:10.1016/j.future.2016.06.009

    Google Scholar 

  64. Wang L, Zhang J, Liu P, Choo KKR, Huang F (2017) Spectral–spatial multi-feature-based deep learning for hyperspectral remote sensing image classification. Soft Comput 21:213–221

  65. Xu D, Li C, Zhuang D, Pan J (2011) Assessment of the relative role of climate change and human activities in desertification: a review. J Geogr Sci 21:926–936

    Article  Google Scholar 

  66. Yan WY, Shaker A, El-Ashmawy N (2015) Urban land cover classification using airborne LiDAR data: a review. Remote Sens Environ 158:295–310

    Article  Google Scholar 

  67. Zha F, Ma M, Chen S, Liu Y, Li Y, Huang S (2015) Remote sensing Lithologic classification of multispectral data based on the vegetation inhibition method in the vegetation coverage area. Earth Sci 40(8):1403–1408 (In Chinese)

    Google Scholar 

Download references

Acknowledgements

This research was jointly supported by the China Geological Survey project (No. 12120115063201), the Fundamental Research Funds for Central Universities, China University of Geosciences (Wuhan) (No. CUGL150417), and the China Scholarship Council (No. 201406415051).

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Weitao Chen.

Ethics declarations

Conflict of interest

The authors declare that they have no conflict of interest.

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Li, X., Chen, W., Cheng, X. et al. Comparison and integration of feature reduction methods for land cover classification with RapidEye imagery. Multimed Tools Appl 76, 23041–23057 (2017). https://doi.org/10.1007/s11042-016-4311-4

Download citation

  • Received:

  • Revised:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s11042-016-4311-4

Keywords

Navigation