Skip to main content

Advertisement

Log in

Flood susceptibility analysis and its verification using a novel ensemble support vector machine and frequency ratio method

  • Original Paper
  • Published:
Stochastic Environmental Research and Risk Assessment Aims and scope Submit manuscript

Abstract

Flood is one of the most commonly occurred natural hazards worldwide. Severe flood occurrences in Kelantan, Malaysia cause damage to both life and property every year. Due to the huge losses in this area, development of appropriate flood modeling is required for the government. Remote sensing and geographic information system techniques can support overall flood management as they can produce rapid data collection and analysis for hydrological studies. The existing models for flood mapping have some weak points that may improve through more sophisticated and ensemble methods. The current research aimed to propose a novel ensemble method by integrating support vector machine (SVM) and frequency ratio (FR) to produce spatial modeling in flood susceptibility assessment. In the literature, mostly statistical and machine learning methods are used individually; however, their integration can enhance the final output. The FR model can perform bivariate statistical analysis and evaluate the correlation between the flooding and classes of each conditioning factors. The weights achieved by FR can be assigned to each conditioning factor and the resulted factors can be used in SVM analysis. In order to examine the efficiency of the proposed ensemble method and to show the proficiency of SVM, another machine learning algorithm such as decision tree (DT) was applied and the results were compared. To perform the methods, the upper catchment of the Kelantan basin in Malaysia was chosen. First, a flood inventory map with a total of 155 flood locations were extracted from various sources over the study area. The flood inventory map was randomly divided into two dataset; 70 % (115 flood locations) for the purpose of training and the remaining 30 % (40 flood locations) was used for validation. The spatial database included digital elevation model, curvature, geology, river, stream power index, rainfall, land use/cover, soil type, topographic wetness index and slope. For model validation, area under curve method was used and both success and prediction rate curves were calculated. The validation results for ensemble method showed 88.71 and 85.21 % for success rate and prediction rate respectively. The DT model showed 87.00 and 82.00 % for the success rate and prediction rate respectively. It is evident that the accuracies were increased using the ensemble method. The acquired results proved the efficiency of the proposed ensemble method as rapid, accurate and reasonable in flood susceptibility assessment.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6
Fig. 7
Fig. 8
Fig. 9

Similar content being viewed by others

References

  • Abdullah AF, Vojinovic Z, Rahman AA (2013) A methodology for processing raw LiDAR data to support urban flood modelling framework: Case Study—Kuala Lumpur Malaysia. In: Rahman AA, Boguslawski P, Gold C, Said M (eds) Developments in multidimensional spatial data models. Springer, Berlin, pp 49–68

    Chapter  Google Scholar 

  • Adeli H, Hung SL (1994) Machine learning: neural networks, genetic algorithms, and fuzzy systems. Wiley, New York

    Google Scholar 

  • Akgun A, Dag S, Bulut F (2008) Landslide susceptibility mapping for a landslide-prone area (Findikli, NE of Turkey) by likelihood-frequency ratio and weighted linear combination models. Environ Geol 54(6):1127–1143

    Article  Google Scholar 

  • Ayalew L, Yamagishi H (2005) The application of GIS-based logistic regression for landslide susceptibility mapping in the Kakuda-Yahiko Mountains. Cent Jpn Geomorphol 65(1):15–31

    Article  Google Scholar 

  • Bahremand A, De Smedt F, Corluy J, Liu Y, Poorova J, Velcicka L, Kunikova E (2007) WetSpa model application for assessing reforestation impacts on floods in Margecany–Hornad Watershed. Slovak Water Resour Manag 21(8):1373–1391

    Article  Google Scholar 

  • Belousov A, Verzakov S, Von Frese J (2002) Applicational aspects of support vector machines. J chemom 16(8–10):482–489

    Article  CAS  Google Scholar 

  • Bhaduri K, Wolff R, Giannella C, Kargupta H (2008) Decision-tree induction distributed decision-tree induction in peer-to-peer systems. Stat Anal Data Min 1(2):85–103

    Article  Google Scholar 

  • Bou Kheir R, Greve MH, Abdallah C, Dalgaard T (2010) Spatial soil zinc content distribution from terrain parameters: a GIS-based decision-tree model in Lebanon. Environ Pollut 158(2):520–528

    Article  CAS  Google Scholar 

  • Caniani D, Pascale S, Sdao F, Sole A (2008) Neural networks and landslide susceptibility: a case study of the urban area of Potenza. Nat Hazards 45(1):55–72

    Article  Google Scholar 

  • Carrara A, Pike RJ (2008) GIS technology and models for assessing landslide hazard and risk. Geomorphology 94(3):257–260

    Article  Google Scholar 

  • Chapelle O, Vapnik V, Bousquet O, Mukherjee S (2002) Choosing multiple parameters for support vector machines. Mach Learn 46(1–3):131–159

    Article  Google Scholar 

  • Chau K, Wu C, Li Y (2005) Comparison of several flood forecasting models in Yangtze River. J Hydraul Eng 10(6):485–491

    Article  Google Scholar 

  • Chen YR, Yeh CH, Yu B (2011) Integrated application of the analytic hierarchy process and the geographic information system for flood risk assessment and flood plain management in Taiwan. Nat Hazards 59(3):1261–1276

    Article  Google Scholar 

  • Choi J, Oh HJ, Won JS, Lee S (2010) Validation of an artificial neural network model for landslide susceptibility mapping. Environ Earth Sci 60(3):473–483

    Article  Google Scholar 

  • Cloke H, Pappenberger F (2009) Ensemble flood forecasting: a review. J Hydrol 375(3):613–626

    Article  Google Scholar 

  • Cortes C, Vapnik V (1995) Support-vector networks. Mach learn 20(3):273–297

    Google Scholar 

  • Damaševičius R (2010) Optimization of SVM parameters for recognition of regulatory DNA sequences. Top 18(2):339–353

    Article  Google Scholar 

  • Dawson CW, Abrahart R, Shamseldin A, Wilby R (2006) Flood estimation at ungauged sites using artificial neural networks. J Hydrol 319(1):391–409

    Article  Google Scholar 

  • Du J, Fang J, Xu W, Shi P (2013) Analysis of dry/wet conditions using the standardized precipitation index and its potential usefulness for drought/flood monitoring in Hunan Province China. Stoch Env Res Risk Assess 27(2):377–387

    Article  Google Scholar 

  • Glenn E, Morino K, Nagler P, Murray R, Pearlstein S, Hultine K (2012) Roles of saltcedar (< i > Tamarix </i > spp.) and capillary rise in salinizing a non-flooding terrace on a flow-regulated desert river. J Arid Environ 79:56–65

    Article  Google Scholar 

  • Gokceoglu C, Sezer E (2009) A statistical assessment on international landslide literature (1945–2008). Landslides 6(4):345–351

    Article  Google Scholar 

  • Gokceoglu C, Sonmez H, Nefeslioglu HA, Duman TY, Can T (2005) The 17 March 2005 Kuzulu landslide (Sivas, Turkey) and landslide-susceptibility map of its near vicinity. Eng Geol 81(1):65–83

    Article  Google Scholar 

  • Horritt M, Bates P (2002) Evaluation of 1D and 2D numerical models for predicting river flood inundation. J Hydrol 268(1):87–99

    Article  Google Scholar 

  • Imrie C, Durucan S, Korre A (2000) River flow prediction using artificial neural networks: generalisation beyond the calibration range. J Hydrol 233(1):138–153

    Article  Google Scholar 

  • Jayakrishnan R, Srinivasan R, Santhi C, Arnold J (2005) Advances in the application of the SWAT model for water resources management. Hydrol Process 19(3):749–762

    Article  Google Scholar 

  • Jebur MN, Pradhan B, Tehrany MS (2013a) Using ALOS PALSAR derived high-resolution DInSAR to detect slow-moving landslides in tropical forest: Cameron Highlands, Malaysia. Geomat Nat Haz Risk (ahead-of-print): 1–19. doi: 10.1080/19475705.2013.860407

  • Jebur MN, Shafri HZM, Pradhan B, Tehrany MS (2013b) Per-pixel and object-oriented classification methods for mapping urban land cover extraction using SPOT 5 imagery. Geocarto Int. doi:10.1080/10106049.2013.848944

    Google Scholar 

  • Jebur M, Pradhan B, Tehrany M (2014a) Manifestation of LiDAR-derived parameters in the spatial prediction of landslides using novel ensemble evidential belief functions and support vector machine models in GIS. IEEE J Sel Topics Appl Earth Observ. doi:10.1109/JSTARS.2014.2341276

    Google Scholar 

  • Jebur MN, Pradhan B, Tehrany MS (2014b) Detection of vertical slope movement in highly vegetated tropical area of Gunung pass landslide, Malaysia, using L-band InSAR technique. Geosci J 18(1):61–68

    Article  Google Scholar 

  • Jebur MN, Pradhan B, Tehrany MS (2014c) Optimization of landslide conditioning factors using very high-resolution airborne laser scanning (LiDAR) data at catchment scale. Remote Sens Environ 152:150–165

    Article  Google Scholar 

  • Jung IW, Chang H, Moradkhani H (2011) Quantifying uncertainty in urban flooding analysis considering hydro-climatic projection and urban development effects. Hydrol Earth Syst Sci 15(2):617–633

    Article  Google Scholar 

  • Kanevskij M, Pozdnoukhov A, Timonin V (2009) Machine learning for spatial environmental data: theory, applications and software. EPFL Press, Lausanne

    Book  Google Scholar 

  • Keerthi SS, Lin CJ (2003) Asymptotic behaviors of support vector machines with Gaussian kernel. Neural Comput 15(7):1667–1689

    Article  Google Scholar 

  • Kia MB, Pirasteh S, Pradhan B, Mahmud AR, Sulaiman WNA, Moradi A (2012) An artificial neural network model for flood simulation using GIS: Johor River Basin Malaysia. Environ Earth Sci 67(1):251–264

    Article  Google Scholar 

  • Kisi O, Nia AM, Gosheh MG, Tajabadi MRJ, Ahmadi A (2012) Intermittent streamflow forecasting by using several data driven techniques. Water Resour Manag 26(2):457–474

    Article  Google Scholar 

  • Kusiak A, Li M, Zhang Z (2010) A data-driven approach for steam load prediction in buildings. Appl Energy 87(3):925–933

    Article  Google Scholar 

  • Lee S, Talib JA (2005) Probabilistic landslide susceptibility and factor effect analysis. Environ Geol 47(7):982–990

    Article  CAS  Google Scholar 

  • Lee S, Pradhan B (2007) Landslide hazard mapping at Selangor, Malaysia using frequency ratio and logistic regression models. Landslides 4(1):33–41

    Article  Google Scholar 

  • Lee MJ, Kang JE, Jeon S (2012) Application of frequency ratio model and validation for predictive flooded area susceptibility mapping using GIS. In: IEEE international geoscience and remote sensing symposium (IGARSS), Munich, 895–898, 2012

  • Li XH, Zhang Q, Shao M, Li YL (2012) A comparison of parameter estimation for distributed hydrological modelling using automatic and manual methods. Adv Mater Res 356–360:2372–2375

    Article  Google Scholar 

  • Li GF, Xiang XY, Tong YY, Wang HM (2013) Impact assessment of urbanization on flood risk in the Yangtze River Delta. Stoch Env Res Risk Assess 27(7):1683–1693

    Article  Google Scholar 

  • Liao X, Carin L (2009) Migratory logistic regression for learning concept drift between two data sets with application to UXO sensing. IEEE Trans Geosci Remote Sens 47:1454–1466

    Article  Google Scholar 

  • Liu Y, De Smedt F (2005) Flood modeling for complex terrain using GIS and remote sensed information. Water Resour Manag 19(5):605–624

    Article  Google Scholar 

  • Marjanović M, Kovačević M, Bajat B, Voženílek V (2011) Landslide susceptibility assessment using SVM machine learning algorithm. Eng Geol 123(3):225–234

    Article  Google Scholar 

  • Mattera D, Haykin S (1999) Support vector machines for dynamic reconstruction of a chaotic system. In: Schölkopf B, Burges C, Smola AJ (eds) Advances in kernel methods. MIT Press, Cambridge, pp 211–241

    Google Scholar 

  • Merz B, Kreibich H, Thieken A, Schmidtke R (2004) Estimation uncertainty of direct monetary flood damage to buildings. Nat Hazard Earth Sys 4(1):153–163

    Article  Google Scholar 

  • Messner F, Meyer V (2006) Flood damage, vulnerability and risk perception–challenges for flood damage research. Springer, Amsterdam, pp 149–167

    Google Scholar 

  • Mu T, Nandi AK (2007) Breast cancer detection from FNA using SVM with different parameter tuning systems and SOM–RBF classifier. J Frankl Inst 344(3):285–311

    Article  Google Scholar 

  • Mukerji A, Chatterjee C, Raghuwanshi NS (2009) Flood forecasting using ANN, neuro-fuzzy, and neuro-GA models. J Hydrol Eng 14(6):647–652

    Article  Google Scholar 

  • Nefeslioglu H, Gokceoglu C, Sonmez H (2008) An assessment on the use of logistic regression and artificial neural networks with different sampling strategies for the preparation of landslide susceptibility maps. Eng Geol 97(3):171–191

    Article  Google Scholar 

  • Oh HJ, Pradhan B (2011) Application of a neuro-fuzzy model to landslide-susceptibility mapping for shallow landslides in a tropical hilly area. Comput Geosci 37(9):1264–1276

    Article  Google Scholar 

  • Ohlmacher GC, Davis JC (2003) Using multiple logistic regression and GIS technology to predict landslide hazard in northeast Kansas USA. Eng Geol 69(3):331–343

    Article  Google Scholar 

  • Pierdicca N, Pulvirenti L, Chini M, Guerriero L, Ferrazzoli P (2010). A fuzzy-logic-based approach for flood detection from Cosmo-SkyMed data. In: IEEE international geoscience and remote sensing symposium (IGARSS), Honolulu, 4796–4798, 2010

  • Platt J (1999) Probabilistic outputs for support vector machines and comparisons to regularized likelihood methods. In: Somala AJ, Bartlett P, Schölkopf B, Schuurmans D (eds) Advances in large margin classifiers. MIT Press, cambridge, pp 61–74

    Google Scholar 

  • Pourghasemi HR, Pradhan B, Gokceoglu C (2012) Application of fuzzy logic and analytical hierarchy process (AHP) to landslide susceptibility mapping at Haraz watershed. Iran. Nat Hazards 63(2):965–996

    Article  Google Scholar 

  • Pradhan B (2009) Groundwater potential zonation for basaltic watersheds using satellite remote sensing data and GIS techniques. Cent Eur J Geosci 1(1):120–129

    Article  Google Scholar 

  • Pradhan B (2010) Flood susceptible mapping and risk area delineation using logistic regression, GIS and remote sensing. J Spat Hydrol 9(2):1–18

    Google Scholar 

  • Pradhan B (2013) A comparative study on the predictive ability of the decision tree, support vector machine and neuro-fuzzy models in landslide susceptibility mapping using GIS. Comput Geosci 51:350–365

    Article  Google Scholar 

  • Pradhan B, Buchroithner MF (2010) Comparison and validation of landslide susceptibility maps using an artificial neural network model for three test areas in Malaysia. Environ Eng Geosci 16(2):107–126

    Article  Google Scholar 

  • Pradhan B, Lee S (2010) Landslide susceptibility assessment and factor effect analysis: backpropagation artificial neural networks and their comparison with frequency ratio and bivariate logistic regression modelling. Environ Model Softw 25(6):747–759

    Article  Google Scholar 

  • Pradhan B, Shafie M (2009) Flood hazard assessment for cloud prone rainy areas in a typical tropical environment. Disaster Adv 2(2):7–15

    Google Scholar 

  • Pradhan B, Youssef A (2011) A 100-year maximum flood susceptibility mapping using integrated hydrological and hydrodynamic models: Kelantan River Corridor Malaysia. J Flood Risk Manag 4(3):189–202

    Article  Google Scholar 

  • Pradhan B, Shafiee M, Pirasteh S (2009) Maximum flood prone area mapping using RADARSAT images and GIS: Kelantan River Basin. Int J Geoinf 5(2):49–61

    Google Scholar 

  • Pradhan B, Hagemann U, Shafapour Tehrany M, Prechtel N (2013) An easy to use ArcMap based texture analysis program for extraction of flooded areas from TerraSAR-X satellite image. Comput Geosci 63:34–43. doi:10.1016/j.cageo.2013.10.011

    Article  Google Scholar 

  • Pradhan B, Abokharima MH, Jebur MN, Tehrany MS (2014) Land subsidence susceptibility mapping at Kinta Valley (Malaysia) using the evidential belief function model in GIS. Nat Hazards 73(2):1–24

    Article  Google Scholar 

  • Roe BP, Yang HJ, Zhu J, Liu Y, Stancu I, McGregor G (2005) Boosted decision trees as an alternative to artificial neural networks for particle identification. Nucl Instrum Methods Phys Res Sect A 543(2):577–584

    Article  CAS  Google Scholar 

  • Sahoo G, Schladow S, Reuter J (2009) Forecasting stream water temperature using regression analysis, artificial neural network, and chaotic non-linear dynamic models. J Hydrol 378(3):325–342

    Article  Google Scholar 

  • Saito H, Nakayama D, Matsuyama H (2009) Comparison of landslide susceptibility based on a decision-tree model and actual landslide occurrence: the Akaishi Mountains Japan. Geomorphology 109(3):108–121

    Article  Google Scholar 

  • Samui P (2008) Slope stability analysis: a support vector machine approach. Environ Geol 56(2):255–267

    Article  Google Scholar 

  • Sezer EA, Pradhan B, Gokceoglu C (2011) Manifestation of an adaptive neuro-fuzzy model on landslide susceptibility mapping: Klang Valley Malaysia. Expert Syst Appl 38(7):8208–8219

    Article  Google Scholar 

  • Smith K, Ward R (1998) Floods: physical processes and human impacts. Wiley, Chichester

    Google Scholar 

  • Song S, Zhan Z, Long Z, Zhang J, Yao L (2011) Comparative study of SVM methods combined with voxel selection for object category classification on fMRI data. PLoS One 6(2):e17191

    Article  CAS  Google Scholar 

  • Stewart M, Bates P, Anderson M, Price D, Burt T (1999) Modelling floods in hydrologically complex lowland river reaches. J Hydrol 223(1):85–106

    Article  Google Scholar 

  • Swets JA (1988) Measuring the accuracy of diagnostic systems. Science 240(4857):1285–1293

    Article  CAS  Google Scholar 

  • Talei A, Chua LHC, Quek C (2010) A novel application of a neuro-fuzzy computational technique in event-based rainfall–runoff modeling. Expert Syst Appl 37(12):7456–7468

    Article  Google Scholar 

  • Tehrany MS, Pradhan B, Jebu MN (2013a) A comparative assessment between object and pixel-based classification approaches for land use/land cover mapping using SPOT 5 imagery. Geocarto Int 2:1–19. doi:10.1080/10106049.2013.768300

    Google Scholar 

  • Tehrany MS, Pradhan B, Jebur MN (2013b) Remote sensing data reveals eco-environmental changes in urban areas of Klang Valley, Malaysia: contribution from object based analysis. J Indian Soc Remote 41(4):981–991

    Article  Google Scholar 

  • Tien Bui D, Pradhan B, Lofman O, Revhaug I (2012a) Landslide susceptibility assessment in vietnam using support vector machines, decision tree, and Naive Bayes Models. Math Probl Eng 171:12–29

    Google Scholar 

  • Tien Bui D, Pradhan B, Lofman O, Revhaug I, Dick OB (2012b) Landslide susceptibility mapping at Hoa Binh province (Vietnam) using an adaptive neuro-fuzzy inference system and GIS. Comput Geosci 45:199–211

    Article  Google Scholar 

  • Tierney KJ, Lindell MK, Perry RW (2001) Facing the unexpected: disaster preparedness and response in the United States. Joseph Henry Press, Washington

    Google Scholar 

  • Tiwari MK, Chatterjee C (2010) Uncertainty assessment and ensemble flood forecasting using bootstrap based artificial neural networks (BANNs). J Hydrol 382(1):20–33

    Article  Google Scholar 

  • Umar Z, Pradhan B, Ahmad A, Jebur MN, Tehrany MS (2014) Earthquake induced landslide susceptibility mapping using an integrated ensemble frequency ratio and logistic regression models in West Sumatera Province, Indonesia. Catena 118:124–135

    Article  Google Scholar 

  • Van Westen C, Van Asch TW, Soeters R (2006) Landslide hazard and risk zonation—why is it still so difficult? Bull Eng Geol Environ 65(2):167–184

    Article  Google Scholar 

  • Witten IH, Frank E (2005) Data mining: practical machine learning tools and techniques. San Francisco Morgan Kaufmann, San Francisco

    Google Scholar 

  • Wu SJ, Lien HC, Chang CH (2010) Modeling risk analysis for forecasting peak discharge during flooding prevention and warning operation. Stoch Env Res Risk Assess 24(8):1175–1191

    Article  Google Scholar 

  • Yalcin A, Reis S, Aydinoglu A, Yomralioglu T (2011) A GIS-based comparative study of frequency ratio, analytical hierarchy process, bivariate statistics and logistics regression methods for landslide susceptibility mapping in Trabzon NE Turkey. Catena 85(3):274–287

    Article  Google Scholar 

  • Yao X, Tham L, Dai F (2008) Landslide susceptibility mapping based on support vector machine: a case study on natural slopes of Hong Kong China. Geomorphology 101(4):572–582

    Article  Google Scholar 

  • Yilmaz I (2010) Comparison of landslide susceptibility mapping methodologies for Koyulhisar, Turkey: conditional probability, logistic regression, artificial neural networks, and support vector machine. Environ Earth Sci 61(4):821–836

    Article  CAS  Google Scholar 

  • Yu J, Qin X, Larsen O (2013) Joint Monte Carlo and possibilistic simulation for flood damage assessment. Stoch Env Res Risk Assess 27(3):725–735

    Article  Google Scholar 

  • Zhuang L, Dai H (2006) Parameter optimization of kernel-based one-class classifier on imbalance learning. J Comput 1(7):32–40

    Article  Google Scholar 

  • Zou Q, Zhou J, Zhou C, Song L, Guo J (2013) Comprehensive flood risk assessment based on set pair analysis-variable fuzzy sets model and fuzzy AHP. Stoch Env Res Risk Assess 27(2):525–546

    Article  Google Scholar 

Download references

Acknowledgments

Authors would like to thank Malaysian Center for Remote Sensing and Department of Surveying, Malaysia for providing various datasets for this research. Thanks are also due to the Malaysian Meteorological Service Department for providing rainfall data for the research.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Biswajeet Pradhan.

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Tehrany, M.S., Pradhan, B. & Jebur, M.N. Flood susceptibility analysis and its verification using a novel ensemble support vector machine and frequency ratio method. Stoch Environ Res Risk Assess 29, 1149–1165 (2015). https://doi.org/10.1007/s00477-015-1021-9

Download citation

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s00477-015-1021-9

Keywords

Navigation