Abstract
Flood is one of the most commonly occurred natural hazards worldwide. Severe flood occurrences in Kelantan, Malaysia cause damage to both life and property every year. Due to the huge losses in this area, development of appropriate flood modeling is required for the government. Remote sensing and geographic information system techniques can support overall flood management as they can produce rapid data collection and analysis for hydrological studies. The existing models for flood mapping have some weak points that may improve through more sophisticated and ensemble methods. The current research aimed to propose a novel ensemble method by integrating support vector machine (SVM) and frequency ratio (FR) to produce spatial modeling in flood susceptibility assessment. In the literature, mostly statistical and machine learning methods are used individually; however, their integration can enhance the final output. The FR model can perform bivariate statistical analysis and evaluate the correlation between the flooding and classes of each conditioning factors. The weights achieved by FR can be assigned to each conditioning factor and the resulted factors can be used in SVM analysis. In order to examine the efficiency of the proposed ensemble method and to show the proficiency of SVM, another machine learning algorithm such as decision tree (DT) was applied and the results were compared. To perform the methods, the upper catchment of the Kelantan basin in Malaysia was chosen. First, a flood inventory map with a total of 155 flood locations were extracted from various sources over the study area. The flood inventory map was randomly divided into two dataset; 70 % (115 flood locations) for the purpose of training and the remaining 30 % (40 flood locations) was used for validation. The spatial database included digital elevation model, curvature, geology, river, stream power index, rainfall, land use/cover, soil type, topographic wetness index and slope. For model validation, area under curve method was used and both success and prediction rate curves were calculated. The validation results for ensemble method showed 88.71 and 85.21 % for success rate and prediction rate respectively. The DT model showed 87.00 and 82.00 % for the success rate and prediction rate respectively. It is evident that the accuracies were increased using the ensemble method. The acquired results proved the efficiency of the proposed ensemble method as rapid, accurate and reasonable in flood susceptibility assessment.
Similar content being viewed by others
References
Abdullah AF, Vojinovic Z, Rahman AA (2013) A methodology for processing raw LiDAR data to support urban flood modelling framework: Case Study—Kuala Lumpur Malaysia. In: Rahman AA, Boguslawski P, Gold C, Said M (eds) Developments in multidimensional spatial data models. Springer, Berlin, pp 49–68
Adeli H, Hung SL (1994) Machine learning: neural networks, genetic algorithms, and fuzzy systems. Wiley, New York
Akgun A, Dag S, Bulut F (2008) Landslide susceptibility mapping for a landslide-prone area (Findikli, NE of Turkey) by likelihood-frequency ratio and weighted linear combination models. Environ Geol 54(6):1127–1143
Ayalew L, Yamagishi H (2005) The application of GIS-based logistic regression for landslide susceptibility mapping in the Kakuda-Yahiko Mountains. Cent Jpn Geomorphol 65(1):15–31
Bahremand A, De Smedt F, Corluy J, Liu Y, Poorova J, Velcicka L, Kunikova E (2007) WetSpa model application for assessing reforestation impacts on floods in Margecany–Hornad Watershed. Slovak Water Resour Manag 21(8):1373–1391
Belousov A, Verzakov S, Von Frese J (2002) Applicational aspects of support vector machines. J chemom 16(8–10):482–489
Bhaduri K, Wolff R, Giannella C, Kargupta H (2008) Decision-tree induction distributed decision-tree induction in peer-to-peer systems. Stat Anal Data Min 1(2):85–103
Bou Kheir R, Greve MH, Abdallah C, Dalgaard T (2010) Spatial soil zinc content distribution from terrain parameters: a GIS-based decision-tree model in Lebanon. Environ Pollut 158(2):520–528
Caniani D, Pascale S, Sdao F, Sole A (2008) Neural networks and landslide susceptibility: a case study of the urban area of Potenza. Nat Hazards 45(1):55–72
Carrara A, Pike RJ (2008) GIS technology and models for assessing landslide hazard and risk. Geomorphology 94(3):257–260
Chapelle O, Vapnik V, Bousquet O, Mukherjee S (2002) Choosing multiple parameters for support vector machines. Mach Learn 46(1–3):131–159
Chau K, Wu C, Li Y (2005) Comparison of several flood forecasting models in Yangtze River. J Hydraul Eng 10(6):485–491
Chen YR, Yeh CH, Yu B (2011) Integrated application of the analytic hierarchy process and the geographic information system for flood risk assessment and flood plain management in Taiwan. Nat Hazards 59(3):1261–1276
Choi J, Oh HJ, Won JS, Lee S (2010) Validation of an artificial neural network model for landslide susceptibility mapping. Environ Earth Sci 60(3):473–483
Cloke H, Pappenberger F (2009) Ensemble flood forecasting: a review. J Hydrol 375(3):613–626
Cortes C, Vapnik V (1995) Support-vector networks. Mach learn 20(3):273–297
Damaševičius R (2010) Optimization of SVM parameters for recognition of regulatory DNA sequences. Top 18(2):339–353
Dawson CW, Abrahart R, Shamseldin A, Wilby R (2006) Flood estimation at ungauged sites using artificial neural networks. J Hydrol 319(1):391–409
Du J, Fang J, Xu W, Shi P (2013) Analysis of dry/wet conditions using the standardized precipitation index and its potential usefulness for drought/flood monitoring in Hunan Province China. Stoch Env Res Risk Assess 27(2):377–387
Glenn E, Morino K, Nagler P, Murray R, Pearlstein S, Hultine K (2012) Roles of saltcedar (< i > Tamarix </i > spp.) and capillary rise in salinizing a non-flooding terrace on a flow-regulated desert river. J Arid Environ 79:56–65
Gokceoglu C, Sezer E (2009) A statistical assessment on international landslide literature (1945–2008). Landslides 6(4):345–351
Gokceoglu C, Sonmez H, Nefeslioglu HA, Duman TY, Can T (2005) The 17 March 2005 Kuzulu landslide (Sivas, Turkey) and landslide-susceptibility map of its near vicinity. Eng Geol 81(1):65–83
Horritt M, Bates P (2002) Evaluation of 1D and 2D numerical models for predicting river flood inundation. J Hydrol 268(1):87–99
Imrie C, Durucan S, Korre A (2000) River flow prediction using artificial neural networks: generalisation beyond the calibration range. J Hydrol 233(1):138–153
Jayakrishnan R, Srinivasan R, Santhi C, Arnold J (2005) Advances in the application of the SWAT model for water resources management. Hydrol Process 19(3):749–762
Jebur MN, Pradhan B, Tehrany MS (2013a) Using ALOS PALSAR derived high-resolution DInSAR to detect slow-moving landslides in tropical forest: Cameron Highlands, Malaysia. Geomat Nat Haz Risk (ahead-of-print): 1–19. doi: 10.1080/19475705.2013.860407
Jebur MN, Shafri HZM, Pradhan B, Tehrany MS (2013b) Per-pixel and object-oriented classification methods for mapping urban land cover extraction using SPOT 5 imagery. Geocarto Int. doi:10.1080/10106049.2013.848944
Jebur M, Pradhan B, Tehrany M (2014a) Manifestation of LiDAR-derived parameters in the spatial prediction of landslides using novel ensemble evidential belief functions and support vector machine models in GIS. IEEE J Sel Topics Appl Earth Observ. doi:10.1109/JSTARS.2014.2341276
Jebur MN, Pradhan B, Tehrany MS (2014b) Detection of vertical slope movement in highly vegetated tropical area of Gunung pass landslide, Malaysia, using L-band InSAR technique. Geosci J 18(1):61–68
Jebur MN, Pradhan B, Tehrany MS (2014c) Optimization of landslide conditioning factors using very high-resolution airborne laser scanning (LiDAR) data at catchment scale. Remote Sens Environ 152:150–165
Jung IW, Chang H, Moradkhani H (2011) Quantifying uncertainty in urban flooding analysis considering hydro-climatic projection and urban development effects. Hydrol Earth Syst Sci 15(2):617–633
Kanevskij M, Pozdnoukhov A, Timonin V (2009) Machine learning for spatial environmental data: theory, applications and software. EPFL Press, Lausanne
Keerthi SS, Lin CJ (2003) Asymptotic behaviors of support vector machines with Gaussian kernel. Neural Comput 15(7):1667–1689
Kia MB, Pirasteh S, Pradhan B, Mahmud AR, Sulaiman WNA, Moradi A (2012) An artificial neural network model for flood simulation using GIS: Johor River Basin Malaysia. Environ Earth Sci 67(1):251–264
Kisi O, Nia AM, Gosheh MG, Tajabadi MRJ, Ahmadi A (2012) Intermittent streamflow forecasting by using several data driven techniques. Water Resour Manag 26(2):457–474
Kusiak A, Li M, Zhang Z (2010) A data-driven approach for steam load prediction in buildings. Appl Energy 87(3):925–933
Lee S, Talib JA (2005) Probabilistic landslide susceptibility and factor effect analysis. Environ Geol 47(7):982–990
Lee S, Pradhan B (2007) Landslide hazard mapping at Selangor, Malaysia using frequency ratio and logistic regression models. Landslides 4(1):33–41
Lee MJ, Kang JE, Jeon S (2012) Application of frequency ratio model and validation for predictive flooded area susceptibility mapping using GIS. In: IEEE international geoscience and remote sensing symposium (IGARSS), Munich, 895–898, 2012
Li XH, Zhang Q, Shao M, Li YL (2012) A comparison of parameter estimation for distributed hydrological modelling using automatic and manual methods. Adv Mater Res 356–360:2372–2375
Li GF, Xiang XY, Tong YY, Wang HM (2013) Impact assessment of urbanization on flood risk in the Yangtze River Delta. Stoch Env Res Risk Assess 27(7):1683–1693
Liao X, Carin L (2009) Migratory logistic regression for learning concept drift between two data sets with application to UXO sensing. IEEE Trans Geosci Remote Sens 47:1454–1466
Liu Y, De Smedt F (2005) Flood modeling for complex terrain using GIS and remote sensed information. Water Resour Manag 19(5):605–624
Marjanović M, Kovačević M, Bajat B, Voženílek V (2011) Landslide susceptibility assessment using SVM machine learning algorithm. Eng Geol 123(3):225–234
Mattera D, Haykin S (1999) Support vector machines for dynamic reconstruction of a chaotic system. In: Schölkopf B, Burges C, Smola AJ (eds) Advances in kernel methods. MIT Press, Cambridge, pp 211–241
Merz B, Kreibich H, Thieken A, Schmidtke R (2004) Estimation uncertainty of direct monetary flood damage to buildings. Nat Hazard Earth Sys 4(1):153–163
Messner F, Meyer V (2006) Flood damage, vulnerability and risk perception–challenges for flood damage research. Springer, Amsterdam, pp 149–167
Mu T, Nandi AK (2007) Breast cancer detection from FNA using SVM with different parameter tuning systems and SOM–RBF classifier. J Frankl Inst 344(3):285–311
Mukerji A, Chatterjee C, Raghuwanshi NS (2009) Flood forecasting using ANN, neuro-fuzzy, and neuro-GA models. J Hydrol Eng 14(6):647–652
Nefeslioglu H, Gokceoglu C, Sonmez H (2008) An assessment on the use of logistic regression and artificial neural networks with different sampling strategies for the preparation of landslide susceptibility maps. Eng Geol 97(3):171–191
Oh HJ, Pradhan B (2011) Application of a neuro-fuzzy model to landslide-susceptibility mapping for shallow landslides in a tropical hilly area. Comput Geosci 37(9):1264–1276
Ohlmacher GC, Davis JC (2003) Using multiple logistic regression and GIS technology to predict landslide hazard in northeast Kansas USA. Eng Geol 69(3):331–343
Pierdicca N, Pulvirenti L, Chini M, Guerriero L, Ferrazzoli P (2010). A fuzzy-logic-based approach for flood detection from Cosmo-SkyMed data. In: IEEE international geoscience and remote sensing symposium (IGARSS), Honolulu, 4796–4798, 2010
Platt J (1999) Probabilistic outputs for support vector machines and comparisons to regularized likelihood methods. In: Somala AJ, Bartlett P, Schölkopf B, Schuurmans D (eds) Advances in large margin classifiers. MIT Press, cambridge, pp 61–74
Pourghasemi HR, Pradhan B, Gokceoglu C (2012) Application of fuzzy logic and analytical hierarchy process (AHP) to landslide susceptibility mapping at Haraz watershed. Iran. Nat Hazards 63(2):965–996
Pradhan B (2009) Groundwater potential zonation for basaltic watersheds using satellite remote sensing data and GIS techniques. Cent Eur J Geosci 1(1):120–129
Pradhan B (2010) Flood susceptible mapping and risk area delineation using logistic regression, GIS and remote sensing. J Spat Hydrol 9(2):1–18
Pradhan B (2013) A comparative study on the predictive ability of the decision tree, support vector machine and neuro-fuzzy models in landslide susceptibility mapping using GIS. Comput Geosci 51:350–365
Pradhan B, Buchroithner MF (2010) Comparison and validation of landslide susceptibility maps using an artificial neural network model for three test areas in Malaysia. Environ Eng Geosci 16(2):107–126
Pradhan B, Lee S (2010) Landslide susceptibility assessment and factor effect analysis: backpropagation artificial neural networks and their comparison with frequency ratio and bivariate logistic regression modelling. Environ Model Softw 25(6):747–759
Pradhan B, Shafie M (2009) Flood hazard assessment for cloud prone rainy areas in a typical tropical environment. Disaster Adv 2(2):7–15
Pradhan B, Youssef A (2011) A 100-year maximum flood susceptibility mapping using integrated hydrological and hydrodynamic models: Kelantan River Corridor Malaysia. J Flood Risk Manag 4(3):189–202
Pradhan B, Shafiee M, Pirasteh S (2009) Maximum flood prone area mapping using RADARSAT images and GIS: Kelantan River Basin. Int J Geoinf 5(2):49–61
Pradhan B, Hagemann U, Shafapour Tehrany M, Prechtel N (2013) An easy to use ArcMap based texture analysis program for extraction of flooded areas from TerraSAR-X satellite image. Comput Geosci 63:34–43. doi:10.1016/j.cageo.2013.10.011
Pradhan B, Abokharima MH, Jebur MN, Tehrany MS (2014) Land subsidence susceptibility mapping at Kinta Valley (Malaysia) using the evidential belief function model in GIS. Nat Hazards 73(2):1–24
Roe BP, Yang HJ, Zhu J, Liu Y, Stancu I, McGregor G (2005) Boosted decision trees as an alternative to artificial neural networks for particle identification. Nucl Instrum Methods Phys Res Sect A 543(2):577–584
Sahoo G, Schladow S, Reuter J (2009) Forecasting stream water temperature using regression analysis, artificial neural network, and chaotic non-linear dynamic models. J Hydrol 378(3):325–342
Saito H, Nakayama D, Matsuyama H (2009) Comparison of landslide susceptibility based on a decision-tree model and actual landslide occurrence: the Akaishi Mountains Japan. Geomorphology 109(3):108–121
Samui P (2008) Slope stability analysis: a support vector machine approach. Environ Geol 56(2):255–267
Sezer EA, Pradhan B, Gokceoglu C (2011) Manifestation of an adaptive neuro-fuzzy model on landslide susceptibility mapping: Klang Valley Malaysia. Expert Syst Appl 38(7):8208–8219
Smith K, Ward R (1998) Floods: physical processes and human impacts. Wiley, Chichester
Song S, Zhan Z, Long Z, Zhang J, Yao L (2011) Comparative study of SVM methods combined with voxel selection for object category classification on fMRI data. PLoS One 6(2):e17191
Stewart M, Bates P, Anderson M, Price D, Burt T (1999) Modelling floods in hydrologically complex lowland river reaches. J Hydrol 223(1):85–106
Swets JA (1988) Measuring the accuracy of diagnostic systems. Science 240(4857):1285–1293
Talei A, Chua LHC, Quek C (2010) A novel application of a neuro-fuzzy computational technique in event-based rainfall–runoff modeling. Expert Syst Appl 37(12):7456–7468
Tehrany MS, Pradhan B, Jebu MN (2013a) A comparative assessment between object and pixel-based classification approaches for land use/land cover mapping using SPOT 5 imagery. Geocarto Int 2:1–19. doi:10.1080/10106049.2013.768300
Tehrany MS, Pradhan B, Jebur MN (2013b) Remote sensing data reveals eco-environmental changes in urban areas of Klang Valley, Malaysia: contribution from object based analysis. J Indian Soc Remote 41(4):981–991
Tien Bui D, Pradhan B, Lofman O, Revhaug I (2012a) Landslide susceptibility assessment in vietnam using support vector machines, decision tree, and Naive Bayes Models. Math Probl Eng 171:12–29
Tien Bui D, Pradhan B, Lofman O, Revhaug I, Dick OB (2012b) Landslide susceptibility mapping at Hoa Binh province (Vietnam) using an adaptive neuro-fuzzy inference system and GIS. Comput Geosci 45:199–211
Tierney KJ, Lindell MK, Perry RW (2001) Facing the unexpected: disaster preparedness and response in the United States. Joseph Henry Press, Washington
Tiwari MK, Chatterjee C (2010) Uncertainty assessment and ensemble flood forecasting using bootstrap based artificial neural networks (BANNs). J Hydrol 382(1):20–33
Umar Z, Pradhan B, Ahmad A, Jebur MN, Tehrany MS (2014) Earthquake induced landslide susceptibility mapping using an integrated ensemble frequency ratio and logistic regression models in West Sumatera Province, Indonesia. Catena 118:124–135
Van Westen C, Van Asch TW, Soeters R (2006) Landslide hazard and risk zonation—why is it still so difficult? Bull Eng Geol Environ 65(2):167–184
Witten IH, Frank E (2005) Data mining: practical machine learning tools and techniques. San Francisco Morgan Kaufmann, San Francisco
Wu SJ, Lien HC, Chang CH (2010) Modeling risk analysis for forecasting peak discharge during flooding prevention and warning operation. Stoch Env Res Risk Assess 24(8):1175–1191
Yalcin A, Reis S, Aydinoglu A, Yomralioglu T (2011) A GIS-based comparative study of frequency ratio, analytical hierarchy process, bivariate statistics and logistics regression methods for landslide susceptibility mapping in Trabzon NE Turkey. Catena 85(3):274–287
Yao X, Tham L, Dai F (2008) Landslide susceptibility mapping based on support vector machine: a case study on natural slopes of Hong Kong China. Geomorphology 101(4):572–582
Yilmaz I (2010) Comparison of landslide susceptibility mapping methodologies for Koyulhisar, Turkey: conditional probability, logistic regression, artificial neural networks, and support vector machine. Environ Earth Sci 61(4):821–836
Yu J, Qin X, Larsen O (2013) Joint Monte Carlo and possibilistic simulation for flood damage assessment. Stoch Env Res Risk Assess 27(3):725–735
Zhuang L, Dai H (2006) Parameter optimization of kernel-based one-class classifier on imbalance learning. J Comput 1(7):32–40
Zou Q, Zhou J, Zhou C, Song L, Guo J (2013) Comprehensive flood risk assessment based on set pair analysis-variable fuzzy sets model and fuzzy AHP. Stoch Env Res Risk Assess 27(2):525–546
Acknowledgments
Authors would like to thank Malaysian Center for Remote Sensing and Department of Surveying, Malaysia for providing various datasets for this research. Thanks are also due to the Malaysian Meteorological Service Department for providing rainfall data for the research.
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
About this article
Cite this article
Tehrany, M.S., Pradhan, B. & Jebur, M.N. Flood susceptibility analysis and its verification using a novel ensemble support vector machine and frequency ratio method. Stoch Environ Res Risk Assess 29, 1149–1165 (2015). https://doi.org/10.1007/s00477-015-1021-9
Published:
Issue Date:
DOI: https://doi.org/10.1007/s00477-015-1021-9