Zoning of Flash Flood Generation Probability Using Machine Learning Techniques (A Case Study: Karaj Dam Watershed, Iran)

Document Type : Original Article

Authors

1 Department of Watershed Management, University of Agricultural Sciences and Natural Resources, Gorgan

2 Department of Watershed Management, University of Agricultural Sciences and Natural Resources, Gorgan, Iran

3 Department of Forest, University of Agricultural Sciences and Natural Resources, Gorgan, Iran

4 Department of Watershed Management, University of Agricultural Sciences and Natural Resources, Gorgan, Iran.

5 Department of Water Engineering, Faculty of Agriculture, University of Maragheh. Iran.

‎10.22052/deej.2025.256422.1098

Abstract

Introduction: Flooding stands as a major hydro-climatic extreme event and one of the most severe natural disasters, posing significant threats to socio-environmental systems. Given its multifaceted and dynamic nature, floods are influenced by numerous factors, with both natural and anthropogenic agents collectively impacting their magnitude, intensity, extent, and duration. A fundamental step in managing flood-prone areas is to identify the spatial extent of potential floods. By spatially analyzing and investigating floodplains, a risk-based management framework can be established, leading to effective preventive measures. Therefore, flood prevention and management necessitate the initial delineation of high flood-potential areas using modeling methods, followed by the identification of the underlying drivers of these events. In recent years, Iran has experienced an intensifying frequency and impact of flash floods, resulting in considerable damage. The Karaj Dam Watershed in Alborz Province, in particular, has been severely affected, creating substantial challenges for travelers, tourists, and local residents. This ongoing issue underscores the urgent need for enhanced flood management strategies. Consequently, this research focuses on the spatial analysis and susceptibility assessment of flash flood-prone areas. Our aim is to provide essential and practical insights to inform planning, decision-making, and the implementation of various flood mitigation measures by relevant authorities and stakeholders.
 
Materials and Methods: To investigate the probability of flash flood generation across the Karaj Dam Watershed in Alborz Province, we first compiled a comprehensive map of past flash flood events. This involved field visits, consultation with local resources, and integrating data from various sources within the ArcGIS 10.8 software environment. Based on an extensive review of relevant literature, we identified fourteen variables hypothesized to influence flash flood occurrence. To ensure an optimal combination of predictors, we applied the Variant Inflation Factor (VIF) test for feature selection, screening out redundant variables. For model development and validation, the flash flood event data was divided using a 10-fold cross-validation method, with 70% allocated for training and 30% for validation. We then processed and analyzed the spatial data, including environmental variables and hydrologic information, using three distinct machine learning algorithms: Maximum Entropy, Random Forest, and Support Vector Machine. The accuracy of each susceptibility model was rigorously evaluated using comprehensive criteria, including the Relative Operating Characteristic (ROC) curve and the Kappa index. The model demonstrating the most acceptable accuracy was subsequently selected to generate a spatial probability map of flash flood generation for the study area. The insights derived from this research offer valuable contributions to flood hazard assessment and management efforts.
Results: Our analysis revealed that the Support Vector Machine (SVM) model proved to be the most suitable algorithm for determining flash flood susceptibility within the study area. This model achieved an impressive Area Under the Curve (AUC) value of 0.88 and a Kappa coefficient of 0.68, demonstrating strong predictive capability and outperforming both the Maximum Entropy and Random Forest models. The resulting flood susceptibility map indicates that 53% of the study area, approximately 46,200 hectares, exhibits moderate, high, or very high flash flood potential. Conversely, the remaining 47% of the watershed, about 38,860 hectares, falls into the low and very low flood probability classes. Furthermore, our research identified the key contributing factors to flash flooding in the region. In order of significance, elevation, drainage density, slope, and average annual rainfall emerged as the most important drivers influencing flood occurrence.
 
Discussion and Conclusions: Historically, the Karaj Dam Watershed has experienced a unique interplay of climatic and geomorphological factors that amplify flash flood risk. Extreme cold conditions at higher altitudes lead to substantial winter snow accumulation. When this is followed by heavy rainfall in spring and summer, it causes a rapid increase in water flow, resulting in significant flash flooding. This aligns perfectly with the watershed's natural characteristics: its steep slopes limit the soil's capacity to absorb rainwater, and when combined with high drainage density and intense, short-duration rainfall, these factors significantly elevate the risk of flash flooding. The findings from this research are crucial for both improving our understanding of flood phenomena and for practically identifying flood-prone areas. By pinpointing the key contributing factors to flash flood occurrences, we can prioritize management measures tailored to the region's specific conditions, ultimately aiming to reduce flood-related damages. The generated flood susceptibility map serves as a vital tool, providing a roadmap for implementing effective preventive measures. Ultimately, this research offers an effective framework for developing robust flood hazard management plans. It can significantly contribute to reducing human losses and economic damages by guiding informed decision-making and planning processes. We strongly recommend that responsible organizations and agencies utilize these insights, including the detailed zoning of flood susceptibility and the identified effective factors, to optimize financial resource allocation and ensure efficient use of public resources in flood mitigation efforts.

Keywords

Main Subjects


  1. Akbari, M., Ownegh, M., Asgari, H. R., Sadoddin A., Khosravi H. (2016). Soil erosion risk assessment using the CORINE model (Case Study: Semi-arid region in Golestan Province). Desert Ecosystem Engineering, 5 (12), 63-78.
  2. Asadi Nalivan, O., Teimori, A., Rahmati, A., & Kornejadi, A. (2108). River management and flood control using unstructured methods. Noroozi Publications. 204p. Gorgan Iran.
  3. Bai, M., Tahmasbipour, N., Zinivand, H., & Sadoddin, A., Kh, M. (2022). Predicting the effect of intervention scenarios in the vulnerability of local communities in a flood Zone (The Qarachai River, Ramyan-Golestan province, Iran). Journal of Water and Soil Conservation, 30 (2), 51-72.
  4. Bronstert, A. (2003). Floods and climate change: interactions and impacts. Risk Analysis: An International Journal, 23 (3), 545-557.
  5. Choubin, B., Moradi, E., Golshan, M., Adamowski, J., Sajedi-Hosseini, F., & Mosavi, A. (2019). An ensemble prediction of flood susceptibility using multivariate discriminant analysis, classification and regression trees, and support vector machines. Science of the Total Environment, 651, 2087-2096.
  6. Cristianini, N., & Scholkopf, B. (2002). Support vector machines and kernel methods: the new generation of learning machines. American Association for Artificial Intelligence. A .I. Magazine23 (3), 31-31.
  7. Guo, Q., & Liu, Y. (2010). ModEco: an integrated software package for ecological niche modeling. Ecography33 (4), 637-642.
  8. Haghizadeh, A., Siahkamari, S., Haghiabi, A. H., & Rahmati, O. (2017). Forecasting flood-prone areas using Shannon’s entropy model. Journal of Earth System Science, 126, 1-11.
  9. Jakkula, V. (2006). Tutorial on support vector machine (SWM). School of EECS, Washington State University, 37 (2.5), 3.
  10. Kjeldsen, T. R. (2010). Modelling the impact of urbanization on flood frequency relationships in the UK. Hydrology Research, 41 (5), 391-405.
  11. Kornejady, A., Ownegh, M., & Bahremand, A. (2017). Landslide susceptibility assessment using maximum entropy model with two different data sampling methods. Catena152, 144-162.
  12. Kourgialas, N.N., & Karatzas, G. P. (2011). Flood management and a GIS modelling method to assess flood-hazard areas- a case study. Hydrological Sciences Journal–Journal des Sciences Hydrologiques, 56 (2), 212-225.
  13. Laluzaei, A., Sadoddin, A., Zare Garizi, A., & Sheikh, V.B. (2022). Process-based flood type separation for partial duration series (Sample study: the Gorganrood River Sub-Basins). Journal of Iran Water Resources Research, 3 (62), 18-30.
  14. Lee, S., Kim, J., Jung, H., Lee, M., & Lee, S. (2017). Spatial prediction of flood susceptibility using random-forest and boosted-tree models in Seoul metropolitan city, Korea. Geomatics, Natural Hazards and Risk, 8 (2), 1185-1203.
  15. Miller, A. J. (1990). Flood hydrology and geomorphic effectiveness in the central Appalachians. Earth Surface Processes and Landforms, 15 (2), 119-134.
  16. Nachappa, T.G., Tavakkoli, S., Gholamnia, Kh. Ghorbanzadeh, O., Rahmati, O., & Blaschke, T. (2020). Flood susceptibility mapping with machine learning, multi-criteria decision analysis and ensemble using Dempster Shafer Theory. Journal of Hydrology, 590, 1252.
  17. Rahmati, O., Zeinivand, H., & Besharat, M. (2016). Flood hazard zoning in Yasooj region, Iran, using GIS and multi-criteria decision analysis. Geomatics, Natural Hazards and Risk, 7 (3), 1000-1017.
  18. Sadler J.M., Goodall, J.L., Morsy, M.M., & Spencer, K. (2018). Modeling Urban Coastal Flood Severity from Crowd-Sourced Flood Reports Using Poisson Regression and Random Forest. Journal of Hydrology.
  19. Singh, A. K. (2010). Landslide management: concept and philosophy. Disaster Prevention and Management: An International Journal, 19 (1), 119-134.
  20. Shao, G., Pang, B., Xu, Z., Peng, D., & Xu, L. (2019). Assessment of urban flood susceptibility using semi-supervised machine learning model. Science of the Total Environment, 629, 940-949.
  21. Special Reporting Committee on Iran Floods in 2019. (2020). Final report: narration, analysis, lesson learnt, and recommendations. University of Tehran.
  22. Tehrany, M. S., Pradhan, B., & Jebur, M. N. (2014a). Flood susceptibility mapping using a novel ensemble weights-of-evidence and support vector machine models in GIS. Journal of Hydrology, 512, 332–343.
  23. Tehrany, M. S., Lee, M. J., Pradhan, B., Jebur, M. N., & Lee, S. (2014b). Flood susceptibility mapping using integrated bivariate and multivariate statistical models. Environmental Earth Sciences72 (10), 4001-4015.
  24. Tehrany, M. S., Pradhan, B., Mansor, S., & Ahmad, N. (2015). Flood susceptibility assessment using GIS-based support vector machine model with different kernel types. Catena125, 91-101.
  25. Wang, Z., Lai, C., Chen, X., Yang, B., Zhao, S., & Bai, X. (2015). Flood hazard risk assessment model based on random forest. Journal of Hydrology, 527, 1130–1141.