Skip to main content

Advertisement

Log in

Performance benchmarking on several regression models applied in urban flash flood risk assessment

  • Original Paper
  • Published:
Natural Hazards Aims and scope Submit manuscript

Abstract

To evaluate the performances of regression models applied in the urban flash flood risk assessment, the historical urban flash flood occurrences points were used to build the Voronoi polygon networks for calculating Ripley’s K values which can be adopted to be the risk value and the predictands in regression. The first level risk indicators of hazard, vulnerability, sensitivity and exposure risk factors in the risk assessment, as well as the sensitivity subordinate indicators of imperviousness and terrain factor, were listed to be the predictors in the regression model. Subsequently, methods of the linear regression equation (LRE), nonlinear regression power-form function (PF) and a simplified power-form function (SPF), as well as support vector machine (SVM) model and random forests (RF) model, were all nominated for the performance evaluation and comparison of the fitness of their regression relationships between the predictors and the predictands. With the support of samples, the benchmarking firstly demonstrated the SPF is the best of the regression equation; but the full PF equation cannot be figured out on account of the sample data deficiency. The SVM model behaves better than the regression equations of SPE and LRE, while the SVM of nonlinear polynomial kernel function is slightly better than that of the nonlinear Gaussian kernel function. Above all, the RF model performed perfectly in the regression fitting, which the relative bias index is − 0.009 and the relative mean squared error is 0.0773. Meanwhile, it mostly resolves the problems of overfitting, outliers and noise in regression. The variable importance (VI) evaluated by the RF model indicated that the top four important risk factors are the imperviousness, terrain factor, vulnerability, and exposure factor, which the VI index value is 0.38, 0.16, 0.11 and 0.1, respectively. Unexpectedly, the hazard factor appears to be the least important factor with a VI value of 0.04. The homogeneity of invariable hazard being preserved in regional climate background makes the hazard a minor role in risk contribution. The model performance evaluation demonstrated the artificial intelligence RF model should be recommended to be the common-use model for aftermath meteorology-related risk assessment. On the other hand, the VI analysis tools of RF were also recognized to be a welcome toolbox items for the risk analysis.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5

Similar content being viewed by others

References

Download references

Acknowledgements

This study has been supported by the Open Grants of the State Key Laboratory of Severe Weather (2021LASW-A18) and Natural Science Foundation of Beijing Municipality (8222018).

Funding

This work of survey was supported and funded by the Open Grants of the State Key Laboratory of Severe Weather (2021LASW-A18) and Natural Science Foundation of Beijing Municipality (8222018).

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Haibo Hu.

Ethics declarations

Conflict of interest

The authors have not disclosed any competing interests.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Hu, H., Yu, M., Zhang, X. et al. Performance benchmarking on several regression models applied in urban flash flood risk assessment. Nat Hazards 120, 3487–3504 (2024). https://doi.org/10.1007/s11069-023-06341-y

Download citation

  • Received:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s11069-023-06341-y

Keywords

Navigation