基于可解释机器学习的山洪致灾因子研究

    Causing Factors Analysis of Flash floods Based on Explainable Artificial Intelligence

    • 摘要: 为支撑山洪防灾减灾的科学化与精准化,本研究融合遥感数据与可解释机器学习技术(XAI)开展了山洪发生概率预测和致灾因子分析研究。首先从遥感反演数据中提取了地理位置、地形、气象、人类活动和土地覆盖5个方面的18个致灾因子。然后,结合山洪灾害调查数据采用XGBoost机器学习模型构建了山洪发生概率预测模型。最后,在浙江省全省和分区域不同尺度引入SHAP解释方法量化了致灾因子的单因子重要性、影响效应及多因子交互作用,揭示了致灾因子影响和交互的区域异质性。独立验证结果显示,模型的预测准确率达0.90、精确率达0.87,表明模型具备有效判断山洪发生概率的能力。在全省尺度下,山洪发生概率的预测结果与历史山洪点位空间分布模式高度一致,进一步证实模型具备较高的外推鲁棒性。致灾因子重要性排名揭示了不透水面比例是影响山洪发生概率最重要的致灾因子,而经纬度坐标的突出贡献则凸显了在山洪预测中考虑地理空间关系的重要性。在区域尺度,多因子交互作用分析成功识别出了不同区域内致灾因子差异化的影响模式。本研究证实了XAI在山洪发生概率预测和致灾因子分析中有效性,为深入理解山洪致灾机理、精准量化山洪触发条件等相关研究提供了科学参考。

       

      Abstract: To support the scientific and precise prevention and mitigation of flash floods, this study integrated remote sensing data and explainable artificial intelligence (XAI) techniques to conduct systematic research on flash flood occurrence probability prediction and disaster-causing factor analysis. Firstly, 18 influencing factors covering five dimensions—geographical location, topography, meteorology, human activities, and land cover—were extracted from remote sensing inversion datasets. Secondly, combined with historical flash flood survey data, the XGBoost machine learning algorithm was adopted to construct a flash flood occurrence probability prediction model. Finally, the SHAP (SHapley Additive exPlanations) interpretation method was introduced at both provincial and sub-regional scales to quantify the single-factor importance, impact effects, and multi-factor interaction effects of disaster-causing factors, and to reveal the regional heterogeneity in the influence and interaction of disaster-causing factors. Independent verification results showed that the model achieved a prediction accuracy of 0.90 and a precision of 0.87, demonstrating that it has the ability to effectively assess flash flood occurrence probability. At the provincial scale, the spatial distribution of predicted flash flood occurrence probability was highly consistent with the historical flash flood location pattern, which further confirmed that the model has strong extrapolation robustness. The disaster-causing factor importance ranking indicated that the impervious surface ratio is the most critical factor affecting flash flood occurrence probability, while the high importance of latitude and longitude coordinates highlights the necessity of considering geospatial relationships in flash flood prediction. At the sub-regional scale, multi-factor interaction analysis successfully identified the differentiated influence patterns of disaster-causing factors in different regions. This study verified the effectiveness of XAI in flash flood prediction and disaster-causing factor analysis, and provides a scientific reference for understanding the flash flood disaster-causing mechanism and accurately quantifying flash flood triggering conditions.

       

    /

    返回文章
    返回