中长期径流预报模型对比与集成应用

张云帆; 常高松; 赵国茂; 鞠玉梅; 甘亚斌

摘要: 本研究旨在针对四川省9个主要江河控制站点，系统评估多种中长期径流预报模型的应用效果，并探索模型集成方法以提升径流预报的准确性和鲁棒性。研究选取了6种中长期径流预报模型，包括线性模型（广义线性模型，GLM）、机器学习模型（随机森林，RF；梯度提升回归树，BRT；Cubist模型）及神经网络模型（长短期记忆网络，LSTM；BP神经网络）。基于1980—2024年实测径流、降雨数据以及88种大气环流指数和26种海温指数，采用贝叶斯优化方法结合10折交叉验证（率定期1980—2008年，验证期2009—2024年）对模型参数进行率定，并评估模型的泛化能力。此外，通过贝叶斯模型平均（BMA）方法、分位数平均方法和集成选择方法对高性能模型进行集成，并分析特征重要性以揭示影响径流的关键因子。结果显示，LSTM、RF、BRT和Cubist模型在多指标综合表现中优于传统线性模型（GLM）及浅层网络模型（BP）。在率定期，RF模型表现最优，其纳什-斯特鲁特林夫系数（NSE）中值为0.91，平均绝对百分比误差（MAPE）为13.7%。在验证期，LSTM和BRT模型展现出较强的泛化能力，LSTM的NSE中值从0.79下降至0.63，BRT的MAPE中值从21.7%上升至25.2%。特征重要性分析表明，西藏高原-1指数和亚洲区极涡强度指数是岷江五通桥站径流的主控因子，且同一指数不同滞时对径流的影响效应可能相反，其非线性作用与水汽输送路径调整密切相关。BMA集成模型在综合性能上展现出相对优势，其验证期NSE中值为0.67，与RF模型并列第一，PMAE中值为25.2%，仅次于BRT模型，有效约束了水文不确定性。本研究通过多模型对比与集成，为复杂流域径流模拟提供了优化方案，混合建模策略提升了预测的鲁棒性，具有重要的实践意义。

Abstract: This study aims to systematically evaluate the performance of multiple medium-to-long-term runoff forecasting models at nine major river control stations in Sichuan Province, China, and explore ensemble methods to enhance forecast accuracy and robustness. Six models were selected: a linear model (Generalized Linear Model, GLM), machine learning models (Random Forest, RF; Boosted Regression Trees, BRT; Cubist), and neural network models (Long Short-Term Memory, LSTM; Back Propagation, BP). Utilizing observed runoff and precipitation data (1980-2024), along with 88 atmospheric circulation indices and 26 sea surface temperature indices, model parameters were calibrated via Bayesian optimization combined with 10-fold cross-validation (calibration period: 1980-2008; validation period: 2009-2024) to assess generalization capability. High-performing models were integrated using Bayesian Model Averaging (BMA), Quantile Model Averaging, and Ensemble Selection. Feature importance analysis was conducted to identify key drivers of runoff. Results demonstrated that LSTM, RF, BRT, and Cubist models significantly outperformed the traditional GLM and shallow BP model across multiple metrics. During calibration, RF achieved optimal performance (median Nash-Sutcliffe Efficiency, NSE = 0.91; median Mean Absolute Percentage Error, MAPE = 13.7%). In validation, LSTM (median NSE decreased from 0.79 during calibration to 0.63) and BRT (median MAPE increased from 21.7% to 25.2%) exhibited relatively stronger stability. Feature importance analysis revealed the Tibetan Plateau Index-1 and the Asian Polar Vortex Intensity Index as the dominant controlling factors for runoff at the Wutongqiao Station on the Min River. The impact of the same climatic index with different time lags on runoff could be contrasting, linked to its nonlinear interaction with seasonal adjustments in moisture transport pathways. The BMA ensemble model showed relative performance, matching the best single model (RF) in validation period median NSE (0.67) and ranking second only to BRT in median MAPE (25.2%), effectively constraining hydrological uncertainty. This study provides optimization solutions for runoff simulation in complex basins through multiple model comparison and integration, and the hybrid modeling strategy enhances the robustness of predictions, demonstrating significant practical significance.

中长期径流预报模型对比与集成应用

Model Comparison and Ensemble for Medium-to-Long-Term Runoff Forecasting