基于集成学习方法的鄱阳湖水质反演模型研究

    Research on Poyang Lake Water Quality Inversion Model Based on Ensemble Learning Method

    • 摘要: 针对传统统计模型与单一机器学习算法在水质遥感反演中的精度瓶颈问题,本研究提出基于多源遥感影像的集成学习优化方法。研究采用多元线性回归与3种机器学习模型(K近邻、支持向量回归和多层感知机)以及3种集成策略(Bagging、Boosting和Stacking)进行对比,分别实现鄱阳湖水温、溶解氧和pH值的反演。研究结果表明:基于Ridge回归的Stacking策略在水温反演中表现最优,R2值达0.829,较单模型中表现最佳的KNN提升了8.2%;Bagging中的随机森林模型在pH值反演中效果最佳,R2值达0.868,较单模型中表现最佳的KNN模型提升12.9%,在溶解氧反演中表现突出,RF模型R2值为0.868,较单模型中表现最佳的KNN提升11.7%。通过对影像的波段特征组合优化,验证了少波段数据在集成学习框架中的有效性。反演结果表明,集成学习方法相较于单模型方法显著提升了卫星数据在水质反演方面的能力,为湖泊环境监测提供了一种高效的解决方案。

       

      Abstract: Aiming at the accuracy limitations of traditional statistical models and single machine learning algorithms in water quality remote sensing inversion, this study proposes an ensemble learning optimization method based on multi-source remote sensing imagery. The research employs multiple linear regression and three machine learning models (K-Nearest Neighbors, Support Vector Regression, and Multilayer Perceptron), along with three ensemble strategies (Bagging, Boosting, and Stacking), to invert water temperature, dissolved oxygen, and pH values in Poyang Lake. The results show that the Stacking strategy based on ridge regression performs best in water temperature inversion, and the R2 value is 0.829, which is 8.2% higher than the k nearest neighbor which performs best in the single model; the Random Forest model within the Bagging ensemble demonstrates the best performance in pH remote sensing inversion, with an R2 value of 0.868, representing a 12.9% improvement over the optimal single model (KNN); The Random Forest model also excels in dissolved oxygen inversion, achieving an equally high R2 value of 0.868, which is 11.7% higher than the best single model (KNN). By optimizing the combination of spectral band features in the imagery, the study successfully validates the effectiveness of using limited spectral bands within the ensemble learning framework. The results demonstrate that ensemble learning methods significantly enhance the capability of satellite data in water quality inversion compared to single-model approaches, providing an efficient solution for lake environmental monitoring.

       

    /

    返回文章
    返回