高级检索

    基于SHAP-XGBoost模型的亚热带土壤铜元素高光谱反演及区域适应性研究

    Hyperspectral inversion of copper content in subtropical soils based on the SHAP-XGBoost model and its cross-regional adaptability

    • 摘要: 高光谱遥感是土壤环境监测的新技术,其中特征光谱提取方法和反演模型是该技术的核心,且目前针对反演模型的区域适应性研究较少。该文采用光谱变换对高光谱数据进行预处理,通过Shapley加法解释(Shapley additive explanation, SHAP)方法进行土壤铜特征波段的筛选,极端梯度提升(extreme gradient boosting, XGBoost)、随机森林(random forest, RF)等模型对福建省福州市土壤采样点铜含量进行高光谱反演与对比分析,并应用SHAP-XGBoost模型在福建省莆田市荔城区进行了适用性验证。研究结果表明: ①在建模区福州市,原始高光谱数据的微分变换能够显著增强光谱特征; 基于SHAP方法筛选的土壤铜特征波段集中在400~500 nm和1 900~2 300 nm; 利用SHAP-XGBoost模型反演土壤铜元素效果最佳,决定系数(R2)为0.73,均方根误差(root mean squared error, RMSE)为6.4,平均绝对误差(mean absolute error, MAE)为5.38,残差(residual prediction deviation, RPD)为1.92,相较于LightGBM,RF和偏最小二乘法(partial least squares regression, PLSR)模型各指标分别提升了4.3%,17.7%和49.0%; ②在验证区莆田市荔城区,土壤铜元素反演精度为: R2为0.75、RMSE为5.01,RPD为2.01; ③不确定性分析发现福州市与莆田市荔城区的平均变异系数均在10%以下,表明SHAP-XGBoost模型鲁棒性较强,具备良好的区域适应性。

       

      Abstract: Hyperspectral remote sensing has emerged as a novel technology for monitoring soil environments, the core of which lies in the extraction of characteristic spectral features and the construction of inversion models. However, the cross-regional adaptability of inversion models remains under-studied. In this study, hyperspectral data were pre-processed through spectral transform. The characteristic bands of soil copper were identified using Shapley additive explanations (SHAP). Employing the extreme gradient boosting (XGBoost) and random forest (RF) models, this study conducted hyperspectral inversion and comparative analysis of copper content in soil samples from Fuzhou City. The cross-regional adaptability of the SHAP-XGBoost model was validated in Licheng District, Putian City. The results show that in the modeling area (Fuzhou City), the differential transformation of the raw hyperspectral data could significantly enhance the spectral features. The characteristic bands of soil copper identified using SHAP included 400-500 nm and 1900-2300 nm. The SHAP-XGBoost model proved to be the optimal inversion model for copper content in soils, achieving a coefficient of determination (R2) of 0.73, a root mean square error (RMSE) of 6.4, a mean absolute error (MAE) of 5.38, and a residual prediction deviation (RPD) of 1.92. Compared to the light gradient boosting machine (LightGBM), RF, and partial least squares regression (PLSR) models, the SHAP-XGBoost model improved R2 by 4.3%, 17.7%, and 49.0%, respectively. In the validation area (Licheng District), the inversion results of the SHAP-XGBoost model for soil copper showed a R2 value of 0.75, a RMSE of 5.01, and a RPD of 2.01. Uncertainty analyses reveal that the average coefficients of variation in Fuzhou City and Licheng District were both below 10%, indicating the strong robustness and excellent cross-regional adaptability of the SHAP-XGBoost model.

       

    /

    返回文章
    返回