高级检索+

耦合水云模型与随机森林提升拔节期冬小麦表层土壤水分反演精度

Enhancing the surface soil moisture retrieval accuracy for winter wheat at the jointing stage by coupling the water cloud model with random forest

  • 摘要: 针对农田土壤水分遥感反演中单一机理模型在复杂植被覆盖条件下泛化性不足、单一机器学习模型物理可解释性差的问题,该研究以河南省原阳县这一华北平原典型冬小麦主产区为研究对象,聚焦于植被覆盖度较高、反演挑战性强的拔节生育期,融合Sentinel-1 SAR与Sentinel-2多光谱影像,结合地面实测土壤体积含水量,系统对比了经典水云模型(water cloud model)与6种典型机器学习模型,并优选机器学习模型,并在此基础上构建了一种机理与机器学习深度级联耦合的 WC-RFM(water cloud-random forest model)模型。通过提取雷达后向散射特征、光学植被指数及水云模型物理中间变量,并进行严格的特征筛选与参数敏感性分析,优选出模型的输入变量集,采用五折交叉验证进行模型评估。结果表明:随机森林在单一机器学习模型中表现最优(R2为0.871,均方根误差为0.020 m3/m3);耦合模型WC-RFM进一步提升了反演精度与稳定性(验证集R2为0.910,均方根误差为0.015 m3/m3),模型有效克服了中高植被覆盖下传统物理模型的局限性,同时弥补了纯数据驱动模型缺乏物理约束的缺陷,且稳健性优于单一模型(均方根误差标准差为±0.002 m3/m3),为实现华北平原冬小麦拔节期高精度、高空间分辨率的农田土壤水分动态监测提供了可靠方法。

     

    Abstract: Soil moisture (SM) is one of the most key variables on agricultural productivity, land–atmosphere interactions and hydrological processes. It is often required to exact retrieval of the cropland surface soil moisture (SSM) from the remote sensing. Particularly under moderate to high vegetation cover, the strong vegetation attenuation and scattering effects can substantially degrade the radar sensitivity to soil conditions. Existing approaches can rely generally on either physically models or data-driven machine learning (ML). However, the single mechanistic models can often suffer from the limited generalizability and accuracy under complex canopy conditions. Whereas the purely data-driven models can usually lack the physical interpretability and robustness, although they can capture the nonlinear relationships. The jointing stage of the winter wheat, a critical phenological period can be characterized by the high vegetation coverage and strong soil–vegetation interaction, leading to the substantial difficulty for the soil moisture retrieval. This study aims to enhance the retrieval accuracy of the SSM for the winter wheat at the jointing stage by coupling the water-cloud model with random forest. A study area was selected from a typical winter wheat-producing area at Yuanyang County, Henan Province in the North China Plain. Multi-source remote sensing data from Sentinel-1 C-band synthetic aperture radar (SAR) and Sentinel-2 multispectral imagery were integrated with in situ measurements of the surface (0–10 cm) volumetric soil moisture during the satellite overpass period. A systematic evaluation was performed on a classical physical model—the water cloud model (WCM)—and six widely-used ML algorithms, including the support vector regression, multilayer perceptron, random forest (RF), gradient boosting, k-nearest neighbor, and decision tree models. A cascaded hybrid framework, termed WC-RFM (water cloud–random forest model), was proposed to fully exploit the complementary strengths of the physical modeling and machine learning. The WCM served as a front-end mechanistic module to explicitly characterize the vegetation attenuation and soil backscattering. Key physical intermediate variables were derived from the WCM, including the water content, attenuation factors, vegetation backscattering, and soil backscattering components. The physically meaningful features were subsequently incorporated into the RF model. A feature pool was constructed with SAR backscattering and optical vegetation indices. The stability and interpretability of the model were obtained after feature selection using correlation analysis and multicollinearity diagnostics, as well as parameter sensitivity analysis of the WCM. The performance was evaluated using a 5-fold cross-validation. The results demonstrate that the random forest model was achieved the best performance among the standalone ML models, with a coefficient of determination (R2) of 0.871 and a root mean squared error (RMSE) of 0.020 m3/m3. The wc-rfm hybrid model was further improved the retrieval accuracy and robustness, with an R2 of 0.910 and an RMSE of 0.015 m3/m3 on the test set, as well as the lowest performance variability (RMSE standard deviation of ±0.002 m3/m3). Compared with the standalone WCM and RF models, the WC-RFM was effectively mitigated the performance degradation of the physical models under dense vegetation, while compensating for the absence of the physical constraints in purely data-driven approaches. Spatial mapping results revealed that there was the outstanding heterogeneity of the soil moisture in the study area, which was consistent with the regional irrigation patterns, soil texture variability, and hydrological conditions, further indicating the reliability of the model. Overall, a physically interpretable water cloud model was coupled with machine learning can significantly enhance the retrieval accuracy of the soil moisture under vegetation conditions. The WC-RFM framework can provide a promising approach for the high-resolution, high-accuracy dynamic monitoring of the cropland soil moisture during key growth stages of the winter wheat in the North China Plain. Multi-temporal meteorological variables and multi-growth-stage observations can be further integrated to strengthen the spatiotemporal applicability of the framework for agricultural monitoring.

     

/

返回文章
返回