基于机器视觉的日光温室内部积水检测方法

赵俊陶; 芦天罡; 吕芯悦; 李友丽; 喻黎明; 张钟莉莉

doi:10.11975/j.issn.1002-6819.202511024

基于机器视觉的日光温室内部积水检测方法

Detecting waterlogging in solar greenhouses using machine vision

摘要

摘要: 设施农业在全球食品生产中具有举足轻重的地位，近年来，受气候变化影响，洪涝等极端天气事件频发，设施农业受灾监测需求日益迫切。为解决温室环境下积水区域尺度差异大、复杂背景干扰强的问题，该研究结合温室生产已部署的图像传感器，提出了一种用于日光温室内部积水区域监测的残差多尺度级联编码-解码网络（residual multiscale cascade encoder–decoder，RMCED）。该网络采用ResNet50中间四层作为编码器，以充分提取图像的深层次特征；解码器部分则引入了高效多尺度级联注意力解码器(efficient multi-scale cascaded fully-convolutional attention decoder，EMCAD)，通过多尺度特征融合逐步恢复图像的空间分辨率，从而提高对积水区域细节的刻画能力。为弥合编码器与解码器之间的语义差异和尺度不匹配，研究引入了长跳跃连接，并设计了改进光照感知卷积块注意力模块（lighting-aware convolutional block attention module，LA-CBAM）。该模块在传统卷积注意力模块（convolutional block attention module，CBAM）的基础上进行改进，由多尺度卷积核与Sobel卷积融合构成。其中，多尺度卷积核用于提取不同尺度的特征信息， Sobel卷积用于捕获梯度变化并增强边缘信息表征，从而提升模型对温室内不同大小积水区域特征的识别能力，并增强其在复杂光照条件下的鲁棒性。试验结果表明，所提出的模型在日光温室积水数据集上表现出较好性能，精确率达到93.22%，为设施农业洪涝灾害监测提供了一种有效的技术方法。

Abstract: Protected agriculture plays a critical role in ensuring global food security by enabling stable crop production under controlled environmental conditions. However, climate change–driven extreme weather events have increased flood-related risks, posing growing threats to the safety, productivity, and sustainability of solar greenhouse cultivation. In practical greenhouse production, timely and accurate monitoring of waterlogging is essential for reducing crop losses, improving environmental regulation, and supporting scientific management decisions. Nevertheless, accurate segmentation of greenhouse waterlogging remains challenging because of limited monitoring visibility, uneven illumination, strong specular reflections, and the pronounced scale diversity of inundated areas. These adverse conditions often result in blurred boundaries, ambiguous textures, and unstable visual characteristics between flooded and non-flooded regions. In addition, satellite- and drone-based remote sensing is often unsuitable for greenhouse-scale applications because of resolution limitations and operational constraints. To address these challenges, this study developed an end-to-end image-based segmentation framework, termed the Residual Multiscale Cascade Encoder–Decoder (RMCED) network, for delineating waterlogging areas inside solar greenhouses. The proposed architecture adopted the middle four stages of ResNet-50 as the encoder to balance network depth and computational cost while extracting multi-level semantic features. This encoder design enabled the model to effectively capture both low-level texture patterns and high-level contextual representations, thereby improving discrimination between waterlogged areas and surrounding background regions. The decoder was designed as a multiscale cascaded attention decoder, which progressively restored spatial resolution through the hierarchical fusion of low- and high-level features. This design enhanced the network’s ability to identify fine spatial boundaries and accurately localize small-scale flooded regions that are often overlooked by conventional models. To ensure effective information flow, long skip connections were introduced between the encoder and decoder, thereby reducing feature degradation and semantic loss during feature transmission and improving the preservation of structural details. A key contribution of this work was the introduction of a Lighting-Aware Convolutional Block Attention Module (LA-CBAM). Compared with the original Convolutional Block Attention Module, LA-CBAM incorporated multi-kernel convolution filters and Sobel-gradient operations to improve both illumination robustness and edge sensitivity. The multi-kernel design enabled the model to capture contextual cues across different receptive fields and to better adapt to the diverse spatial scales of greenhouse waterlogging patterns. Meanwhile, the Sobel operation extracted gradient-based texture transitions and boundary-related information, strengthening boundary awareness under varying light intensities and reflective interference. As a result, the proposed network maintained strong discrimination capability even in complex greenhouse environments with reflections and uneven brightness, and it showed better adaptability to irregular and fragmented flooded regions. The model was trained and validated on a self-constructed dataset of 571 pixel-wise annotated images collected in a solar greenhouse under natural operating conditions. All images were resized and normalized using ImageNet statistics to match the pretrained ResNet-50 encoder. Comparative experiments with mainstream segmentation models, including U-Net, DeepLabV3+, and PSPNet, showed that RMCED outperformed representative baselines on the proposed dataset, achieving a precision of 93.22%. Ablation studies further demonstrated that LA-CBAM enhanced the model’s perception of key features in waterlogged areas, while the multiscale cascaded decoder enhanced edge continuity and object completeness. Overall, this study presented an efficient and robust framework for greenhouse waterlogging monitoring, providing a promising technical basis for flood risk assessment, waterlogging management, and related image-based monitoring tasks in protected agricultural systems.

HTML全文

参考文献(32)

施引文献

资源附件(0)