DSFNet：双流空频协同的无人机小麦黄矮病高效检测模型

毛锐; 郝思嘉; 李书琴; 冯明晨; 王健

doi:10.11975/j.issn.1002-6819.202508005

DSFNet：双流空频协同的无人机小麦黄矮病高效检测模型

DSFNet: An efficient detection model for wheat yellow dwarf disease using dual-stream air-frequency coordination drones

摘要

摘要: 小麦黄矮病是小麦主要病毒病害，在黄淮麦区常年普遍发生，严重威胁小麦生产。传统的病毒检测方法和人工踏查均无法满足大规模田间病害快速调查需求。针对此问题，该研究结合小麦黄矮病的多时期发病特征，创新构建了一种基于无人机多光谱影像的田块尺度检测模型DSFNet。该模型的核心在于双流空频协同机制：一是双流自适应协同融合特征编码（dual-stream adaptive synergistic fusion，DASF），通过CNN和Transformer双流特征交互提取，增强了全局上下文与局部病变纹理特征的互补表达。二是空频联合特征映射（spatial-frequency feature mapping，SFFM），引入小波变换频域特征，与空间域特征开展跨域关联学习，有效提升模型对图像灰度剧变区域的处理能力。试验结果表明，DSFNet对小麦黄矮病检测的平均像素精度（mean pixel accuracy，mPA）和平均交并比（mean intersection over union，mIoU）分别达到92.66%和86.77%；在2025年独立测试集上，mPA和mIoU仍保持89.71%和82.56%的高水平。与Segformer、DBFormer、SFFNet和DECSNet等7种典型病害检测模型相比，mPA和mIoU分别显著提升6.35和10.16个百分点以上，充分验证了其在跨时区复杂真实场景下的稳健性和优异迁移性能。该研究为小麦黄矮病快速精准检测提供了高效解决方案，空频协同特征学习框架也为其他作物病害的遥感识别提供了重要参考。

Abstract: Wheat is one of the most extensively cultivated crops in global food security. Wheat yellow dwarf disease can be caused by the barley yellow dwarf virus complex (BYDVs). The major viral threat can often occur year-round, widespread in the Huang-Huai wheat region. A substantial risk has posed to the quality and yield during production. Infected plants typically exhibit leaf yellowing, stunted growth, delayed maturity, and shriveled grains. Manual detection can be performed on the preliminary diagnoses using visual symptoms. It is often required to provide costly equipment or specialized expertise, thus limiting the large-scale field screening. In this study, DSFNet, a field-scale detection model, was proposed using drone-acquired multispectral imagery, according to the multi-stage development of wheat yellow dwarf disease. Two components were also integrated - dual-stream adaptive synergistic fusion (DASF) and spatiotemporal joint feature mapping (SFFM) - to achieve the precise identification of the diseases. Its encoder was incorporated with the DASF module. convolutional neural networks (CNNs) and Transformers were combined to dynamically balance the local and global information. The CNN branch was focused on fine-grained features to detect the contour and structural variations, such as lesion edges, textures, and colors. In contrast, the Transformer branch employed self-attention mechanisms to capture long-range dependencies. Semantic relationships among regions were obtained to reveal the spatial distribution patterns of disease on a global scale. An adaptive weight allocation in the DASF module was used to fuse CNN local extraction with Transformer global context. Rich feature representations combined spatial details with semantic depth. The decoder integrated the SFFM module into the spatial and frequency-domain information. Spatial features were preserved to incorporate the frequency-domain analysis. The regions with substantial grayscale variation were captured to enhance the multi-scale feature detection. Finally, the multi-scale features were fused with the original features for the precise segmentation via a multi-layer perceptron (MLP). A series of field experiments was conducted on the multispectral image dataset, including multiple growth stages, diverse lighting conditions, and multi-plot samples. DSFNet’s performance was compared against mainstream models, including SegFormer, UNet, DBFormer, SFFNet, DECSNet, DeepLabV3+, and PSPNet. The DSFNet outperformed all baseline models, with a mean pixel accuracy (mPA) of 92.66% and a mean intersection over union (mIoU) of 86.77%. The mPA values were improved by 3.96, 2.21, 3.65, 2.82, 3.61, 0.28, and 1.95 percentage points, with the mIoU gains of 5.77, 5.53, 5.43, 5.23, 4.49, 2.25, and 1.90 percentage points, respectively. The high accuracy of the DSFNet was maintained on an independent test dataset, particularly with the mPA and mIoU values of 89.71% and 82.56%. There were improvements of 6.35-10.16 percentage points over the rest models. The high robustness and exceptional transfer of DSFNet were also achieved under complex and real-world scenarios over the various time zones. Overall, an efficient and scalable solution was presented for the large-scale field detection of wheat yellow dwarf disease. The optimal feature fusion, frequency-spatial interaction, and adaptive semantic modeling can be combined to provide a practical framework in precision agriculture. The finding can also offer technical support for the rapid, accurate, and intelligent monitoring of crop health in the prevention and control of wheat viral diseases.

HTML全文

参考文献(33)

施引文献

资源附件(0)