基于改进YOLOv5的柑橘叶片病害小目标检测方法

郑争兵; 张乂邦; 孙璐超; 王新宽

doi:10.11975/j.issn.1002-6819.202505148

基于改进YOLOv5的柑橘叶片病害小目标检测方法

Citrus leaf disease small target detection method based on improved YOLOv5

摘要

摘要: 针对自然环境下柑橘叶片病害识别中存在的小目标检测困难、背景复杂多变及光照条件变化等问题，该研究提出了一种基于改进YOLOv5的柑橘叶片病害检测算法。首先，在模型设计中引入高像素小目标检测头H₀，通过融合深层颈部网络特征与浅层主干网络特征，增强对小目标的检测能力以及多尺度信息的融合效果。此外，为了提升模型在复杂场景中的理解力，在特征提取阶段加入了改进的CBAM（convolutional block attention module）注意力机制模块MR-CBAM（multi-scale fusion residual structure convolutional block attention module）。这一模块结合多尺度残差结构，能够有效捕捉不同尺度下的特征信息，并整合浅层特征细节，从而减少背景和边缘噪声干扰，增强模型在各种光照条件下的表现。为进一步优化模型性能，采用GIoU（generalized intersection over union）作为损失函数，以实现对不同形状目标更精确的边界框回归，从而提高了在复杂背景下柑橘叶片病害的检测精度。为验证所提方法的有效性，构建了一个包含多种柑橘叶片病害类型的综合数据集，并进行了消融试验、模型性能评估和可视化分析等。试验结果显示，经过改进的YOLOv5模型在柑橘叶片病害检测任务中表现出色，其平均识别准确率、召回率、交并比阈值为 0.50 的平均精度均值（mAP₅₀）和交并比阈值从 0.50 到 0.95 每隔 0.05 取值的平均精度均值（mAP_50:95）分别达到了91.5%、90.2%、89.8%和86.7%。相比原模型，准确率提升了2.1个百分点，召回率增加了2.6个百分点，mAP₅₀和mAP_50:95上分别增长了1.6和1.4个百分点。这些改进提升了模型性能，为柑橘病害小目标检测的实际应用提供重要参考。

Abstract: Accurate and timely detection of citrus leaf diseases in natural outdoor environments is critical for effective orchard management and sustainable agricultural production. However, this task faces significant challenges due to the small size of disease lesions, complex backgrounds with overlapping leaves and branches, and highly variable illumination conditions caused by weather, shadows, and sun exposure. To address these issues, this study proposes an improved YOLOv5-based algorithm specifically designed for robust citrus leaf disease detection under real-world conditions. The proposed method introduces three key enhancements to the original YOLOv5 architecture: a high-resolution detection head for small targets, an advanced attention mechanism for feature refinement, and an optimized loss function for precise localization.To improve the detection of small disease spots,often only a few pixels in size,a new high-pixel small target detection head, named H₀, is incorporated into the network. This head is connected to shallow layers of the backbone that preserve high spatial resolution, enabling the model to detect minute pathological features that are typically missed by standard detection heads. By fusing features from both the deep neck network and the shallow backbone layers, the model achieves enhanced multi-scale feature representation. This cross-level fusion strengthens the model’s ability to recognize small lesions while maintaining contextual awareness, significantly improving detection sensitivity for early-stage diseases. To enhance the model’s discriminative power in complex scenes, an improved attention module called MR-CBAM (multi-scale fusion residual structure convolutional block attention module) is introduced in the feature extraction stage. Unlike the standard CBAM, which applies channel and spatial attention independently, MR-CBAM integrates a multi-scale residual block that processes input features through parallel convolutional paths with varying kernel sizes. This allows the model to capture contextual information at different scales, effectively distinguishing subtle disease patterns from background noise such as leaf veins or soil. The fused multi-scale features are then refined by the CBAM structure, which recalibrates feature maps by emphasizing informative channels and spatial regions. The residual connection ensures stable gradient propagation, facilitating training convergence and preserving fine details. This design significantly improves the model’s robustness under challenging lighting conditions, such as overexposure or low-light scenarios.To achieve more accurate object localization, especially for irregularly shaped lesions,the GIoU(generalized intersection over union)loss is adopted as the bounding box regression loss. GIoU considers both the overlap and the distance between predicted and ground-truth boxes, providing more meaningful gradients during training, even when there is no intersection. This leads to faster convergence and more precise bounding box predictions, which is crucial for accurate disease localization in cluttered environments.To validate the proposed method, a comprehensive citrus leaf disease dataset was constructed, including citrus canker, greasy spot, and scab,collected under diverse natural conditions. Experimental results demonstrate that the improved model achieves AP(average precision), recall, mAP₅₀, and mAP_50:95 of 91.5%, 90.2%, 89.8%, and 86.7%, respectively. Compared to the original YOLOv5, this represents improvements of 2.1 percentage points, 2.6 percentage points, 1.6 percentage points, and 1.4 percentage points, respectively. These results confirm the effectiveness of the proposed enhancements in boosting detection accuracy and reliability, offering a promising solution for practical citrus disease monitoring systems.

HTML全文

参考文献(28)

施引文献

资源附件(0)