高级检索+

基于PSMF-YOLO的无人机遥感图像枇杷花蕾小目标检测方法

A small object detection method for loquat flower buds in UAV remote sensing imagery based on PSMF-YOLO

  • 摘要: 为解决枇杷花蕾因呈簇状生长、体积小且与背景叶片颜色相近难以被准确识别的问题,该研究提出了一种基于无人机遥感图像与深度学习结合的目标检测算法PSMF-YOLO。首先,利用无人机采集图像并构建了枇杷花蕾数据集;在此基础上,基于YOLOv11n模型引入P2小目标检测层,以保留更多空间细节,提升对花蕾的检测能力;其次,在网络中加入SPD-Conv的模块,使空间到深度变换的细节保留,有助于提高低分辨率图像和小目标的检测精度;将C2PSA与MSDA多尺度扩张注意力机制结合,通过多尺度扩张卷积与动态特征融合,实现全局场景理解的跨层次感知;最后,引入Focaler-DIoU损失函数解决小目标样本不平衡和边界框回归不精准问题,提升模型在复杂场景下的检测鲁棒性。试验结果表明,PSMF-YOLO模型在平均精度均值、准确率、召回率指标上比基础模型分别高4.9、2.3、4.2个百分点,较表现最好的YOLOv5s基础模型分别高0.8、0.1、0.6个百分点。该研究为农业遥感小目标检测提供有效方法,为果蔬智能精准管理提供技术支持。

     

    Abstract: Loquat flower buds are typically observed to develop in dense and clustered distributions during their natural growth cycle. To ensure optimal nutrient allocation, prevent fruit size disparities, and guarantee the final commercial quality of the fruits, timely and precise thinning operations during the early bud stage are fundamentally required. However, accurate identification and spatial localization of these targets remain significantly challenging in practical agricultural scenarios. This difficulty primarily stems from the inherently small volumetric size of the buds, their clustered growth patterns, and their severe visual similarity in color to the complex surrounding background foliage. To comprehensively address these critical issues and facilitate automated agricultural operations, this study proposes a novel target detection algorithm, termed PSMF-YOLO, which integrates unmanned aerial vehicle (UAV) remote sensing imagery with advanced deep learning techniques. Initially, a specialized loquat flower bud dataset was constructed using high-resolution images acquired by UAVs operating in real-world orchard environments. Building upon this empirical data, an improved detection model was developed using the YOLOv11n architecture as the baseline to effectively balance computational efficiency and detection accuracy. To systematically optimize the network for these microscopic targets, several crucial structural modifications were sequentially implemented. First, a dedicated P2 small-object detection layer was introduced into the architecture. This layer leverages shallower network layers to preserve high-resolution spatial details, thereby fundamentally enhancing the model's intrinsic capability to accurately detect minute and densely packed targets. Second, an advanced Space-to-Depth Convolution (SPD-Conv) module was incorporated into the feature extraction network. By retaining detailed information during the space-to-depth transformation and avoiding the redundant information loss typically caused by traditional stride convolutions, this module significantly improves detection accuracy, particularly for low-resolution images and small objects. Third, the C2PSA module was combined with a multi-scale dilated attention (MSDA) mechanism. This integration achieves comprehensive cross-level perception through multi-scale dilated convolutions and dynamic feature fusion strategies, enabling the network to establish a more effective global context modeling capability. Finally, a Focaler-DIoU bounding box regression loss function was employed. This targeted loss function effectively alleviates the inherent sample imbalance commonly encountered in small-object detection tasks and improves the precise localization of bounding box regression, thereby substantially enhancing the overall robustness of the model when deployed under highly complex orchard conditions. Comprehensive experimental results demonstrate the superior performance of the proposed PSMF-YOLO model. Specifically, the enhanced model achieves remarkable improvements of 4.9, 2.3, and 4.2 percentage points in mean average precision (mAP0.5), precision (P), and recall (R) metrics, respectively, when directly compared with the baseline YOLOv11n model. Furthermore, when evaluated against various mainstream object detection frameworks, the PSMF-YOLO model outperforms the best-performing alternative baseline, the YOLOv5s model, by margins of 0.8, 0.1, and 0.6 percentage points across the aforementioned core evaluation metrics. In conclusion, these quantitative results clearly indicate that the proposed PSMF-YOLO method provides a highly effective and accurate algorithmic solution for UAV-based small-target detection tasks in agricultural remote sensing. Consequently, this study offers robust technical support for the broader implementation of intelligent and precise orchard management systems.

     

/

返回文章
返回