高级检索+

无人机遥感视频影像结合改进YOLO11的柑橘追踪计数方法

Citrus tracking and counting using UAV remote sensing video imagery with improved YOLO11

  • 摘要: 针对无人机遥感尺度下,柑橘估产过程中存在跟踪计数误差大的问题,该研究提出一种轻量化 YOLO11-PMSL模型与ByteTrack-DIoU算法相融合的无人机视频流柑橘果实跟踪计数方法。首先,在YOLO11n架构的基础上,通过重构特征金字塔结构,将检测头层级精简为P2~P4三级结构,从而显著增强微小目标感知能力;其次引入C3k2-MSEIE(C3k2-multi-scale edge information enhance)多尺度边缘增强模块,通过自适应尺度融合与轮廓强化机制,有效提升果实轮廓表征能力;进一步,采用SIoU(scylla-IoU)损失函数替代CIoU(complete-IoU)损失函数,引入方向敏感性约束以期提升检测框定位质量与训练稳定性,最后通过LAMP(layer adaptive magnitude-based pruning)方法模型剪枝,去除冗余的权重,减少参数量和浮点运算量,压缩模型体积。在ByteTrack-DIoU算法中嵌入区域计数防抖机制,进一步解决遮挡导致的ID跳变问题。结果表明,改进后YOLO11-PMSL目标检测模型精确率(P)、均值平均精度(mAP0.5)分别提高3.3、9.3个百分点。剪枝后,与原始的YOLO11n相比,在保持精度总体提升的同时,模型的参数量,浮点运算量和模型大小分别降低了86.05%、26.98%和76.36%,检测速度由84.83帧/s提高到140.12帧/s。与传统 SORT、DeepSORT 和BotSORT算法相比,ByteTrack-DIoU算法对多目标跟踪准确率分别提高5.5、5.7和 4.3个百分点,跟踪计数平均精度达88.4%。该方法可准确地实现果园柑橘果实的跟踪计数,为柑橘产量预测提供有效的技术方案。

     

    Abstract: Citrus yield estimation is often required for low missed detection rates, tracking, and counting errors using UAV remote sensing, particularly for dense fruit occlusion and small target size. In this study, citrus tracking and counting were proposed using UAV remote sensing video imagery with improved YOLO11. The video data was first collected by a DJI Phantom 4 UAV at an angle of approximately 45°. A citrus target detection was constructed on the tracking dataset. An automatic citrus counting was realized using UAV video streams. A lightweight YOLO11-PMSL model was combined with an improved ByteTrack algorithm. The detection head layer was simplified to a three-level structure (P2-P4) using the YOLO11n network architecture. The feature pyramid structure was reconstructed. Deep redundant modules were removed to fuse the high-resolution shallow features, and then perceive small targets. Secondly, the C3k2-MSEIE multi-scale edge module was introduced after adaptive scale fusion and contour enhancement. The local details were expressed to extract the fruit contours. The overall morphological features of the fruit were preserved, with better feature expression in the densely populated fruit areas. Subsequently, the loss function was replaced by SIoU. A direction-sensitive constraint was introduced to improve the localization accuracy and training stability of the detection boxes. Finally, the LAMP was used to prune the model for the removal of redundant weights. The number of parameters and floating-point operations was then reduced to compress the model size for model lightweighting. The ByteTrack algorithm framework was improved, rather than using IoU in spatial location measurement. The accuracy and stability of fruit tracking were further enhanced in complex orchard environments. The similarity metric in ByteTrack was replaced with the DIoU. Simultaneously, a region-counting anti-shake mechanism was embedded in the algorithm. The target ID jump problem under occlusion was effectively solved for the accurate counting of citrus fruits. Experimental results showed that the YOLO11-PMSL model effectively improved the performance of the model. Specifically, compared with the original YOLO11n object model, better performance was achieved in the feature pyramid into a P2-P4 three-level structure. The number of model parameters was reduced to 116 m, the model size was compressed to 2.6 MB, and the recall and mAP0.5 metrics were significantly improved by 7.9 and 5.1 percentage points, respectively. The more lightweight model was verified by the higher accuracy for small targets. The precision, recall, and mAP0.5 were improved by 2.2, 10.7, and 8.7 percentage points, respectively, with the C3k2-MSEIE edge module. Once the loss function was replaced from CIoU to SIoU, the convergence speed was accelerated to further improve its performance. The LAMP algorithm was used to prune the model, fully meeting the lightweight deployment of edge terminals. The performance remained at the baseline level before pruning. While the number of parameters, floating-point operations, and model size were significantly reduced from before. Ultimately, the precision, recall, and mAP0.5 were improved by 3.3, 11.6, and 9.3 percentage points, respectively, in the object detection task. In terms of lightweighting, the number of parameters, model size, and floating-point operations were reduced by 86.05%, 76.36%, and 26.98%, respectively, compared with the original model. The detection speed was improved by 65.18%. The YOLO11-PMSL model achieved a detection accuracy and speed on the citrus dataset. In the object tracking task, the ByteTrack multi-object tracking algorithm achieved an accuracy of 92.8% and a tracking precision of 81.7%. Compared with the SORT, DeepSORT, and BotSort algorithms, the tracking accuracy was improved by 5.5, 5.7, and 4.3 percentage points, respectively, and the tracking precision was improved by 19.2, 19.4, and 10.8 percentage points, respectively. The average accuracy of citrus counting reached 88.4%, compared with manual counting. The counting error was smaller than that of manual counting. Citrus counting was effectively realized in farmland scenarios. This finding can provide a technical approach for citrus yield prediction.

     

/

返回文章
返回