高级检索+

基于YOLO v5s的自然场景油茶果识别方法

Camellia oleifera Fruit Detection in Natural Scene Based on YOLO v5s

  • 摘要: 针对油茶果体积小、分布密集、颜色多变等特点,为实现自然复杂场景下油茶果的快速精准定位,并依据果实的疏密分布,确定恰当的自动振荡采收装置夹持位置,利用YOLO v5s卷积神经网络模型,开展了自然环境下油茶果图像检测方法研究,用3 296幅油茶果图像制作PASCAL VOC的数据集,对网络进行了150轮训练,得到的最优权值模型准确率为90.73%,召回率为98.38%,综合评价指标为94.4%,平均检测精度为98.71%,单幅图像检测时间为12.7 ms,模型占内存空间为14.08 MB。与目前主流的一阶检测算法YOLO v4-tiny和RetinaNet相比,其精确率分别提高了1.99个百分点和4.50个百分点,召回率分别提高了9.41个百分点和10.77个百分点,时间分别降低了96.39%和96.25%。同时结果表明,该模型对密集、遮挡、昏暗环境和模糊虚化情况下的果实均能实现高精度识别与定位,具有较强的鲁棒性。研究结果可为自然复杂环境下油茶果机械采收及小目标检测等研究提供借鉴。

     

    Abstract: In view of the characteristics of small size, dense distribution and changeable color of Camellia oleifera fruit, in order to realize the rapid and accurate identification of Camellia oleifera fruit in complex natural scene, and determine the appropriate clamping position for the automatic oscillating harvesting device according to the density distribution of the fruit, the YOLO v5 s convolutional neural network model was used to carry out research on the image detection method of Camellia oleifera fruit in the natural scene. Through data enhancement, totally 3 296 Camellia oleifera fruit images were obtained to make the PASCAL VOC data set. After 150 rounds of training, the optimal weight model was got. The accurate rate was 90.73%, the recall rate was 98.38%, the comprehensive evaluation index was 94.4%, the average detection accuracy was 98.71%, the single image detection time was 12.7 ms, and the memory size of the model was 14.08 MB. Compared with the current mainstream first-stage detection algorithms YOLO v4-tiny and RetinaNet, its accuracy rate was increased by 1.99 percentage points and 4.50 percentage points, the recall rate was increased by 9.41 percentage points and 10.77 percentage points, and the time was reduced by 96.39% and 96.25%, respectively. In addition, the weight file of the YOLO v5 s model was small, indicating that its network was simpler and had the advantage of rapid deployment. It could be transplanted to edge devices in the future to provide algorithm reference for the vision system of the Camellia oleifera fruit automatic harvesting device. Through comparative experiment, the results also showed that the model can achieve high-precision recognition and positioning of fruits in dense, occluded, dim environments and fuzzy blur conditions, and it had strong robustness. The research results can provide a reference for the research of mechanical harvesting of Camellia oleifera fruit under the natural complex environment.

     

/

返回文章
返回