基于改进YOLO11的荔枝果实品种实时精准识别算法

黄铭杰; 蔡伟强; 张子健; 方展桌; 郭炜伦; 龙拥兵; 兰玉彬; 王建华

doi:10.11975/j.issn.1002-6819.202412219

摘要: 精准高效的荔枝品种识别是实现采集后荔枝品质智能化检测的重要一环。针对目前深度学习算法因不同荔枝果实表皮和形状存在细微性差别而无法精准识别的问题，该研究将目标检测模型YOLO11进行改进，提出一种基于改进YOLO11的荔枝果实品种识别模型SCL-YOLO11。首先，在YOLO11的主干网络，将C2PSA注意力模块替换为具有通道注意力和空间注意力C2f_SimAM注意力模块，提高模型关注和加权图像不同特征维度的能力；其次，在C3k2模块中应用具有大核深度可分离卷积CMUNeXt，提高模型对荔枝果实几何特征的感知和计算能力；测试结果表明，改进后的SCL-YOLO11模型识别准确率为99.61%，相比于VGG-19、VIT、AlexNet、RestNet-50、YOLOv8、YOLO11模型，分别提高了27.10、17.99、12.47、7.05、4.83、2.89个百分点，该模型参数量为1.27M，计算量为3.1G，相较于YOLO11模型分别降低了17.5、6.1个百分点。改进后的SCL-YOLO11模型能对具有细微纹理差异和形状差异的荔枝果实进行实时精准的品种识别，并且降低了网络规模，可为采集后荔枝果实智能化品质检测设备研发提供技术参考。

Abstract: Litchi is one of the most important fruits widely planted in southern China. However, manual identification cannot fully meet the large-scale production at present, due to the time-consuming and laborious. Particularly, Litchi is rich in a variety of resources, leading to the large similarity among varieties. Furthermore, the rapid and accurate identification of litchi varieties is often required in the process of intelligent production and sales. In this study, a recognition model (SCL-YOLO11) was proposed for the litchi fruit variety using improved YOLO11. Firstly, the C2f module (CSP2f, cross-stage partial fusion) of YOLO11model was added with the SIMAM attention mechanism to replace the original C2PSA module. Different feature dimensions of litchi images were focused to automatically adjust the features of each spatial position in the key areas; Secondly, CMUNeXt convolution with the large kernel and separable depth was applied to the C3K2 module, in order to improve the perception of the litchi geometric features under various occlusion. The experimental results show that the recognition accuracy of the improved SCL-YOLO11 model was 99.61%, which was improved by 27.10%, 15.06%, 12.47%, 7.05%, 4.83% and 2.89%, respectively, compared with the VGG-19, VIT, AlexNet, ResNet-50, YOLOv8 and YOLO11 model. The parameter size of the model was 1.27M, and the calculation was 3.1G, which was reduced by 17.5% and 6.1%, respectively, compared with the YOLO11 model. The improved SCL-YOLO11 model was used to carry out real-time and accurate variety recognition for the litchi fruits with subtle texture differences and shape differences. The network scale was then reduced after data collection. Furthermore, the attention mechanism was fused with the C2f module. The weight of the SimAM attention mechanism was adjusted for the features of the C2f module after splicing. The enhanced features were outputted after segmentation. The accuracy and recall rate of the improved model were enhanced by 1.29% and 1.09%, respectively, compared with the original. In the convolution module, the CMUNeXt block with the large-core depth separable convolution was applied in the parallel convolution layer of the C3K2 module. The diverse geometric features of the litchi were then extracted to maintain the depth of the feature extraction in the C3K2 module. A series of tests were conducted to verify the improved SCL-YOLO11 model in the natural environment. The data sets were selected with different light intensity and occlusion conditions. The recognition accuracy of the SCL-YOLO11 model was 96.89%, 97.87%, and 98.48% under the conditions of insufficient illumination, strong light, and shadow, respectively; whereas the recognition accuracy of the SCL-YOLO11 model was 98.18% and 96.89% under the condition of the fruit and branch-leaf occlusion, respectively. There were better recognition accuracy and robustness, compared with the YOLOv8 and YOLO11. In a simulation, the reasoning time of the improved SCL-YOLO11 model was much less than that of the YOLOv8 and YOLO11 under different detection quantities. Especially when the detection quantity reached 1000, the reasoning time was reduced by 50.58% and 22.73%, respectively, indicating higher accuracy than the rest. As such, both parameters and calculations were reduced for the real-time accuracy. The finding can provide the technical reference to develop intelligent equipment for the quality detection of litchi fruits.

基于改进YOLO11的荔枝果实品种实时精准识别算法

Real-time accurate recognition algorithm for litchi fruit varieties based on improved YOLO11