基于YOLOv11s-RLDP的番茄穴盘苗分级检测与定位

郑安臣; 江丹; 饶元; 王坦; 徐先宝; 金秀; 李成宇

doi:10.11975/j.issn.1002-6819.202510222

摘要: 针对穴盘苗不同程度叶片越界现象导致的识别与定位困难问题，该研究提出一种基于改进YOLOv11s-Seg的轻量化实例分割模型YOLOv11s-RLDP。首先，使用RDSConv替换网络中的基础卷积层，提升特征提取能力并降低模型计算复杂度；其次，基于大型可分离核注意力机制（large separable kernel attention）重新设计C3k2模块，扩大模型感受野，强化幼苗关键特征感知能力；然后，提出融合门控卷积（Gating Conv）的DSGCF模块，替换原主干网络中的C2PSA模块，增强特征选择能力；最后，采用LAMP剪枝策略进行模型轻量化。试验结果表明，YOLOv11s-RLDP平均精度均值和平均交并比分别达到89.4%和88.7%，较原模型均提高1.4个百分点，检测速度提升至128.1帧/s；同时，模型参数量和模型大小较原模型分别减少34.0%和32.5%。YOLOv11s-RLDP平均交并比MIoU为88.7%，mAP_50-90达到70.0%，模型大小仅为12.0MB，参数量为6.0M。改进模型YOLOv11s-RLDP在有效提升综合分割性能与定位能力的同时，降低了计算资源需求，为番茄穴盘苗分级检测与定位任务的轻量化和实际应用提供了算法参考。

Abstract: Seedling transplanting is one of the most important components in modern tray seedling. Non-uniform substrate distribution and fluctuating environmental parameters contribute to weak seedling development and empty cell formation during seedling cultivation. Moreover, manual grading cannot fully meet the requirements of a large-scale nursery, due to the low efficiency, low quality, and high cost. Furthermore, tomato seedlings exhibit complex and variable growth states, with leaf outgrowth directions highly random in real cultivation environments. Consequently, it is often required to accurately identify seedling categories with the positional information during mechanical grading. In this study, a lightweight instance segmentation model, YOLOv11s-RLDP, was proposed using the YOLOv11s-Seg model. Firstly, RDSConv (reinforced depthwise separable conv) replaced standard convolutional layers in the network, thus enhancing feature extraction to reduce computational complexity. Secondly, the C3k2 module was redesigned using the Large Separable Kernel Attention (LSKA) mechanism to expand the model's receptive field and strengthen the perception of key seedling features. Subsequently, the DSGCF (dual stream gating cross fusion) module with Gating Convolution was introduced to replace the original C2PSA module in the backbone, thus augmenting feature selection. Finally, the LAMP (layer-adaptive sparsity for the magnitude-based pruning) strategy was employed for model lightweighting. Ablation experiments were conducted to validate the effectiveness of each module for high segmentation performance, according to the computational resource. Experimental results demonstrate that YOLOv11s-RLDP achieved the accuracy of strong seedling, recall, and mean average precision (mAP) of 91.2%, 95.1%, and 89.4%, respectively, with the improvement of 1.0, 0.7, and 1.4 percentage points over the original YOLOv11s-Seg model. The mAP50-90 increased by 1.8 percentage points, indicating significantly enhanced robustness when processing seedlings under complex growth conditions. Concurrently, the parameter count and model size were reduced by 34.0% and 32.5%, respectively, compared with the original model, thus facilitating future deployment on edge devices. Comparative experiments were performed on different models. The improved model performed best over the two-stage instance segmentation Mask R-CNN, in terms of accuracy and lightness. Compared with one-stage instance segmentation networks like YOLOv5s-Seg, YOLOv8s-Seg, YOLOv9s-Seg, YOLOv10s-Seg, YOLOv11s-Seg, YOLOv12s-Seg, and YOLACT, the mAP of YOLOv11s-RLDP was improved by 1.6, 1.3, 2.6, 1.5, 1.4, 1.5 and 8.0 percentage points, respectively, while the model size was simultaneously reduced by 7.0, 10.8, 5.2, 5.9, 5.8, 6.7, and 177.9MB, respectively. In conclusion, the YOLOv11s-RLDP model effectively enhanced overall segmentation performance with low computational resources. The finding can provide a strong reference for the lightweight and practical application in tomato plug seedling grading and localization. Therefore, tomato plug seedlings can be collected with growth extending beyond cell boundaries for their high robustness and generalization.

基于YOLOv11s-RLDP的番茄穴盘苗分级检测与定位

Grading detection and localization for tomato tray seedlings using YOLOv11s-RLDP