高级检索+

基于改进YOLOv8-seg和半截锥模型的草莓尺寸估计方法

Strawberry size estimation based on improved YOLOv8-seg and semi-truncated cone models

  • 摘要: 针对垄作草莓果实的在线三维尺寸估计,本文提出一种基于单视角RGB-D的“改进YOLOv8-seg+半截锥模型”轻量级框架。YOLOv8-seg主干嵌入C2f_Faster_EMA模块,在参数量仅增加0.3M、模型大小提升不足3%的代价下,将mAP@0.5由94.1%提升至96.9%,成熟果分割精度达到98.3%,推理速度仍保持159 FPS。根据获取的点云数据,经点云匹配-滤波-下采样-RANSAC-PCA级联处理,完成主轴线及质心估计。进而以纵径、顶径、底径构建半截锥模型,通过优化实现点云-模型亚毫米匹配。试验表明,不同视角下纵径、顶径、底径估计与实测差异分别为1.2%、0.7%、1.7%;不同高度下差异1.8%、1.2%、4.8%;多样本试验总体平均估计误差为0.22%~1.34%;全流程平均耗时3.3s/帧,满足田间实时尺寸估计需求。研究为类锥形果蔬低成本、高精度三维表型解析提供可复用方案。

     

    Abstract: In order to realize the online 3D size estimation of strawberries grown on monopolies under natural growing conditions, this paper proposed a lightweight framework based on the improved YOLOv8-seg and semi-truncated cone model. By integrating the C2f_Faster_EMA module into the YOLOv8-seg backbone, the network elevated mAP@0.5 from 94.1% to 96.9% and achieved a ripe-fruit segmentation accuracy of 98.3% while retaining an inference speed of 159 FPS, at the marginal cost of merely 0.3M additional parameters and less than 3% increase in model size. Secondly, the point cloud data were acquired by RealSense D435i, and the main axis and the centroid estimation were completed by ‘point cloud matching-statistical filtering-voxel down-sampling-RANSAC-PCA cascade’ processing. Then, a semi-truncated cone model was constructed with three parameters, namely, longitudinal diameter, top diameter and bottom diameter. Sub-millimetre alignment between the model and the observed point cloud was achieved through L-BFGS-B optimization constrained by a bidirectional chamfer distance. Field experiments demonstrated that, across different capture angles, the relative deviations between estimated and measured longitudinal, top, and bottom diameters were 1.2%, 0.7%, and 1.7%, with standard deviations of 0.64 mm, 0.34 mm, and 0.17 mm, respectively. Under different capture heights, the corresponding deviations were 1.8%, 1.2%, and 4.8%, all exhibiting standard deviations were below 0.6 mm. In multi-sample experiment, the mean estimation error ranges from 0.22% to 1.34%, with standard deviations of 2.16 mm, 1.67 mm, and 1.40 mm, and maximum ranges remaining under 2 mm. The complete processing pipeline averaged 3.3s per frame, fulfilling the real-time requirements for in-field dimensional estimation. Accordingly, the proposed methodology delivered a transferable, resource-efficient, and high-accuracy paradigm for three-dimensional phenotypic characterization of conical fruit and vegetable crops.

     

/

返回文章
返回