高级检索+

基于YOLO-SDCG和椭圆傅里叶描述子的番茄苗表型检测

Phenotypic detection of tomato seedlings based on YOLO-SDCG and elliptic Fourier descriptors

  • 摘要: 番茄苗的表型特征是判断其是否适宜移栽的重要依据,为快速准确识别番茄苗表型特征,该研究提出了一种基于YOLO-SDCG和椭圆傅里叶描述子的番茄苗表型检测方法。针对在穴盘育苗期,番茄穴盘苗(35~40 d)生长密集、遮挡情况复杂、难以检测其直径和直立度等表型参数的问题,首先搭建番茄苗图像采集系统,融合正视与侧视视角获取图像数据;其次,改进YOLOv8s-seg模型为YOLO-SDCG,将动态蛇卷积(dynamic snake convolution, DySConv)模块添加到C2f模块(cross-stage partial-connection with 2 convolutions)以增强茎秆分割能力;采用内容感知特征重组模块(content-aware reassembly of features, CARAFE)替代原有的卷积上采样模块以提升特征重建与融合;在骨干网络和颈部网络中加入幻影卷积(grouped hybrid one-shot tensor, GHOST)以减少模型参数量和计算量。最后,融合图像分割、椭圆傅里叶描述子(elliptic Fourier descriptors, EFDs)、最大内切圆法、弦弧比与分段拟合法,实现番茄苗茎秆直径和直立度等表型参数的检测。结果表明,YOLO-SDCG在自建番茄苗数据集上掩码水平的精确率、召回率和平均精度均值分别为93.1%、93.9%、94.9%,较原始YOLOv8s-seg模型分别提高了4.6、2.7和2.4个百分点,参数量与运算时间小幅增加0.32 M和0.4 ms,但满足部署要求。最大内切圆法在正视图、侧视图下茎秆直径的平均绝对误差均为0.03 mm,平均绝对百分比误差均为1.04%;弦弧比与分段拟合法在正视图、侧视图下直立度的平均绝对误差分别为1.60°、1.80°,平均绝对百分比误差分别为2.00%、2.14%;决定系数均大于0.96,验证了该方法可有效估测番茄苗表型参数。该研究可为其他穴盘苗表型特征检测提供方法参考。

     

    Abstract: To address the challenges in detecting phenotypic parameters such as diameter and erectness due to dense growth and complex occlusion of mature tomato seedlings (35-40 d) during the plug seedling stage, a dual-view detection method combining a variable-pitch stepped manipulator and the YOLO-SDCG model is proposed. The innovations of this work lie in three aspects: hardware design, improvements to the YOLOv8s-seg model, and phenotypic parameter extraction. First, in terms of hardware design, an image acquisition system for tomato seedlings was constructed, integrating top-view and side-view perspectives to capture image data. Second, regarding the vision algorithm, the YOLOv8s-seg model was enhanced to YOLO-SDCG by introducing dynamic snake convolution (DySConv) to focus on the slender tubular features of the main stem during the initial feature extraction stage. The content-aware reassembly of features (CARAFE) module was adopted to enhance the resolution of high-level semantic features of tomato seedling stems extracted from deep layers of the backbone network through content-adaptive upsampling. The resulting high-resolution stem feature maps were then concatenated or weighted-fused with high-resolution detailed features from shallow layers of the backbone network, achieving complementary integration of stem semantic information and seedling spatial details. Grouped hybrid one-shot tensor (GHOST) convolution was incorporated to extract intrinsic features using a small number of standard convolutions and generate complementary ghost features through linear transformation, thereby reducing model parameters and computational cost. Finally, for parameter extraction, based on stem segmentation results, image segmentation, elliptic Fourier descriptors (EFDs), the maximum inscribed circle method, chord-to-arc ratio, and piecewise fitting were integrated to detect phenotypic parameters such as stem diameter and erectness. Experimental results showed that in hardware design, the variable-pitch stepped manipulator enabled a stepped arrangement of multiple plants, effectively avoiding seedling occlusion from different perspectives, thereby providing a clear and stable input basis for subsequent image acquisition and model detection. In terms of the vision algorithm, compared to YOLOv8-seg, the YOLO-SDCG model achieved precision, recall, and mean average precision of 93.1%, 93.9%, and 94.9%, representing improvements of 4.6, 2.7, and 2.4 percentage points, respectively, with parameter count and inference time of 3.58 M and 3.0 ms, effectively segmenting the overall contour structure of stems while maintaining a good balance between accuracy and computational efficiency. For parameter extraction, the contour of the main stem of tomato seedlings is a closed tubular curve. Using elliptic Fourier descriptors with a harmonic order n=16 enabled high-quality reconstruction. The proposed maximum inscribed circle method yielded mean absolute errors of 0.03 mm for stem diameter in both top and side views, with mean absolute percentage errors of 1.04%. The chord-to-arc ratio and piecewise fitting method achieved mean absolute errors for erectness of 1.60° and 1.80° in top and side views, respectively, with mean absolute percentage errors of 2.00% and 2.14%, and coefficients of determination all exceeding 0.96. Transplanting experiments demonstrated that at a picking frequency of 120 seedlings per minute, the success rates for clamping seedlings from 72-cell and 105-cell trays were 93.13% and 92.50%, respectively, indicating that the transplanting system ensured operational efficiency while maintaining reliable seedling screening and grasping capabilities. This study has achieved preliminary results in tomato seedling phenotypic detection, though only two phenotypic traits—diameter and erectness—were included. Future work will expand the dataset to incorporate more phenotypic features and focus on model lightweighting to enable more comprehensive and efficient detection of tomato seedling phenotypes.

     

/

返回文章
返回