高级检索+

基于KPLNet网络的端到端农机导航线检测方法

End-to-end agricultural machinery navigation line detection method based on KPLNet network

  • 摘要: 导航线检测是视觉导航方法中的关键步骤,其目的是从农田图像中实时、准确地提取出指导农机行进的方向或路径。目前,该任务的主流解决方案多基于实例分割技术,先提取作物行边缘,再通过复杂后处理拟合导航线。这类方法流程繁琐,计算开销较大。为此,该研究提出一种端到端的导航线回归模型——KPLNet(keypoint plant line network),旨在实现精度与效率兼顾的农机导航线检测。KPLNet将导航线表示为关键点进行直接预测,无需任何后处理流程,从而在保证精度的前提下提升推理速度。在KPLNet的构建中,首先引入几何先验思想,构建中心注意力机制(central attention mechanism, CAM),CAM专注于关键位置特征的上下文整合,可提高关键特征的提取能力和效率。其次,针对导航线检测任务中目标尺度变化小且仅需进行单目标检测的特点,提出精简特征融合结构(single feature fusion, SFF),SFF结构在保证强大导航线特征提取能力的同时提升网络的实时性。此外,在模型训练阶段引入改进的双优化粒子群优化算法(dual-optimization particle swarm optimization, D-PSO)进行超参数优化,在不影响推理速度的前提下进一步提升模型精度。在玉米以及花生-大豆间作套种代表性场景下的试验结果表明,KPLNet在检测精度和实时性方面均优于YOLOv8n-seg、YOLO11n-seg等主流实例分割方法。KPLNet在玉米田和花生大豆套种田中的导航线端点检测精度( \textmAP_\textpose^50-95 )均达到了99.5%,航向角偏差分别低至0.79°和0.73°。在推理效率方面,KPLNet拥有更低的平均推理时间且彻底消除了后处理时间,最高帧率达119.2 帧/s,消融试验进一步验证了CAM、SFF和D-PSO对模型性能提升的具体贡献。试验结果验证了所提端到端范式在农业导航线检测任务中的优越性,为开发高效、可靠的农机自动导航系统提供了技术支撑。

     

    Abstract: avigation line detection can serve as a pivotal procedure in the visual navigation of agricultural machinery. Driving direction or path guiding of agricultural machinery can be accurately extracted from field images in real time. Instance segmentation has been mostly used for this task at present. Crop row edges can be extracted to fit navigation lines after complex post-processing. However, cumbersome workflows and large computational overhead have severely restricted their application under real-time agricultural navigation. It is often needed to have high efficiency and instant response. This study aims to propose an end-to-end navigation line regression model, called Keypoint Plant Line Network (KPLNet). Navigation line detection of agricultural machinery was realized with superior precision and efficiency. Different from conventional instance segmentation, the KPLNet framework directly represented navigation lines in the form of keypoints for prediction. Post-processing was removed to improve inference speed and detection accuracy. Firstly, the Central Attention Mechanism (CAM) was constructed in the architecture of KPLNet using a geometric prior. Contextual integration of features was concentrated at key positions. Accurate keypoint positioning of navigation lines was realized to effectively enhance the extraction and efficiency of critical features. Secondly, a streamlined feature fusion structure, named Single Feature Fusion (SFF) was proposed, according to the small target scale variation and the demand for single-target detection only in the navigation line detection task. SFF structure effectively extracted from navigation line features. While the network structure was improved, the real-time performance of the model was. Redundant calculation and parameter costs were reduced for the edge deployment requirements of agricultural machinery. Finally, an improved Dual-Optimization Particle Swarm Optimization (D-PSO) was introduced for hyperparameter optimization during training. The high accuracy of the model was obtained at the inference speed. A better balance between performance and computational efficiency was achieved after optimization. Experiments were conducted on the maize and peanut-soybean intercropping field. The KPLNet was also compared with mainstream instance segmentation, including YOLOv8n-seg and YOLO11n-seg. The experimental results show that KPLNet performed the best in terms of detection accuracy and real-time performance. Concretely, the navigation line endpoint detection accuracy (mAP_pose^(50-95)) of KPLNet reached 99.5% in maize and peanut-soybean intercropping fields, and the heading angle deviations were as low as 0.79° and 0.73°, respectively, indicating extremely high detection precision. In terms of inference efficiency, the lower average inference time of KPLNet was obtained to eliminate the post-processing time consumption, with the maximum frame rate of 119.2 frames per second (fps), fully meeting the real-time navigation requirements of agricultural machinery. Ablation tests verified that the CAM, SFF, and D-PSO module contributed to the accuracy and inference speed. The experiments validated that the end-to-end navigation was superior to the simple workflows and had low computational overhead in the task of navigation line detection, compared with the conventional instance segmentation. This finding can provide reliable technical support to develop efficient and stable navigation for agricultural machinery. A technical reference can also offer guidance for the intelligent and precise operation of agricultural equipment in complex planting environments in smart agriculture.

     

/

返回文章
返回