高级检索+

基于KPLNet网络的端到端农机导航线检测方法

End-to-end agricultural machinery navigation line detection method based on KPLNet network

  • 摘要: 导航线检测是视觉导航方法中的关键步骤,其目的是从农田图像中实时、准确地提取出指导农机行进的方向或路径。目前,该任务的主流解决方案多基于实例分割技术,先提取作物行边缘,再通过复杂后处理拟合导航线。这类方法流程繁琐,计算开销较大。为此,该研究提出一种端到端的导航线回归模型——KPLNet(keypoint plant line network),旨在实现精度与效率兼顾的农机导航线检测。KPLNet将导航线表示为关键点进行直接预测,无需任何后处理流程,从而在保证精度的前提下提升推理速度。在KPLNet的构建中,首先引入几何先验思想,构建中心注意力机制(central attention mechanism, CAM),CAM专注于关键位置特征的上下文整合,可提高关键特征的提取能力和效率。其次,针对导航线检测任务中目标尺度变化小且仅需进行单目标检测的特点,提出精简特征融合结构(single feature fusion, SFF),SFF结构在保证强大导航线特征提取能力的同时提升网络的实时性。此外,在模型训练阶段引入改进的双优化粒子群优化算法(dual-optimization particle swarm optimization, D-PSO)进行超参数优化,在不影响推理速度的前提下进一步提升模型精度。在玉米以及花生-大豆间作套种代表性场景下的试验结果表明,KPLNet在检测精度和实时性方面均优于YOLOv8n-seg、YOLO11n-seg等主流实例分割方法。KPLNet在玉米田和花生大豆套种田中的导航线端点检测精度( \textmAP_\textpose^50-95 )均达到了99.5%,航向角偏差分别低至0.79°和0.73°。在推理效率方面,KPLNet拥有更低的平均推理时间且彻底消除了后处理时间,最高帧率达119.2 帧/s,消融试验进一步验证了CAM、SFF和D-PSO对模型性能提升的具体贡献。试验结果验证了所提端到端范式在农业导航线检测任务中的优越性,为开发高效、可靠的农机自动导航系统提供了技术支撑。

     

    Abstract: Navigation line detection serves as a pivotal procedure in visual navigation approaches for agricultural machinery, and its core purpose is to extract the driving direction or path guiding agricultural machinery operation from field images in real time and with high accuracy. At present, mainstream solutions for this task are mostly based on instance segmentation technology, which first extracts crop row edges and then fits navigation lines through complex post-processing operations. Such methods are featured with cumbersome workflows and large computational overhead, which severely restrict their application in real-time agricultural navigation scenarios requiring high efficiency and instant response. To tackle these problems, this study proposes an end-to-end navigation line regression model called Keypoint Plant Line Network (KPLNet), aiming to realize agricultural machinery navigation line detection with both superior precision and efficiency. Different from traditional instance segmentation-based frameworks, KPLNet directly represents navigation lines in the form of keypoints for prediction without any post-processing procedures, thereby improving inference speed on the premise of ensuring detection accuracy.In the architectural design of KPLNet, the idea of geometric prior is firstly introduced to construct a Central Attention Mechanism (CAM). CAM concentrates on the contextual integration of features at key positions, which can effectively enhance the extraction capability and efficiency of critical features related to navigation lines, and lay a solid foundation for accurate keypoint positioning of navigation lines. Secondly, in view of the characteristics of small target scale variation and the demand for single-target detection only in navigation line detection tasks, a streamlined feature fusion structure named Single Feature Fusion (SFF) is proposed. The SFF structure maintains powerful feature extraction ability for navigation line features while optimizing the network structure to improve the real-time performance of the model, reducing redundant calculation and parameter costs to adapt to the edge deployment requirements of agricultural machinery. In addition, an improved Dual-Optimization Particle Swarm Optimization (D-PSO) algorithm is introduced for hyperparameter optimization in the model training stage. This optimization strategy further boosts the detection accuracy of the model without affecting inference speed, achieving a better balance between model performance and computational efficiency.Experiments are carried out on two representative field scenarios: maize fields and peanut-soybean intercropping fields, and the proposed KPLNet is compared with mainstream instance segmentation methods including YOLOv8n-seg and YOLO11n-seg. The experimental results show that KPLNet outperforms the above comparative methods in both detection accuracy and real-time performance. Concretely, the navigation line endpoint detection accuracy ( \textmAP_\textpose^50-95 ) of KPLNet reaches 99.5% in both maize fields and peanut-soybean intercropping fields, and the heading angle deviations are as low as 0.79° and 0.73° respectively, reflecting extremely high navigation line detection precision. In terms of inference efficiency, KPLNet has a lower average inference time and completely eliminates post-processing time consumption, with a maximum frame rate of 119.2 frames per second (fps), which fully meets the real-time operation requirements of agricultural automatic navigation. Ablation experiments further verify the specific contribution of CAM, SFF and D-PSO to the performance improvement of the model, confirming that each proposed module plays a positive role in enhancing detection accuracy and inference speed respectively.The experimental results validate the superiority of the proposed end-to-end paradigm in the task of agricultural navigation line detection, breaking through the bottlenecks of traditional instance segmentation methods in terms of complex workflows and high computational overhead. This research provides reliable technical support for the development of efficient and stable automatic navigation systems for agricultural machinery, and offers a novel technical reference for the intelligent and precise operation of agricultural equipment in complex field planting environments, which is of great significance for promoting the intelligent upgrading of agricultural production and the application of smart agricultural equipment.

     

/

返回文章
返回