基于半监督学习的农机田-路轨迹分割方法

张晓强; 权雷; 李欣悦; 魏谦润; 吴才聪; 陈瑛

doi:10.11975/j.issn.1002-6819.202410153

摘要: 农机田-路轨迹分割是监测农机作业动态的关键一环，需要判断每个轨迹点的位置（在农田中作业或在道路上行驶），是农机应急调度、作业补贴、实施精准作业的重要依据。但传统田-路轨迹分割方法存在人工标注成本高、分割精度低等问题，为此，该研究提出一种基于半监督学习的农机田-路轨迹分割方法，利用海量的无标注轨迹数据提升田-路轨迹分割模型的性能。首先，预训练一个基于小规模、人工标注的原始训练数据构建的多视图特征融合的田-路轨迹分割模型，该模型通过统计分析从轨迹序列中提取农机运动特征，并运用Attention U-Net网络从轨迹图中提取视觉特征，然后通过BiLSTM（bidirectional long short-term memory）网络实现多视图特征融合。然后，使用预训练模型对大量未标注的轨迹数据进行自动田-路轨迹分割，生成伪标签样本，结合轮廓系数（silhouette coefficient, Sil）与戴维斯-博尔丁指数（davies-bouldin, DBI），筛选出高质量伪标签样本。最后，采用半监督学习中的自训练策略将筛选出的高质量样本迭代纳入到原始训练数据集中，重新训练田-路轨迹分割预训练模型。试验结果表明，该方法在小麦和水稻收割作业农机轨迹数据集上的分割准确率分别达到91.89%和84.19%，明显优于传统方法，可为农机作业动态监测解决方案提供参考。

Abstract: Field-road trajectory segmentation can serve as an essential task to calculate the work area and efficiency of agricultural machinery. The GNSS points during machinery movement can be generated within a field or on a road, without considering the field boundary information. However, the previous research has been focused mainly on the optimization of the model structures. It is still lacking in the large amount of unlabeled trajectory data. The field-road trajectory segmentation can also be severely limited to the operation of the agricultural machinery. Therefore, this study aims to enhance the performance of the field-road trajectory segmentation using the unlabeled trajectory data of the agricultural machinery. Firstly, a pre-trained model was employed for the field-road trajectory segmentation. The multi-view feature fusion was also used to extract the visual features from the trajectory images, semantic segmentation, and motion features from the trajectory data using statistical analysis. Subsequently, a BiLSTM (bidirectional long short-term memory) network was used to fuse the motion and visual features, in order to predict the field-road segmentation for the unlabeled trajectories. Secondly, a systematic evaluation was proposed using metric davies-bouldin silhouette-coefficient (DB_sil). The davies-bouldin index (DBI) and the silhouette coefficient (Sil) were also combined to consider the multiple key aspects of the trajectory label quality, such as the compactness and separation of the predicted labels, and the relative distance of data points to cluster centers. Therefore, the high-quality pseudo-label samples were obtained using semi-supervised learning. Finally, the labeled trajectory dataset was utilized to retrain the field-road trajectory segmentation. The upper and lower bounds were also established to regulate the training set size on the number of pseudo-label samples in the labeled dataset. The pseudo-label samples were integrated to positively maximize the model learning. Furthermore, the trajectory datasets of the agricultural machinery were then selected to validate the effectiveness of the improved model for wheat and paddy harvesting. The performance of the field-road trajectory segmentation was significantly enhanced in the generalization and training efficiency. The accuracies were achieved in 91.89% and 84.19% on the two datasets, respectively. The accuracy of the proposed method improved 7.24, 10.52, and 5.38 percentage points on the annotated wheat test dataset compared to DBSCAN+rules(DR), decision tree(DT), and improved 12.59, 12.79, and 7.03percentage points on the annotated rice test dataset, respectively. Additionally, ablation experiments verified the effectiveness of the DB_sil pseudo-labels metric after sample selection. The DB_sil approach significantly improved the precision, recall, F1 score, and accuracy, compared with the random sampling, DBI, and silhouette coefficient. This finding can provide valuable insights into the high performance of the field-road segmentation, in terms of the operation efficiency, area calculation, and monitoring of agricultural machinery.

基于半监督学习的农机田-路轨迹分割方法

Field-road trajectory segmentation method for agricultural machinery based on semi-supervised learning