高级检索+

基于IDCA-Unet的无人机影像绿豆花分割及花期量化

Segmentation and flowering period quantification of mung bean flowers in unmanned aerial vehicle images using IDCA-Unet

  • 摘要: 绿豆(Vigna radiata L.)植株花期性状的精准量化对于作物表型分析和育种研究具有重要意义,其中花期同期性的定量表征尤为关键,直接影响群体性状整齐度及品种成熟期一致性,进而关系到育种材料的有效筛选与品种适应性的准确评价。实现花的精准分割是进行量化分析的基础,然而,由于绿豆植株存在花序结构复杂、与背景的对比度较低等问题,使得传统语义分割模型难以有效应对这些挑战。该研究基于无人机遥感平台获取绿豆植株花荚期图像,提出了一种融合双通道注意力机制(dual channel attention, DCA)和实例归一化(instance normalization, IN)的IDCA-UNet模型。该模型通过DCA动态聚合全局平均池化与最大池化特征,实现通道维度的自适应特征增强,提升模型对微小花朵的敏感度;同时引入实例归一化层,解决了小批量训练中统计量估计不稳定问题,增强模型在复杂田间场景中的泛化能力。试验结果表明,IDCA-Unet在绿豆花分割任务中性能显著优于基准模型,平均交并比(mean intersection over union, mIoU)、平均精度均值(mean average precision, mAP)和F1值(F1 score)分别达到88.95%、93.47%和93.79%,相较于基准模型(VGG16-UNet with focal loss)分别提升了1.62、1.48和0.72个百分点。基于分割结果进一步量化分析绿豆植株花荚期的动态生长规律,发现其生长曲线呈现明显的“Λ”型,表明群体开花同步性较强。该研究提出的IDCA-UNet模型通过DCA与IN的协同优化,有效提升了复杂场景下绿豆植株花的分割精度,为绿豆植株花期量化分析提供了可靠的技术支持。为作物器官级表型分析提供了具备实际跨平台应用价值的解决方案,能够为精准农业中作物生长状态的准确评估、管理决策的科学制定等提供参考。

     

    Abstract: Mung bean (Vigna radiata L.) is one of the most favorite crop plants worldwide. It is often required for the precise quantification of the flowering traits during crop phenotyping and breeding. Particularly, the flowering synchrony can directly influence the population trait uniformity and varietal maturity consistency. Thereby the breeding materials can be refined to evaluate the varietal adaptability. However, the conventional semantic segmentation of flowers cannot fully meet the large-scale quantitative analysis in recent years. Some challenges remain in the low contrast with the background, such as the complex inflorescence structure of the mung bean plants, the minute size of the floral organs, their similar coloration to leaves and stems, coupled with the intense lighting variations, extensive occlusions, and overlapping in field environments. Alternatively, deep learning can be expected for the image segmentation of small targets in complex agricultural scenarios. Therefore, it is very necessary for sufficient feature extraction and model generalization to avoid the spatial information loss. In this study, an IDCA-UNet model was proposed to integrate a dual channel attention (DCA) mechanism and instance normalization (IN). The high-resolution images of the mung bean plants were also captured at the flowering and pod-setting stage using an unmanned aerial vehicle (UAV) remote sensing platform. According to the classic U-Net architecture, the DCA module was employed to dynamically aggregate some features from the global average pooling and max pooling, in order to realize the adaptive feature enhancement in the channel dimension. The high-resolution features effectively improved the sensitivity and recognition for the tiny flowers. Simultaneously, the Instance Normalization layers were introduced to replace the batch normalization layers. The unstable statistical estimation was avoided to reduce the internal covariate shift during small-batch training. The robustness and generalization were enhanced under complex and variable field conditions. Experimental results demonstrate that the IDCA-UNet significantly outperformed the various models, including the ResNet-UNet, YOLOv11, and DeepLabV3+ in the mung bean flower segmentation task, with the mIoU, mAP, and F1 scores of 88.95%, 93.47%, and 93.79%, respectively. There were the improvements of 1.62, 1.48, and 0.72 percentage points, compared with the VGG16-UNet benchmark model with the Focal loss. The high-precision segmentation was further quantitatively analyzed to study the dynamic growth patterns of the mung bean plants at the flowering and pod-setting stage. The flowering dynamics exhibited the "Λ"-shaped growth curve with a concentrated peak flowering period after daily flower count statistics and fitting, indicating the strong flowering synchrony within the population. Therefore, important evidence can be provided to assess the varietal maturity, consistency, and suitability for mechanical harvesting. The mung bean flower segmentation can also offer a practical cross-platform application for the organ-level crop phenotyping. Multi-source remote sensing data and temporal analysis can be integrated for environmental forecasting in the future. Thereby, the finding can serve as the robust theoretical and technical support to accurately assess the crop growth status, particularly for the high yield and efficiency breeding in precision agriculture.

     

/

返回文章
返回