基于树冠精准分割和多源特征融合的无人机单木材积估测

龙开源; 龙江平; 林辉; 孙华; 徐川; 黄子加

doi:10.11975/j.issn.1002-6819.202503210

基于树冠精准分割和多源特征融合的无人机单木材积估测

Single-tree volume estimation using UAV-based crown segmentation and multi-source feature fusion

摘要

摘要: 随着森林资源管理逐步迈向精准化与数字化，无人机技术为智能化与自动化的森林资源样地调查提供了新的解决方案。然而，当前树冠分割边界刻画不够精细、单木材积估测精度较低的问题仍然突出，同时高精度激光雷达数据的获取成本较高，限制了其在实际应用中的广泛推广。为提高单木材积估测的精度与效率，克服现有方法中树冠分割不精细和高精度激光雷达数据成本高的问题，该研究提出了一种结合无人机可见光影像和低密度点云数据的单木材积估测方法。在此方法中，基于YOLOv11算法，结合引入 ScaleEdgeExtractor（SEE）、DilatedFusion（DF）、C2BRA 和 GatedFPN 等模块，增强了树冠边界的感知能力和多尺度特征表达能力，并构建了高精度树冠分割网络 CrownSeg。在此基础上，基于树冠形态、光谱及纹理特征的多维特征融合策略，结合递进特征组合方法和加权集成学习模型构建了单木材积估测模型。结果表明，CrownSeg 树冠分割算法提升了树冠边界的刻画精度，交并比（intersection over union, IoU）阈值为0.5时的平均精度（AP50）达到94.9%，较基准模型提升1.5个百分点；IoU阈值从0.5到0.95区间的平均精度（AP50-95）达到66.2%，较基准模型提升3.8个百分点。此外，多源特征融合有效强化了单木材积的预测能力，最终加权集成模型表现优异，其决定系数（R²）达到0.9215，平均绝对误差（MAE）为0.022 8 m³，平均绝对百分比误差（MAPE）为17.00%，均优于单一模型，展现出良好的模型稳定性和泛化能力，为无人机遥感技术在精准林业中的应用提供了新的技术参考。

Abstract: As forest resource management increasingly emphasized precision and digitization, unmanned aerial vehicle (UAV) technology emerged as a promising tool for intelligent and automated forest inventory, though challenges like imprecise crown segmentation, limited accuracy in single-tree volume estimation, and the high cost of high-precision Light Detection and Ranging (LiDAR) point cloud data hindered its broad adoption. These limitations prompted this study to develop a method for single-tree volume estimation using UAV-derived visible light imagery and low-density point cloud data, focusing on enhancing crown segmentation precision and improving volume estimation through multi-source feature integration. A crown segmentation network called CrownSeg was introduced, utilizing UAV visible light imagery and built upon the YOLOv11 framework with several specialized modules. The ScaleEdgeExtractor (SEE) module employed a three-stage mechanism—shallow filtering, edge enhancement, and cross-layer fusion—combining directional Sobel convolution, multi-scale downsampling, and adaptive edge-feature fusion to effectively preserve and enhance crown boundary information. The Gated Feature Pyramid Network (GatedFPN) adopted a bi-directional hierarchical structure with spatial-channel dual-attention gating, enabling closed-loop multi-scale optimization and more refined crown segmentation across different canopy densities. The C2BRA module introduced bi-level routing attention and a channel-spatial dual-attention mechanism to enhance boundary perception while suppressing background interference from complex forest environments. Meanwhile, the DilatedFusion (DF) module leveraged parallel dilated convolutions with shared kernels to extract multi-granularity contextual information, improving adaptability to trees of varying shapes and sizes. These modules worked collaboratively to enhance spatial detail retention and semantic feature abstraction, resulting in high-quality segmentation outputs. For volume estimation, a model was developed combining crown morphological, spectral, and textural features extracted from UAV imagery with tree height data from low-density LiDAR point clouds. A progressive feature combination strategy and a weighted ensemble learning technique were employed to integrate these multi-source inputs for robust prediction. The CrownSeg network achieved an Average Precision at an Intersection over Union threshold of 0.5 (AP50) of 94.9% and an AP50-95 of 66.2% surpassing the baseline model by 1.5 and 3.8 percentage points respectively due to enhanced boundary delineation and multi-scale feature representation. The weighted ensemble model for volume estimation yielded a coefficient of determination (R²) of 0.9215, a mean absolute error (MAE) of 0.0228 cubic meters, and a mean absolute percentage error (MAPE) of 17.00%, outperforming standalone models. Comparative analyses showed that integrating morphological, spectral, and textural features significantly reduced estimation errors, with the ensemble model demonstrating superior stability and generalization across diverse forest conditions. These findings were validated by experimental data from 749 single trees in a plantation forest, where error metrics were consistently lower than those of individual algorithms like Random Forest or Neural Networks. Visual inspections confirmed CrownSeg’s excellence in handling complex canopy structures and minimizing segmentation errors in dense or heterogeneous stands, ultimately establishing a high-precision crown segmentation network and an accurate single-tree volume estimation model that leveraged UAV-based data to offer a cost-effective, efficient alternative to traditional ground-based surveys, providing a practical technical framework for UAV remote sensing applications in precision forestry. Future efforts are suggested to explore multi-modal data integration, such as combining LiDAR and optical imagery, to further refine segmentation and estimation accuracy in varied forest environments.

HTML全文

参考文献(36)

施引文献

资源附件(0)