基于语义-结构协同的盆栽植物自修复高斯重建

丁维龙; 桑沄; 张吉光; 徐利锋; 陈亮

doi:10.11975/j.issn.1002-6819.202508155

基于语义-结构协同的盆栽植物自修复高斯重建

Semantic-Structure Synergistic Self-Repair Gaussian Reconstruction for Potted Plants

摘要

摘要: 针对盆栽植物三维重建中形态复杂、器官遮挡和结构不连续等问题，为提升模型的几何完整性并增强语义表达的清晰度，该研究提出一种基于语义-结构协同的自修复高斯重建方法（4SPP）。该方法通过构建高斯语义划分与渐进式结构补全两个模块，实现多视角语义信息向三维空间的映射，并基于渲染误差对结构缺失区域进行动态修复，从而提升模型的几何完整性与语义一致性。从而修复遮挡与弱纹理导致的结构不连续问题。试验结果表明，在9种真实盆栽植物数据上，所提方法相较于3DGS（3D Gaussian splatting），平均峰值信噪比（peak signal to noise ratio，PSNR）提升0.33 dB，在结构相似性（structural similarity，SSIM）与学习感知图像块相似度（learned perceptual image patch similarity，LPIPS）上亦表现出提升。在公开植物数据集上，平均PSNR达到23.884 dB，较3DGS提升0.75 dB，并优于其他对比方法。综上所述，所提方法能够有效提升盆栽植物三维重建的结构完整性与表达能力，为数字农业中的植物建模、表型分析与产量预测等任务提供可靠的技术支撑。

Abstract: Three-dimensional(3D) reconstruction of potted plants remained challenging due to complex morphology and self-occlusion, while existing reconstruction methods often failed to accurately recover fine structures and maintain geometric completeness in such scenarios. These limitations primarily arose from insufficient structural representation and lack of semantic constraints in current approaches. We proposed a Gaussian-based reconstruction framework named Semantic-Structural Synergy-based Self-repairing gaussian reconstruction for potted plants (4SPP). The method comprised two core modules. First, a gaussian semantic dividing stage transferred 2D organ-level segmentation results into the 3D gaussian space, assigning semantic labels to each Gaussian by aggregating multi-view mask responses. Uncertain points were refined through a skeleton-guided proximity check and K-nearest neighbour propagation to ensure local semantic consistency. Second, a progressive structure repair stage detected persistent high-error regions via temporal rendering error analysis. For each detected region, candidate positions for new gaussians were estimated from the local neighbourhood depth, then refined toward the spatial centre of semantically consistent neighbours. The new gaussians inherited averaged geometric and appearance attributes from their neighbours, ensuring smooth integration into the existing structure and reducing the risk of artifacts. This study conducted experiments on nine varieties of potted plants from the real world, each with distinct structural characteristics. Results showed that the proposed method achieved an average improvement of 0.33 dB in peak signal-to-noise ratio(PSNR) compared with standard 3d gaussian splatting(3DGS), along with consistent gains in structural similarity index measure(SSIM) and reductions in learned perceptual image patch similarity(LPIPS). On a public plant dataset, the method achieved an average PSNR of 23.884 dB, outperforming multiple baseline approaches with improvements ranging from 0.50 to 0.75 dB. Visual comparisons demonstrated that the proposed method effectively alleviated reconstruction artifacts such as broken branches, structural gaps, and floating noise, particularly in regions affected by occlusion and weak texture. The reconstructed models exhibited improved structural continuity and more coherent organ-level distribution across different plant types. The efficiency test shows that after introducing of the repair module increased training time by approximately 14.9 percent and memory consumption by about 10.4 percent, which remained acceptable relative to the achieved improvements. Ablation studies further verified that both the semantic dividing module and the structure repair module contributed significantly to the overall performance, and their integration produced the most robust results. The proposed framework effectively improves the geometric completeness and semantic consistency of potted plant reconstruction. By incorporating multi-view semantic information into the 3D Gaussian representation and integrating a rendering error–driven progressive structure repair mechanism, the method alleviates geometric deficiencies in regions such as branch connections and occluded areas caused by insufficient structure from motion(SfM) initialization. Meanwhile, the introduction of organ-level semantic constraints helps regulate the distribution of gaussians, thereby reducing structural artifacts and improving overall reconstruction fidelity. This work provides a reliable technical solution for high-fidelity plant reconstruction and demonstrates promising potential in applications such as digital agriculture, plant phenotyping, and ecological analysis. Future work will explore the integration of generative models with botanical constraints to improve structure recovery in severely occluded regions, aiming to further enhance reconstruction completeness.

HTML全文

参考文献(37)

施引文献

资源附件(0)