高级检索+

基于语义-结构协同的盆栽植物自修复高斯重建

Self-repair Gaussian reconstruction for potted plants using semantic-structure synergy

  • 摘要: 针对盆栽植物三维重建中形态复杂、器官遮挡和结构不连续等问题,为提升模型的几何完整性并增强语义表达的清晰度,该研究提出一种基于语义-结构协同的自修复高斯重建方法(4SPP)。该方法通过构建高斯语义划分与渐进式结构补全两个模块,实现多视角语义信息向三维空间的映射,并基于渲染误差对结构缺失区域进行动态修复,从而提升模型的几何完整性与语义一致性。从而修复遮挡与弱纹理导致的结构不连续问题。试验结果表明,在9种真实盆栽植物数据上,所提方法相较3DGS(3D Gaussian splatting)平均峰值信噪比(peak signal to noise ratio,PSNR)提升0.33分贝(decibel,dB),在结构相似性(structural similarity,SSIM)与学习感知图像块相似度(learned perceptual image patch similarity,LPIPS)上亦表现出提升。在公开植物数据集上,平均PSNR达到23.884 dB,较3DGS提升0.75 dB,并优于其他对比方法。综上所述,所提方法能够有效提升盆栽植物三维重建的结构完整性与表达能力,为数字农业中的植物建模、表型分析与产量预测等任务提供可靠的技术支撑。

     

    Abstract: Three-dimensional (3D) reconstruction of potted plants remained challenging, due to complex morphology and self-occlusion. Existing reconstruction cannot accurately recover fine structures and geometric completeness in such scenarios. It is often required for sufficient structural representation and semantic constraints. In this study, a self-Repair Gaussian reconstruction was proposed, named Semantic-Structural Synergy-based Self-repairing gaussian reconstruction for potted plants (4SPP). Two modules included: 1) The Gaussian semantic assignment stage projected 2D organ-level segmentation into the 3D Gaussian space and labeled each Gaussian by aggregating multi-view mask responses. Uncertain points were refined via a skeleton-guided proximity check and K-nearest neighbor propagation, particularly for local semantic consistency. 2) A progressive structure repair stage detected persistent high-error regions via temporal rendering error analysis. Candidate positions for new Gaussians were estimated from the local neighborhood depth in each detected region, then refined toward the spatial center of semantically consistent neighbors. The new gaussians inherited average geometric and appearance attributes from their neighbors, leading to smooth integration into the existing structure with less risk of artifacts. A series of experiments was conducted on nine varieties of potted plants from the real world, each of which shared structural characteristics. Results showed that the improved model achieved an average improvement of 0.33 decibel (dB) in the peak signal-to-noise ratio (PSNR), compared with standard 3d gaussian splatting (3DGS), along with consistent gains in structural similarity index measure (SSIM) and reductions in learned perceptual image patch similarity (LPIPS). An average PSNR of 23.884 dB was obtained on a public plant dataset, which was improved from 0.50 to 0.75 dB, compared with the multiple baselines. Visual comparisons demonstrated that the reconstruction artifacts were effectively alleviated under occlusion and weak texture, such as broken branches, structural gaps, and floating noise. Different plant types shared the structural continuity and a more coherent organ-level distribution after reconstruction. The efficiency test showed that the training time increased by 14.9 percent in the repair module, while memory consumption increased by about 10.4 percent, which remained acceptable relative to the improvements. Ablation studies further verified that both the semantic assignment and the structure repair module contributed significantly to the overall performance. Their integration also produced the most robust performance. The framework effectively improved the geometric completeness and semantic consistency of potted plant reconstruction. Multi-view semantic information was incorporated into the 3D Gaussian representation, with the rendering error–driven progressive structure repair mechanism. Geometric deficiencies caused by insufficient structure from motion (SfM) initialization were alleviated in the branch connections and occluded areas. Meanwhile, the organ-level semantic constraints were introduced to regulate the distribution of Gaussians, thereby reducing structural artifacts for the overall reconstruction fidelity. This work can provide a reliable technical solution for high-fidelity plant reconstruction, particularly for promising potential applications, such as digital agriculture, plant phenotyping, and ecological analysis. Reconstruction completeness can also be enhanced to integrate generative models with botanical constraints for the structure recovery in severely occluded regions.

     

/

返回文章
返回