一种基于多重注意力机制的点云深度学习解码器

陈炜哲; 高金金; 李潞洋

一种基于多重注意力机制的点云深度学习解码器

A Decoder Based on Multiple Attention Mechanisms for Deep Learning on Point Cloud

摘要

摘要: 针对点云深度学习模型中解码器特征上采样存在的空间与特征结构损失的问题，提出一种基于注意力机制的特征插值上采样解码器方法.该解码器主要由空间注意力与特征注意力插值特征上采样组成.首先，根据点的空间位置特征之间的注意力关系，通过空间注意力插值计算上采样点的空间插值特征，然后利用空间插值特征与下采样点之间的特征注意力关系计算特征插值，最终将插值特征与编码特征跳跃连接，并进行特征混合后输出解码特征.在实验中，将本文提出的解码器嵌入到多种点云深度学习模型中，并在ShapeNet与ModelNet两种数据集上验证解码器的有效性.试验结果表明，与传统的三点线性插值模型相比，本文提出的方法在零件分割任务中平均重叠度提高0.5%,法向量估计任务中平均余弦距离误差降低13%,这充分验证了解码器的有效性.

Abstract: Aiming at the problem of spatial and feature structure loss in the feature upsampling of decoder for deep learning model on point cloud, an attention-based decoder for feature interpolation upsampling was proposed. The decoder mainly consists of spatial attention and feature attention feature interpolation upsampling. First, the spatial interpolated features of the upsampled points were calculated by spatial attention interpolation based on the attention relationship between the spatial location features of the points, then the feature interpolation was calculated from the feature attention relationship between the spatial interpolated features and the subsampling points, finally the interpolated features were skip connected with the encoder features and the decoder features were output after feature mix. In the experiments, the proposed decoder was embedded into various deep learning models on point clouds, and the effectiveness of the decoder was verified on both ShapeNet and ModelNet datasets. The experimental results show that the proposed method improved the average overlap by 0.5% in the part segmentation task and reduced the average cosine distance error by 13% in the normal vector estimation task compared with the traditional three-point linear interpolation model, which fully verified the effectiveness of the decoder.

HTML全文

参考文献(16)

施引文献

资源附件(0)