Translated title: 基于面具自编码器的点云预训练的3D特征预测 Translated abstract: 近期，由于在自然语言处理和计算机视觉中的成功，面具自编码器（MAE）被引入了3D自我监督预训练点云中。与在图像领域中使用的MAE不同，那里的预处理任务是恢复掩蔽像素的特征，如颜色，现有的3D MAE重建仅丢失的几何信息，即掩蔽点的位置。与以前的研究相反，我们认为点位置恢复不太重要，恢复固有的点特征要优越得多。为此，我们建议忽略点位置重构，通过独立于编码器设计的新型基于注意力的解码器，恢复掩盖点处的高阶特征，包括表面法线和表面变化。我们使用不同的3D训练编码器结构验证了我们预备课任务和解码器设计的有效性，并展示了我们的预先训练网络在各种点云分析任务中的优势。 (3D Feature Prediction for Masked-AutoEncoder-Based Point Cloud Pretraining)

翻译：Translated title: 基于面具自编码器的点云预训练的3D特征预测 Translated abstract: 近期，由于在自然语言处理和计算机视觉中的成功，面具自编码器（MAE）被引入了3D自我监督预训练点云中。与在图像领域中使用的MAE不同，那里的预处理任务是恢复掩蔽像素的特征，如颜色，现有的3D MAE重建仅丢失的几何信息，即掩蔽点的位置。与以前的研究相反，我们认为点位置恢复不太重要，恢复固有的点特征要优越得多。为此，我们建议忽略点位置重构，通过独立于编码器设计的新型基于注意力的解码器，恢复掩盖点处的高阶特征，包括表面法线和表面变化。我们使用不同的3D训练编码器结构验证了我们预备课任务和解码器设计的有效性，并展示了我们的预先训练网络在各种点云分析任务中的优势。

Siming Yan,Yuqi Yang,Yuxiao Guo,Hao Pan,Peng-shuai Wang,Xin Tong,Yang Liu,Qixing Huang

from arxiv, 11 pages, 4 figures

Masked autoencoders (MAE) have recently been introduced to 3D self-supervised pretraining for point clouds due to their great success in NLP and computer vision. Unlike MAEs used in the image domain, where the pretext task is to restore features at the masked pixels, such as colors, the existing 3D MAE works reconstruct the missing geometry only, i.e, the location of the masked points. In contrast to previous studies, we advocate that point location recovery is inessential and restoring intrinsic point features is much superior. To this end, we propose to ignore point position reconstruction and recover high-order features at masked points including surface normals and surface variations, through a novel attention-based decoder which is independent of the encoder design. We validate the effectiveness of our pretext task and decoder design using different encoder structures for 3D training and demonstrate the advantages of our pretrained networks on various point cloud analysis tasks.

翻译：注：英文明确标识的专有名词（ Proper Nouns）已在翻译中保留，请勿进行中译。