We propose MATE, the first Test-Time-Training (TTT) method designed for 3D data. It makes deep networks trained in point cloud classification robust to distribution shifts occurring in test data, which could not be anticipated during training. Like existing TTT methods, which focused on classifying 2D images in the presence of distribution shifts at test-time, MATE also leverages test data for adaptation. Its test-time objective is that of a Masked Autoencoder: Each test point cloud has a large portion of its points removed before it is fed to the network, tasked with reconstructing the full point cloud. Once the network is updated, it is used to classify the point cloud. We test MATE on several 3D object classification datasets and show that it significantly improves robustness of deep networks to several types of corruptions commonly occurring in 3D point clouds. Further, we show that MATE is very efficient in terms of the fraction of points it needs for the adaptation. It can effectively adapt given as few as 5% of tokens of each test sample, which reduces its memory footprint and makes it lightweight. We also highlight that MATE achieves competitive performance by adapting sparingly on the test data, which further reduces its computational overhead, making it ideal for real-time applications.
翻译:我们建议使用为 3D 数据设计的第一个测试时间培训方法MATE。 它使深网络在点云分类方面经过培训,对测试数据中分布变化具有很强的强度,而测试数据是无法预料的。 与现有的TTT方法一样,在测试时分布变化时,侧重于对二维图像进行分类,MATE还利用测试数据进行调适。 它的测试时间目标是蒙面自动编码器:每个测试点云在输入到网络之前有很大一部分点被移除,任务是重建全点云。 一旦网络更新,它被用来对点云进行分类。 我们用数个3D 对象分类数据集测试MATE, 并表明它大大改进深网络的强度,使其适应在3D 点云中常见的若干类型的腐败。 此外,我们证明MATE在调适量所需的点分数方面非常有效。 它可以有效地适应每个测试样品的5%的标志,从而减少其记忆足迹,使其变得轻重。 我们还强调,通过不断调整,使实际的MATETE进行高空测试,从而降低其真实的测试。