Test-time adaptation (TTA) aims to adapt a trained classifier using online unlabeled test data only, without any information related to the training procedure. Most existing TTA methods adapt the trained classifier using the classifier's prediction on the test data as pseudo-label. However, under test-time domain shift, accuracy of the pseudo labels cannot be guaranteed, and thus the TTA methods often encounter performance degradation at the adapted classifier. To overcome this limitation, we propose a novel test-time adaptation method, called Test-time Adaptation via Self-Training with nearest neighbor information (TAST), which is composed of the following procedures: (1) adds trainable adaptation modules on top of the trained feature extractor; (2) newly defines a pseudo-label distribution for the test data by using the nearest neighbor information; (3) trains these modules only a few times during test time to match the nearest neighbor-based pseudo label distribution and a prototype-based class distribution for the test data; and (4) predicts the label of test data using the average predicted class distribution from these modules. The pseudo-label generation is based on the basic intuition that a test data and its nearest neighbor in the embedding space are likely to share the same label under the domain shift. By utilizing multiple randomly initialized adaptation modules, TAST extracts useful information for the classification of the test data under the domain shift, using the nearest neighbor information. TAST showed better performance than the state-of-the-art TTA methods on two standard benchmark tasks, domain generalization, namely VLCS, PACS, OfficeHome, and TerraIncognita, and image corruption, particularly CIFAR-10/100C.
翻译:测试时间适应(TTA)旨在仅使用在线未贴标签的测试数据调整经过培训的分类器,而没有与培训程序相关的任何信息。大多数现有的TTA方法使用分类器对测试数据所作的伪标签预测对经过培训的分类器进行调整。然而,在测试时间的域变换中,假标签的准确性得不到保证,因此,TTA方法往往在调整的分类器上遇到性能退化。为了克服这一限制,我们建议一种新型测试时间适应方法,即:用最接近的临近信息进行自我培训的测试时间适应(TAST),该方法由以下程序组成:(1) 在经过培训的功能提取器的顶部增加可培训的调整模块;(2) 使用最近的邻居信息,新定义测试数据的假标签分配;(3) 在测试时间内仅培训这些模块的几度,与最近的邻居的假标签分配值相匹配,因此测试数据的原型等级分配。 假标签生成基于测试办公室测试数据的基本直觉,即使用最接近的图像转换的TA-RFA 服务器的初始域域域域域域中的随机数据, 可能用SAL 共享数据。</s>