This paper proposes a novel batch normalization strategy for test-time adaptation. Recent test-time adaptation methods heavily rely on the modified batch normalization, i.e., transductive batch normalization (TBN), which calculates the mean and the variance from the current test batch rather than using the running mean and variance obtained from the source data, i.e., conventional batch normalization (CBN). Adopting TBN that employs test batch statistics mitigates the performance degradation caused by the domain shift. However, re-estimating normalization statistics using test data depends on impractical assumptions that a test batch should be large enough and be drawn from i.i.d. stream, and we observed that the previous methods with TBN show critical performance drop without the assumptions. In this paper, we identify that CBN and TBN are in a trade-off relationship and present a new test-time normalization (TTN) method that interpolates the statistics by adjusting the importance between CBN and TBN according to the domain-shift sensitivity of each BN layer. Our proposed TTN improves model robustness to shifted domains across a wide range of batch sizes and in various realistic evaluation scenarios. TTN is widely applicable to other test-time adaptation methods that rely on updating model parameters via backpropagation. We demonstrate that adopting TTN further improves their performance and achieves state-of-the-art performance in various standard benchmarks.
翻译:本文提出了用于测试时间适应的新型批次正常化战略。最近的测试时间适应方法在很大程度上依赖经过修改的批次正常化,即转式批次正常化,这种方法计算出与当前测试批次的平均值和差异,而不是使用从源数据获得的运行平均值和差异,即常规批次正常化(CBN)。采用TBN,采用测试批次统计减轻了域变换造成的性能退化。然而,使用测试数据重新估计正常化统计数据取决于不切实际的假设,即测试批次应该足够大,从i.d.流中抽取,我们发现,以前采用TBN的方法显示,在没有假设的情况下,关键性业绩下降。在本文件中,我们确定CBN和TBN处于交易关系中,提出了一种新的测试-时间正常化(TTN)方法,通过调整CBN和TBN之间的重要性,以适应每个BN层的地位变敏感度。我们提议的TTN改进模型的坚固性能模型,在广泛的批次模型中和各种现实的TN更新基准中,我们广泛地采用了其他的绩效测试方法。