This paper compares and ranks 8 UDA validation methods. Validators estimate model accuracy, which makes them an essential component of any UDA train-test pipeline. We rank these validators to indicate which of them are most useful for the purpose of selecting optimal model checkpoints and hyperparameters. To the best of our knowledge, this large-scale benchmark study is the first of its kind in the UDA field. In addition, we propose three new validators that outperform all the existing checkpoint-based validators that we were able to find in the existing literature. Code is available at https://www.github.com/KevinMusgrave/powerful-benchmarker.
翻译:本文件比较了8个UDA验证方法,并将其排在第8位。验证人估计了模型准确性,使其成为UDA任何测试输油管的基本组成部分。我们对这些验证人进行排位,以表明其中哪些对选择最佳示范检查站和超参数最为有用。据我们所知,这一大规模基准研究是UDA领域同类研究中的首个。此外,我们提议了3个新的验证人,这些验证人比我们在现有文献中能找到的基于检查站的现有验证人都好。代码见https://www.github.com/KevinMusgrave/powerful-benchmarkerer。