Existing long-tailed recognition methods, aiming to train class-balanced models from long-tailed data, generally assume the models would be evaluated on the uniform test class distribution. However, practical test class distributions often violate this assumption (e.g., being long-tailed or even inversely long-tailed), which would lead existing methods to fail in real-world applications. In this work, we study a more practical task setting, called test-agnostic long-tailed recognition, where the training class distribution is long-tailed while the test class distribution is unknown and can be skewed arbitrarily. In addition to the issue of class imbalance, this task poses another challenge: the class distribution shift between the training and test samples is unidentified. To handle this task, we propose a new method, called Test-time Aggregating Diverse Experts, that presents two solution strategies: (1) a new skill-diverse expert learning strategy that trains diverse experts to excel at handling different class distributions from a single long-tailed training distribution; (2) a novel test-time expert aggregation strategy that leverages self-supervision to aggregate multiple experts for handling various unknown test distributions. We theoretically show that our method has a provable ability to simulate the test class distribution. Extensive experiments verify that our method achieves new state-of-the-art performance on both vanilla and test-agnostic long-tailed recognition, where only three experts are sufficient to handle arbitrarily varied test class distributions. Code is available at https://github.com/Vanint/TADE-AgnosticLT.
翻译:在这项工作中,我们研究了一个更实际的任务设置,即:测试班分配长,而测试班分配不尽人意,而且可以任意扭曲。除了班级不平衡问题之外,这个任务还构成另一个挑战:培训和测试样品之间的班级分配变化不明。为了完成这项任务,我们提出了一个新的方法,即称为试验时间多样性专家,它提出了两种解决方案战略:(1) 新的技能多样化专家学习战略,它培训各种专家精于处理单种长程培训分配的不同班级分配,而单种长程培训分配,(2) 新颖的测试时间专家汇总战略,利用自我监督的多类专家来处理各种不为人所知的班级/考试分配能力。我们从理论上讲,在测试方法上,我们既能进行不为人知的班级/考试分配,又能进行不为人知的。