Long-tailed classification poses a challenge due to its heavy imbalance in class probabilities and tail-sensitivity risks with asymmetric misprediction costs. Recent attempts have used re-balancing loss and ensemble methods, but they are largely heuristic and depend heavily on empirical results, lacking theoretical explanation. Furthermore, existing methods overlook the decision loss, which characterizes different costs associated with tailed classes. This paper presents a general and principled framework from a Bayesian-decision-theory perspective, which unifies existing techniques including re-balancing and ensemble methods, and provides theoretical justifications for their effectiveness. From this perspective, we derive a novel objective based on the integrated risk and a Bayesian deep-ensemble approach to improve the accuracy of all classes, especially the ``tail". Besides, our framework allows for task-adaptive decision loss which provides provably optimal decisions in varying task scenarios, along with the capability to quantify uncertainty. Finally, We conduct comprehensive experiments, including standard classification, tail-sensitive classification with a new False Head Rate metric, calibration, and ablation studies. Our framework significantly improves the current SOTA even on large-scale real-world datasets like ImageNet.
翻译:长尾分类因其在等级概率和对尾敏感风险方面的严重不平衡以及非对称性错误成本,构成了一项挑战。最近的一些尝试使用了重新平衡损失和混合方法,但大多是累赘,严重依赖经验结果,缺乏理论解释。此外,现有方法忽略了决定损失,而这种损失是尾尾细类不同成本的特点。本文从巴耶斯人决策理论的角度提出了一个一般性和原则性的框架,它统一了现有技术,包括重新平衡和混合方法,并为这些技术的有效性提供了理论上的理由。从这个角度出发,我们根据综合风险和巴耶斯人深思熟虑的方法提出了一个新的目标,以提高所有类别的准确性,特别是“尾尾巴”的准确性。此外,我们的框架允许任务调整性决定损失,在各种任务假设中提供可认为最佳的决定,同时具备量化不确定性的能力。最后,我们进行了全面实验,包括标准分类、对尾巴敏感分类,并采用新的假头指数、校准和缩缩略图研究。我们的框架大大改进了当前SOTAS数据库,甚至像现实世界。</s>