Our work focuses on tackling the challenging but natural visual recognition task of long-tailed data distribution (\ie, a few classes occupy most of the data, while most classes have rarely few samples). In the literature, class re-balancing strategies (\eg, re-weighting and re-sampling) are the prominent and effective methods proposed to alleviate the extreme imbalance for dealing with long-tailed problems. In this paper, we firstly discover that these re-balancing methods achieving satisfactory recognition accuracy owe to that they could significantly promote the classifier learning of deep networks. However, at the same time, they will unexpectedly damage the representative ability of the learned deep features to some extent. Therefore, we propose a unified Bilateral-Branch Network (BBN) to take care of both representation learning and classifier learning simultaneously, where each branch does perform its own duty separately. In particular, our BBN model is further equipped with a novel cumulative learning strategy, which is designed to first learn the universal patterns and then pay attention to the tail data gradually. Extensive experiments on four benchmark datasets, including the large-scale iNaturalist ones, justify that the proposed BBN can significantly outperform state-of-the-art methods. Furthermore, validation experiments can demonstrate both our preliminary discovery and effectiveness of tailored designs in BBN for long-tailed problems. Our method won the first place in the iNaturalist 2019 large scale species classification competition, and our code is open-source and available at https://github.com/Megvii-Nanjing/BBN
翻译:我们的工作重点是应对长期数据分发的具有挑战性但自然的视觉识别任务(在大多数数据中,少数类占大多数数据,而大多数类则很少有样本 ) 。 在文献中,为缓解处理长期问题极端不平衡现象而提出的突出而有效的方法,是班级重新平衡战略(如,重新加权和再抽样 ) 。 在本文中,我们首先发现,这些重新平衡方法能够实现令人满意的认知准确度,因为它们可以大大促进分类者对深层网络的学习。但与此同时,它们会意外地损害所学到的深度特征的代表性能力。 因此,我们提议建立一个统一的双边-布拉奇网络(BBN),以同时兼顾代表性学习和分类方法(例如,重新加权和再抽样 ) 。 特别是,我们的BBN模型进一步配备了一个新的累积学习战略,目的是首先了解普遍模式,然后逐渐注意尾端数据。 广泛试验四个基准数据集,包括大规模内部分类,在大规模内部分类中,我们提出的大规模内部分类/内部分类方法, 证明我们提出的BBL的大规模测试方法能够大大地在20年的升级方法中证明我们B级的升级的升级。