Partition-based methods are increasingly-used in extreme multi-label classification (XMC) problems due to their scalability to large output spaces (e.g., millions or more). However, existing methods partition the large label space into mutually exclusive clusters, which is sub-optimal when labels have multi-modality and rich semantics. For instance, the label "Apple" can be the fruit or the brand name, which leads to the following research question: can we disentangle these multi-modal labels with non-exclusive clustering tailored for downstream XMC tasks? In this paper, we show that the label assignment problem in partition-based XMC can be formulated as an optimization problem, with the objective of maximizing precision rates. This leads to an efficient algorithm to form flexible and overlapped label clusters, and a method that can alternatively optimizes the cluster assignments and the model parameters for partition-based XMC. Experimental results on synthetic and real datasets show that our method can successfully disentangle multi-modal labels, leading to state-of-the-art (SOTA) results on four XMC benchmarks.
翻译:在极端多标签分类(XMC)问题上,基于分区的分区标签分配问题由于可伸缩到大型产出空间(例如,数百万或更多)而日益使用基于分区的方法。然而,现有的方法将大标签空间分割成相互排斥的集群,如果标签具有多式和丰富的语义,这是次最佳的。例如,标签“Apple”可以是水果或品牌名称,这会导致以下研究问题:我们能否将这些多模式标签与为下游XMC任务量身定制的非排他性集群分解开来?在本文件中,我们表明基于分区的XMC的标签分配问题可以作为一种优化问题,目的是最大限度地提高精确率。这导致一种高效的算法,形成灵活和重叠的标签集群,以及一种可以优化基于分区的XMC的集群分配和模型参数的方法。合成和真实数据集的实验结果显示,我们的方法可以成功地分离出多模式标签,导致四个XMC基准的状态(SOTA)结果。