Deep extreme classification (XC) seeks to train deep architectures that can tag a data point with its most relevant subset of labels from an extremely large label set. The core utility of XC comes from predicting labels that are rarely seen during training. Such rare labels hold the key to personalized recommendations that can delight and surprise a user. However, the large number of rare labels and small amount of training data per rare label offer significant statistical and computational challenges. State-of-the-art deep XC methods attempt to remedy this by incorporating textual descriptions of labels but do not adequately address the problem. This paper presents ECLARE, a scalable deep learning architecture that incorporates not only label text, but also label correlations, to offer accurate real-time predictions within a few milliseconds. Core contributions of ECLARE include a frugal architecture and scalable techniques to train deep models along with label correlation graphs at the scale of millions of labels. In particular, ECLARE offers predictions that are 2 to 14% more accurate on both publicly available benchmark datasets as well as proprietary datasets for a related products recommendation task sourced from the Bing search engine. Code for ECLARE is available at https://github.com/Extreme-classification/ECLARE.
翻译:深极分类 (XC) 旨在训练深层结构,这些结构可以标记一个数据点,其最相关的标签子组来自一个非常大的标签组。 XC的核心效用来自对培训中很少见的标签的预测。这种稀有的标签是个人化建议的关键,可以给用户带来喜悦和惊喜。然而,大量稀有标签和少量的培训数据每个稀有标签提供了重大的统计和计算挑战。最先进的深层XC方法试图通过纳入标签的文字描述来纠正这一点,但没有充分解决问题。本文展示了ECLARRE,这是一个可缩放的深层学习结构,不仅包含标签文本,而且还包含标签关联性,在几毫秒内提供准确的实时预测。ELCRE的核心贡献包括一个节制架构和可缩放的技术,以在百万个标签规模的标签相关图表中培训深层模型。特别是,ECLARE提供两种公开的基准数据集的预测更准确度为2至14 %,以及用于相关产品ERE数据组。Exgreal CRA/Exgistrical 的搜索数据库/Exgistrateal redustration for Exgistration for redustration for ex