Automatic annotation of short-text data to a large number of target labels, referred to as Short Text Extreme Classification, has found numerous applications including prediction of related searches and product recommendation tasks. In this paper, we propose a convolutional architecture InceptionXML which is light-weight, yet powerful, and robust to the inherent lack of word-order in short-text queries encountered in search and recommendation tasks. We demonstrate the efficacy of applying convolutions by recasting the operation along the embedding dimension instead of the word dimension as applied in conventional CNNs for text classification. Towards scaling our model to datasets with millions of labels, we also propose InceptionXML+ framework which improves upon the shortcomings of the recently proposed dynamic hard-negative mining technique for label shortlisting by synchronizing the label-shortlister and extreme classifier. InceptionXML+ not only reduces the inference time to half but is also an order of magnitude smaller than previous state-of-the-art Astec in terms of model size. Through our proposed models, we outperform all existing approaches on popular benchmark datasets.
翻译:将短文本数据自动加注到大量目标标签(简称短文本极端分类)中,发现了许多应用,包括相关搜索和产品建议任务的预测。在本文件中,我们提议了一个进化结构的感知XML,它轻量、但有力,能够应对在搜索和建议任务中遇到的短文本询问中内在的缺乏单词顺序的情况。我们通过按照嵌入的尺寸而不是在常规CNN文本分类中应用的单词尺寸来显示应用变动的功效。为了用数百万个标签将我们的模型缩到数据集中,我们还提议了 " 感知XML+ " 框架,该框架通过同步标签光滑动器和极端分类器来改进最近提议的动态硬反式采矿技术的缺点。 " 感知XML+ " 不仅将推导时间降低到一半,而且比以往在模型大小方面最先进的Astec状态小得多。我们通过拟议的模型,在流行基准数据集上超越了所有现有方法。