Multi-label text classification (MLTC) aims to annotate documents with the most relevant labels from a number of candidate labels. In real applications, the distribution of label frequency often exhibits a long tail, i.e., a few labels are associated with a large number of documents (a.k.a. head labels), while a large fraction of labels are associated with a small number of documents (a.k.a. tail labels). To address the challenge of insufficient training data on tail label classification, we propose a Head-to-Tail Network (HTTN) to transfer the meta-knowledge from the data-rich head labels to data-poor tail labels. The meta-knowledge is the mapping from few-shot network parameters to many-shot network parameters, which aims to promote the generalizability of tail classifiers. Extensive experimental results on three benchmark datasets demonstrate that HTTN consistently outperforms the state-of-the-art methods. The code and hyper-parameter settings are released for reproducibility
翻译:多标签文本分类(MLTC)的目的是用一些候选标签中最相关的标签来说明文件。在实际应用中,标签频率的分布往往显示长尾,即少数标签与大量文件(a.k.a.头标签)相关,而大部分标签与少量文件(a.k.a.尾标签)相关。为了应对尾标签分类培训数据不足的挑战,我们提议建立一个头对尾标签网络(HTTN),将富数据头标签的元知识转移到缺数据尾标签。元知识是从几发网络参数到多发网络参数的映射,目的是促进尾夹分类器的通用性。三个基准数据集的广泛实验结果显示,HTTN始终超越了最新方法。代码和超参数设置被发布,以便进行重新校正。