Multi-label text classification aims to extract all the related labels from a sentence, which can be viewed as a sequence generation problem. However, the labels in training dataset are unordered. We propose to treat it as a direct set prediction problem and don't need to consider the order of labels. Besides, in order to model the correlation between labels, the adjacency matrix is constructed through the statistical relations between labels and GCN is employed to learn the label information. Based on the learned label information, the set prediction networks can both utilize the sentence information and label information for multi-label text classification simultaneously. Furthermore, the Bhattacharyya distance is imposed on the output probability distributions of the set prediction networks to increase the recall ability. Experimental results on four multi-label datasets show the effectiveness of the proposed method and it outperforms previous method a substantial margin.
翻译:多标签文本分类旨在从一个句子中提取所有相关的标签,这可以视为序列生成问题。然而,训练数据集中的标签是无序的。我们建议将其视为直接的集合预测问题,无需考虑标签的顺序。此外,为了建模标签之间的相关性,通过标签之间的统计关系构建邻接矩阵,并使用GCN学习标签信息。基于学习到的标签信息,集合预测网络可以同时利用句子信息和标签信息进行多标签文本分类。此外,对集合预测网络输出的概率分布施加巴氏距离以增加召回能力。在四个多标签数据集上的实验结果表明了所提出方法的有效性,其表现优于之前的方法。