This paper describes our submission for the SemEval-2019 Suggestion Mining task. A simple Convolutional Neural Network (CNN) classifier with contextual word representations from a pre-trained language model was used for sentence classification. The model is trained using tri-training, a semi-supervised bootstrapping mechanism for labelling unseen data. Tri-training proved to be an effective technique to accommodate domain shift for cross-domain suggestion mining (Subtask B) where there is no hand labelled training data. For in-domain evaluation (Subtask A), we use the same technique to augment the training set. Our system ranks thirteenth in Subtask A with an $F_1$-score of 68.07 and third in Subtask B with an $F_1$-score of 81.94.
翻译:本文介绍了我们对SemEval-2019建议采矿任务的呈件。在量刑分类中使用了一个简单的进化神经网络分类器,该分类器具有来自预先培训的语言模式的背景文字说明。该模型使用三重训练(一种半监督的隐性数据标签跟踪机制)进行培训。三重训练被证明是一种有效的技术,用于在没有手标培训数据的情况下进行跨界建议采矿(Subtask B)的域转移。在现场评估(Subtask A)中,我们使用同样的技术来扩大培训内容。我们的系统在Subtask A中排名第十三,在Subtask B中排名第13位,在Suptask中排名68.07美元,在Subtask B中排名第三,在81.94美元中排名F_1美元。