Stack Overflow is one of the most popular Programming Community-based Question Answering (PCQA) websites that has attracted more and more users in recent years. When users raise or inquire questions in Stack Overflow, providing related questions can help them solve problems. Although there are many approaches based on deep learning that can automatically predict the relatedness between questions, those approaches are limited since interaction information between two questions may be lost. In this paper, we adopt the deep learning technique, propose an Attention-based Sentence pair Interaction Model (ASIM) to predict the relatedness between questions on Stack Overflow automatically. We adopt the attention mechanism to capture the semantic interaction information between the questions. Besides, we have pre-trained and released word embeddings specific to the software engineering domain for this task, which may also help other related tasks. The experiment results demonstrate that ASIM has made significant improvement over the baseline approaches in Precision, Recall, and Micro-F1 evaluation metrics, achieving state-of-the-art performance in this task. Our model also performs well in the duplicate question detection task of AskUbuntu, which is a similar but different task, proving its generalization and robustness.
翻译:用户在Stack overproduction中提出或查询问题时,提供相关问题可以帮助他们解决问题。虽然有许多基于深层学习的方法可以自动预测问题之间的关联性,但这些方法是有限的,因为两个问题之间的相互作用信息可能会丢失。在本文件中,我们采用了深层次的学习技术,建议采用基于关注的判刑对口互动模型(ASIM)来自动预测Stack overproflow上的问题之间的关系。我们采用了关注机制来捕捉问题之间的语义互动信息。此外,我们还预先培训和发布了用于这项工作软件工程领域的文字嵌入,这也可能有助于其他相关任务。实验结果表明,ASIM在预感、回召和微F1评估的基线方法上取得了显著的改进,从而实现了这项工作中的最新业绩。我们的模式还很好地完成了 " skUbuntu " 的重复问题探测任务,该任务十分健全,但性质不同。