There has a been a significant rise in the use of Community Question Answering sites (CQAs) over the last decade owing primarily to their ability to leverage the wisdom of the crowd. Duplicate questions have a crippling effect on the quality of these sites. Tackling duplicate questions is therefore an important step towards improving quality of CQAs. In this regard, we propose two neural network based architectures for duplicate question detection on Stack Overflow. We also propose explicitly modeling the code present in questions to achieve results that surpass the state of the art.
翻译:过去十年来,使用社区问答网站的情况显著增加,主要是因为这些网站有能力利用人群的智慧,重复问题对这些网站的质量产生了严重影响,因此,处理重复问题是提高社区问答网站质量的一个重要步骤,在这方面,我们提议建立两个神经网络结构,用于在堆积溢流中重复探测问题,我们还提议在问题中明确模拟现有的代码,以取得超过最新水平的成果。