Community question answering and discussion platforms such as Reddit, Yahoo! answers or Quora provide users the flexibility of asking open ended questions to a large audience, and replies to such questions maybe useful both to the user and the community on certain topics such as health, sports or finance. Given the recent events around COVID-19, some of these platforms have attracted 2000+ questions from users about several aspects associated with the disease. Given the impact of this disease on general public, in this work we investigate ways to improve the ranking of user generated answers on COVID-19. We specifically explore the utility of external technical sources of side information (such as CDC guidelines or WHO FAQs) in improving answer ranking on such platforms. We found that ranking user answers based on question-answer similarity is not sufficient, and existing models cannot effectively exploit external (side) information. In this work, we demonstrate the effectiveness of different attention based neural models that can directly exploit side information available in technical documents or verified forums (e.g., research publications on COVID-19 or WHO website). Augmented with a temperature mechanism, the attention based neural models can selectively determine the relevance of side information for a given user question, while ranking answers.
翻译:社区问题解答和讨论平台,如Reddit、Yahoo!答案或Quora等,为用户提供了向广大受众提出开放式问题的灵活度,对这些问题的答复对用户和社区在卫生、体育或金融等某些专题上的答案都可能有用。鉴于最近围绕COVID-19的事件,其中一些平台吸引了用户对与该疾病有关的若干方面的2000+问题。鉴于这一疾病对公众的影响,我们在这项工作中调查了如何改进用户对COVID-19的答复的排序。我们特别探讨了外部附带信息技术来源(如CDC准则或WHO FAQs)在改进这类平台的答案排名方面的效用。我们发现,根据问答的相似性排列用户答案是不够的,现有模型无法有效地利用外部(侧面)信息。在这项工作中,我们展示了基于神经模型的不同关注的有效性,这些模型可以直接利用技术文件或经核实的论坛(例如COVID-19或WHO网站的研究出版物)中可获得的侧面信息。我们用温度机制强化了外部技术来源(如CDC准则或WHO FAQs)的注意度,而基于神经模型可以有选择地决定侧信息在用户答案排名上的相关性。