The usage and amount of information available on the internet increase over the past decade. This digitization leads to the need for automated answering system to extract fruitful information from redundant and transitional knowledge sources. Such systems are designed to cater the most prominent answer from this giant knowledge source to the user query using natural language understanding (NLU) and thus eminently depends on the Question-answering(QA) field. Question answering involves but not limited to the steps like mapping of user question to pertinent query, retrieval of relevant information, finding the best suitable answer from the retrieved information etc. The current improvement of deep learning models evince compelling performance improvement in all these tasks. In this review work, the research directions of QA field are analyzed based on the type of question, answer type, source of evidence-answer, and modeling approach. This detailing followed by open challenges of the field like automatic question generation, similarity detection and, low resource availability for a language. In the end, a survey of available datasets and evaluation measures is presented.
翻译:过去十年来,互联网上现有信息的使用和数量在增加。这种数字化导致需要自动回答系统,以便从多余和过渡性知识来源中提取富有成效的信息。这些系统的设计是为了利用自然语言理解(NLU),从这个巨大的知识来源中,对用户查询提供最突出的答案,因此明显地取决于问答领域。回答问题涉及但不限于诸如用户问题绘图等步骤,以进行相关查询、检索相关信息、从检索的信息中找到最合适的答案等。当前深层次学习模式的改进表明所有这些任务的业绩都有了令人信服的改进。在这次审查工作中,根据问题类型、回答类型、证据回答来源和建模方法分析了质量评估领域的研究方向。随后详细介绍了实地的公开挑战,如自动生成问题、相似性探测和一种语言的资源匮乏。最后,介绍了对现有数据集和评估措施的调查。