Conversational Machine Comprehension (CMC) is a research track in conversational AI which expects the machine to understand an open-domain text and thereafter engage in a multi-turn conversation to answer questions related to the text. While most of the research in Machine Reading Comprehension (MRC) revolves around single-turn question answering, multi-turn CMC has recently gained prominence, thanks to the advancement in natural language understanding via neural language models like BERT and the introduction of large-scale conversational datasets like CoQA and QuAC. The rise in interest has, however, led to a flurry of concurrent publications, each with a different yet structurally similar modeling approach and an inconsistent view of the surrounding literature. With the volume of model submissions to conversational datasets increasing every year, there exists a need to consolidate the scattered knowledge in this domain to streamline future research. This literature review, therefore, is a first-of-its-kind attempt at providing a holistic overview of CMC, with an emphasis on the common trends across recently published models, specifically in their approach to tackling conversational history. It focuses on synthesizing a generic framework for CMC models, rather than describing the models individually. The review is intended to serve as a compendium for future researchers in this domain.
翻译:虽然机器阅读综合(MRC)的研究大多围绕单点答题,但由于通过BERT等神经语言模型和采用CQA和QuAC等大规模谈话数据集等自然语言理解的进步,多点计算机综合体(CMC)最近越来越突出。但是,由于兴趣的上升,同时出版的出版物纷繁多,每种出版物都具有不同的结构相似的建模方法,对周围文献的看法也不一致。随着对对话数据集的示范提交量逐年增加,有必要整合该领域的分散知识,以简化未来的研究。因此,这一文献审查是首次尝试对CMC进行整体的概述,重点是最近出版的模型的共同趋势,特别是处理对话历史的方法。它侧重于将一个通用框架作为个人化的模型,而不是作为未来研究者的一种领域化框架。它侧重于将一个通用框架用于个人化的CMC,而不是将一个域化的模型用于描述未来的模型。