使用部分语音标签对语音搜索进行简单而有效的评估 (POSSCORE: A Simple Yet Effective Evaluation of Conversational Search with Part of Speech Labelling)

Conversational search systems, such as Google Assistant and Microsoft Cortana, provide a new search paradigm where users are allowed, via natural language dialogues, to communicate with search systems. Evaluating such systems is very challenging since search results are presented in the format of natural language sentences. Given the unlimited number of possible responses, collecting relevance assessments for all the possible responses is infeasible. In this paper, we propose POSSCORE, a simple yet effective automatic evaluation method for conversational search. The proposed embedding-based metric takes the influence of part of speech (POS) of the terms in the response into account. To the best knowledge, our work is the first to systematically demonstrate the importance of incorporating syntactic information, such as POS labels, for conversational search evaluation. Experimental results demonstrate that our metrics can correlate with human preference, achieving significant improvements over state-of-the-art baseline metrics.

翻译：谷歌助理和微软科尔塔纳等连通搜索系统提供了一个新的搜索模式,允许用户通过自然语言对话与搜索系统沟通。评估这些系统非常具有挑战性,因为搜索结果以自然语言句的形式出现。鉴于可能的答复数量有限,收集所有可能答复的关联性评估是不可行的。在本文中,我们提出POSCORE,这是一个简单而有效的对话搜索自动评价方法。提议的嵌入基度指标在回应中考虑到语言术语部分的影响。在最先进的知识中,我们的工作是首先系统地表明将合成信息(如POS标签)纳入谈话搜索评估的重要性。实验结果表明,我们的衡量标准可以与人类偏好相关,大大改进了最先进的基线衡量标准。

相关内容

词性标注

关注 389

词性（part-of-speech）是词汇基本的语法属性，通常也称为词类。词性标注就是在给定句子中判定每个词的语法范畴，确定其词性并加以标注的过程，是中文信息处理面临的重要基础性问题。在语料库语言学中，词性标注（POS标注或PoS标注或POST），也称为语法标注，是将文本（语料库）中的单词标注为与特定词性相对应的过程，[1] 基于其定义和上下文。

史上最全！358篇机器学习&自然语言处理综述论文！都这儿了

专知会员服务

129+阅读 · 2020年7月18日

【KDD2020】基于知识图谱的语义融合改进会话推荐系统，Improving Conversational Recommender Systems via Knowledge Graph based Semantic Fusion

专知会员服务

90+阅读 · 2020年7月9日