The rapidly growing market demand for automatic dialogue agents capable of goal-oriented behavior has caused many tech-industry leaders to invest considerable efforts into task-oriented dialog systems. The success of these systems is highly dependent on the accuracy of their intent identification -- the process of deducing the goal or meaning of the user's request and mapping it to one of the known intents for further processing. Gaining insights into unrecognized utterances -- user requests the systems fail to attribute to a known intent -- is therefore a key process in continuous improvement of goal-oriented dialog systems. We present an end-to-end pipeline for processing unrecognized user utterances, deployed in a real-world, commercial task-oriented dialog system, including a specifically-tailored clustering algorithm, a novel approach to cluster representative extraction, and cluster naming. We evaluated the proposed components, demonstrating their benefits in the analysis of unrecognized user requests.
翻译:市场对能够以目标为导向的行为的自动对话代理商的需求迅速增长,导致许多技术行业领导人对面向任务的对话系统投入大量精力。这些系统的成功在很大程度上取决于其意图识别的准确性 -- -- 将用户请求的目标或含义引申为已知的进一步处理意图之一的过程。因此,深入了解未经承认的言论 -- -- 用户要求系统不能归因于已知的意图 -- -- 是不断改进面向目标的对话系统的一个关键过程。我们为处理未经确认的用户话提出了一个端到端的管道,安装在现实世界、商业面向任务的对话系统中,包括一个专门定制的集群组合算法,这是对集群代表性提取和集群命名的一种新办法。我们评价了拟议的组成部分,在分析未经确认的用户请求时展示了它们的好处。