Through a data-oriented question and answering system, users can directly "ask" the system for the answers to their analytical questions about the input tabular data. This process greatly improves user experience and lowers the technical barriers of data analysis. Existing techniques focus on providing a concrete query for users or untangling the ambiguities in a specific question so that the system could better understand questions and provide more correct and precise answers. However, when users have little knowledge about the data, it is difficult for them to ask concrete questions. Instead, high-level questions are frequently asked, which cannot be easily solved with the existing techniques. To address the issue, in this paper, we introduce Talk2Data, a data-oriented online question and answering system that supports answering both low-level and high-level questions. It leverages a novel deep-learning model to resolve high-level questions into a series of low-level questions that can be answered by data facts. These low-level questions could be used to gradually elaborate the users' requirements. We design a set of annotated and captioned visualizations to represent the answers in a form that supports interpretation and narration. We evaluate the effectiveness of the Talk2Data system via a series of evaluations including case studies, performance validation, and a controlled user study. The results show the power of the system.
翻译:通过以数据为导向的问题和回答系统,用户可以直接“询问”系统来回答关于输入表格数据的分析问题。这一过程极大地改进了用户的经验,降低了数据分析的技术障碍。现有技术侧重于为用户提供具体查询,或者在一个具体问题中解开模糊不清之处,以便系统能够更好地理解问题,提供更正确和准确的答案。然而,当用户对数据缺乏了解时,他们很难提出具体问题。相反,经常提出高层次的问题,而这些问题无法以现有技术轻易地解决。为了解决这个问题,我们在本文件中引入了“Talk2Data”,这是一个面向数据的在线问题和回答系统,支持回答低层次和高层次的问题。现有技术侧重于为用户提供具体查询或解析具体问题的模糊不清之处,以便系统能够更好理解问题,提供更准确和准确的答案。但是,当用户对数据缺乏了解时,这些低层次的问题就难以提出具体的问题。我们设计了一套附加说明和说明的直观数据,以支持解释和解说的方式代表答案。我们通过一个支持解说和解说的表格,我们评估了“Tal2”用户对结果进行一系列的论证的有效性,我们通过案例研究来评估。我们评估的系统,通过一个用户对结果的系统进行测试。