Natural language interfaces (NLIs) for data visualization are becoming increasingly popular both in academic research and in commercial software. Yet, there is a lack of empirical understanding of how people specify visualizations through natural language. To bridge this gap, we conducted an online study with 102 participants. We showed participants a series of ten visualizations for a given dataset and asked them to provide utterances they would pose to generate the displayed charts. The curated list of utterances generated from the study is provided below. This corpus of utterances can be used to evaluate existing NLIs for data visualization as well as for creating new systems and models to generate visualizations from natural language utterances.
翻译:用于数据可视化的自然语言界面(NLIs)在学术研究和商业软件中越来越受欢迎。然而,对于人们如何通过自然语言指定可视化,缺乏经验上的理解。为了缩小这一差距,我们与102名参与者进行了在线研究。我们向参与者展示了给定数据集的10种可视化系列,请他们提供他们为生成显示的图表而将构成的语句。下文提供了研究产生的语句汇编清单。这些语句可用于评估现有的可用自然语言进行可视化的NLIs,以及创建新的系统和模型,从自然语言语句中产生可视化。