Visualization recommendation systems simplify exploratory data analysis (EDA) and make understanding data more accessible to users of all skill levels by automatically generating visualizations for users to explore. However, most existing visualization recommendation systems focus on ranking all visualizations into a single list or set of groups based on particular attributes or encodings. This global ranking makes it difficult and time-consuming for users to find the most interesting or relevant insights. To address these limitations, we introduce a novel class of visualization recommendation systems that automatically rank and recommend both groups of related insights as well as the most important insights within each group. Our proposed approach combines results from many different learning-based methods to discover insights automatically. A key advantage is that this approach generalizes to a wide variety of attribute types such as categorical, numerical, and temporal, as well as complex non-trivial combinations of these different attribute types. To evaluate the effectiveness of our approach, we implemented a new insight-centric visualization recommendation system, SpotLight, which generates and ranks annotated visualizations to explain each insight. We conducted a user study with 12 participants and two datasets which showed that users are able to quickly understand and find relevant insights in unfamiliar data.
翻译:可视化建议系统简化探索性数据分析(EDA),并使所有技能水平的用户更容易理解数据,办法是自动生成可视化,供用户进行探索。然而,大多数现有的可视化建议系统都侧重于根据特定属性或编码将所有可视化排列成单一清单或一组群体。这种全球排名使用户难以找到最有趣或最相关的洞察力,而且耗费时间。为了解决这些局限性,我们引入了新型的可视化建议系统类别,自动排列和推荐两组相关洞察力以及每个组内最重要的洞察力。我们建议的方法将许多不同的基于学习的方法结合起来,以自动发现洞察力。一个关键的优势是,这种方法将所有可视化方法笼统地概括到各种各样的属性类型,如绝对、数字和时间,以及这些不同属性类型的复杂的非三角组合。为了评估我们的方法的有效性,我们采用了一个新的直观性直观可视化建议系统,即SpotLightight,生成和排序注释性可视化,以解释每一项洞察力。我们与12名参与者和两套数据集进行了一项用户研究,表明用户能够迅速理解和找到相关数据。