Natural language descriptions sometimes accompany visualizations to better communicate and contextualize their insights, and to improve their accessibility for readers with disabilities. However, it is difficult to evaluate the usefulness of these descriptions, and how effectively they improve access to meaningful information, because we have little understanding of the semantic content they convey, and how different readers receive this content. In response, we introduce a conceptual model for the semantic content conveyed by natural language descriptions of visualizations. Developed through a grounded theory analysis of 2,147 sentences, our model spans four levels of semantic content: enumerating visualization construction properties (e.g., marks and encodings); reporting statistical concepts and relations (e.g., extrema and correlations); identifying perceptual and cognitive phenomena (e.g., complex trends and patterns); and elucidating domain-specific insights (e.g., social and political context). To demonstrate how our model can be applied to evaluate the effectiveness of visualization descriptions, we conduct a mixed-methods evaluation with 30 blind and 90 sighted readers, and find that these reader groups differ significantly on which semantic content they rank as most useful. Together, our model and findings suggest that access to meaningful information is strongly reader-specific, and that research in automatic visualization captioning should orient toward descriptions that more richly communicate overall trends and statistics, sensitive to reader preferences. Our work further opens a space of research on natural language as a data interface coequal with visualization.
翻译:自然语言描述的自然自然语言描述有时伴随着视觉化,以更好地沟通和使其洞察力背景化,并改善其对残疾读者的无障碍程度。然而,很难评估这些描述的有用性,以及这些描述如何有效地改善对有意义的信息的获取,因为我们对其所传达的语义内容了解甚少,以及不同的读者如何收到这种内容。作为回应,我们为自然语言描述视觉化所传达的语义内容引入了一个概念模型。通过对2,147个句子进行基于理论的分析,我们的模式跨越了语义学内容的四种层次:列举视觉化建筑的属性(例如,标记和编码);报告统计概念和关系(例如,极端和关联);报告统计概念和关系(例如,极端和关联);报告感知和认知现象(例如,复杂的趋势和模式);阐明具体领域的洞察力(例如,社会和政治背景)。为了展示我们的模型如何应用来评价视觉化描述的效果,我们进行了一种混合的方言义评估,有30个盲人和90个视力阅读者,并发现这些读者群体在哪些语义学模型上有很大的差异,在何种语言定义上,它们作为最有用的研究,它们作为最有用的工具,在哪些精密的读者上,它们应该成为我们最有用的研究中,一个更精确的、最有用的统计。