Mining the distribution of features and sorting items by combined attributes are two common tasks in exploring and understanding multi-attribute (or multivariate) data. Up to now, few have pointed out the possibility of merging these two tasks into a united exploration context and the potential benefits of doing so. In this paper, we present SemanticAxis, a technique that achieves this goal by enabling analysts to build a semantic vector in two-dimensional space interactively. Essentially, the semantic vector is a linear combination of the original attributes. It can be used to represent and explain abstract concepts implied in local (outliers, clusters) or global (general pattern) features of reduced space, as well as serving as a ranking metric for its defined concepts. In order to validate the significance of combining the above two tasks in multi-attribute data analysis, we design and implement a visual analysis system, in which several interactive components cooperate with SemanticAxis seamlessly and expand its capacity to handle complex scenarios. We prove the effectiveness of our system and the SemanticAxis technique via two practical cases.
翻译:通过合并属性对地物和分类项目的分布进行采矿是探索和理解多属性(或多变量)数据的两个共同任务。到目前为止,很少有人指出将这两个任务合并成一个统一的勘探背景的可能性以及这样做的潜在好处。在本文中,我们介绍SemisticAxis,这是一种通过使分析家能够在两维空间互动的基础上构建一个语义矢量来实现这一目标的技术。基本上,语义矢量是原始属性的线性组合。它可以用来代表并解释空间缩小的地方(外部层、集群)或全球(一般模式)特征中隐含的抽象概念,并且作为确定概念的分级衡量标准。为了验证将以上两项任务合并在多属性数据分析中的重要性,我们设计和实施了一个视觉分析系统,其中几个互动组成部分与斯曼尼特Axis进行无缝合作,并扩大其处理复杂情景的能力。我们通过两个实际案例证明了我们的系统和SmantiAxis技术的有效性。