Principal component analysis is a long-standing go-to method for exploring multivariate data. The principal components are linear combinations of the original variables, ordered by descending variance. The first few components typically provide a good visual summary of the data. Tours also make linear projections of the original variables but offer many different views, like examining the data from different directions. The grand tour shows a smooth sequence of projections as an animation following interpolations between random target bases. The manual radial tour rotates the selected variable's contribution into and out of a projection. This allows the importance of the variable to structure in the projection to be assessed. This work describes a mixed-design user study evaluating the radial tour's efficacy compared with principal component analysis and the grand tour. A supervised classification task is assigned to participants who evaluate variable attribution of the separation between two classes. Their accuracy in assigning the variable importance is measured across various factors. Data were collected from 108 crowdsourced participants, who performed two trials with each visual for 648 trials in total. Mixed model regression finds strong evidence that the radial tour results in a large increase in accuracy over the alternatives. Participants also reported a preference for the radial tour in comparison to the other two methods.
翻译:主要组成部分分析是探索多变量数据的长期方法,主要组成部分是原始变量的线性组合,按降序排列。前几个组成部分通常提供数据的良好视觉摘要。导游还对原始变量进行线性预测,但提供许多不同的观点,例如审查不同方向的数据。大巡演显示,在随机目标基地之间的插图中,预测是一个顺利的动画序列。人工无线电巡演将选定变量的贡献旋转成或调出投影。这使得变量对预测中的结构的重要性得到评估。这项工作描述的是一个混合设计用户研究,与主要组成部分分析和大巡演相比,评估辐射巡演的效力。监督的分类任务分配给了评估两个类别之间差异属性的参与者,对变量重要性的准确性进行了不同因素的测量。从108名来自人群的参与者收集了数据,他们分别进行了两次试验,总共为648次试验。混合模型回归发现强有力的证据表明,辐射巡演的结果比其他选项的精确度大得多。参与者还报告了在巡回巡演中的优先度。