A plethora of dimensionality reduction techniques have emerged over the past decades, leaving researchers and analysts with a wide variety of choices for reducing their data, all the more so given some techniques come with additional parametrization (e.g. t-SNE, UMAP, etc.). Recent studies are showing that people often use dimensionality reduction as a black-box regardless of the specific properties the method itself preserves. Hence, evaluating and comparing 2D projections is usually qualitatively decided, by setting projections side-by-side and letting human judgment decide which projection is the best. In this work, we propose a quantitative way of evaluating projections, that nonetheless places human perception at the center. We run a comparative study, where we ask people to select 'good' and 'misleading' views between scatterplots of low-level projections of image datasets, simulating the way people usually select projections. We use the study data as labels for a set of quality metrics whose purpose is to discover and quantify what exactly people are looking for when deciding between projections. With this proxy for human judgments, we use it to rank projections on new datasets, explain why they are relevant, and quantify the degree of subjectivity in projections selected.
翻译:在过去几十年中,出现了大量减少维度的技术,使研究人员和分析人员在减少数据方面有了各种各样的选择,特别是考虑到有些技术带来了额外的超光化(例如 t-SNE、UMAP等)。最近的研究显示,人们经常使用减少维度作为黑盒,而不管方法本身所保存的具体特性如何。因此,评估和比较2D预测通常是在质量上决定的,方法是设置预测,并让人类判断决定哪些预测是最佳的。在这项工作中,我们提出了一个量化的预测评价方法,将人类的感知置于中心位置。我们进行了一项比较研究,要求人们选择“良好”和“误导”的观点,在图像数据集低度预测的散落之间选择“良好”和“误导”的观点,模拟人们通常选择预测的方式。我们使用研究数据作为一套质量指标的标签,目的是发现和量化人们在决定两种预测时所期待的准确内容。我们用这个指标来作为人类判断的代号,我们用它来在新的数据集中进行排序预测,解释它们为何具有相关程度和量化。