Finding the similarities and differences between groups of datasets is a fundamental analysis task. For high-dimensional data, dimensionality reduction (DR) methods are often used to find the characteristics of each group. However, existing DR methods provide limited capability and flexibility for such comparative analysis as each method is designed only for a narrow analysis target, such as identifying factors that most differentiate groups. This paper presents an interactive DR framework where we integrate our new DR method, called ULCA (unified linear comparative analysis), with an interactive visual interface. ULCA unifies two DR schemes, discriminant analysis and contrastive learning, to support various comparative analysis tasks. To provide flexibility for comparative analysis, we develop an optimization algorithm that enables analysts to interactively refine ULCA results. Additionally, the interactive visualization interface facilitates interpretation and refinement of the ULCA results. We evaluate ULCA and the optimization algorithm to show their efficiency as well as present multiple case studies using real-world datasets to demonstrate the usefulness of this framework.
翻译:查找各组数据集之间的相似和差异是一项基本的分析任务。对于高维数据而言,通常使用维度减少(DR)方法来查找每一组的特征。然而,现有的DR方法为比较分析提供了有限的能力和灵活性,因为每一种方法的设计都是为狭隘的分析目标而设计的,例如确定最不同组别的因素。本文提出了一个互动的DR框架,将我们的新DR方法(称为ULCA(统一线性比较分析))与互动的视觉界面结合起来。ULCA将两种DR计划(辨别性分析和对比性学习)统一起来,以支持各种比较分析任务。为了提供比较分析的灵活性,我们开发了一种优化算法,使分析人员能够互动地完善ULCA的结果。此外,交互式可视化界面有助于解释和完善ULCA的结果。我们评估ULCA和优化算法,以展示其效率,并利用真实世界数据集展示其效用,并介绍多个案例研究。