Finding the similarities and differences between two or more groups of datasets is a fundamental analysis task. For high-dimensional data, dimensionality reduction (DR) methods are often used to find the characteristics of each group. However, existing DR methods provide limited capability and flexibility for such comparative analysis as each method is designed only for a narrow analysis target, such as identifying factors that most differentiate groups. In this work, we introduce an interactive DR framework where we integrate our new DR method, called ULCA (unified linear comparative analysis), with an interactive visual interface. ULCA unifies two DR schemes, discriminant analysis and contrastive learning, to support various comparative analysis tasks. To provide flexibility for comparative analysis, we develop an optimization algorithm that enables analysts to interactively refine ULCA results. Additionally, we provide an interactive visualization interface to examine ULCA results with a rich set of analysis libraries. We evaluate ULCA and the optimization algorithm to show their efficiency as well as present multiple case studies using real-world datasets to demonstrate the usefulness of our framework.
翻译:查找两组或更多组数据集之间的相似和差异是一项基本的分析任务。对于高维数据,通常使用维度减少(DR)方法来查找每一组的特征。然而,现有的DR方法为比较分析提供了有限的能力和灵活性,因为每一种方法的设计都是为了狭义的分析目标,例如确定最不同组别的因素。在这项工作中,我们引入了一个互动的DR框架,将我们的新DR方法(称为ULCA(统一线性比较分析))与交互式视觉界面结合起来。 ULCA统一了两种DR计划,即差异分析和对比性学习,以支持各种比较分析任务。为了提供比较分析的灵活性,我们开发了一种优化算法,使分析家能够交互地完善ULCA的结果。此外,我们提供了一个互动的可视化界面,与一套丰富的分析库一起审查ULCA的结果。我们用真实世界的数据集来评估ULCA和优化算法,以显示其效率,并展示多种案例研究。