In ordinary Dimensionality Reduction (DR), each data instance in a high dimensional space (original space), or on a distance matrix denoting original space distances, is mapped to (projected onto) one point in a low dimensional space (visual space), building a layout of projected points trying to preserve as much as possible some property of data such as distances, neighbourhood relationships, and/or topology structures, with the ultimate goal of approximating semantic properties of data with preserved geometric properties or topology structures in visual space. In this paper, the concept of Multi-point Dimensionality Reduction is elaborated on where each data instance can be mapped to (projected onto) possibly more than one point in visual space by providing the first general solution (algorithm) for it as a move in the direction of improving reliablity, usability and interpretability of dimensionality reduction. Furthermore by allowing the points in visual space to be split into two layers while maintaining the possibility of having more than one projection (mapping) per data instance , the benefit of separating more reliable points from less reliable points is dicussed notwithstanding the effort to improve less reliable points. The proposed solution (algorithm) in this paper, named Layered Vertex Splitting Data Embedding (LVSDE), is built upon and extends a combination of ordinary DR and graph drawing techniques. On the empirical side, this paper shows that the particular proposed algorithm (LVSDE) practically outperforms popular ordinary DR methods visually (semantics, group separation, subgroup detection or combinational group detection) in a way that is easily explainable and performs in close proximity to top on most of the data sets studied in the paper in a quantitative analysis based on KNN classification accuracy. [Abstract truncated for length]
翻译:在普通尺寸减少(DR)中,在高维空间(原始空间)或显示原始空间距离的距离矩阵中,每个数据实例都被映射到(投射到)低维空间(视觉空间)中一个点(投射到)可能超过一个点(投射到)低维空间(视觉空间),构建一个预测点的布局,试图尽可能保存数据的某些属性,如距离、邻里关系和/或地形结构,最终目标是在视觉空间中将数据与保存的几何属性或地形结构相匹配的语义特性相匹配。在本文中,多点尺寸减少度减少概念的概念被详细描述到每个数据实例可以被映射到(投射到)一个较不可靠的端点(直径直径直径直径),在S 直径直径直径直径直径分析中,在直径直径直的直径直方位数分析中,在直径直径直径直径直的直方位数分析中,在直径直径直的方位数分析中,在直径直径直径直的直方位数分析中,在直径直方位数分析中,在直的直的直方位的直方向上,在直方位上,在推进的直方位数分析中,在直的直的直的直方向上,在直方位数分析中,在推进的直方向上,在直方位数分析中,在直方向上,在直方向上,在推进的直方向上,在推进的直方向上,在推进的直方位数分析中,在推进的轨道上,在直径直,在推进的轨道上,在推进的轨道上,在推进的深度分析中,在推进的轨道上,在推进的直方向上,在推进的轨道上,在推进的轨道上,在推进的轨道上,在推进的轨道上,在推进的轨道上,在上,在推进图图图图上,在推进的轨道上,在推进的推上,在进行推,在推进的轨道上,在推进的轨道上,在上,在推进图上,在方向上,在推进的深度分析中,在方向上,在方向上,在方向上,在方向上,在直方,在