A novel method, named Curvature-Augmented Manifold Embedding and Learning (CAMEL), is proposed for high dimensional data classification, dimension reduction, and visualization. CAMEL utilizes a topology metric defined on the Riemannian manifold, and a unique Riemannian metric for both distance and curvature to enhance its expressibility. The method also employs a smooth partition of unity operator on the Riemannian manifold to convert localized orthogonal projection to global embedding, which captures both the overall topological structure and local similarity simultaneously. The local orthogonal vectors provide a physical interpretation of the significant characteristics of clusters. Therefore, CAMEL not only provides a low-dimensional embedding but also interprets the physics behind this embedding. CAMEL has been evaluated on various benchmark datasets and has shown to outperform state-of-the-art methods, especially for high-dimensional datasets. The method's distinct benefits are its high expressibility, interpretability, and scalability. The paper provides a detailed discussion on Riemannian distance and curvature metrics, physical interpretability, hyperparameter effect, manifold stability, and computational efficiency for a holistic understanding of CAMEL. Finally, the paper presents the limitations and future work of CAMEL along with key conclusions.
翻译:为高维数据分类、维度减少和可视化,建议采用名为Curvature-Auged Manigide 嵌入和学习(CAMEL)的新方法。CAMEL使用在里曼多管上定义的地形测量仪,以及独特的里曼多管的距离和曲线测量仪,以提高其可见性。该方法还使用里曼多管上的统一操作员的平稳分隔法,将本地或远方投影转换为全球嵌入,同时捕捉总体表层结构和地方相似性。本地或远方矢量对各组的重要特征进行物理解释。因此,CAMEL不仅提供低维嵌入,而且还对嵌入后的物理原理进行解释。CAML在各种基准数据集上进行了评估,并显示,特别是高维数据集,该方法的独特好处是高清晰度、可解释性和可缩缩缩放性。该文件详细讨论了里曼多面距离和曲线的物理测算法,并详细讨论了里曼多面、Climal-Climimal 度、Climimalalalalalal commalalalalal commal commal exalal commalisalalalal exbalislisldal exfalisal exstralal exfal exsalsalsal exmal exmal exslupalisalisalisal 。</s>