AVIDA: 可视化和集成数据的交替方法 (AVIDA: Alternating method for Visualizing and Integrating Data) - 专知论文

会员服务 ·

0

降维 · 数据集 · 多模 · 多模态 · 模态 ·

2023 年 4 月 7 日

AVIDA: Alternating method for Visualizing and Integrating Data

翻译：AVIDA: 可视化和集成数据的交替方法

Kathryn Dover,Zixuan Cang,Anna Ma,Qing Nie,Roman Vershynin

from arxiv, To appear in Journal of Computational Science (Accepted, 2023)

High-dimensional multimodal data arises in many scientific fields. The integration of multimodal data becomes challenging when there is no known correspondence between the samples and the features of different datasets. To tackle this challenge, we introduce AVIDA, a framework for simultaneously performing data alignment and dimension reduction. In the numerical experiments, Gromov-Wasserstein optimal transport and t-distributed stochastic neighbor embedding are used as the alignment and dimension reduction modules respectively. We show that AVIDA correctly aligns high-dimensional datasets without common features with four synthesized datasets and two real multimodal single-cell datasets. Compared to several existing methods, we demonstrate that AVIDA better preserves structures of individual datasets, especially distinct local structures in the joint low-dimensional visualization, while achieving comparable alignment performance. Such a property is important in multimodal single-cell data analysis as some biological processes are uniquely captured by one of the datasets. In general applications, other methods can be used for the alignment and dimension reduction modules.

翻译：高维多模态数据在科学领域中经常出现。当不同数据集之间没有已知的样本和特征对应关系时，多模态数据的整合变得具有挑战性。为了解决这个问题，我们引入了 AVIDA 框架，用于同时进行数据对齐和降维。在数值实验中，采用 Gromov-Wasserstein 最优传输和 t-distributed stochastic neighbor embedding 作为对齐和降维模块。我们证明 AVIDA 能够正确地对齐没有共同特征的高维数据集，包括四个合成数据集和两个真实的多模态单细胞数据集。与几种现有方法相比，我们展示了 AVIDA 更好地保留了每个数据集的结构，特别是联合低维可视化中的独特局部结构，同时实现了可比的对齐性能。这种性质在多模态单细胞数据分析中非常重要，因为某些生物过程只能由其中一个数据集独特捕捉。在一般应用中，可以使用其他方法作为对齐和降维模块。

0

相关内容

降维是将数据从高维空间转换为低维空间，以便低维表示保留原始数据的某些有意义的属性，理想情况下接近其固有维。降维在处理大量观察和/或大量变量的领域很常见，例如信号处理，语音识别，神经信息学和生物信息学。

Patterns | scMMGAN: 单细胞多模态GAN揭示三阴性乳腺癌单细胞数据中的空间模式

Patterns | scMMGAN: 单细胞多模态GAN揭示三阴性乳腺癌单细胞数据中的空间模式

专知会员服务

13+阅读 · 2022年9月12日

【中科院自动化所】深度图生成方法及应用综述，A Survey on Deep Graph Generation: Methods and Applications

【中科院自动化所】深度图生成方法及应用综述，A Survey on Deep Graph Generation: Methods and Applications

专知会员服务

24+阅读 · 2022年3月15日

【干货书】机器学习设计模式，408页pdf，Machine Learning Design Patterns

【干货书】机器学习设计模式，408页pdf，Machine Learning Design Patterns

专知会员服务

138+阅读 · 2022年2月6日

INRIA 最新《机器学习理论》课程笔记，176页pdf

专知会员服务

51+阅读 · 2020年12月14日

【干货书】机器学习速查手册，135页pdf

【干货书】机器学习速查手册，135页pdf

专知会员服务

127+阅读 · 2020年11月20日

【斯坦福大学博士论文】大规模和高维统计学习方法和算法，147页pdf， Large-scale and high-dimensional statistical learning methods and algorithms

专知会员服务

26+阅读 · 2020年6月13日

【AAAI 2020】InteractE: 通过增加特征交互来改进基于卷积的知识图谱嵌入， InteractE: Improving Convolution-based Knowledge Graph Embeddings by Increasing Feature Interactions

【AAAI 2020】InteractE: 通过增加特征交互来改进基于卷积的知识图谱嵌入， InteractE: Improving Convolution-based Knowledge Graph Embeddings by Increasing Feature Interactions

专知会员服务

53+阅读 · 2020年6月7日

50+篇《神经架构搜索NAS》2020论文合集

专知会员服务

61+阅读 · 2020年3月19日

【新书】数字图像(影像)处理手第二版，2176pdf，Mathematical Methods in Imaging

【新书】数字图像(影像)处理手第二版，2176pdf，Mathematical Methods in Imaging

专知会员服务

93+阅读 · 2020年2月12日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

直播 | Interpretable and Trustworthy Graph Geometric Deep Learning

直播 | Interpretable and Trustworthy Graph Geometric Deep Learning

图与推荐

2+阅读 · 2022年11月2日

局部学习的特征选择：Local-Learning-Based Feature Selection

局部学习的特征选择：Local-Learning-Based Feature Selection

我爱读PAMI

14+阅读 · 2019年9月20日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

IEEE | DSC 2019诚邀稿件 (EI检索)

IEEE | DSC 2019诚邀稿件 (EI检索)

Call4Papers

10+阅读 · 2019年2月25日

逆强化学习-学习人先验的动机

逆强化学习-学习人先验的动机

CreateAMind

16+阅读 · 2019年1月18日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

【SIGIR2018】五篇对抗训练文章

【SIGIR2018】五篇对抗训练文章

专知

12+阅读 · 2018年7月9日

【论文推荐】最新六篇主题模型相关论文—动态主题模型、主题趋势、大规模并行采样、随机采样、非参主题建模

【论文推荐】最新六篇主题模型相关论文—动态主题模型、主题趋势、大规模并行采样、随机采样、非参主题建模

专知

14+阅读 · 2018年6月24日

Capsule Networks解析

Capsule Networks解析

机器学习研究会

11+阅读 · 2017年11月12日

面向图像网状结构体的蚁群分割算法

国家自然科学基金

0+阅读 · 2017年12月31日

面向在线检索的医学影像多特征降维方法研究

国家自然科学基金

0+阅读 · 2015年12月31日

非局部Schrödinger方程的高效守恒算法

国家自然科学基金

0+阅读 · 2015年12月31日

Calderon问题和边界刚性问题

国家自然科学基金

0+阅读 · 2013年12月31日

动态纹理建模与应用的张量方法研究

国家自然科学基金

0+阅读 · 2013年12月31日

恶性肺结节的分类不确定性信息可视化传递函数研究

国家自然科学基金

0+阅读 · 2013年12月31日

遥感影像大范围地表信息缺失区域的修复理论与方法研究

国家自然科学基金

0+阅读 · 2012年12月31日

计算电磁学积分方程的数值精度研究与改进

国家自然科学基金

0+阅读 · 2012年12月31日

基于PCA与二代Curvelet变换的多模态医学图像融合方法研究

国家自然科学基金

1+阅读 · 2012年12月31日

基于空间化模拟方法修复遥感地表温度图像的数据缺失

国家自然科学基金

0+阅读 · 2012年12月31日

A noise based novel strategy for faster SNN training

Arxiv

0+阅读 · 2023年5月29日

StyleS2ST: Zero-shot Style Transfer for Direct Speech-to-speech Translation

Arxiv

0+阅读 · 2023年5月28日

Physics-Guided Discovery of Highly Nonlinear Parametric Partial Differential Equations

Arxiv

0+阅读 · 2023年5月26日

Automated Data Denoising for Recommendation

Arxiv

0+阅读 · 2023年5月26日

Graph-Based Model-Agnostic Data Subsampling for Recommendation Systems

Arxiv

0+阅读 · 2023年5月25日

Federated Multi-organ Segmentation with Inconsistent Labels

Arxiv

0+阅读 · 2023年5月25日

Symplectic model reduction of Hamiltonian systems using data-driven quadratic manifolds

Arxiv

0+阅读 · 2023年5月24日

Exploring and Exploiting Data Heterogeneity in Recommendation

Arxiv

0+阅读 · 2023年5月21日

Deep Meta-learning in Recommendation Systems: A Survey

Arxiv

13+阅读 · 2022年6月9日

Causal Embeddings for Recommendation

Arxiv

23+阅读 · 2018年8月3日

VIP会员

文章信息

相关主题

相关VIP内容

Patterns | scMMGAN: 单细胞多模态GAN揭示三阴性乳腺癌单细胞数据中的空间模式

Patterns | scMMGAN: 单细胞多模态GAN揭示三阴性乳腺癌单细胞数据中的空间模式

专知会员服务

13+阅读 · 2022年9月12日

【中科院自动化所】深度图生成方法及应用综述，A Survey on Deep Graph Generation: Methods and Applications

【中科院自动化所】深度图生成方法及应用综述，A Survey on Deep Graph Generation: Methods and Applications

专知会员服务

24+阅读 · 2022年3月15日

【干货书】机器学习设计模式，408页pdf，Machine Learning Design Patterns

【干货书】机器学习设计模式，408页pdf，Machine Learning Design Patterns

专知会员服务

138+阅读 · 2022年2月6日

INRIA 最新《机器学习理论》课程笔记，176页pdf

专知会员服务

51+阅读 · 2020年12月14日

【干货书】机器学习速查手册，135页pdf

【干货书】机器学习速查手册，135页pdf

专知会员服务

127+阅读 · 2020年11月20日

【斯坦福大学博士论文】大规模和高维统计学习方法和算法，147页pdf， Large-scale and high-dimensional statistical learning methods and algorithms

专知会员服务

26+阅读 · 2020年6月13日

【AAAI 2020】InteractE: 通过增加特征交互来改进基于卷积的知识图谱嵌入， InteractE: Improving Convolution-based Knowledge Graph Embeddings by Increasing Feature Interactions

【AAAI 2020】InteractE: 通过增加特征交互来改进基于卷积的知识图谱嵌入， InteractE: Improving Convolution-based Knowledge Graph Embeddings by Increasing Feature Interactions

专知会员服务

53+阅读 · 2020年6月7日

50+篇《神经架构搜索NAS》2020论文合集

专知会员服务

61+阅读 · 2020年3月19日

【新书】数字图像(影像)处理手第二版，2176pdf，Mathematical Methods in Imaging

【新书】数字图像(影像)处理手第二版，2176pdf，Mathematical Methods in Imaging

专知会员服务

93+阅读 · 2020年2月12日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

热门VIP内容

开通专知VIP会员享更多权益服务

新质生成式AI赋能产业变革的实践与路径

用于多模态大模型的离散标记化：全面综述

Nature综述：金融网络中的物理学

【CMU博士论文】通信高效且差分隐私的优化方法

相关资讯

直播 | Interpretable and Trustworthy Graph Geometric Deep Learning

直播 | Interpretable and Trustworthy Graph Geometric Deep Learning

图与推荐

2+阅读 · 2022年11月2日

局部学习的特征选择：Local-Learning-Based Feature Selection

局部学习的特征选择：Local-Learning-Based Feature Selection

我爱读PAMI

14+阅读 · 2019年9月20日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

IEEE | DSC 2019诚邀稿件 (EI检索)

IEEE | DSC 2019诚邀稿件 (EI检索)

Call4Papers

10+阅读 · 2019年2月25日

逆强化学习-学习人先验的动机

逆强化学习-学习人先验的动机

CreateAMind

16+阅读 · 2019年1月18日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

【SIGIR2018】五篇对抗训练文章

【SIGIR2018】五篇对抗训练文章

专知

12+阅读 · 2018年7月9日

【论文推荐】最新六篇主题模型相关论文—动态主题模型、主题趋势、大规模并行采样、随机采样、非参主题建模

【论文推荐】最新六篇主题模型相关论文—动态主题模型、主题趋势、大规模并行采样、随机采样、非参主题建模

专知

14+阅读 · 2018年6月24日

Capsule Networks解析

Capsule Networks解析

机器学习研究会

11+阅读 · 2017年11月12日

相关论文

A noise based novel strategy for faster SNN training

Arxiv

0+阅读 · 2023年5月29日

StyleS2ST: Zero-shot Style Transfer for Direct Speech-to-speech Translation

Arxiv

0+阅读 · 2023年5月28日

Physics-Guided Discovery of Highly Nonlinear Parametric Partial Differential Equations

Arxiv

0+阅读 · 2023年5月26日

Automated Data Denoising for Recommendation

Arxiv

0+阅读 · 2023年5月26日

Graph-Based Model-Agnostic Data Subsampling for Recommendation Systems

Arxiv

0+阅读 · 2023年5月25日

Federated Multi-organ Segmentation with Inconsistent Labels

Arxiv

0+阅读 · 2023年5月25日

Symplectic model reduction of Hamiltonian systems using data-driven quadratic manifolds

Arxiv

0+阅读 · 2023年5月24日

Exploring and Exploiting Data Heterogeneity in Recommendation

Arxiv

0+阅读 · 2023年5月21日

Deep Meta-learning in Recommendation Systems: A Survey

Arxiv

13+阅读 · 2022年6月9日

Causal Embeddings for Recommendation

Arxiv

23+阅读 · 2018年8月3日

相关基金

面向图像网状结构体的蚁群分割算法

国家自然科学基金

0+阅读 · 2017年12月31日

面向在线检索的医学影像多特征降维方法研究

国家自然科学基金

0+阅读 · 2015年12月31日

非局部Schrödinger方程的高效守恒算法

国家自然科学基金

0+阅读 · 2015年12月31日

Calderon问题和边界刚性问题

国家自然科学基金

0+阅读 · 2013年12月31日

动态纹理建模与应用的张量方法研究

国家自然科学基金

0+阅读 · 2013年12月31日

恶性肺结节的分类不确定性信息可视化传递函数研究

国家自然科学基金

0+阅读 · 2013年12月31日

遥感影像大范围地表信息缺失区域的修复理论与方法研究

国家自然科学基金

0+阅读 · 2012年12月31日

计算电磁学积分方程的数值精度研究与改进

国家自然科学基金

0+阅读 · 2012年12月31日

基于PCA与二代Curvelet变换的多模态医学图像融合方法研究

国家自然科学基金

1+阅读 · 2012年12月31日

基于空间化模拟方法修复遥感地表温度图像的数据缺失

国家自然科学基金

0+阅读 · 2012年12月31日

微信扫码咨询专知VIP会员