处理数据核心圈双模样测试 (Kernel Two-Sample Tests for Manifold Data) - 专知论文

会员服务 ·

0

流形 · 核化 · 最大平均偏差 · 缩放 · Pair ·

2021 年 10 月 17 日

Kernel Two-Sample Tests for Manifold Data

翻译：处理数据核心圈双模样测试

Xiuyuan Cheng,Yao Xie

We present a study of kernel based two-sample test statistic, which is related to the Maximum Mean Discrepancy (MMD), in the manifold data setting, assuming that high-dimensional observations are close to a low-dimensional manifold. We characterize the test level and power in relation to the kernel bandwidth, the number of samples, and the intrinsic dimensionality of the manifold. Specifically, we show that when data densities are supported on a $d$-dimensional sub-manifold $\mathcal{M}$ embedded in an $m$-dimensional space, the kernel two-sample test for data sampled from a pair of distributions $(p, q)$ that are H\"older with order $\beta$ is consistent and powerful when the number of samples $n$ is greater than $\delta_2(p,q)^{-2-d/\beta}$ up to certain constant, where $\delta_2$ is the squared $\ell_2$-divergence between two distributions on manifold. Moreover, to achieve testing consistency under this scaling of $n$, our theory suggests that the kernel bandwidth $\gamma$ scales with $n^{-1/(d+2\beta)}$. These results indicate that the kernel two-sample test does not have a curse-of-dimensionality when the data lie on a low-dimensional manifold. We demonstrate the validity of our theory and the property of the kernel test for manifold data using several numerical experiments.

翻译：具体地说,我们提出一个基于内核的双模量测试统计研究,它与多元数据设置中的最大平均值差异值(MMD)有关,假设高维观测接近一个低维的元体。我们用内核带宽、样本数量和多元的内在维度来描述试验水平和能量。我们显示,当数据密度支持在以美元维基次维值为单位的1美元维基值下值$\mathcal{M}(MD)中嵌入的1美元维空间中时,假设高维度观测接近于一个低维度的多维值。当数据密度为美元大于$delta_p,q) ⁇ 2-d/d/beta}到一定的恒定值时,数据密度为$=2美元基值的平方元双基值测试。此外,如果使用这一数值测试的数值的数值测试,我们两个基值的数值的数值的数值值值值值值值值值值,则显示我们两个公式的基值的数值值值值值值。

0

相关内容

【数据科学导论书】Introduction to Datascience，253页pdf

【数据科学导论书】Introduction to Datascience，253页pdf

专知会员服务

50+阅读 · 2021年11月15日

ICLR 2021杰出论文奖出炉，8篇论文上榜！

专知会员服务

26+阅读 · 2021年4月2日

INRIA 最新《机器学习理论》课程笔记，176页pdf

专知会员服务

51+阅读 · 2020年12月14日

【经典书】概率统计导论第五版，730页pdf

【经典书】概率统计导论第五版，730页pdf

专知会员服务

249+阅读 · 2020年7月28日

图机器学习-图拉普拉斯算子的离散正则性，141页ppt，Discrete regularity graph Laplacians

专知会员服务

29+阅读 · 2020年6月4日

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

专知会员服务

96+阅读 · 2020年3月12日

【MIT】对抗鲁棒性的流形正则化，Manifold Regularization for Adversarial Robustness

【MIT】对抗鲁棒性的流形正则化，Manifold Regularization for Adversarial Robustness

专知会员服务

28+阅读 · 2020年3月11日

Risk Sensitive Portfolio Optimization with Regime-Switching and Default Contagion，香港理工大学应用数学系余翔助理教授，第八届全国社会媒体处理大会SMP2019

Risk Sensitive Portfolio Optimization with Regime-Switching and Default Contagion，香港理工大学应用数学系余翔助理教授，第八届全国社会媒体处理大会SMP2019

专知会员服务

10+阅读 · 2019年10月24日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

计算机 | 入门级EI会议ICVRIS 2019诚邀稿件

计算机 | 入门级EI会议ICVRIS 2019诚邀稿件

Call4Papers

10+阅读 · 2019年6月24日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

学术会议 | 知识图谱顶会 ISWC 征稿：Poster/Demo

学术会议 | 知识图谱顶会 ISWC 征稿：Poster/Demo

开放知识图谱

5+阅读 · 2019年4月16日

IEEE | DSC 2019诚邀稿件 (EI检索)

IEEE | DSC 2019诚邀稿件 (EI检索)

Call4Papers

10+阅读 · 2019年2月25日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

Disentangled的假设的探讨

Disentangled的假设的探讨

CreateAMind

9+阅读 · 2018年12月10日

【NIPS2018】接收论文列表

【NIPS2018】接收论文列表

专知

5+阅读 · 2018年9月10日

Hierarchical Disentangled Representations

Hierarchical Disentangled Representations

CreateAMind

4+阅读 · 2018年4月15日

ResNet, AlexNet, VGG, Inception：各种卷积网络架构的理解

ResNet, AlexNet, VGG, Inception：各种卷积网络架构的理解

全球人工智能

20+阅读 · 2017年12月17日

【学习】(Python)SVM数据分类

【学习】(Python)SVM数据分类

机器学习研究会

6+阅读 · 2017年10月15日

Learning curves of generic features maps for realistic datasets with a teacher-student model

Arxiv

0+阅读 · 2021年12月14日

Properties of the After Kernel

Arxiv

0+阅读 · 2021年12月13日

A Homotopy Algorithm for Optimal Transport

Arxiv

0+阅读 · 2021年12月13日

Estimating customer impatience in a service system with unobserved balking

Arxiv

0+阅读 · 2021年12月12日

Understanding Layer-wise Contributions in Deep Neural Networks through Spectral Analysis

Arxiv

0+阅读 · 2021年12月12日

A Game-Theoretic Analysis of Cross-Ledger Swaps with Packetized Payments

Arxiv

0+阅读 · 2021年12月11日

Numerical methods for Mean field Games based on Gaussian Processes and Fourier Features

Arxiv

0+阅读 · 2021年12月10日

Mixing convergence of LSE for supercritical Gaussian AR(2) processes using random scaling

Arxiv

0+阅读 · 2021年12月10日

Hyperspherical Variational Auto-Encoders

Hyperspherical Variational Auto-Encoders

Arxiv

4+阅读 · 2018年9月26日

Quickshift++: Provably Good Initializations for Sample-Based Mean Shift

Arxiv

4+阅读 · 2018年5月21日

VIP会员

文章信息

相关主题

最大平均偏差

相关VIP内容

【数据科学导论书】Introduction to Datascience，253页pdf

【数据科学导论书】Introduction to Datascience，253页pdf

专知会员服务

50+阅读 · 2021年11月15日

ICLR 2021杰出论文奖出炉，8篇论文上榜！

专知会员服务

26+阅读 · 2021年4月2日

INRIA 最新《机器学习理论》课程笔记，176页pdf

专知会员服务

51+阅读 · 2020年12月14日

【经典书】概率统计导论第五版，730页pdf

【经典书】概率统计导论第五版，730页pdf

专知会员服务

249+阅读 · 2020年7月28日

图机器学习-图拉普拉斯算子的离散正则性，141页ppt，Discrete regularity graph Laplacians

专知会员服务

29+阅读 · 2020年6月4日

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

专知会员服务

96+阅读 · 2020年3月12日

【MIT】对抗鲁棒性的流形正则化，Manifold Regularization for Adversarial Robustness

【MIT】对抗鲁棒性的流形正则化，Manifold Regularization for Adversarial Robustness

专知会员服务

28+阅读 · 2020年3月11日

Risk Sensitive Portfolio Optimization with Regime-Switching and Default Contagion，香港理工大学应用数学系余翔助理教授，第八届全国社会媒体处理大会SMP2019

Risk Sensitive Portfolio Optimization with Regime-Switching and Default Contagion，香港理工大学应用数学系余翔助理教授，第八届全国社会媒体处理大会SMP2019

专知会员服务

10+阅读 · 2019年10月24日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

热门VIP内容

开通专知VIP会员享更多权益服务

大语言模型智能体强化学习：全景综述

《城市滨海地区：理解复杂多变环境下的指挥控制框架》50页报告

【伯克利博士论文】从推理服务到训练：面向大规模 LLM 智能体的高效系统

美空军“顶点2025”实验：推进AI在C2、动态目标锁定与联盟集成中的应用

相关资讯

计算机 | 入门级EI会议ICVRIS 2019诚邀稿件

计算机 | 入门级EI会议ICVRIS 2019诚邀稿件

Call4Papers

10+阅读 · 2019年6月24日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

学术会议 | 知识图谱顶会 ISWC 征稿：Poster/Demo

学术会议 | 知识图谱顶会 ISWC 征稿：Poster/Demo

开放知识图谱

5+阅读 · 2019年4月16日

IEEE | DSC 2019诚邀稿件 (EI检索)

IEEE | DSC 2019诚邀稿件 (EI检索)

Call4Papers

10+阅读 · 2019年2月25日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

Disentangled的假设的探讨

Disentangled的假设的探讨

CreateAMind

9+阅读 · 2018年12月10日

【NIPS2018】接收论文列表

【NIPS2018】接收论文列表

专知

5+阅读 · 2018年9月10日

Hierarchical Disentangled Representations

Hierarchical Disentangled Representations

CreateAMind

4+阅读 · 2018年4月15日

ResNet, AlexNet, VGG, Inception：各种卷积网络架构的理解

ResNet, AlexNet, VGG, Inception：各种卷积网络架构的理解

全球人工智能

20+阅读 · 2017年12月17日

【学习】(Python)SVM数据分类

【学习】(Python)SVM数据分类

机器学习研究会

6+阅读 · 2017年10月15日

相关论文

Learning curves of generic features maps for realistic datasets with a teacher-student model

Arxiv

0+阅读 · 2021年12月14日

Properties of the After Kernel

Arxiv

0+阅读 · 2021年12月13日

A Homotopy Algorithm for Optimal Transport

Arxiv

0+阅读 · 2021年12月13日

Estimating customer impatience in a service system with unobserved balking

Arxiv

0+阅读 · 2021年12月12日

Understanding Layer-wise Contributions in Deep Neural Networks through Spectral Analysis

Arxiv

0+阅读 · 2021年12月12日

A Game-Theoretic Analysis of Cross-Ledger Swaps with Packetized Payments

Arxiv

0+阅读 · 2021年12月11日

Numerical methods for Mean field Games based on Gaussian Processes and Fourier Features

Arxiv

0+阅读 · 2021年12月10日

Mixing convergence of LSE for supercritical Gaussian AR(2) processes using random scaling

Arxiv

0+阅读 · 2021年12月10日

Hyperspherical Variational Auto-Encoders

Hyperspherical Variational Auto-Encoders

Arxiv

4+阅读 · 2018年9月26日

Quickshift++: Provably Good Initializations for Sample-Based Mean Shift

Arxiv

4+阅读 · 2018年5月21日

微信扫码咨询专知VIP会员