内核双抽样测试的变量选择 (Variable Selection for Kernel Two-Sample Tests) - 专知论文

会员服务 ·

0

核化 · CASE · 确切的 · 线性的 · 最大平均偏差 ·

2023 年 2 月 15 日

Variable Selection for Kernel Two-Sample Tests

翻译：内核双抽样测试的变量选择

Jie Wang,Santanu S. Dey,Yao Xie

from arxiv, 30 pages, 5 figures

We consider the variable selection problem for two-sample tests, aiming to select the most informative features to best distinguish samples from two groups. We propose a kernel maximum mean discrepancy (MMD) framework to solve this problem and further derive its equivalent mixed-integer programming formulations for linear, quadratic, and Gaussian types of kernel functions. Our proposed framework admits advantages of both computational efficiency and nice statistical properties: (i) A closed-form solution is provided for the linear kernel case. Despite NP-hardness, we provide an exact mixed-integer semi-definite programming formulation for the quadratic kernel case, which further motivates the development of exact and approximation algorithms. We propose a convex-concave procedure that finds critical points for the Gaussian kernel case. (ii) We provide non-asymptotic uncertainty quantification of our proposed formulation under null and alternative scenarios. Experimental results demonstrate good performance of our framework.

翻译：我们考虑了两样样本测试的可变选择问题,目的是选择信息最丰富的特征,以便从两个组中最佳地区分样本。我们提议了一个最大平均差异(MMD)框架来解决这个问题,并进一步得出其线性、二次和高斯内核功能等同的混合整数编程配方。我们提议的框架承认计算效率和良好的统计属性的优点:(一)为线性内核案例提供了封闭式的解决方案。尽管NP-硬性,但我们为二次内核案例提供了精确的混合整数半确定式编程配方,这进一步推动了精确和近似算法的开发。我们提议了一个对高斯内核案例找出临界点的矩形剖面程序。 (二)我们提供了在无效和替代情景下对拟议配方进行非象征性的不确定性量化。实验结果显示了我们框架的良好表现。

0

相关内容

不可错过！《机器学习100讲》课程，UBC Mark Schmidt讲授

不可错过！《机器学习100讲》课程，UBC Mark Schmidt讲授

专知会员服务

76+阅读 · 2022年6月28日

INRIA 最新《机器学习理论》课程笔记，176页pdf

专知会员服务

51+阅读 · 2020年12月14日

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

专知会员服务

96+阅读 · 2020年3月12日

UC.Berkeley CS189讲义教材:《机器学习全面指南》，185页pdf

专知会员服务

162+阅读 · 2020年1月16日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

50+阅读 · 2019年10月17日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

专知会员服务

65+阅读 · 2019年10月9日

VCIP 2022 Call for Demos

VCIP 2022 Call for Demos

CCF多媒体专委会

1+阅读 · 2022年6月6日

局部学习的特征选择：Local-Learning-Based Feature Selection

局部学习的特征选择：Local-Learning-Based Feature Selection

我爱读PAMI

14+阅读 · 2019年9月20日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

逆强化学习-学习人先验的动机

逆强化学习-学习人先验的动机

CreateAMind

16+阅读 · 2019年1月18日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

【论文推荐】最新十篇目标跟踪相关论文—多帧光流跟踪、动态图学习、MV-YOLO、姿态估计、深度核相关滤波、Benchmark

【论文推荐】最新十篇目标跟踪相关论文—多帧光流跟踪、动态图学习、MV-YOLO、姿态估计、深度核相关滤波、Benchmark

专知

13+阅读 · 2018年5月26日

【论文】变分推断（Variational inference)的总结

【论文】变分推断（Variational inference)的总结

机器学习研究会

39+阅读 · 2017年11月16日

Berezin变换及相关的算子理论

国家自然科学基金

1+阅读 · 2014年12月31日

具有临界指数的Schrodinger-Poisson系统的解

国家自然科学基金

0+阅读 · 2013年12月31日

中药飞龙掌血中磷酸二酯酶IV抑制剂的发现、结构优化及构效关系研究

国家自然科学基金

0+阅读 · 2013年12月31日

Cocycle动力学和拟周期薛定谔算子的谱

国家自然科学基金

0+阅读 · 2012年12月31日

Dirichlet空间的分析与几何

国家自然科学基金

0+阅读 · 2011年12月31日

图的对称性与曲面嵌入

国家自然科学基金

0+阅读 · 2009年12月31日

积分几何与凸几何分析

国家自然科学基金

2+阅读 · 2009年12月31日

时间尺度上偏动力方程解的若干定性性质研究

国家自然科学基金

0+阅读 · 2009年12月31日

含载流子基团磷光配合物的合成及在电致发光器件中的应用

国家自然科学基金

0+阅读 · 2009年12月31日

印迹基因TSSC3在骨肉瘤失巢凋亡过程中的分子机制研究

国家自然科学基金

0+阅读 · 2009年12月31日

Can we learn better with hard samples?

Arxiv

0+阅读 · 2023年4月7日

On the approximation of vector-valued functions by samples

Arxiv

0+阅读 · 2023年4月6日

Batch mode active learning for efficient parameter estimation

Arxiv

0+阅读 · 2023年4月5日

A Class of Models for Large Zero-inflated Spatial Data

Arxiv

0+阅读 · 2023年4月5日

A Bayesian Collocation Integral Method for Parameter Estimation in Ordinary Differential Equations

Arxiv

0+阅读 · 2023年4月4日

A statistical framework for analyzing shape in a time series of random geometric objects

Arxiv

0+阅读 · 2023年4月4日

Uniform convergence rates and automatic variable selection in nonparametric regression with functional and categorical covariates

Arxiv

0+阅读 · 2023年4月3日

Copula-Based Density Estimation Models for Multivariate Zero-Inflated Continuous Data

Arxiv

0+阅读 · 2023年4月2日

A variance reduction strategy for numerical random homogenization based on the equivalent inclusion method

Arxiv

0+阅读 · 2023年3月29日

Class-Balanced Loss Based on Effective Number of Samples

Arxiv

12+阅读 · 2019年1月16日

VIP会员

文章信息

相关主题

最大平均偏差

相关VIP内容

不可错过！《机器学习100讲》课程，UBC Mark Schmidt讲授

不可错过！《机器学习100讲》课程，UBC Mark Schmidt讲授

专知会员服务

76+阅读 · 2022年6月28日

INRIA 最新《机器学习理论》课程笔记，176页pdf

专知会员服务

51+阅读 · 2020年12月14日

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

专知会员服务

96+阅读 · 2020年3月12日

UC.Berkeley CS189讲义教材:《机器学习全面指南》，185页pdf

专知会员服务

162+阅读 · 2020年1月16日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

50+阅读 · 2019年10月17日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

专知会员服务

65+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

《代码、指挥与冲突：描绘军事人工智能的未来》报告

【斯坦福博士论文】面向地理空间数据的多模态与多尺度建模：时空生成式人工智能

美国启动“自有军事人工智能计划”：采用谷歌Gemini以推动全军人工智能应用

《创新与适应性作为军事成功的关键因素：来自俄乌战争的战略洞见》报告

相关资讯

VCIP 2022 Call for Demos

VCIP 2022 Call for Demos

CCF多媒体专委会

1+阅读 · 2022年6月6日

局部学习的特征选择：Local-Learning-Based Feature Selection

局部学习的特征选择：Local-Learning-Based Feature Selection

我爱读PAMI

14+阅读 · 2019年9月20日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

逆强化学习-学习人先验的动机

逆强化学习-学习人先验的动机

CreateAMind

16+阅读 · 2019年1月18日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

【论文推荐】最新十篇目标跟踪相关论文—多帧光流跟踪、动态图学习、MV-YOLO、姿态估计、深度核相关滤波、Benchmark

【论文推荐】最新十篇目标跟踪相关论文—多帧光流跟踪、动态图学习、MV-YOLO、姿态估计、深度核相关滤波、Benchmark

专知

13+阅读 · 2018年5月26日

【论文】变分推断（Variational inference)的总结

【论文】变分推断（Variational inference)的总结

机器学习研究会

39+阅读 · 2017年11月16日

相关论文

Can we learn better with hard samples?

Arxiv

0+阅读 · 2023年4月7日

On the approximation of vector-valued functions by samples

Arxiv

0+阅读 · 2023年4月6日

Batch mode active learning for efficient parameter estimation

Arxiv

0+阅读 · 2023年4月5日

A Class of Models for Large Zero-inflated Spatial Data

Arxiv

0+阅读 · 2023年4月5日

A Bayesian Collocation Integral Method for Parameter Estimation in Ordinary Differential Equations

Arxiv

0+阅读 · 2023年4月4日

A statistical framework for analyzing shape in a time series of random geometric objects

Arxiv

0+阅读 · 2023年4月4日

Uniform convergence rates and automatic variable selection in nonparametric regression with functional and categorical covariates

Arxiv

0+阅读 · 2023年4月3日

Copula-Based Density Estimation Models for Multivariate Zero-Inflated Continuous Data

Arxiv

0+阅读 · 2023年4月2日

A variance reduction strategy for numerical random homogenization based on the equivalent inclusion method

Arxiv

0+阅读 · 2023年3月29日

Class-Balanced Loss Based on Effective Number of Samples

Arxiv

12+阅读 · 2019年1月16日

相关基金

Berezin变换及相关的算子理论

国家自然科学基金

1+阅读 · 2014年12月31日

具有临界指数的Schrodinger-Poisson系统的解

国家自然科学基金

0+阅读 · 2013年12月31日

中药飞龙掌血中磷酸二酯酶IV抑制剂的发现、结构优化及构效关系研究

国家自然科学基金

0+阅读 · 2013年12月31日

Cocycle动力学和拟周期薛定谔算子的谱

国家自然科学基金

0+阅读 · 2012年12月31日

Dirichlet空间的分析与几何

国家自然科学基金

0+阅读 · 2011年12月31日

图的对称性与曲面嵌入

国家自然科学基金

0+阅读 · 2009年12月31日

积分几何与凸几何分析

国家自然科学基金

2+阅读 · 2009年12月31日

时间尺度上偏动力方程解的若干定性性质研究

国家自然科学基金

0+阅读 · 2009年12月31日

含载流子基团磷光配合物的合成及在电致发光器件中的应用

国家自然科学基金

0+阅读 · 2009年12月31日

印迹基因TSSC3在骨肉瘤失巢凋亡过程中的分子机制研究

国家自然科学基金

0+阅读 · 2009年12月31日

微信扫码咨询专知VIP会员