VC 尺寸和无分发的无抽样抽样测试 (VC Dimension and Distribution-Free Sample-Based Testing) - 专知论文

会员服务 ·

0

PAC学习 · PAC学习理论 · 样本 · 样本复杂度 · 类别 ·

2020 年 12 月 7 日

VC Dimension and Distribution-Free Sample-Based Testing

翻译：VC 尺寸和无分发的无抽样抽样测试

Eric Blais,Renato Ferreira Pinto Jr.,Nathaniel Harms

from arxiv, 44 pages

We consider the problem of determining which classes of functions can be tested more efficiently than they can be learned, in the distribution-free sample-based model that corresponds to the standard PAC learning setting. Our main result shows that while VC dimension by itself does not always provide tight bounds on the number of samples required to test a class of functions in this model, it can be combined with a closely-related variant that we call "lower VC" (or LVC) dimension to obtain strong lower bounds on this sample complexity. We use this result to obtain strong and in many cases nearly optimal lower bounds on the sample complexity for testing unions of intervals, halfspaces, intersections of halfspaces, polynomial threshold functions, and decision trees. Conversely, we show that two natural classes of functions, juntas and monotone functions, can be tested with a number of samples that is polynomially smaller than the number of samples required for PAC learning. Finally, we also use the connection between VC dimension and property testing to establish new lower bounds for testing radius clusterability and testing feasibility of linear constraint systems.

翻译：我们考虑了在与标准PAC学习环境相对应的无分布式样本模型中确定哪些类别的功能可以比所学得更高效地测试的问题。我们的主要结果表明,虽然VC层面本身并不总能对测试该模型中某类功能所需的样本数量提供严格的限制,但它可以与一个密切相关的变量相结合,我们称之为“低VC”(或LVC)层面,以获得关于这一样本复杂性的更强的下限。我们利用这一结果在样本复杂性方面获得了强力,在许多情况下,在样本复杂性方面获得了几乎最佳的较低界限,以测试间隔、半空、半空的交叉点、多数值阈值功能和决策树。相反,我们表明,两种自然的功能类别,即军政府军和单体内功能,可以与一些比PAC学习所需的样本数量多得多的样本进行测试。最后,我们还利用VC层面与财产测试之间的联系,为测试半径集性和线性约束系统的可行性设定新的较低界限。

0

相关内容

PAC学习

【干货书】机器学习速查手册，135页pdf

【干货书】机器学习速查手册，135页pdf

专知会员服务

127+阅读 · 2020年11月20日

(普林斯顿讲义)：高维概率论，326页pdf《Probability in High Dimension》

(普林斯顿讲义)：高维概率论，326页pdf《Probability in High Dimension》

专知会员服务

124+阅读 · 2020年5月30日

【伯克利】元学习的元基线，A New Meta-Baseline for Few-Shot Learning

【伯克利】元学习的元基线，A New Meta-Baseline for Few-Shot Learning

专知会员服务

67+阅读 · 2020年3月28日

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

专知会员服务

96+阅读 · 2020年3月12日

【新书：机器学习简介】《A Concise Introduction to Machine Learning》by A.C. Faul (CRC 2019)

【新书：机器学习简介】《A Concise Introduction to Machine Learning》by A.C. Faul (CRC 2019)

专知会员服务

77+阅读 · 2020年2月8日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

50+阅读 · 2019年10月17日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

专知会员服务

163+阅读 · 2019年10月12日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

局部学习的特征选择：Local-Learning-Based Feature Selection

局部学习的特征选择：Local-Learning-Based Feature Selection

我爱读PAMI

14+阅读 · 2019年9月20日

强化学习三篇论文避免遗忘等

强化学习三篇论文避免遗忘等

CreateAMind

20+阅读 · 2019年5月24日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

逆强化学习-学习人先验的动机

逆强化学习-学习人先验的动机

CreateAMind

16+阅读 · 2019年1月18日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

Hierarchical Disentangled Representations

Hierarchical Disentangled Representations

CreateAMind

4+阅读 · 2018年4月15日

条件GAN重大改进！cGANs with Projection Discriminator

条件GAN重大改进！cGANs with Projection Discriminator

CreateAMind

8+阅读 · 2018年2月7日

【学习】(Python)SVM数据分类

【学习】(Python)SVM数据分类

机器学习研究会

6+阅读 · 2017年10月15日

【推荐】决策树/随机森林深入解析

【推荐】决策树/随机森林深入解析

机器学习研究会

5+阅读 · 2017年9月21日

【学习】Hierarchical Softmax

【学习】Hierarchical Softmax

机器学习研究会

4+阅读 · 2017年8月6日

Nonparametric C- and D-vine based quantile regression

Nonparametric C- and D-vine based quantile regression

Arxiv

0+阅读 · 2021年2月9日

The Optimality of Polynomial Regression for Agnostic Learning under Gaussian Marginals

Arxiv

0+阅读 · 2021年2月8日

High-dimensional nonlinear approximation by parametric manifolds in Hölder-Nikol'skii spaces of mixed smoothness

Arxiv

0+阅读 · 2021年2月8日

Tests and estimation strategies associated to some loss functions

Arxiv

0+阅读 · 2021年2月8日

A Constraint-Based Algorithm for the Structural Learning of Continuous-Time Bayesian Networks

Arxiv

0+阅读 · 2021年2月8日

Testing correlation of unlabeled random graphs

Arxiv

0+阅读 · 2021年2月8日

On the Conditional Complexity of Sets of Strings

Arxiv

0+阅读 · 2021年2月7日

Dimension Free Generalization Bounds for Non Linear Metric Learning

Arxiv

0+阅读 · 2021年2月7日

Fast and Robust Distributed Learning in High Dimension

Fast and Robust Distributed Learning in High Dimension

Arxiv

0+阅读 · 2021年2月5日

Greedy $k$-Center from Noisy Distance Samples

Greedy $k$-Center from Noisy Distance Samples

Arxiv

0+阅读 · 2021年2月5日

VIP会员

文章信息

相关主题

PAC学习理论

样本复杂度

相关VIP内容

【干货书】机器学习速查手册，135页pdf

【干货书】机器学习速查手册，135页pdf

专知会员服务

127+阅读 · 2020年11月20日

(普林斯顿讲义)：高维概率论，326页pdf《Probability in High Dimension》

(普林斯顿讲义)：高维概率论，326页pdf《Probability in High Dimension》

专知会员服务

124+阅读 · 2020年5月30日

【伯克利】元学习的元基线，A New Meta-Baseline for Few-Shot Learning

【伯克利】元学习的元基线，A New Meta-Baseline for Few-Shot Learning

专知会员服务

67+阅读 · 2020年3月28日

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

专知会员服务

96+阅读 · 2020年3月12日

【新书：机器学习简介】《A Concise Introduction to Machine Learning》by A.C. Faul (CRC 2019)

【新书：机器学习简介】《A Concise Introduction to Machine Learning》by A.C. Faul (CRC 2019)

专知会员服务

77+阅读 · 2020年2月8日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

50+阅读 · 2019年10月17日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

专知会员服务

163+阅读 · 2019年10月12日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

热门VIP内容

开通专知VIP会员享更多权益服务

视觉-语言-动作模型解析：从模块构成到里程碑与挑战

《解析陆域作战方向：一个概念性框架》报告

【博士论文】基于多模态基础模型的上下文学习

追寻真正的AI自主性：从遗留思维到战场优势

相关资讯

局部学习的特征选择：Local-Learning-Based Feature Selection

局部学习的特征选择：Local-Learning-Based Feature Selection

我爱读PAMI

14+阅读 · 2019年9月20日

强化学习三篇论文避免遗忘等

强化学习三篇论文避免遗忘等

CreateAMind

20+阅读 · 2019年5月24日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

逆强化学习-学习人先验的动机

逆强化学习-学习人先验的动机

CreateAMind

16+阅读 · 2019年1月18日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

Hierarchical Disentangled Representations

Hierarchical Disentangled Representations

CreateAMind

4+阅读 · 2018年4月15日

条件GAN重大改进！cGANs with Projection Discriminator

条件GAN重大改进！cGANs with Projection Discriminator

CreateAMind

8+阅读 · 2018年2月7日

【学习】(Python)SVM数据分类

【学习】(Python)SVM数据分类

机器学习研究会

6+阅读 · 2017年10月15日

【推荐】决策树/随机森林深入解析

【推荐】决策树/随机森林深入解析

机器学习研究会

5+阅读 · 2017年9月21日

【学习】Hierarchical Softmax

【学习】Hierarchical Softmax

机器学习研究会

4+阅读 · 2017年8月6日

相关论文

Nonparametric C- and D-vine based quantile regression

Nonparametric C- and D-vine based quantile regression

Arxiv

0+阅读 · 2021年2月9日

The Optimality of Polynomial Regression for Agnostic Learning under Gaussian Marginals

Arxiv

0+阅读 · 2021年2月8日

High-dimensional nonlinear approximation by parametric manifolds in Hölder-Nikol'skii spaces of mixed smoothness

Arxiv

0+阅读 · 2021年2月8日

Tests and estimation strategies associated to some loss functions

Arxiv

0+阅读 · 2021年2月8日

A Constraint-Based Algorithm for the Structural Learning of Continuous-Time Bayesian Networks

Arxiv

0+阅读 · 2021年2月8日

Testing correlation of unlabeled random graphs

Arxiv

0+阅读 · 2021年2月8日

On the Conditional Complexity of Sets of Strings

Arxiv

0+阅读 · 2021年2月7日

Dimension Free Generalization Bounds for Non Linear Metric Learning

Arxiv

0+阅读 · 2021年2月7日

Fast and Robust Distributed Learning in High Dimension

Fast and Robust Distributed Learning in High Dimension

Arxiv

0+阅读 · 2021年2月5日

Greedy $k$-Center from Noisy Distance Samples

Greedy $k$-Center from Noisy Distance Samples

Arxiv

0+阅读 · 2021年2月5日

微信扫码咨询专知VIP会员