近最佳假设选择 (Statistically Near-Optimal Hypothesis Selection) - 专知论文

会员服务 ·

0

样本复杂度 · 优化器 · 统计量 · 学习器 · 样本 ·

2021 年 8 月 17 日

Statistically Near-Optimal Hypothesis Selection

翻译：近最佳假设选择

Olivier Bousquet,Mark Braverman,Klim Efremenko,Gillat Kol,Shay Moran

from arxiv, Accepted to FOCS 2021

Hypothesis Selection is a fundamental distribution learning problem where given a comparator-class $Q=\{q_1,\ldots, q_n\}$ of distributions, and a sampling access to an unknown target distribution $p$, the goal is to output a distribution $q$ such that $\mathsf{TV}(p,q)$ is close to $opt$, where $opt = \min_i\{\mathsf{TV}(p,q_i)\}$ and $\mathsf{TV}(\cdot, \cdot)$ denotes the total-variation distance. Despite the fact that this problem has been studied since the 19th century, its complexity in terms of basic resources, such as number of samples and approximation guarantees, remains unsettled (this is discussed, e.g., in the charming book by Devroye and Lugosi `00). This is in stark contrast with other (younger) learning settings, such as PAC learning, for which these complexities are well understood. We derive an optimal $2$-approximation learning strategy for the Hypothesis Selection problem, outputting $q$ such that $\mathsf{TV}(p,q) \leq2 \cdot opt + \eps$, with a (nearly) optimal sample complexity of~$\tilde O(\log n/\epsilon^2)$. This is the first algorithm that simultaneously achieves the best approximation factor and sample complexity: previously, Bousquet, Kane, and Moran (COLT `19) gave a learner achieving the optimal $2$-approximation, but with an exponentially worse sample complexity of $\tilde O(\sqrt{n}/\epsilon^{2.5})$, and Yatracos~(Annals of Statistics `85) gave a learner with optimal sample complexity of $O(\log n /\epsilon^2)$ but with a sub-optimal approximation factor of $3$.

翻译：选择是一个基本的分布学习问题, 给一个比较器级 $q_ 1,\ ldots, q_n $, 发行量的 q_n 美元, 以及一个未知目标分配量的抽样访问 $p$, 目标是输出一个分配量 $q 美元, 这样美元接近 $opt =\ min_ i\ maths{TV} (p, q_ i) $ 和 $ mathslickr=TV} (\ cdot, ndot, 美元) 表示统计总变异的距离。尽管这个问题自19世纪以来一直在研究过, 其基本资源的复杂性, 如样本数量和近似保证, 仍然不解析( e.) 讨论过, 德洛威和卢戈西的书中, 这与其他学习环境( 年轻) 形成鲜明的对比, 例如PAC 学习, 这些复杂性是很清楚的。

0

相关内容

样本复杂度

样本复杂度

ICLR 2021杰出论文奖出炉，8篇论文上榜！

专知会员服务

26+阅读 · 2021年4月2日

INRIA 最新《机器学习理论》课程笔记，176页pdf

专知会员服务

51+阅读 · 2020年12月14日

不可错过！UIUC最新《统计强化学习》课程！

专知会员服务

53+阅读 · 2020年9月7日

【UIUC硬核书】统计学习理论，Statistical Learning Theory，213页pdf

【UIUC硬核书】统计学习理论，Statistical Learning Theory，213页pdf

专知会员服务

134+阅读 · 2020年4月14日

UC.Berkeley CS189讲义教材:《机器学习全面指南》，185页pdf

专知会员服务

162+阅读 · 2020年1月16日

【变分推断课件】Lectures on Variational Inference：Statistical Analysis of Variational Approximations（附带pdf）

【变分推断课件】Lectures on Variational Inference：Statistical Analysis of Variational Approximations（附带pdf）

专知会员服务

16+阅读 · 2019年11月30日

【电子书推荐】机器学习、神经网络和统计分类（Machine Learning, Neural Networks, and Statistical Classification）

【电子书推荐】机器学习、神经网络和统计分类（Machine Learning, Neural Networks, and Statistical Classification）

专知会员服务

29+阅读 · 2019年11月19日

【机器学习基础最新版】（Mathematics for Machine Learning），417页pdf

【机器学习基础最新版】（Mathematics for Machine Learning），417页pdf

专知会员服务

244+阅读 · 2019年10月21日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

【新书】Python编程基础，669页pdf

【新书】Python编程基础，669页pdf

专知会员服务

197+阅读 · 2019年10月10日

强化学习三篇论文避免遗忘等

强化学习三篇论文避免遗忘等

CreateAMind

20+阅读 · 2019年5月24日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

19篇ICML2019论文摘录选读！

19篇ICML2019论文摘录选读！

专知

28+阅读 · 2019年4月28日

Disentangled的假设的探讨

Disentangled的假设的探讨

CreateAMind

9+阅读 · 2018年12月10日

【NIPS2018】接收论文列表

【NIPS2018】接收论文列表

专知

5+阅读 · 2018年9月10日

【论文推荐】最新六篇强化学习相关论文—Sublinear、机器阅读理解、加速强化学习、对抗性奖励学习、人机交互

【论文推荐】最新六篇强化学习相关论文—Sublinear、机器阅读理解、加速强化学习、对抗性奖励学习、人机交互

专知

17+阅读 · 2018年4月28日

【论文】变分推断（Variational inference)的总结

【论文】变分推断（Variational inference)的总结

机器学习研究会

39+阅读 · 2017年11月16日

【推荐】免费书(草稿)：数据科学的数学基础

【推荐】免费书(草稿)：数据科学的数学基础

机器学习研究会

20+阅读 · 2017年10月1日

【学习】Hierarchical Softmax

【学习】Hierarchical Softmax

机器学习研究会

4+阅读 · 2017年8月6日

强化学习 cartpole_a3c

强化学习 cartpole_a3c

CreateAMind

9+阅读 · 2017年7月21日

Almost Optimal Batch-Regret Tradeoff for Batch Linear Contextual Bandits

Arxiv

0+阅读 · 2021年10月15日

A posteriori estimates for the stochastic total variation flow

A posteriori estimates for the stochastic total variation flow

Arxiv

0+阅读 · 2021年10月15日

Tight Lipschitz Hardness for Optimizing Mean Field Spin Glasses

Arxiv

0+阅读 · 2021年10月15日

Solving estimating equations with copulas

Arxiv

0+阅读 · 2021年10月14日

Near optimal sample complexity for matrix and tensor normal models via geodesic convexity

Arxiv

0+阅读 · 2021年10月14日

Multivariate density estimation from privatised data: universal consistency and minimax rates

Arxiv

0+阅读 · 2021年10月14日

Score Matched Neural Exponential Families for Likelihood-Free Inference

Arxiv

0+阅读 · 2021年10月14日

Adaptive Padé-Chebyshev Type Approximation of Piecewise Smooth Functions

Arxiv

0+阅读 · 2021年10月14日

Minimax extrapolation problem for periodically correlated stochastic sequences with missing observations

Arxiv

0+阅读 · 2021年10月13日

Data-Time Tradeoffs for Optimal k-Thresholding Algorithms in Compressed Sensing

Arxiv

0+阅读 · 2021年10月13日

VIP会员

文章信息

相关主题

样本复杂度

相关VIP内容

ICLR 2021杰出论文奖出炉，8篇论文上榜！

专知会员服务

26+阅读 · 2021年4月2日

INRIA 最新《机器学习理论》课程笔记，176页pdf

专知会员服务

51+阅读 · 2020年12月14日

不可错过！UIUC最新《统计强化学习》课程！

专知会员服务

53+阅读 · 2020年9月7日

【UIUC硬核书】统计学习理论，Statistical Learning Theory，213页pdf

【UIUC硬核书】统计学习理论，Statistical Learning Theory，213页pdf

专知会员服务

134+阅读 · 2020年4月14日

UC.Berkeley CS189讲义教材:《机器学习全面指南》，185页pdf

专知会员服务

162+阅读 · 2020年1月16日

【变分推断课件】Lectures on Variational Inference：Statistical Analysis of Variational Approximations（附带pdf）

【变分推断课件】Lectures on Variational Inference：Statistical Analysis of Variational Approximations（附带pdf）

专知会员服务

16+阅读 · 2019年11月30日

【电子书推荐】机器学习、神经网络和统计分类（Machine Learning, Neural Networks, and Statistical Classification）

【电子书推荐】机器学习、神经网络和统计分类（Machine Learning, Neural Networks, and Statistical Classification）

专知会员服务

29+阅读 · 2019年11月19日

【机器学习基础最新版】（Mathematics for Machine Learning），417页pdf

【机器学习基础最新版】（Mathematics for Machine Learning），417页pdf

专知会员服务

244+阅读 · 2019年10月21日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

【新书】Python编程基础，669页pdf

【新书】Python编程基础，669页pdf

专知会员服务

197+阅读 · 2019年10月10日

热门VIP内容

开通专知VIP会员享更多权益服务

《乌克兰无人机产业：志愿者与政策在构建新兴无人机产业中的协同作用》最新报告

《人工智能辅助决策中的数据可视化：系统性综述》

人工智能驱动弹药制造现代化：美国陆军转型之路

《敏捷作战部署中枢纽-辐条基地选址优化研究》80页

相关资讯

强化学习三篇论文避免遗忘等

强化学习三篇论文避免遗忘等

CreateAMind

20+阅读 · 2019年5月24日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

19篇ICML2019论文摘录选读！

19篇ICML2019论文摘录选读！

专知

28+阅读 · 2019年4月28日

Disentangled的假设的探讨

Disentangled的假设的探讨

CreateAMind

9+阅读 · 2018年12月10日

【NIPS2018】接收论文列表

【NIPS2018】接收论文列表

专知

5+阅读 · 2018年9月10日

【论文推荐】最新六篇强化学习相关论文—Sublinear、机器阅读理解、加速强化学习、对抗性奖励学习、人机交互

【论文推荐】最新六篇强化学习相关论文—Sublinear、机器阅读理解、加速强化学习、对抗性奖励学习、人机交互

专知

17+阅读 · 2018年4月28日

【论文】变分推断（Variational inference)的总结

【论文】变分推断（Variational inference)的总结

机器学习研究会

39+阅读 · 2017年11月16日

【推荐】免费书(草稿)：数据科学的数学基础

【推荐】免费书(草稿)：数据科学的数学基础

机器学习研究会

20+阅读 · 2017年10月1日

【学习】Hierarchical Softmax

【学习】Hierarchical Softmax

机器学习研究会

4+阅读 · 2017年8月6日

强化学习 cartpole_a3c

强化学习 cartpole_a3c

CreateAMind

9+阅读 · 2017年7月21日

相关论文

Almost Optimal Batch-Regret Tradeoff for Batch Linear Contextual Bandits

Arxiv

0+阅读 · 2021年10月15日

A posteriori estimates for the stochastic total variation flow

A posteriori estimates for the stochastic total variation flow

Arxiv

0+阅读 · 2021年10月15日

Tight Lipschitz Hardness for Optimizing Mean Field Spin Glasses

Arxiv

0+阅读 · 2021年10月15日

Solving estimating equations with copulas

Arxiv

0+阅读 · 2021年10月14日

Near optimal sample complexity for matrix and tensor normal models via geodesic convexity

Arxiv

0+阅读 · 2021年10月14日

Multivariate density estimation from privatised data: universal consistency and minimax rates

Arxiv

0+阅读 · 2021年10月14日

Score Matched Neural Exponential Families for Likelihood-Free Inference

Arxiv

0+阅读 · 2021年10月14日

Adaptive Padé-Chebyshev Type Approximation of Piecewise Smooth Functions

Arxiv

0+阅读 · 2021年10月14日

Minimax extrapolation problem for periodically correlated stochastic sequences with missing observations

Arxiv

0+阅读 · 2021年10月13日

Data-Time Tradeoffs for Optimal k-Thresholding Algorithms in Compressed Sensing

Arxiv

0+阅读 · 2021年10月13日

微信扫码咨询专知VIP会员