对应用A/B测试和最佳武器识别技术的量化数的序列估计 (Sequential estimation of quantiles with applications to A/B-testing and best-arm identification) - 专知论文

会员服务 ·

0

估计/估计量 · 置信度 · 经验分布 · 样本复杂度 · 赌博机/老虎机 ·

2021 年 4 月 21 日

Sequential estimation of quantiles with applications to A/B-testing and best-arm identification

翻译：对应用A/B测试和最佳武器识别技术的量化数的序列估计

Steven R. Howard,Aaditya Ramdas

from arxiv, 35 pages, 8 figures

We propose confidence sequences -- sequences of confidence intervals which are valid uniformly over time -- for quantiles of any distribution over a complete, fully-ordered set, based on a stream of i.i.d. observations. We give methods both for tracking a fixed quantile and for tracking all quantiles simultaneously. Specifically, we provide explicit expressions with small constants for intervals whose widths shrink at the fastest possible $\sqrt{t^{-1} \log\log t}$ rate, along with a non-asymptotic concentration inequality for the empirical distribution function which holds uniformly over time with the same rate. The latter strengthens Smirnov's empirical process law of the iterated logarithm and extends the Dvoretzky-Kiefer-Wolfowitz inequality to hold uniformly over time. We give a new algorithm and sample complexity bound for selecting an arm with an approximately best quantile in a multi-armed bandit framework. In simulations, our method requires fewer samples than existing methods by a factor of five to fifty.

翻译：我们建议信任序列 -- -- 信任间隔序列序列,这些序列在时间上统一有效 -- -- 任何分布于完整、完全有序的集成体的四分位数,基于一流的i.d.观察。我们给出了追踪固定孔径和同时跟踪所有孔径的方法。具体地说,我们为宽度以最快速度缩小于$\sqrt{t ⁇ 1}\log\log\logt}$的间隔提供了清晰的常数表达式,同时提出一个非被动集中的不平等,用于经验分布函数,这种分配功能与同一速度保持统一。后者强化了Smirnov的迭代对数实证过程法,并扩展了Dvoretzky-Kiefer-Wolfowitzlitz 的不平等,以便统一时间。我们给出了一个新的算法和样本复杂性,用于选择一个在多臂土带框架中具有大约最佳的四分位器的手臂。在模拟中,我们的方法需要比现有方法少5至50倍的样品。

0

相关内容

估计/估计量

估计/估计量

INRIA最新「机器学习理论」新书，229页pdf原理性阐述机器学习

INRIA最新「机器学习理论」新书，229页pdf原理性阐述机器学习

专知会员服务

69+阅读 · 2021年3月27日

【经典书】线性代数，436页pdf

专知会员服务

78+阅读 · 2021年3月16日

INRIA 最新《机器学习理论》课程笔记，176页pdf

专知会员服务

51+阅读 · 2020年12月14日

【经典书】统计学习导论，434页pdf，斯坦福大学

【经典书】统计学习导论，434页pdf，斯坦福大学

专知会员服务

239+阅读 · 2020年4月29日

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

专知会员服务

96+阅读 · 2020年3月12日

UC.Berkeley CS189讲义教材:《机器学习全面指南》，185页pdf

专知会员服务

162+阅读 · 2020年1月16日

【变分推断课件】Lectures on Variational Inference：Statistical Analysis of Variational Approximations（附带pdf）

【变分推断课件】Lectures on Variational Inference：Statistical Analysis of Variational Approximations（附带pdf）

专知会员服务

16+阅读 · 2019年11月30日

《Hands-On Machine Learning with Scikit-Learn and TensorFlow》Scikit-Learn与TensorFlow机器学习实用指南

《Hands-On Machine Learning with Scikit-Learn and TensorFlow》Scikit-Learn与TensorFlow机器学习实用指南

专知会员服务

65+阅读 · 2019年10月27日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

【论文笔记】通俗理解少样本文本分类 (Few-Shot Text Classification) (1)

【论文笔记】通俗理解少样本文本分类 (Few-Shot Text Classification) (1)

深度学习自然语言处理

7+阅读 · 2020年4月8日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

逆强化学习-学习人先验的动机

逆强化学习-学习人先验的动机

CreateAMind

16+阅读 · 2019年1月18日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

RL 真经

CreateAMind

5+阅读 · 2018年12月28日

Disentangled的假设的探讨

Disentangled的假设的探讨

CreateAMind

9+阅读 · 2018年12月10日

Hierarchical Disentangled Representations

Hierarchical Disentangled Representations

CreateAMind

4+阅读 · 2018年4月15日

最佳实践：深度学习用于自然语言处理（三）

最佳实践：深度学习用于自然语言处理（三）

待字闺中

3+阅读 · 2017年8月20日

Auto-Encoding GAN

Auto-Encoding GAN

CreateAMind

7+阅读 · 2017年8月4日

Quantifying and Reducing Bias in Maximum Likelihood Estimation of Structured Anomalies

Quantifying and Reducing Bias in Maximum Likelihood Estimation of Structured Anomalies

Arxiv

0+阅读 · 2021年6月11日

A Distribution-Dependent Analysis of Meta-Learning

A Distribution-Dependent Analysis of Meta-Learning

Arxiv

0+阅读 · 2021年6月11日

Parameter Estimation and Model-Based Clustering with Spherical Normal Distribution on the Unit Hypersphere

Arxiv

0+阅读 · 2021年6月11日

Quantile Bandits for Best Arms Identification

Arxiv

0+阅读 · 2021年6月11日

Neural Networks for Partially Linear Quantile Regression

Arxiv

0+阅读 · 2021年6月11日

On Robust Mean Estimation under Coordinate-level Corruption

Arxiv

0+阅读 · 2021年6月11日

Fixed-Budget Best-Arm Identification in Contextual Bandits: A Static-Adaptive Algorithm

Arxiv

0+阅读 · 2021年6月10日

Robust Prediction Interval estimation for Gaussian Processes by Cross-Validation method

Arxiv

0+阅读 · 2021年6月9日

Parameter and Feature Selection in Stochastic Linear Bandits

Arxiv

0+阅读 · 2021年6月9日

Navigating to the Best Policy in Markov Decision Processes

Arxiv

0+阅读 · 2021年6月5日

VIP会员

文章信息

相关主题

估计/估计量

样本复杂度

赌博机/老虎机

相关VIP内容

INRIA最新「机器学习理论」新书，229页pdf原理性阐述机器学习

INRIA最新「机器学习理论」新书，229页pdf原理性阐述机器学习

专知会员服务

69+阅读 · 2021年3月27日

【经典书】线性代数，436页pdf

专知会员服务

78+阅读 · 2021年3月16日

INRIA 最新《机器学习理论》课程笔记，176页pdf

专知会员服务

51+阅读 · 2020年12月14日

【经典书】统计学习导论，434页pdf，斯坦福大学

【经典书】统计学习导论，434页pdf，斯坦福大学

专知会员服务

239+阅读 · 2020年4月29日

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

专知会员服务

96+阅读 · 2020年3月12日

UC.Berkeley CS189讲义教材:《机器学习全面指南》，185页pdf

专知会员服务

162+阅读 · 2020年1月16日

【变分推断课件】Lectures on Variational Inference：Statistical Analysis of Variational Approximations（附带pdf）

【变分推断课件】Lectures on Variational Inference：Statistical Analysis of Variational Approximations（附带pdf）

专知会员服务

16+阅读 · 2019年11月30日

《Hands-On Machine Learning with Scikit-Learn and TensorFlow》Scikit-Learn与TensorFlow机器学习实用指南

《Hands-On Machine Learning with Scikit-Learn and TensorFlow》Scikit-Learn与TensorFlow机器学习实用指南

专知会员服务

65+阅读 · 2019年10月27日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

热门VIP内容

开通专知VIP会员享更多权益服务

自动驾驶轨迹规划中的基础模型：进展综述与开放挑战

《用于提升多域战备的大型语言模型辅助场景生成器》报告

【斯坦福博士论文】为人类使用优化 AI 模型

国防领域人工智能规模化应用的理论与实践

相关资讯

【论文笔记】通俗理解少样本文本分类 (Few-Shot Text Classification) (1)

【论文笔记】通俗理解少样本文本分类 (Few-Shot Text Classification) (1)

深度学习自然语言处理

7+阅读 · 2020年4月8日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

逆强化学习-学习人先验的动机

逆强化学习-学习人先验的动机

CreateAMind

16+阅读 · 2019年1月18日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

RL 真经

CreateAMind

5+阅读 · 2018年12月28日

Disentangled的假设的探讨

Disentangled的假设的探讨

CreateAMind

9+阅读 · 2018年12月10日

Hierarchical Disentangled Representations

Hierarchical Disentangled Representations

CreateAMind

4+阅读 · 2018年4月15日

最佳实践：深度学习用于自然语言处理（三）

最佳实践：深度学习用于自然语言处理（三）

待字闺中

3+阅读 · 2017年8月20日

Auto-Encoding GAN

Auto-Encoding GAN

CreateAMind

7+阅读 · 2017年8月4日

相关论文

Quantifying and Reducing Bias in Maximum Likelihood Estimation of Structured Anomalies

Quantifying and Reducing Bias in Maximum Likelihood Estimation of Structured Anomalies

Arxiv

0+阅读 · 2021年6月11日

A Distribution-Dependent Analysis of Meta-Learning

A Distribution-Dependent Analysis of Meta-Learning

Arxiv

0+阅读 · 2021年6月11日

Parameter Estimation and Model-Based Clustering with Spherical Normal Distribution on the Unit Hypersphere

Arxiv

0+阅读 · 2021年6月11日

Quantile Bandits for Best Arms Identification

Arxiv

0+阅读 · 2021年6月11日

Neural Networks for Partially Linear Quantile Regression

Arxiv

0+阅读 · 2021年6月11日

On Robust Mean Estimation under Coordinate-level Corruption

Arxiv

0+阅读 · 2021年6月11日

Fixed-Budget Best-Arm Identification in Contextual Bandits: A Static-Adaptive Algorithm

Arxiv

0+阅读 · 2021年6月10日

Robust Prediction Interval estimation for Gaussian Processes by Cross-Validation method

Arxiv

0+阅读 · 2021年6月9日

Parameter and Feature Selection in Stochastic Linear Bandits

Arxiv

0+阅读 · 2021年6月9日

Navigating to the Best Policy in Markov Decision Processes

Arxiv

0+阅读 · 2021年6月5日

微信扫码咨询专知VIP会员