无需替换的取样的可信度序列 (Confidence sequences for sampling without replacement) - 专知论文

会员服务 ·

0

置信度 · 频率主义学派 · TOOLS · 样本 · 计算机科学 ·

2021 年 1 月 8 日

Confidence sequences for sampling without replacement

翻译：无需替换的取样的可信度序列

Ian Waudby-Smith,Aaditya Ramdas

Many practical tasks involve sampling sequentially without replacement (WoR) from a finite population of size $N$, in an attempt to estimate some parameter $\theta^\star$. Accurately quantifying uncertainty throughout this process is a nontrivial task, but is necessary because it often determines when we stop collecting samples and confidently report a result. We present a suite of tools for designing confidence sequences (CS) for $\theta^\star$. A CS is a sequence of confidence sets $(C_n)_{n=1}^N$, that shrink in size, and all contain $\theta^\star$ simultaneously with high probability. We present a generic approach to constructing a frequentist CS using Bayesian tools, based on the fact that the ratio of a prior to the posterior at the ground truth is a martingale. We then present Hoeffding- and empirical-Bernstein-type time-uniform CSs and fixed-time confidence intervals for sampling WoR, which improve on previous bounds in the literature and explicitly quantify the benefit of WoR sampling.

翻译：许多实际任务涉及连续取样,而不从一定规模的美元中替换(WoR),以试图估算某些参数$\theta ⁇ star$。准确量化整个过程中的不确定性是一项非三重任务,但之所以有必要,是因为它常常确定当我们停止采集样本时,并有信心地报告结果。我们为$theta ⁇ star$提供了一套设计信任序列的工具(CS)。 CS是一套(C_n)n=1N$的置信套件序列,其规模缩小,所有都包含$\theta ⁇ star$,同时具有很高的概率。我们提出了一个使用Bayesian工具来构建常客式 CS的通用方法,其依据是,在地面的外星之前的比例是martingale。然后我们提出一套用于设计信任序列(CS)的Hoffding-和实证-Bernstein型时间统一 CS和固定时间信任间隔,用于取样WoR的样本,这些套件在文献的以往界限上有所改进,并明确量化WoR取样的好处。

0

相关内容

置信度

INRIA 最新《机器学习理论》课程笔记，176页pdf

专知会员服务

51+阅读 · 2020年12月14日

【经典书】应用随机微分方程，324页pdf，Applied Stochastic Differential Equations

【经典书】应用随机微分方程，324页pdf，Applied Stochastic Differential Equations

专知会员服务

58+阅读 · 2020年11月21日

【干货书】机器学习速查手册，135页pdf

【干货书】机器学习速查手册，135页pdf

专知会员服务

127+阅读 · 2020年11月20日

迁移学习简明教程，11页ppt

迁移学习简明教程，11页ppt

专知会员服务

108+阅读 · 2020年8月4日

Transformer文本分类代码

Transformer文本分类代码

专知会员服务

118+阅读 · 2020年2月3日

【ECML-PKDD 2019】带歧义的分类变量编码（Encoding Categorical Variables with Ambiguity）

【ECML-PKDD 2019】带歧义的分类变量编码（Encoding Categorical Variables with Ambiguity）

专知会员服务

5+阅读 · 2019年12月1日

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

专知会员服务

160+阅读 · 2019年10月12日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

2019年机器学习框架回顾

2019年机器学习框架回顾

专知会员服务

36+阅读 · 2019年10月11日

【新书】Python编程基础，669页pdf

【新书】Python编程基础，669页pdf

专知会员服务

197+阅读 · 2019年10月10日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

已删除

将门创投

3+阅读 · 2019年4月25日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

meta learning 17年：MAML SNAIL

meta learning 17年：MAML SNAIL

CreateAMind

11+阅读 · 2019年1月2日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

disentangled-representation-papers

disentangled-representation-papers

CreateAMind

26+阅读 · 2018年9月12日

Hierarchical Disentangled Representations

Hierarchical Disentangled Representations

CreateAMind

4+阅读 · 2018年4月15日

【论文】变分推断（Variational inference)的总结

【论文】变分推断（Variational inference)的总结

机器学习研究会

39+阅读 · 2017年11月16日

【推荐】(Keras)LSTM多元时序预测教程

【推荐】(Keras)LSTM多元时序预测教程

机器学习研究会

24+阅读 · 2017年8月14日

【学习】Hierarchical Softmax

【学习】Hierarchical Softmax

机器学习研究会

4+阅读 · 2017年8月6日

Bayesian posterior repartitioning for nested sampling

Arxiv

0+阅读 · 2021年3月8日

Cross-validation based adaptive sampling for Gaussian process models

Arxiv

0+阅读 · 2021年3月6日

Efficient Learning in Non-Stationary Linear Markov Decision Processes

Arxiv

0+阅读 · 2021年3月5日

Is Simple Uniform Sampling Efficient for Center-Based Clustering With Outliers: When and Why?

Arxiv

0+阅读 · 2021年3月5日

Density ratio model with data-adaptive basis function

Arxiv

0+阅读 · 2021年3月5日

Cost-sensitive Selection of Variables by Ensemble of Model Sequences

Arxiv

0+阅读 · 2021年3月5日

Time-dependent stochastic basis adaptation for uncertainty quantification

Arxiv

0+阅读 · 2021年3月4日

Small Sample Spaces for Gaussian Processes

Arxiv

0+阅读 · 2021年3月4日

Construction of approximate $C^1$ bases for isogeometric analysis on two-patch domains

Arxiv

0+阅读 · 2021年3月4日

Minimax Risk and Uniform Convergence Rates for Nonparametric Dyadic Regression

Arxiv

0+阅读 · 2021年3月4日

VIP会员

文章信息

相关主题

频率主义学派

计算机科学

相关VIP内容

INRIA 最新《机器学习理论》课程笔记，176页pdf

专知会员服务

51+阅读 · 2020年12月14日

【经典书】应用随机微分方程，324页pdf，Applied Stochastic Differential Equations

【经典书】应用随机微分方程，324页pdf，Applied Stochastic Differential Equations

专知会员服务

58+阅读 · 2020年11月21日

【干货书】机器学习速查手册，135页pdf

【干货书】机器学习速查手册，135页pdf

专知会员服务

127+阅读 · 2020年11月20日

迁移学习简明教程，11页ppt

迁移学习简明教程，11页ppt

专知会员服务

108+阅读 · 2020年8月4日

Transformer文本分类代码

Transformer文本分类代码

专知会员服务

118+阅读 · 2020年2月3日

【ECML-PKDD 2019】带歧义的分类变量编码（Encoding Categorical Variables with Ambiguity）

【ECML-PKDD 2019】带歧义的分类变量编码（Encoding Categorical Variables with Ambiguity）

专知会员服务

5+阅读 · 2019年12月1日

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

专知会员服务

160+阅读 · 2019年10月12日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

2019年机器学习框架回顾

2019年机器学习框架回顾

专知会员服务

36+阅读 · 2019年10月11日

【新书】Python编程基础，669页pdf

【新书】Python编程基础，669页pdf

专知会员服务

197+阅读 · 2019年10月10日

热门VIP内容

开通专知VIP会员享更多权益服务

《陆军战斗操练中的关键事件诊断》

《自适应训练辅助概念及其在空战管理员加速训练中的应用导论》最新126页

军事通信市场七大趋势概述

《抗干扰无人机蜂群行为的遗传算法方法》

相关资讯

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

已删除

将门创投

3+阅读 · 2019年4月25日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

meta learning 17年：MAML SNAIL

meta learning 17年：MAML SNAIL

CreateAMind

11+阅读 · 2019年1月2日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

disentangled-representation-papers

disentangled-representation-papers

CreateAMind

26+阅读 · 2018年9月12日

Hierarchical Disentangled Representations

Hierarchical Disentangled Representations

CreateAMind

4+阅读 · 2018年4月15日

【论文】变分推断（Variational inference)的总结

【论文】变分推断（Variational inference)的总结

机器学习研究会

39+阅读 · 2017年11月16日

【推荐】(Keras)LSTM多元时序预测教程

【推荐】(Keras)LSTM多元时序预测教程

机器学习研究会

24+阅读 · 2017年8月14日

【学习】Hierarchical Softmax

【学习】Hierarchical Softmax

机器学习研究会

4+阅读 · 2017年8月6日

相关论文

Bayesian posterior repartitioning for nested sampling

Arxiv

0+阅读 · 2021年3月8日

Cross-validation based adaptive sampling for Gaussian process models

Arxiv

0+阅读 · 2021年3月6日

Efficient Learning in Non-Stationary Linear Markov Decision Processes

Arxiv

0+阅读 · 2021年3月5日

Is Simple Uniform Sampling Efficient for Center-Based Clustering With Outliers: When and Why?

Arxiv

0+阅读 · 2021年3月5日

Density ratio model with data-adaptive basis function

Arxiv

0+阅读 · 2021年3月5日

Cost-sensitive Selection of Variables by Ensemble of Model Sequences

Arxiv

0+阅读 · 2021年3月5日

Time-dependent stochastic basis adaptation for uncertainty quantification

Arxiv

0+阅读 · 2021年3月4日

Small Sample Spaces for Gaussian Processes

Arxiv

0+阅读 · 2021年3月4日

Construction of approximate $C^1$ bases for isogeometric analysis on two-patch domains

Arxiv

0+阅读 · 2021年3月4日

Minimax Risk and Uniform Convergence Rates for Nonparametric Dyadic Regression

Arxiv

0+阅读 · 2021年3月4日

微信扫码咨询专知VIP会员