使用 PARIS 粒子 Gibbs 进行状态和参数学习 (State and parameter learning with PaRIS particle Gibbs) - 专知论文

会员服务 ·

0

PPG · 估计/估计量 · Learning · Markov · 有偏 ·

2023 年 1 月 2 日

State and parameter learning with PaRIS particle Gibbs

翻译：使用 PARIS 粒子 Gibbs 进行状态和参数学习

Gabriel Cardoso,Yazid Janati El Idrissi,Sylvain Le Corff,Eric Moulines,Jimmy Olsson

from arxiv, preprint. arXiv admin note: text overlap with arXiv:2209.10351

Non-linear state-space models, also known as general hidden Markov models, are ubiquitous in statistical machine learning, being the most classical generative models for serial data and sequences in general. The particle-based, rapid incremental smoother PaRIS is a sequential Monte Carlo (SMC) technique allowing for efficient online approximation of expectations of additive functionals under the smoothing distribution in these models. Such expectations appear naturally in several learning contexts, such as likelihood estimation (MLE) and Markov score climbing (MSC). PARIS has linear computational complexity, limited memory requirements and comes with non-asymptotic bounds, convergence results and stability guarantees. Still, being based on self-normalised importance sampling, the PaRIS estimator is biased. Our first contribution is to design a novel additive smoothing algorithm, the Parisian particle Gibbs PPG sampler, which can be viewed as a PaRIS algorithm driven by conditional SMC moves, resulting in bias-reduced estimates of the targeted quantities. We substantiate the PPG algorithm with theoretical results, including new bounds on bias and variance as well as deviation inequalities. Our second contribution is to apply PPG in a learning framework, covering MLE and MSC as special examples. In this context, we establish, under standard assumptions, non-asymptotic bounds highlighting the value of bias reduction and the implicit Rao--Blackwellization of PPG. These are the first non-asymptotic results of this kind in this setting. We illustrate our theoretical results with numerical experiments supporting our claims.

翻译：非线性状态-空间模型,也称为一般隐藏的马尔科夫模型,在统计机学习中无处不在,这是最典型的序列数据和整个序列的最典型遗传模型。基于粒子的快速增量光滑PARIS(PARIS)技术是连续的蒙特卡洛(SMC)技术,允许在这些模型的平滑分布下高效在线近似添加功能功能的预期值。这些预期自然出现在若干学习环境中,如可能性估计(MLE)和马克夫得分攀升(MSC)。巴黎的计算方法具有线性复杂性、记忆要求有限,并带有非无线性界限、趋同结果和稳定性保证。不过,基于自我标准化重要性取样,PARIS天平滑滑度测量器是有偏差的。我们的第一个贡献是设计新的添加式平滑动算法,即巴黎粒子GPGPG样本取样器,可以被视为由有条件的SMC动作驱动的PRIS算法,从而得出了目标数量的偏差性估计值。我们用理论结果来证实PGUC的算法的计算结果,包括新的偏差性和差异和差异性缩性缩性缩缩缩缩图。我们在IMGPGPGLI的缩缩缩框架之下,我们用了这个标准模型的缩缩缩缩缩缩缩缩缩缩的模型的模型,我们用了一个模型的模型的缩缩缩缩缩缩缩缩缩缩缩缩缩的模型。

0

相关内容

PPG

不可错过！《机器学习100讲》课程，UBC Mark Schmidt讲授

不可错过！《机器学习100讲》课程，UBC Mark Schmidt讲授

专知会员服务

75+阅读 · 2022年6月28日

【ETH】最新《几何数据分析》2020课程，附PPT下载

专知会员服务

44+阅读 · 2020年12月18日

NLP必读经典文献100篇

专知会员服务

124+阅读 · 2020年9月8日

Linux导论，Introduction to Linux，96页ppt

Linux导论，Introduction to Linux，96页ppt

专知会员服务

81+阅读 · 2020年7月26日

因果图，Causal Graphs，52页ppt

因果图，Causal Graphs，52页ppt

专知会员服务

250+阅读 · 2020年4月19日

【机器学习基础最新版】（Mathematics for Machine Learning），417页pdf

【机器学习基础最新版】（Mathematics for Machine Learning），417页pdf

专知会员服务

244+阅读 · 2019年10月21日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

专知会员服务

79+阅读 · 2019年10月10日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

VCIP 2022 Call for Special Session Proposals

VCIP 2022 Call for Special Session Proposals

CCF多媒体专委会

1+阅读 · 2022年4月1日

ACM MM 2022 Call for Papers

ACM MM 2022 Call for Papers

CCF多媒体专委会

5+阅读 · 2022年3月29日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium3

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium3

中国图象图形学学会CSIG

0+阅读 · 2021年11月9日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

【论文】变分推断（Variational inference)的总结

【论文】变分推断（Variational inference)的总结

机器学习研究会

39+阅读 · 2017年11月16日

镜对称及双有理代数几何背景下的高维 Calabi-Yau 簇

国家自然科学基金

0+阅读 · 2016年12月31日

肿瘤抗原HCA587与STAT3的相互作用及其促进肿瘤转移的分子机制研究

国家自然科学基金

1+阅读 · 2014年12月31日

协方差阵的推断及在方向数据分析中的应用

国家自然科学基金

1+阅读 · 2014年12月31日

低交叉极化共形天线阵列综合的混合DE算法研究

国家自然科学基金

0+阅读 · 2012年12月31日

Arisandilactone A 的不对称全合成

国家自然科学基金

0+阅读 · 2012年12月31日

纳米银对巨噬细胞毒性的机理研究

国家自然科学基金

0+阅读 · 2012年12月31日

HDAC抑制剂治疗视网膜感光细胞变性的分子基础

国家自然科学基金

1+阅读 · 2011年12月31日

复杂网络结构和动力学的尺度可变性及中尺度分析

国家自然科学基金

0+阅读 · 2011年12月31日

CIB1对脑缺血半暗带微血管作用机制的研究

国家自然科学基金

0+阅读 · 2009年12月31日

组蛋白修饰调控异染色质边界的分子机制

国家自然科学基金

0+阅读 · 2008年12月31日

Ensemble-based gradient inference for particle methods in optimization and sampling

Arxiv

0+阅读 · 2023年3月1日

Auxiliary MCMC and particle Gibbs samplers for parallelisable inference in latent dynamical systems

Arxiv

0+阅读 · 2023年3月1日

On Parametric Misspecified Bayesian Cramér-Rao bound: An application to linear Gaussian systems

Arxiv

0+阅读 · 2023年3月1日

Particle-based Online Bayesian Sampling

Arxiv

0+阅读 · 2023年2月28日

An active-set method for sparse approximations. Part I: Separable $\ell_1$ terms

Arxiv

0+阅读 · 2023年2月28日

Exponential Hardness of Reinforcement Learning with Linear Function Approximation

Arxiv

0+阅读 · 2023年2月25日

A Theoretical Analysis of the Learning Dynamics under Class Imbalance

Arxiv

0+阅读 · 2023年2月24日

Randomized low-rank approximation of parameter-dependent matrices

Arxiv

0+阅读 · 2023年2月24日

Causality and Generalizability: Identifiability and Learning Methods

Arxiv

12+阅读 · 2021年10月4日

The Causal Learning of Retail Delinquency

Arxiv

14+阅读 · 2020年12月17日

VIP会员

文章信息

相关主题

估计/估计量

相关VIP内容

不可错过！《机器学习100讲》课程，UBC Mark Schmidt讲授

不可错过！《机器学习100讲》课程，UBC Mark Schmidt讲授

专知会员服务

75+阅读 · 2022年6月28日

【ETH】最新《几何数据分析》2020课程，附PPT下载

专知会员服务

44+阅读 · 2020年12月18日

NLP必读经典文献100篇

专知会员服务

124+阅读 · 2020年9月8日

Linux导论，Introduction to Linux，96页ppt

Linux导论，Introduction to Linux，96页ppt

专知会员服务

81+阅读 · 2020年7月26日

因果图，Causal Graphs，52页ppt

因果图，Causal Graphs，52页ppt

专知会员服务

250+阅读 · 2020年4月19日

【机器学习基础最新版】（Mathematics for Machine Learning），417页pdf

【机器学习基础最新版】（Mathematics for Machine Learning），417页pdf

专知会员服务

244+阅读 · 2019年10月21日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

专知会员服务

79+阅读 · 2019年10月10日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

【CMU博士论文】数据驱动决策中的激励、信息与不确定性

DGP双粒度提示框架：图增强大模型助力欺诈检测

【ICCV2025】ESSENTIAL：用于视频类增量学习的情景记忆与语义记忆整合

唯快不破：大型语言模型高效架构综述

相关资讯

VCIP 2022 Call for Special Session Proposals

VCIP 2022 Call for Special Session Proposals

CCF多媒体专委会

1+阅读 · 2022年4月1日

ACM MM 2022 Call for Papers

ACM MM 2022 Call for Papers

CCF多媒体专委会

5+阅读 · 2022年3月29日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium3

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium3

中国图象图形学学会CSIG

0+阅读 · 2021年11月9日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

【论文】变分推断（Variational inference)的总结

【论文】变分推断（Variational inference)的总结

机器学习研究会

39+阅读 · 2017年11月16日

相关论文

Ensemble-based gradient inference for particle methods in optimization and sampling

Arxiv

0+阅读 · 2023年3月1日

Auxiliary MCMC and particle Gibbs samplers for parallelisable inference in latent dynamical systems

Arxiv

0+阅读 · 2023年3月1日

On Parametric Misspecified Bayesian Cramér-Rao bound: An application to linear Gaussian systems

Arxiv

0+阅读 · 2023年3月1日

Particle-based Online Bayesian Sampling

Arxiv

0+阅读 · 2023年2月28日

An active-set method for sparse approximations. Part I: Separable $\ell_1$ terms

Arxiv

0+阅读 · 2023年2月28日

Exponential Hardness of Reinforcement Learning with Linear Function Approximation

Arxiv

0+阅读 · 2023年2月25日

A Theoretical Analysis of the Learning Dynamics under Class Imbalance

Arxiv

0+阅读 · 2023年2月24日

Randomized low-rank approximation of parameter-dependent matrices

Arxiv

0+阅读 · 2023年2月24日

Causality and Generalizability: Identifiability and Learning Methods

Arxiv

12+阅读 · 2021年10月4日

The Causal Learning of Retail Delinquency

Arxiv

14+阅读 · 2020年12月17日

相关基金

镜对称及双有理代数几何背景下的高维 Calabi-Yau 簇

国家自然科学基金

0+阅读 · 2016年12月31日

肿瘤抗原HCA587与STAT3的相互作用及其促进肿瘤转移的分子机制研究

国家自然科学基金

1+阅读 · 2014年12月31日

协方差阵的推断及在方向数据分析中的应用

国家自然科学基金

1+阅读 · 2014年12月31日

低交叉极化共形天线阵列综合的混合DE算法研究

国家自然科学基金

0+阅读 · 2012年12月31日

Arisandilactone A 的不对称全合成

国家自然科学基金

0+阅读 · 2012年12月31日

纳米银对巨噬细胞毒性的机理研究

国家自然科学基金

0+阅读 · 2012年12月31日

HDAC抑制剂治疗视网膜感光细胞变性的分子基础

国家自然科学基金

1+阅读 · 2011年12月31日

复杂网络结构和动力学的尺度可变性及中尺度分析

国家自然科学基金

0+阅读 · 2011年12月31日

CIB1对脑缺血半暗带微血管作用机制的研究

国家自然科学基金

0+阅读 · 2009年12月31日

组蛋白修饰调控异染色质边界的分子机制

国家自然科学基金

0+阅读 · 2008年12月31日

微信扫码咨询专知VIP会员