比较贴贴现和平均成本的Markov决定程序:统计意义视角 (Comparing discounted and average-cost Markov Decision Processes: a statistical significance perspective) - 专知论文

会员服务 ·

0

可辨认的 · 优化器 · 统计量 · 经验分布 · t检验 ·

2021 年 12 月 1 日

Comparing discounted and average-cost Markov Decision Processes: a statistical significance perspective

翻译：比较贴贴现和平均成本的Markov决定程序:统计意义视角

Optimal Markov Decision Process policies for problems with finite state and action space are identified through a partial ordering by comparing the value function across states. This is referred to as state-based optimality. This paper identifies when such optimality guarantees some form of system-based optimality as measured by a scalar. Four such system-based metrics are introduced. Uni-variate empirical distributions of these metrics are obtained through simulation as to assess whether theoretically optimal policies provide a statistically significant advantage. This has been conducted using a Student's t-test, Welch's $t$-test and a Mann-Whitney $U$-test. The proposed method is applied to a common problem in queuing theory: admission control.

翻译：关于有限状态和行动空间问题的最佳Markov 决策程序政策通过比较各州的值函数,通过部分排序确定,称为基于国家的最佳性。本文件确定这种最佳性何时能保证某种以卡路里测量的基于系统的最佳性。采用了四种基于系统的计量标准。这些计量标准的单变经验分布是通过模拟获得的,通过模拟来评估理论上的最佳政策是否提供了统计上的重大优势。这是使用学生的t-test、Welch's $t-tat-test和Man-Whitney $U$-test的测试进行的。拟议方法适用于排队理论中常见的问题:录入控制。

0

相关内容

可辨认的

【ETH】最新《几何数据分析》2020课程，附PPT下载

专知会员服务

45+阅读 · 2020年12月18日

INRIA 最新《机器学习理论》课程笔记，176页pdf

专知会员服务

51+阅读 · 2020年12月14日

Fariz Darari简明《博弈论Game Theory》介绍，35页ppt

Fariz Darari简明《博弈论Game Theory》介绍，35页ppt

专知会员服务

112+阅读 · 2020年5月15日

【UIUC硬核书】统计学习理论，Statistical Learning Theory，213页pdf

【UIUC硬核书】统计学习理论，Statistical Learning Theory，213页pdf

专知会员服务

134+阅读 · 2020年4月14日

【新书】贝叶斯网络进展与新应用，附全书下载

【新书】贝叶斯网络进展与新应用，附全书下载

专知会员服务

122+阅读 · 2019年12月9日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

专知会员服务

79+阅读 · 2019年10月10日

MIT新书《强化学习与最优控制》

MIT新书《强化学习与最优控制》

专知会员服务

281+阅读 · 2019年10月9日

UC.Berkeley CS189讲义教材:《机器学习全面指南》，185页pdf

UC.Berkeley CS189讲义教材:《机器学习全面指南》，185页pdf

专知

6+阅读 · 2020年1月16日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

动物脑的好奇心和强化学习的好奇心

动物脑的好奇心和强化学习的好奇心

CreateAMind

10+阅读 · 2019年1月26日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

Disentangled的假设的探讨

Disentangled的假设的探讨

CreateAMind

9+阅读 · 2018年12月10日

CCF C类 | IJCNN 2019 Special Section : 信息论与深度学习

CCF C类 | IJCNN 2019 Special Section : 信息论与深度学习

Call4Papers

5+阅读 · 2018年12月7日

disentangled-representation-papers

disentangled-representation-papers

CreateAMind

26+阅读 · 2018年9月12日

Hierarchical Imitation - Reinforcement Learning

Hierarchical Imitation - Reinforcement Learning

CreateAMind

19+阅读 · 2018年5月25日

Hierarchical Disentangled Representations

Hierarchical Disentangled Representations

CreateAMind

4+阅读 · 2018年4月15日

【学习】Hierarchical Softmax

【学习】Hierarchical Softmax

机器学习研究会

4+阅读 · 2017年8月6日

Statistical inference in factor analysis for diffusion processes from discrete observations

Arxiv

0+阅读 · 2022年2月3日

Error analysis of a fully discrete scheme for the Cahn-Hilliard-Magneto-hydrodynamics problem

Arxiv

0+阅读 · 2022年2月3日

Robust approach for comparing two dependent normal populations through Wald-type tests based on Rényi's pseudodistance estimators

Arxiv

0+阅读 · 2022年2月2日

Independence in Infinite Probabilistic Databases

Arxiv

0+阅读 · 2022年2月1日

Black-box Bayesian inference for economic agent-based models

Black-box Bayesian inference for economic agent-based models

Arxiv

0+阅读 · 2022年2月1日

Spectral Clustering, Spanning Forest, and Bayesian Forest Process

Spectral Clustering, Spanning Forest, and Bayesian Forest Process

Arxiv

0+阅读 · 2022年2月1日

Evaluating Feature Attribution: An Information-Theoretic Perspective

Arxiv

0+阅读 · 2022年2月1日

On a formula for moments of the multivariate normal distribution generalizing Stein's lemma and Isserlis theorem

Arxiv

0+阅读 · 2022年2月1日

Approximate Bayesian Computation via Classification

Arxiv

0+阅读 · 2022年1月31日

The Importance of Modeling Data Missingness in Algorithmic Fairness: A Causal Perspective

Arxiv

5+阅读 · 2020年12月21日

VIP会员

文章信息

相关主题

相关VIP内容

【ETH】最新《几何数据分析》2020课程，附PPT下载

专知会员服务

45+阅读 · 2020年12月18日

INRIA 最新《机器学习理论》课程笔记，176页pdf

专知会员服务

51+阅读 · 2020年12月14日

Fariz Darari简明《博弈论Game Theory》介绍，35页ppt

Fariz Darari简明《博弈论Game Theory》介绍，35页ppt

专知会员服务

112+阅读 · 2020年5月15日

【UIUC硬核书】统计学习理论，Statistical Learning Theory，213页pdf

【UIUC硬核书】统计学习理论，Statistical Learning Theory，213页pdf

专知会员服务

134+阅读 · 2020年4月14日

【新书】贝叶斯网络进展与新应用，附全书下载

【新书】贝叶斯网络进展与新应用，附全书下载

专知会员服务

122+阅读 · 2019年12月9日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

专知会员服务

79+阅读 · 2019年10月10日

MIT新书《强化学习与最优控制》

MIT新书《强化学习与最优控制》

专知会员服务

281+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

大语言模型时代的文档智能：综述

蜂窝通信是否是无人机与无人地面战车主宰战场的关键？

文档视觉问答简述

最新新Agent综述！76页327篇论文梳理，北交大桑基韬教授团队发布《迈向模型原生智能体式人工智能的范式转变综述》

相关资讯

UC.Berkeley CS189讲义教材:《机器学习全面指南》，185页pdf

UC.Berkeley CS189讲义教材:《机器学习全面指南》，185页pdf

专知

6+阅读 · 2020年1月16日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

动物脑的好奇心和强化学习的好奇心

动物脑的好奇心和强化学习的好奇心

CreateAMind

10+阅读 · 2019年1月26日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

Disentangled的假设的探讨

Disentangled的假设的探讨

CreateAMind

9+阅读 · 2018年12月10日

CCF C类 | IJCNN 2019 Special Section : 信息论与深度学习

CCF C类 | IJCNN 2019 Special Section : 信息论与深度学习

Call4Papers

5+阅读 · 2018年12月7日

disentangled-representation-papers

disentangled-representation-papers

CreateAMind

26+阅读 · 2018年9月12日

Hierarchical Imitation - Reinforcement Learning

Hierarchical Imitation - Reinforcement Learning

CreateAMind

19+阅读 · 2018年5月25日

Hierarchical Disentangled Representations

Hierarchical Disentangled Representations

CreateAMind

4+阅读 · 2018年4月15日

【学习】Hierarchical Softmax

【学习】Hierarchical Softmax

机器学习研究会

4+阅读 · 2017年8月6日

相关论文

Statistical inference in factor analysis for diffusion processes from discrete observations

Arxiv

0+阅读 · 2022年2月3日

Error analysis of a fully discrete scheme for the Cahn-Hilliard-Magneto-hydrodynamics problem

Arxiv

0+阅读 · 2022年2月3日

Robust approach for comparing two dependent normal populations through Wald-type tests based on Rényi's pseudodistance estimators

Arxiv

0+阅读 · 2022年2月2日

Independence in Infinite Probabilistic Databases

Arxiv

0+阅读 · 2022年2月1日

Black-box Bayesian inference for economic agent-based models

Black-box Bayesian inference for economic agent-based models

Arxiv

0+阅读 · 2022年2月1日

Spectral Clustering, Spanning Forest, and Bayesian Forest Process

Spectral Clustering, Spanning Forest, and Bayesian Forest Process

Arxiv

0+阅读 · 2022年2月1日

Evaluating Feature Attribution: An Information-Theoretic Perspective

Arxiv

0+阅读 · 2022年2月1日

On a formula for moments of the multivariate normal distribution generalizing Stein's lemma and Isserlis theorem

Arxiv

0+阅读 · 2022年2月1日

Approximate Bayesian Computation via Classification

Arxiv

0+阅读 · 2022年1月31日

The Importance of Modeling Data Missingness in Algorithmic Fairness: A Causal Perspective

Arxiv

5+阅读 · 2020年12月21日

微信扫码咨询专知VIP会员