生命是随机的, 时间不是 : Markov 决策进程, 带有窗口目标 (Life is Random, Time is Not: Markov Decision Processes with Window Objectives) - 专知论文

会员服务 ·

0

Microsoft Windows · Processing（编程语言） · 易处理的 · contrastive · 生成方法 ·

2020 年 12 月 10 日

Life is Random, Time is Not: Markov Decision Processes with Window Objectives

翻译：生命是随机的, 时间不是 : Markov 决策进程, 带有窗口目标

Thomas Brihaye,Florent Delgrange,Youssouf Oualhadj,Mickael Randour

The window mechanism was introduced by Chatterjee et al. to strengthen classical game objectives with time bounds. It permits to synthesize system controllers that exhibit acceptable behaviors within a configurable time frame, all along their infinite execution, in contrast to the traditional objectives that only require correctness of behaviors in the limit. The window concept has proved its interest in a variety of two-player zero-sum games because it enables reasoning about such time bounds in system specifications, but also thanks to the increased tractability that it usually yields. In this work, we extend the window framework to stochastic environments by considering Markov decision processes. A fundamental problem in this context is the threshold probability problem: given an objective it aims to synthesize strategies that guarantee satisfying runs with a given probability. We solve it for the usual variants of window objectives, where either the time frame is set as a parameter, or we ask if such a time frame exists. We develop a generic approach for window-based objectives and instantiate it for the classical mean-payoff and parity objectives, already considered in games. Our work paves the way to a wide use of the window mechanism in stochastic models.

翻译：由Chatterjee et al. 引入了窗口机制, 目的是用时间限制来加强经典游戏目标。它允许将显示在可配置时间框架内可接受行为的系统控制器合成为可接受行为, 并随其无限执行, 与只要求限制中行为正确性的传统目标形成对比。窗口概念证明了它在各种双玩零和游戏中的利益, 因为它有助于在系统规格中推理这种时间限制, 但也由于它通常产生的可移动性增加。在这项工作中, 我们通过考虑Markov 决策程序, 将窗口框架扩展至随机环境。这方面的一个基本问题是临界概率问题: 因为它的目标是综合战略, 保证以给定的概率运行。我们解决了通常的窗口目标变式, 即时间框架被设定为参数, 或者我们问是否存在这样的时间框架。我们为基于窗口的目标开发了一种通用的方法, 并在游戏中已经考虑过的经典平均报酬和对等目标进行即刻。我们的工作为广泛使用窗口机制铺平了道路。

0

相关内容

Microsoft Windows

Microsoft Windows

Microsoft Windows（视窗操作系统）是微软公司推出的一系列操作系统。它问世于1985年，当时是DOS之下的操作环境，而后其后续版本作逐渐发展成为个人电脑和服务器用户设计的操作系统。

最新《Deepfakes：创造与检测》2020综述论文，36页pdf

最新《Deepfakes：创造与检测》2020综述论文，36页pdf

专知会员服务

65+阅读 · 2020年5月15日

【TPAMI2020】目标检测中的不平衡问题:综述论文，34页pdf

专知会员服务

55+阅读 · 2020年3月16日

FlowQA: Grasping Flow in History for Conversational Machine Comprehension

FlowQA: Grasping Flow in History for Conversational Machine Comprehension

专知会员服务

34+阅读 · 2019年10月18日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

50+阅读 · 2019年10月17日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

开源书：PyTorch深度学习起步

开源书：PyTorch深度学习起步

专知会员服务

51+阅读 · 2019年10月11日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

最新BERT相关论文清单，BERT-related Papers

最新BERT相关论文清单，BERT-related Papers

专知会员服务

53+阅读 · 2019年9月29日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

逆强化学习-学习人先验的动机

逆强化学习-学习人先验的动机

CreateAMind

16+阅读 · 2019年1月18日

RL 真经

CreateAMind

5+阅读 · 2018年12月28日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

算法｜随机森林（Random Forest）

算法｜随机森林（Random Forest）

全球人工智能

3+阅读 · 2018年1月8日

【推荐】YOLO实时目标检测(6fps)

【推荐】YOLO实时目标检测(6fps)

机器学习研究会

20+阅读 · 2017年11月5日

【推荐】决策树/随机森林深入解析

【推荐】决策树/随机森林深入解析

机器学习研究会

5+阅读 · 2017年9月21日

强化学习族谱

强化学习族谱

CreateAMind

26+阅读 · 2017年8月2日

强化学习 cartpole_a3c

强化学习 cartpole_a3c

CreateAMind

9+阅读 · 2017年7月21日

已删除

将门创投

4+阅读 · 2017年7月7日

Linearly Constrained Gaussian Processes with Boundary Conditions

Arxiv

0+阅读 · 2021年2月15日

Double-descent curves in neural networks: a new perspective using Gaussian processes

Arxiv

0+阅读 · 2021年2月14日

Energy and Spectral Efficiency Balancing Algorithm for Energy Saving in LTE Downlinks

Arxiv

0+阅读 · 2021年2月14日

Getting recommendation is not always better

Arxiv

0+阅读 · 2021年2月14日

Newton Method over Networks is Fast up to the Statistical Precision

Arxiv

0+阅读 · 2021年2月12日

Proximal and Federated Random Reshuffling

Arxiv

0+阅读 · 2021年2月12日

Depthwise Separable Convolutions Allow for Fast and Memory-Efficient Spectral Normalization

Depthwise Separable Convolutions Allow for Fast and Memory-Efficient Spectral Normalization

Arxiv

0+阅读 · 2021年2月12日

Continuous window functions for NFFT

Arxiv

0+阅读 · 2021年2月11日

Learning to Speed Up Query Planning in Graph Databases

Arxiv

6+阅读 · 2018年1月21日

Variance-based regularization with convex objectives

Arxiv

5+阅读 · 2017年12月14日

VIP会员

文章信息

相关主题

Microsoft Windows

Processing（编程语言）

相关VIP内容

最新《Deepfakes：创造与检测》2020综述论文，36页pdf

最新《Deepfakes：创造与检测》2020综述论文，36页pdf

专知会员服务

65+阅读 · 2020年5月15日

【TPAMI2020】目标检测中的不平衡问题:综述论文，34页pdf

专知会员服务

55+阅读 · 2020年3月16日

FlowQA: Grasping Flow in History for Conversational Machine Comprehension

FlowQA: Grasping Flow in History for Conversational Machine Comprehension

专知会员服务

34+阅读 · 2019年10月18日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

50+阅读 · 2019年10月17日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

开源书：PyTorch深度学习起步

开源书：PyTorch深度学习起步

专知会员服务

51+阅读 · 2019年10月11日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

最新BERT相关论文清单，BERT-related Papers

最新BERT相关论文清单，BERT-related Papers

专知会员服务

53+阅读 · 2019年9月29日

热门VIP内容

开通专知VIP会员享更多权益服务

数据要素发展报告(2025年)：附下载

人工智能代理提升战时舰船战备水平

【NeurIPS2025教程】大语言模型规划

NeurIPS 2025 教程：深度学习训练不稳定性的理论洞见

相关资讯

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

逆强化学习-学习人先验的动机

逆强化学习-学习人先验的动机

CreateAMind

16+阅读 · 2019年1月18日

RL 真经

CreateAMind

5+阅读 · 2018年12月28日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

算法｜随机森林（Random Forest）

算法｜随机森林（Random Forest）

全球人工智能

3+阅读 · 2018年1月8日

【推荐】YOLO实时目标检测(6fps)

【推荐】YOLO实时目标检测(6fps)

机器学习研究会

20+阅读 · 2017年11月5日

【推荐】决策树/随机森林深入解析

【推荐】决策树/随机森林深入解析

机器学习研究会

5+阅读 · 2017年9月21日

强化学习族谱

强化学习族谱

CreateAMind

26+阅读 · 2017年8月2日

强化学习 cartpole_a3c

强化学习 cartpole_a3c

CreateAMind

9+阅读 · 2017年7月21日

已删除

将门创投

4+阅读 · 2017年7月7日

相关论文

Linearly Constrained Gaussian Processes with Boundary Conditions

Arxiv

0+阅读 · 2021年2月15日

Double-descent curves in neural networks: a new perspective using Gaussian processes

Arxiv

0+阅读 · 2021年2月14日

Energy and Spectral Efficiency Balancing Algorithm for Energy Saving in LTE Downlinks

Arxiv

0+阅读 · 2021年2月14日

Getting recommendation is not always better

Arxiv

0+阅读 · 2021年2月14日

Newton Method over Networks is Fast up to the Statistical Precision

Arxiv

0+阅读 · 2021年2月12日

Proximal and Federated Random Reshuffling

Arxiv

0+阅读 · 2021年2月12日

Depthwise Separable Convolutions Allow for Fast and Memory-Efficient Spectral Normalization

Depthwise Separable Convolutions Allow for Fast and Memory-Efficient Spectral Normalization

Arxiv

0+阅读 · 2021年2月12日

Continuous window functions for NFFT

Arxiv

0+阅读 · 2021年2月11日

Learning to Speed Up Query Planning in Graph Databases

Arxiv

6+阅读 · 2018年1月21日

Variance-based regularization with convex objectives

Arxiv

5+阅读 · 2017年12月14日

微信扫码咨询专知VIP会员