使用算法信息理论的智慧和不雄心壮志 (Intelligence and Unambitiousness Using Algorithmic Information Theory) - 专知论文

会员服务 ·

0

INFORMS · 信息理论 · 通用智能 · 学成 · 易处理的 ·

2021 年 5 月 13 日

Intelligence and Unambitiousness Using Algorithmic Information Theory

翻译：使用算法信息理论的智慧和不雄心壮志

Michael K. Cohen,Badri Vellambi,Marcus Hutter

from arxiv, 13 pages, 6 figures, 5-page appendix. arXiv admin note: text overlap with arXiv:1905.12186

Algorithmic Information Theory has inspired intractable constructions of general intelligence (AGI), and undiscovered tractable approximations are likely feasible. Reinforcement Learning (RL), the dominant paradigm by which an agent might learn to solve arbitrary solvable problems, gives an agent a dangerous incentive: to gain arbitrary "power" in order to intervene in the provision of their own reward. We review the arguments that generally intelligent algorithmic-information-theoretic reinforcement learners such as Hutter's (2005) AIXI would seek arbitrary power, including over us. Then, using an information-theoretic exploration schedule, and a setup inspired by causal influence theory, we present a variant of AIXI which learns to not seek arbitrary power; we call it "unambitious". We show that our agent learns to accrue reward at least as well as a human mentor, while relying on that mentor with diminishing probability. And given a formal assumption that we probe empirically, we show that eventually, the agent's world-model incorporates the following true fact: intervening in the "outside world" will have no effect on reward acquisition; hence, it has no incentive to shape the outside world.

翻译：解析信息理论启发了一般情报(AGI)的棘手构思,而未发现的可移植近似可能是可行的。强化学习(RL)是代理人学习解决任意可溶问题的主导范式,它给代理人一种危险的激励:获得任意的“权力”以干预提供自己的奖赏。我们审视了一般智能算法-信息理论强化学习者(如Hutter's(2005年) AIXI)会寻求任意权力(包括对我们)的论点。然后,利用信息理论探索时间表和由因果关系理论启发的构思,我们展示了AXI的变式,学会不寻求任意权力;我们称它为“不雄心勃勃 ” 。我们表明我们的代理人学会了至少和人类导师的奖赏,同时依赖导师的概率越来越小。我们正式假设我们用经验来探究,我们最终显示,该代理人的世界模型包含以下事实:在“外部世界”的干预不会对奖赏获取产生任何影响;因此,它没有激励机制来塑造外部世界。

0

相关内容

INFORMS

《计算机信息》杂志发表高质量的论文，扩大了运筹学和计算的范围，寻求有关理论、方法、实验、系统和应用方面的原创研究论文、新颖的调查和教程论文，以及描述新的和有用的软件工具的论文。官网链接：https://pubsonline.informs.org/journal/ijoc

【硬核书】信息论，528页pdf，Information Theory and Coding by Example

【硬核书】信息论，528页pdf，Information Theory and Coding by Example

专知会员服务

148+阅读 · 2020年4月20日

【课程推荐】人工智能导论：Introduction to Articial Intelligence

【课程推荐】人工智能导论：Introduction to Articial Intelligence

专知会员服务

103+阅读 · 2019年12月20日

【纽约大学-AI研讨会】现代人工智能（Modern Artificial Intelligence）

【纽约大学-AI研讨会】现代人工智能（Modern Artificial Intelligence）

专知会员服务

26+阅读 · 2019年11月10日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

专知会员服务

160+阅读 · 2019年10月12日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

专知会员服务

79+阅读 · 2019年10月10日

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

专知会员服务

83+阅读 · 2019年10月9日

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

专知会员服务

65+阅读 · 2019年10月9日

计算机 | 入门级EI会议ICVRIS 2019诚邀稿件

计算机 | 入门级EI会议ICVRIS 2019诚邀稿件

Call4Papers

10+阅读 · 2019年6月24日

IEEE | DSC 2019诚邀稿件 (EI检索)

IEEE | DSC 2019诚邀稿件 (EI检索)

Call4Papers

10+阅读 · 2019年2月25日

人工智能 | SCI期刊专刊信息3条

人工智能 | SCI期刊专刊信息3条

Call4Papers

5+阅读 · 2019年1月10日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

meta learning 17年：MAML SNAIL

meta learning 17年：MAML SNAIL

CreateAMind

11+阅读 · 2019年1月2日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

Hierarchical Disentangled Representations

Hierarchical Disentangled Representations

CreateAMind

4+阅读 · 2018年4月15日

人工智能 | 国际会议截稿信息9条

人工智能 | 国际会议截稿信息9条

Call4Papers

4+阅读 · 2018年3月13日

人工智能 | 国际会议/SCI期刊约稿信息9条

人工智能 | 国际会议/SCI期刊约稿信息9条

Call4Papers

3+阅读 · 2018年1月12日

【学习】Hierarchical Softmax

【学习】Hierarchical Softmax

机器学习研究会

4+阅读 · 2017年8月6日

Modularity in Reinforcement Learning via Algorithmic Independence in Credit Assignment

Arxiv

0+阅读 · 2021年7月3日

Game Theory to Study Interactions between Mobility Stakeholders

Arxiv

0+阅读 · 2021年7月3日

Constructive Decision Theory

Arxiv

0+阅读 · 2021年7月2日

Artificial Intelligence in the Creative Industries: A Review

Arxiv

0+阅读 · 2021年7月2日

Meta-learning in natural and artificial intelligence

Arxiv

10+阅读 · 2020年11月26日

Interference and Generalization in Temporal Difference Learning

Arxiv

8+阅读 · 2020年3月13日

What Can Neural Networks Reason About?

Arxiv

10+阅读 · 2020年2月15日

The Measure of Intelligence

The Measure of Intelligence

Arxiv

7+阅读 · 2019年11月5日

Generalization and Regularization in DQN

Generalization and Regularization in DQN

Arxiv

6+阅读 · 2019年1月30日

Physical Primitive Decomposition

Physical Primitive Decomposition

Arxiv

4+阅读 · 2018年9月13日

VIP会员

文章信息

相关主题

相关VIP内容

【硬核书】信息论，528页pdf，Information Theory and Coding by Example

【硬核书】信息论，528页pdf，Information Theory and Coding by Example

专知会员服务

148+阅读 · 2020年4月20日

【课程推荐】人工智能导论：Introduction to Articial Intelligence

【课程推荐】人工智能导论：Introduction to Articial Intelligence

专知会员服务

103+阅读 · 2019年12月20日

【纽约大学-AI研讨会】现代人工智能（Modern Artificial Intelligence）

【纽约大学-AI研讨会】现代人工智能（Modern Artificial Intelligence）

专知会员服务

26+阅读 · 2019年11月10日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

专知会员服务

160+阅读 · 2019年10月12日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

专知会员服务

79+阅读 · 2019年10月10日

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

专知会员服务

83+阅读 · 2019年10月9日

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

专知会员服务

65+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

《多域空战指挥体系：驾驭复杂性的艺术》

构建军事人工智能信任体系始于破除黑盒机制

《生态建模密码破译：建模与编程实践》美陆军最新报告

《战争形态演变：合成兵种防御主导模式探析》48页slides

相关资讯

计算机 | 入门级EI会议ICVRIS 2019诚邀稿件

计算机 | 入门级EI会议ICVRIS 2019诚邀稿件

Call4Papers

10+阅读 · 2019年6月24日

IEEE | DSC 2019诚邀稿件 (EI检索)

IEEE | DSC 2019诚邀稿件 (EI检索)

Call4Papers

10+阅读 · 2019年2月25日

人工智能 | SCI期刊专刊信息3条

人工智能 | SCI期刊专刊信息3条

Call4Papers

5+阅读 · 2019年1月10日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

meta learning 17年：MAML SNAIL

meta learning 17年：MAML SNAIL

CreateAMind

11+阅读 · 2019年1月2日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

Hierarchical Disentangled Representations

Hierarchical Disentangled Representations

CreateAMind

4+阅读 · 2018年4月15日

人工智能 | 国际会议截稿信息9条

人工智能 | 国际会议截稿信息9条

Call4Papers

4+阅读 · 2018年3月13日

人工智能 | 国际会议/SCI期刊约稿信息9条

人工智能 | 国际会议/SCI期刊约稿信息9条

Call4Papers

3+阅读 · 2018年1月12日

【学习】Hierarchical Softmax

【学习】Hierarchical Softmax

机器学习研究会

4+阅读 · 2017年8月6日

相关论文

Modularity in Reinforcement Learning via Algorithmic Independence in Credit Assignment

Arxiv

0+阅读 · 2021年7月3日

Game Theory to Study Interactions between Mobility Stakeholders

Arxiv

0+阅读 · 2021年7月3日

Constructive Decision Theory

Arxiv

0+阅读 · 2021年7月2日

Artificial Intelligence in the Creative Industries: A Review

Arxiv

0+阅读 · 2021年7月2日

Meta-learning in natural and artificial intelligence

Arxiv

10+阅读 · 2020年11月26日

Interference and Generalization in Temporal Difference Learning

Arxiv

8+阅读 · 2020年3月13日

What Can Neural Networks Reason About?

Arxiv

10+阅读 · 2020年2月15日

The Measure of Intelligence

The Measure of Intelligence

Arxiv

7+阅读 · 2019年11月5日

Generalization and Regularization in DQN

Generalization and Regularization in DQN

Arxiv

6+阅读 · 2019年1月30日

Physical Primitive Decomposition

Physical Primitive Decomposition

Arxiv

4+阅读 · 2018年9月13日

微信扫码咨询专知VIP会员