行为的习惯和目标协同作用：一种变分贝叶斯框架 (Habits and goals in synergy: a variational Bayesian framework for behavior) - 专知论文

会员服务 ·

0

贝叶斯框架 · 变分贝叶斯 · 贝叶斯 · 变分 · 协同作用 ·

2023 年 4 月 11 日

Habits and goals in synergy: a variational Bayesian framework for behavior

翻译：行为的习惯和目标协同作用：一种变分贝叶斯框架

Dongqi Han,Kenji Doya,Dongsheng Li,Jun Tani

How to behave efficiently and flexibly is a central problem for understanding biological agents and creating intelligent embodied AI. It has been well known that behavior can be classified as two types: reward-maximizing habitual behavior, which is fast while inflexible; and goal-directed behavior, which is flexible while slow. Conventionally, habitual and goal-directed behaviors are considered handled by two distinct systems in the brain. Here, we propose to bridge the gap between the two behaviors, drawing on the principles of variational Bayesian theory. We incorporate both behaviors in one framework by introducing a Bayesian latent variable called "intention". The habitual behavior is generated by using prior distribution of intention, which is goal-less; and the goal-directed behavior is generated by the posterior distribution of intention, which is conditioned on the goal. Building on this idea, we present a novel Bayesian framework for modeling behaviors. Our proposed framework enables skill sharing between the two kinds of behaviors, and by leveraging the idea of predictive coding, it enables an agent to seamlessly generalize from habitual to goal-directed behavior without requiring additional training. The proposed framework suggests a fresh perspective for cognitive science and embodied AI, highlighting the potential for greater integration between habitual and goal-directed behaviors.

翻译：如何高效灵活地行为是理解生物智能代理并创建智能化身人工智能的中心问题。已经广泛认识到行为可以被分类为两种类型：最大化奖励的习惯行为，它快速而不灵活;和以目标为导向的行为，后者灵活而缓慢。通常，习惯和目标导向的行为被认为由大脑中的两个不同系统处理。在这里，我们提出借鉴变分贝叶斯理论的原则来弥合两种行为之间的差距。我们通过引入一种贝叶斯隐变量称为“意图”，在一个框架中结合了习惯和目标导向的行为。习惯行为是通过使用意图的先验分布生成的，其没有目标;而以目标为导向的行为是通过意图的后验分布生成的，其以目标为条件。在这个基础上，我们提出了一种建模行为的新型贝叶斯框架。我们所提出的框架使这两种行为之间共享技能，并通过利用预测编码的理念，使代理人能够无需额外训练从习惯行为无缝地推广到目标导向的行为。所提出的框架为认知科学和智能化身人工智能提供了新的视角，突出了习惯和目标导向行为之间更大的整合潜力。

0

相关内容

贝叶斯框架

贝叶斯框架

【nature machine intelligence】终身学习机器的生物基础，Biological underpinnings for lifelong learning machines

【nature machine intelligence】终身学习机器的生物基础，Biological underpinnings for lifelong learning machines

专知会员服务

38+阅读 · 2022年3月24日

《行为与认知机器人学》，241页pdf

《行为与认知机器人学》，241页pdf

专知会员服务

54+阅读 · 2021年4月11日

【AAAI2021】元学习器的冷启动序列推荐

【AAAI2021】元学习器的冷启动序列推荐

专知会员服务

41+阅读 · 2020年12月19日

NLP必读经典文献100篇

专知会员服务

124+阅读 · 2020年9月8日

【新书】人工智能Python代码，227页pdf，Python code for Artificial Intelligence: Foundations of Computational Agents

【新书】人工智能Python代码，227页pdf，Python code for Artificial Intelligence: Foundations of Computational Agents

专知会员服务

102+阅读 · 2020年6月21日

回顾机器学习公平的数学框架，Review of Mathematical frameworks for Fairness in Machine Learning

回顾机器学习公平的数学框架，Review of Mathematical frameworks for Fairness in Machine Learning

专知会员服务

38+阅读 · 2020年5月30日

【基于模型的强化学习的博弈论框架】A Game Theoretic Framework for Model Based Reinforcement Learning

【基于模型的强化学习的博弈论框架】A Game Theoretic Framework for Model Based Reinforcement Learning

专知会员服务

131+阅读 · 2020年4月19日

《C++ Primer中文版第5版》电子书与学习笔记和课后练习答案

《C++ Primer中文版第5版》电子书与学习笔记和课后练习答案

专知会员服务

275+阅读 · 2020年2月13日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

Multi-Task Learning的几篇综述文章

Multi-Task Learning的几篇综述文章

深度学习自然语言处理

15+阅读 · 2020年6月15日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

逆强化学习-学习人先验的动机

逆强化学习-学习人先验的动机

CreateAMind

16+阅读 · 2019年1月18日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

vae 相关论文表示学习 1

vae 相关论文表示学习 1

CreateAMind

12+阅读 · 2018年9月6日

【论文】变分推断（Variational inference)的总结

【论文】变分推断（Variational inference)的总结

机器学习研究会

39+阅读 · 2017年11月16日

强化学习族谱

强化学习族谱

CreateAMind

26+阅读 · 2017年8月2日

面向图像网状结构体的蚁群分割算法

国家自然科学基金

0+阅读 · 2017年12月31日

MARVELD1基因调控肝细胞癌介入治疗的机制研究

国家自然科学基金

0+阅读 · 2016年12月31日

RNF43介导的AKT/MDM2和NEDL1途径调控肝癌细胞恶性行为的分子机制

国家自然科学基金

0+阅读 · 2015年12月31日

Dnmt1调控斑马鱼造血干细胞产生、分化及迁移的分子机制

国家自然科学基金

0+阅读 · 2013年12月31日

基于电路QED超强耦合机制的动力学演化和量子调控

国家自然科学基金

0+阅读 · 2013年12月31日

双相不锈钢2103连铸坯凝固过程热模拟研究

国家自然科学基金

0+阅读 · 2013年12月31日

移动互联网络中的博弈与协同激励机制研究

国家自然科学基金

0+阅读 · 2012年12月31日

RGM与neogenin信号调控应激性精神障碍-PTSD杏仁核、海马神经细胞凋亡的分子机制

国家自然科学基金

0+阅读 · 2011年12月31日

survivin拮抗细胞衰老的机制研究

国家自然科学基金

0+阅读 · 2011年12月31日

视频选择性注意机理与语义特征提取

国家自然科学基金

1+阅读 · 2009年12月31日

Bayesian feedback in the framework of ecological sciences

Arxiv

0+阅读 · 2023年5月29日

Maximum Optimality Margin: A Unified Approach for Contextual Linear Programming and Inverse Linear Programming

Arxiv

0+阅读 · 2023年5月28日

Reading Between the Lines: Modeling User Behavior and Costs in AI-Assisted Programming

Arxiv

0+阅读 · 2023年5月26日

HAAV: Hierarchical Aggregation of Augmented Views for Image Captioning

Arxiv

0+阅读 · 2023年5月25日

FemtoDet: An Object Detection Baseline for Energy Versus Performance Tradeoffs

Arxiv

0+阅读 · 2023年5月25日

Neural incomplete factorization: learning preconditioners for the conjugate gradient method

Arxiv

0+阅读 · 2023年5月25日

The Behavior and Convergence of Local Bayesian Optimization

Arxiv

0+阅读 · 2023年5月24日

Bayesian Deep Learning for Graphs

Arxiv

23+阅读 · 2022年2月24日

Multi-Agent Cooperative Bidding Games for Multi-Objective Optimization in e-Commercial Sponsored Search

Arxiv

12+阅读 · 2021年6月8日

Adversarial and Contrastive Variational Autoencoder for Sequential Recommendation

Arxiv

17+阅读 · 2021年3月19日

VIP会员

文章信息

相关主题

贝叶斯框架

变分贝叶斯

相关VIP内容

【nature machine intelligence】终身学习机器的生物基础，Biological underpinnings for lifelong learning machines

【nature machine intelligence】终身学习机器的生物基础，Biological underpinnings for lifelong learning machines

专知会员服务

38+阅读 · 2022年3月24日

《行为与认知机器人学》，241页pdf

《行为与认知机器人学》，241页pdf

专知会员服务

54+阅读 · 2021年4月11日

【AAAI2021】元学习器的冷启动序列推荐

【AAAI2021】元学习器的冷启动序列推荐

专知会员服务

41+阅读 · 2020年12月19日

NLP必读经典文献100篇

专知会员服务

124+阅读 · 2020年9月8日

【新书】人工智能Python代码，227页pdf，Python code for Artificial Intelligence: Foundations of Computational Agents

【新书】人工智能Python代码，227页pdf，Python code for Artificial Intelligence: Foundations of Computational Agents

专知会员服务

102+阅读 · 2020年6月21日

回顾机器学习公平的数学框架，Review of Mathematical frameworks for Fairness in Machine Learning

回顾机器学习公平的数学框架，Review of Mathematical frameworks for Fairness in Machine Learning

专知会员服务

38+阅读 · 2020年5月30日

【基于模型的强化学习的博弈论框架】A Game Theoretic Framework for Model Based Reinforcement Learning

【基于模型的强化学习的博弈论框架】A Game Theoretic Framework for Model Based Reinforcement Learning

专知会员服务

131+阅读 · 2020年4月19日

《C++ Primer中文版第5版》电子书与学习笔记和课后练习答案

《C++ Primer中文版第5版》电子书与学习笔记和课后练习答案

专知会员服务

275+阅读 · 2020年2月13日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

热门VIP内容

开通专知VIP会员享更多权益服务

大型语言模型遇上文本属性图：一种融合框架与应用的综述

人工智能赋能自主武器与人类控制第三部分：人类控制与系统操作员 | 35页

【博士论文】用于概率程序与生成模型的变分推断

军事指挥控制系统：2025年5种用途

相关资讯

Multi-Task Learning的几篇综述文章

Multi-Task Learning的几篇综述文章

深度学习自然语言处理

15+阅读 · 2020年6月15日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

逆强化学习-学习人先验的动机

逆强化学习-学习人先验的动机

CreateAMind

16+阅读 · 2019年1月18日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

vae 相关论文表示学习 1

vae 相关论文表示学习 1

CreateAMind

12+阅读 · 2018年9月6日

【论文】变分推断（Variational inference)的总结

【论文】变分推断（Variational inference)的总结

机器学习研究会

39+阅读 · 2017年11月16日

强化学习族谱

强化学习族谱

CreateAMind

26+阅读 · 2017年8月2日

相关论文

Bayesian feedback in the framework of ecological sciences

Arxiv

0+阅读 · 2023年5月29日

Maximum Optimality Margin: A Unified Approach for Contextual Linear Programming and Inverse Linear Programming

Arxiv

0+阅读 · 2023年5月28日

Reading Between the Lines: Modeling User Behavior and Costs in AI-Assisted Programming

Arxiv

0+阅读 · 2023年5月26日

HAAV: Hierarchical Aggregation of Augmented Views for Image Captioning

Arxiv

0+阅读 · 2023年5月25日

FemtoDet: An Object Detection Baseline for Energy Versus Performance Tradeoffs

Arxiv

0+阅读 · 2023年5月25日

Neural incomplete factorization: learning preconditioners for the conjugate gradient method

Arxiv

0+阅读 · 2023年5月25日

The Behavior and Convergence of Local Bayesian Optimization

Arxiv

0+阅读 · 2023年5月24日

Bayesian Deep Learning for Graphs

Arxiv

23+阅读 · 2022年2月24日

Multi-Agent Cooperative Bidding Games for Multi-Objective Optimization in e-Commercial Sponsored Search

Arxiv

12+阅读 · 2021年6月8日

Adversarial and Contrastive Variational Autoencoder for Sequential Recommendation

Arxiv

17+阅读 · 2021年3月19日

相关基金

面向图像网状结构体的蚁群分割算法

国家自然科学基金

0+阅读 · 2017年12月31日

MARVELD1基因调控肝细胞癌介入治疗的机制研究

国家自然科学基金

0+阅读 · 2016年12月31日

RNF43介导的AKT/MDM2和NEDL1途径调控肝癌细胞恶性行为的分子机制

国家自然科学基金

0+阅读 · 2015年12月31日

Dnmt1调控斑马鱼造血干细胞产生、分化及迁移的分子机制

国家自然科学基金

0+阅读 · 2013年12月31日

基于电路QED超强耦合机制的动力学演化和量子调控

国家自然科学基金

0+阅读 · 2013年12月31日

双相不锈钢2103连铸坯凝固过程热模拟研究

国家自然科学基金

0+阅读 · 2013年12月31日

移动互联网络中的博弈与协同激励机制研究

国家自然科学基金

0+阅读 · 2012年12月31日

RGM与neogenin信号调控应激性精神障碍-PTSD杏仁核、海马神经细胞凋亡的分子机制

国家自然科学基金

0+阅读 · 2011年12月31日

survivin拮抗细胞衰老的机制研究

国家自然科学基金

0+阅读 · 2011年12月31日

视频选择性注意机理与语义特征提取

国家自然科学基金

1+阅读 · 2009年12月31日

微信扫码咨询专知VIP会员