GO - 组合行为和影响 (Go-Blend behavior and affect) - 专知论文

会员服务 ·

0

优化器 · Performer · 强化学习 · 学成 · Processing（编程语言） ·

2021 年 9 月 24 日

Go-Blend behavior and affect

翻译：GO - 组合行为和影响

Matthew Barthet,Antonios Liapis,Georgios N. Yannakakis

from arxiv, 8 pages

This paper proposes a paradigm shift for affective computing by viewing the affect modeling task as a reinforcement learning process. According to our proposed framework the context (environment) and the actions of an agent define the common representation that interweaves behavior and affect. To realise this framework we build on recent advances in reinforcement learning and use a modified version of the Go-Explore algorithm which has showcased supreme performance in hard exploration tasks. In this initial study, we test our framework in an arcade game by training Go-Explore agents to both play optimally and attempt to mimic human demonstrations of arousal. We vary the degree of importance between optimal play and arousal imitation and create agents that can effectively display a palette of affect and behavioral patterns. Our Go-Explore implementation not only introduces a new paradigm for affect modeling; it empowers believable AI-based game testing by providing agents that can blend and express a multitude of behavioral and affective patterns.

翻译：本文建议通过将影响模拟任务视为强化学习过程来改变感官计算的模式。根据我们提议的框架,环境(环境)和代理人的行动定义了相互交织的行为和影响的共同代表。为了实现这一框架,我们以最近加强学习的进展为基础,并使用经修改的Go-Explore算法,该算法展示了艰苦探索任务的最高性能。在最初的研究中,我们通过训练Go-Explore代理商最佳地发挥和试图模仿人类的发声演示来测试我们的框架。我们在最佳玩耍和振奋性模仿以及创造能够有效展示影响和行为模式的代理商之间的重要性不同。我们的Go-Explore实施不仅引入了影响建模的新模式;它通过提供能够混合和表达多种行为和动性模式的代理商来增强可信赖的AI型游戏测试能力。

0

相关内容

优化器

【DSAA教程】可解释人工智能金融服务，325页ppt，Explainable AI in Financial Services

【DSAA教程】可解释人工智能金融服务，325页ppt，Explainable AI in Financial Services

专知会员服务

43+阅读 · 2021年10月21日

【NUS-Xavier 教授】图神经网络应用概述，15页ppt

专知会员服务

52+阅读 · 2021年6月30日

INRIA 最新《机器学习理论》课程笔记，176页pdf

专知会员服务

51+阅读 · 2020年12月14日

【斯坦福大学Chelsea Finn-NeurIPS 2019】贝叶斯元学习

【斯坦福大学Chelsea Finn-NeurIPS 2019】贝叶斯元学习

专知会员服务

38+阅读 · 2019年12月17日

Risk Sensitive Portfolio Optimization with Regime-Switching and Default Contagion，香港理工大学应用数学系余翔助理教授，第八届全国社会媒体处理大会SMP2019

Risk Sensitive Portfolio Optimization with Regime-Switching and Default Contagion，香港理工大学应用数学系余翔助理教授，第八届全国社会媒体处理大会SMP2019

专知会员服务

10+阅读 · 2019年10月24日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

专知会员服务

160+阅读 · 2019年10月12日

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

专知会员服务

83+阅读 · 2019年10月9日

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

专知会员服务

65+阅读 · 2019年10月9日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

谷歌足球游戏环境使用介绍

谷歌足球游戏环境使用介绍

CreateAMind

33+阅读 · 2019年6月27日

计算机 | 入门级EI会议ICVRIS 2019诚邀稿件

计算机 | 入门级EI会议ICVRIS 2019诚邀稿件

Call4Papers

10+阅读 · 2019年6月24日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

Call for Participation: Shared Tasks in NLPCC 2019

Call for Participation: Shared Tasks in NLPCC 2019

中国计算机学会

5+阅读 · 2019年3月22日

逆强化学习-学习人先验的动机

逆强化学习-学习人先验的动机

CreateAMind

16+阅读 · 2019年1月18日

spinningup.openai 强化学习资源完整

spinningup.openai 强化学习资源完整

CreateAMind

6+阅读 · 2018年12月17日

计算机类 | 期刊专刊截稿信息9条

计算机类 | 期刊专刊截稿信息9条

Call4Papers

4+阅读 · 2018年1月26日

计算机类 | 国际会议信息7条

计算机类 | 国际会议信息7条

Call4Papers

3+阅读 · 2017年11月17日

Auto-Encoding GAN

Auto-Encoding GAN

CreateAMind

7+阅读 · 2017年8月4日

Beyond NDCG: behavioral testing of recommender systems with RecList

Arxiv

0+阅读 · 2021年11月18日

A big data intelligence marketplace and secure analytics experimentation platform for the aviation industry

Arxiv

0+阅读 · 2021年11月18日

The Multi-Agent Behavior Dataset: Mouse Dyadic Social Interactions

The Multi-Agent Behavior Dataset: Mouse Dyadic Social Interactions

Arxiv

0+阅读 · 2021年11月18日

Variational Dynamic for Self-Supervised Exploration in Deep Reinforcement Learning

Arxiv

0+阅读 · 2021年11月17日

Stronger Generalization Guarantees for Robot Learning by Combining Generative Models and Real-World Data

Arxiv

0+阅读 · 2021年11月16日

Efficient Fully-Offline Meta-Reinforcement Learning via Distance Metric Learning and Behavior Regularization

Arxiv

8+阅读 · 2020年11月26日

Kalman Filtering Attention for User Behavior Modeling in CTR Prediction

Arxiv

5+阅读 · 2020年10月2日

Inferred successor maps for better transfer learning

Inferred successor maps for better transfer learning

Arxiv

3+阅读 · 2019年7月2日

Learning Recommender Systems from Multi-Behavior Data

Learning Recommender Systems from Multi-Behavior Data

Arxiv

8+阅读 · 2018年9月21日

Together or Alone: The Price of Privacy in Collaborative Learning

Arxiv

4+阅读 · 2018年2月28日

VIP会员

文章信息

相关主题

Processing（编程语言）

相关VIP内容

【DSAA教程】可解释人工智能金融服务，325页ppt，Explainable AI in Financial Services

【DSAA教程】可解释人工智能金融服务，325页ppt，Explainable AI in Financial Services

专知会员服务

43+阅读 · 2021年10月21日

【NUS-Xavier 教授】图神经网络应用概述，15页ppt

专知会员服务

52+阅读 · 2021年6月30日

INRIA 最新《机器学习理论》课程笔记，176页pdf

专知会员服务

51+阅读 · 2020年12月14日

【斯坦福大学Chelsea Finn-NeurIPS 2019】贝叶斯元学习

【斯坦福大学Chelsea Finn-NeurIPS 2019】贝叶斯元学习

专知会员服务

38+阅读 · 2019年12月17日

Risk Sensitive Portfolio Optimization with Regime-Switching and Default Contagion，香港理工大学应用数学系余翔助理教授，第八届全国社会媒体处理大会SMP2019

Risk Sensitive Portfolio Optimization with Regime-Switching and Default Contagion，香港理工大学应用数学系余翔助理教授，第八届全国社会媒体处理大会SMP2019

专知会员服务

10+阅读 · 2019年10月24日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

专知会员服务

160+阅读 · 2019年10月12日

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

专知会员服务

83+阅读 · 2019年10月9日

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

专知会员服务

65+阅读 · 2019年10月9日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

《战区安全决策课程体系》最新244页

《"无人机航母"原型平台》

任务规划与地形分析：现代复杂环境作战导航体系

《攻击场景描述形式化模型研究》

相关资讯

谷歌足球游戏环境使用介绍

谷歌足球游戏环境使用介绍

CreateAMind

33+阅读 · 2019年6月27日

计算机 | 入门级EI会议ICVRIS 2019诚邀稿件

计算机 | 入门级EI会议ICVRIS 2019诚邀稿件

Call4Papers

10+阅读 · 2019年6月24日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

Call for Participation: Shared Tasks in NLPCC 2019

Call for Participation: Shared Tasks in NLPCC 2019

中国计算机学会

5+阅读 · 2019年3月22日

逆强化学习-学习人先验的动机

逆强化学习-学习人先验的动机

CreateAMind

16+阅读 · 2019年1月18日

spinningup.openai 强化学习资源完整

spinningup.openai 强化学习资源完整

CreateAMind

6+阅读 · 2018年12月17日

计算机类 | 期刊专刊截稿信息9条

计算机类 | 期刊专刊截稿信息9条

Call4Papers

4+阅读 · 2018年1月26日

计算机类 | 国际会议信息7条

计算机类 | 国际会议信息7条

Call4Papers

3+阅读 · 2017年11月17日

Auto-Encoding GAN

Auto-Encoding GAN

CreateAMind

7+阅读 · 2017年8月4日

相关论文

Beyond NDCG: behavioral testing of recommender systems with RecList

Arxiv

0+阅读 · 2021年11月18日

A big data intelligence marketplace and secure analytics experimentation platform for the aviation industry

Arxiv

0+阅读 · 2021年11月18日

The Multi-Agent Behavior Dataset: Mouse Dyadic Social Interactions

The Multi-Agent Behavior Dataset: Mouse Dyadic Social Interactions

Arxiv

0+阅读 · 2021年11月18日

Variational Dynamic for Self-Supervised Exploration in Deep Reinforcement Learning

Arxiv

0+阅读 · 2021年11月17日

Stronger Generalization Guarantees for Robot Learning by Combining Generative Models and Real-World Data

Arxiv

0+阅读 · 2021年11月16日

Efficient Fully-Offline Meta-Reinforcement Learning via Distance Metric Learning and Behavior Regularization

Arxiv

8+阅读 · 2020年11月26日

Kalman Filtering Attention for User Behavior Modeling in CTR Prediction

Arxiv

5+阅读 · 2020年10月2日

Inferred successor maps for better transfer learning

Inferred successor maps for better transfer learning

Arxiv

3+阅读 · 2019年7月2日

Learning Recommender Systems from Multi-Behavior Data

Learning Recommender Systems from Multi-Behavior Data

Arxiv

8+阅读 · 2018年9月21日

Together or Alone: The Price of Privacy in Collaborative Learning

Arxiv

4+阅读 · 2018年2月28日

微信扫码咨询专知VIP会员