语义空间中新出现差异通信 (Emergent Discrete Communication in Semantic Spaces) - 专知论文

会员服务 ·

0

离散化 · 独热 · 词元分析器 · 学成 · 独热向量 ·

2021 年 11 月 4 日

Emergent Discrete Communication in Semantic Spaces

翻译：语义空间中新出现差异通信

Mycal Tucker,Huao Li,Siddharth Agrawal,Dana Hughes,Katia Sycara,Michael Lewis,Julie Shah

Neural agents trained in reinforcement learning settings can learn to communicate among themselves via discrete tokens, accomplishing as a team what agents would be unable to do alone. However, the current standard of using one-hot vectors as discrete communication tokens prevents agents from acquiring more desirable aspects of communication such as zero-shot understanding. Inspired by word embedding techniques from natural language processing, we propose neural agent architectures that enables them to communicate via discrete tokens derived from a learned, continuous space. We show in a decision theoretic framework that our technique optimizes communication over a wide range of scenarios, whereas one-hot tokens are only optimal under restrictive assumptions. In self-play experiments, we validate that our trained agents learn to cluster tokens in semantically-meaningful ways, allowing them communicate in noisy environments where other techniques fail. Lastly, we demonstrate both that agents using our method can effectively respond to novel human communication and that humans can understand unlabeled emergent agent communication, outperforming the use of one-hot communication.

翻译：在强化学习设置方面受过训练的神经代理商可以学习通过离散的象征物相互交流,作为一个团队完成什么是不能单独做到的。然而,目前使用单热矢量作为离散的通信象征物的标准使代理商无法获得更可取的通信方面,例如零射线理解。在自然语言处理过程中的文字嵌入技术的启发下,我们提议神经代理物结构,使他们能够通过从一个有知识的连续空间产生的离散象征物进行交流。我们在一个决定性框架中显示,我们的技术在广泛的情景中优化了通信,而单热象征物只是在限制性假设下是最佳的。在自我玩耍实验中,我们证实我们受过训练的代理商学会了以语义上有意义的方式组合标志物,允许他们在其他技术失败的吵闹环境中进行交流。最后,我们证明,使用我们的方法可以有效地应对人类新通信,人类能够理解无标签的新兴代理物的通信,而人类能够理解无标签的新兴代理物的通信,比使用单热通信好。

0

相关内容

离散化

【上海交大】<操作系统> 2021课程，附课件

【上海交大】<操作系统> 2021课程，附课件

专知会员服务

42+阅读 · 2021年4月3日

【Yoshua Bengio】因果表示学习，附视频与72页ppt

【Yoshua Bengio】因果表示学习，附视频与72页ppt

专知会员服务

76+阅读 · 2021年1月7日

【硬核课】机器人学习课程，UT Austin朱玉可博士讲述自主机器人的人工智能与机器学习机器学习算法

【硬核课】机器人学习课程，UT Austin朱玉可博士讲述自主机器人的人工智能与机器学习机器学习算法

专知会员服务

40+阅读 · 2020年9月21日

华为发布《自动驾驶网络解决方案白皮书》

华为发布《自动驾驶网络解决方案白皮书》

专知会员服务

130+阅读 · 2020年5月22日

【推荐】用于解缠学习的半监督StyleGAN，Semi-Supervised StyleGAN for Disentanglement Learning

【推荐】用于解缠学习的半监督StyleGAN，Semi-Supervised StyleGAN for Disentanglement Learning

专知会员服务

36+阅读 · 2020年3月13日

【跨语言BERT模型大集合】Transfer learning is increasingly going multilingual with language-specific BERT models

专知会员服务

54+阅读 · 2020年1月30日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

专知会员服务

65+阅读 · 2019年10月9日

Successor representations 强化学习表示的生物学启发

Successor representations 强化学习表示的生物学启发

CreateAMind

6+阅读 · 2019年9月5日

强化学习三篇论文避免遗忘等

强化学习三篇论文避免遗忘等

CreateAMind

20+阅读 · 2019年5月24日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

逆强化学习-学习人先验的动机

逆强化学习-学习人先验的动机

CreateAMind

16+阅读 · 2019年1月18日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

无监督元学习表示学习

无监督元学习表示学习

CreateAMind

27+阅读 · 2019年1月4日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

meta learning 17年：MAML SNAIL

meta learning 17年：MAML SNAIL

CreateAMind

11+阅读 · 2019年1月2日

Hierarchical Disentangled Representations

Hierarchical Disentangled Representations

CreateAMind

4+阅读 · 2018年4月15日

强化学习族谱

强化学习族谱

CreateAMind

26+阅读 · 2017年8月2日

Emergent Communication of Generalizations

Arxiv

0+阅读 · 2022年1月9日

Distributed Nash Equilibrium Seeking over Time-Varying Directed Communication Networks

Arxiv

0+阅读 · 2022年1月7日

Mixture of basis for interpretable continual learning with distribution shifts

Arxiv

0+阅读 · 2022年1月5日

Discovering Diverse Nearly Optimal Policies with Successor Features

Arxiv

0+阅读 · 2022年1月4日

On the Expressivity of Markov Reward

Arxiv

3+阅读 · 2021年11月1日

Bridging Multi-Task Learning and Meta-Learning: Towards Efficient Training and Effective Adaptation

Arxiv

9+阅读 · 2021年6月16日

Causal Curiosity: RL Agents Discovering Self-supervised Experiments for Causal Representation Learning

Arxiv

7+阅读 · 2021年4月14日

AuxNet: Auxiliary tasks enhanced Semantic Segmentation for Automated Driving

AuxNet: Auxiliary tasks enhanced Semantic Segmentation for Automated Driving

Arxiv

4+阅读 · 2019年1月17日

Emergent Translation in Multi-Agent Communication

Arxiv

3+阅读 · 2018年4月11日

Parameter Space Noise for Exploration

Arxiv

3+阅读 · 2018年1月31日

VIP会员

文章信息

相关主题

词元分析器

相关VIP内容

【上海交大】<操作系统> 2021课程，附课件

【上海交大】<操作系统> 2021课程，附课件

专知会员服务

42+阅读 · 2021年4月3日

【Yoshua Bengio】因果表示学习，附视频与72页ppt

【Yoshua Bengio】因果表示学习，附视频与72页ppt

专知会员服务

76+阅读 · 2021年1月7日

【硬核课】机器人学习课程，UT Austin朱玉可博士讲述自主机器人的人工智能与机器学习机器学习算法

【硬核课】机器人学习课程，UT Austin朱玉可博士讲述自主机器人的人工智能与机器学习机器学习算法

专知会员服务

40+阅读 · 2020年9月21日

华为发布《自动驾驶网络解决方案白皮书》

华为发布《自动驾驶网络解决方案白皮书》

专知会员服务

130+阅读 · 2020年5月22日

【推荐】用于解缠学习的半监督StyleGAN，Semi-Supervised StyleGAN for Disentanglement Learning

【推荐】用于解缠学习的半监督StyleGAN，Semi-Supervised StyleGAN for Disentanglement Learning

专知会员服务

36+阅读 · 2020年3月13日

【跨语言BERT模型大集合】Transfer learning is increasingly going multilingual with language-specific BERT models

专知会员服务

54+阅读 · 2020年1月30日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

专知会员服务

65+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

【新书】面向企业的图学习扩展：生产级图学习与推理，485页pdf

AI智能体编程：技术、挑战与机遇综述

【国家标准】数据安全技术数据安全风险评估方法

【CMU博士论文】交互式学习的进展：替代性反馈机制与自适应因果推理

相关资讯

Successor representations 强化学习表示的生物学启发

Successor representations 强化学习表示的生物学启发

CreateAMind

6+阅读 · 2019年9月5日

强化学习三篇论文避免遗忘等

强化学习三篇论文避免遗忘等

CreateAMind

20+阅读 · 2019年5月24日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

逆强化学习-学习人先验的动机

逆强化学习-学习人先验的动机

CreateAMind

16+阅读 · 2019年1月18日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

无监督元学习表示学习

无监督元学习表示学习

CreateAMind

27+阅读 · 2019年1月4日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

meta learning 17年：MAML SNAIL

meta learning 17年：MAML SNAIL

CreateAMind

11+阅读 · 2019年1月2日

Hierarchical Disentangled Representations

Hierarchical Disentangled Representations

CreateAMind

4+阅读 · 2018年4月15日

强化学习族谱

强化学习族谱

CreateAMind

26+阅读 · 2017年8月2日

相关论文

Emergent Communication of Generalizations

Arxiv

0+阅读 · 2022年1月9日

Distributed Nash Equilibrium Seeking over Time-Varying Directed Communication Networks

Arxiv

0+阅读 · 2022年1月7日

Mixture of basis for interpretable continual learning with distribution shifts

Arxiv

0+阅读 · 2022年1月5日

Discovering Diverse Nearly Optimal Policies with Successor Features

Arxiv

0+阅读 · 2022年1月4日

On the Expressivity of Markov Reward

Arxiv

3+阅读 · 2021年11月1日

Bridging Multi-Task Learning and Meta-Learning: Towards Efficient Training and Effective Adaptation

Arxiv

9+阅读 · 2021年6月16日

Causal Curiosity: RL Agents Discovering Self-supervised Experiments for Causal Representation Learning

Arxiv

7+阅读 · 2021年4月14日

AuxNet: Auxiliary tasks enhanced Semantic Segmentation for Automated Driving

AuxNet: Auxiliary tasks enhanced Semantic Segmentation for Automated Driving

Arxiv

4+阅读 · 2019年1月17日

Emergent Translation in Multi-Agent Communication

Arxiv

3+阅读 · 2018年4月11日

Parameter Space Noise for Exploration

Arxiv

3+阅读 · 2018年1月31日

微信扫码咨询专知VIP会员