baller2vec+++:模拟协调剂外观多实体变异器 (baller2vec++: A Look-Ahead Multi-Entity Transformer For Modeling Coordinated Agents) - 专知论文

会员服务 ·

0

统计量 · MoDELS · 时间步 · 变换 · 学成 ·

2021 年 4 月 24 日

baller2vec++: A Look-Ahead Multi-Entity Transformer For Modeling Coordinated Agents

翻译：baller2vec+++:模拟协调剂外观多实体变异器

Michael A. Alcorn,Anh Nguyen

In many multi-agent spatiotemporal systems, the agents are under the influence of shared, unobserved variables (e.g., the play a team is executing in a game of basketball). As a result, the trajectories of the agents are often statistically dependent at any given time step; however, almost universally, multi-agent models implicitly assume the agents' trajectories are statistically independent at each time step. In this paper, we introduce baller2vec++, a multi-entity Transformer that can effectively model coordinated agents. Specifically, baller2vec++ applies a specially designed self-attention mask to a mixture of location and "look-ahead" trajectory sequences to learn the distributions of statistically dependent agent trajectories. We show that, unlike baller2vec (baller2vec++'s predecessor), baller2vec++ can learn to emulate the behavior of perfectly coordinated agents in a simulated toy dataset. Additionally, when modeling the trajectories of professional basketball players, baller2vec++ outperforms baller2vec by a wide margin.

翻译：在许多多试剂时空系统中, 代理器受到共享且不受观察的变量的影响( 例如, 一个团队在篮球游戏中执行的游戏 ) 。结果, 代理器的轨迹在任何特定时间步骤中通常在统计上取决于任何特定时间步骤; 但是, 几乎普遍地, 多试样模型暗含地假定, 代理器的轨迹在统计上每个步骤都是独立的。在本文中, 我们引入一个能够有效模拟协调代理器的多实体变异器。具体地说, Baller2vec++ 将一个专门设计的自我注意遮罩应用到一个位置和“ 外观” 轨迹序列来学习统计依赖性代理器轨迹的分布。我们表明, 与 Baller2vec ( Baller2vec++' 的前身) 不同的是, Baller2vec++ 可以学习在模拟的玩具数据集中模仿完全协调的代理器的行为。此外, 当模拟专业篮球员的轨迹时, Baller2vec+ 超越了球盘的模。

0

相关内容

统计量

ICLR 2021杰出论文奖出炉，8篇论文上榜！

专知会员服务

26+阅读 · 2021年4月2日

【ICML2020】用于强化学习的对比无监督表示嵌入

【ICML2020】用于强化学习的对比无监督表示嵌入

专知会员服务

28+阅读 · 2020年7月6日

【CVPR2020】视觉导航的神经拓扑SLAM，56页ppt，Neural Topological SLAM for Visual Navigation

【CVPR2020】视觉导航的神经拓扑SLAM，56页ppt，Neural Topological SLAM for Visual Navigation

专知会员服务

14+阅读 · 2020年6月18日

【ACL2020】Span-ConveRT：预训练对话表示小样本跨度提取，Span-ConveRT: Few-shot Span Extraction for Dialog with Pretrained Conversational Representations

【ACL2020】Span-ConveRT：预训练对话表示小样本跨度提取，Span-ConveRT: Few-shot Span Extraction for Dialog with Pretrained Conversational Representations

专知会员服务

17+阅读 · 2020年5月19日

【ACL2020】命名实体识别即依存解析，Named Entity Recognition as Dependency Parsing

【ACL2020】命名实体识别即依存解析，Named Entity Recognition as Dependency Parsing

专知会员服务

61+阅读 · 2020年5月15日

【UCLA-微软-WWW2020】异构图Transformer，Heterogeneous Graph Transformer

【UCLA-微软-WWW2020】异构图Transformer，Heterogeneous Graph Transformer

专知会员服务

137+阅读 · 2020年3月8日

【AAAI 2020 |接收论文】使用屏蔽层次Transformer进行会话结构建模，Conversation Structure Modeling Using Masked Hierarchical Transformer，波士顿大学

【AAAI 2020 |接收论文】使用屏蔽层次Transformer进行会话结构建模，Conversation Structure Modeling Using Masked Hierarchical Transformer，波士顿大学

专知会员服务

5+阅读 · 2019年11月25日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

50+阅读 · 2019年10月17日

Stabilizing Transformers for Reinforcement Learning

Stabilizing Transformers for Reinforcement Learning

专知会员服务

60+阅读 · 2019年10月17日

最新BERT相关论文清单，BERT-related Papers

最新BERT相关论文清单，BERT-related Papers

专知会员服务

53+阅读 · 2019年9月29日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

19篇ICML2019论文摘录选读！

19篇ICML2019论文摘录选读！

专知

28+阅读 · 2019年4月28日

逆强化学习-学习人先验的动机

逆强化学习-学习人先验的动机

CreateAMind

16+阅读 · 2019年1月18日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

误差反向传播——RNN

误差反向传播——RNN

统计学习与视觉计算组

18+阅读 · 2018年9月6日

【SIGIR2018】五篇对抗训练文章

【SIGIR2018】五篇对抗训练文章

专知

12+阅读 · 2018年7月9日

Hierarchical Imitation - Reinforcement Learning

Hierarchical Imitation - Reinforcement Learning

CreateAMind

19+阅读 · 2018年5月25日

Hierarchical Disentangled Representations

Hierarchical Disentangled Representations

CreateAMind

4+阅读 · 2018年4月15日

分布式TensorFlow入门指南

分布式TensorFlow入门指南

机器学习研究会

4+阅读 · 2017年11月28日

Grounding Spatio-Temporal Language with Transformers

Grounding Spatio-Temporal Language with Transformers

Arxiv

0+阅读 · 2021年6月16日

Scene Transformer: A unified multi-task model for behavior prediction and planning

Arxiv

0+阅读 · 2021年6月15日

On Multi-objective Policy Optimization as a Tool for Reinforcement Learning

Arxiv

0+阅读 · 2021年6月15日

A Distributed Model-Free Ride-Sharing Approach for Joint Matching, Pricing, and Dispatching using Deep Reinforcement Learning

Arxiv

0+阅读 · 2021年6月14日

Exploring Sparse Expert Models and Beyond

Exploring Sparse Expert Models and Beyond

Arxiv

0+阅读 · 2021年6月14日

Verified Synthesis of Optimal Safety Controllers for Human-Robot Collaboration

Arxiv

0+阅读 · 2021年6月11日

Transformers for Modeling Physical Systems

Arxiv

0+阅读 · 2021年6月8日

PlayVirtual: Augmenting Cycle-Consistent Virtual Trajectories for Reinforcement Learning

Arxiv

0+阅读 · 2021年6月8日

Mixture of Virtual-Kernel Experts for Multi-Objective User Profile Modeling

Arxiv

1+阅读 · 2021年6月4日

A Neural Influence Diffusion Model for Social Recommendation

Arxiv

4+阅读 · 2019年4月20日

VIP会员

文章信息

相关主题

相关VIP内容

ICLR 2021杰出论文奖出炉，8篇论文上榜！

专知会员服务

26+阅读 · 2021年4月2日

【ICML2020】用于强化学习的对比无监督表示嵌入

【ICML2020】用于强化学习的对比无监督表示嵌入

专知会员服务

28+阅读 · 2020年7月6日

【CVPR2020】视觉导航的神经拓扑SLAM，56页ppt，Neural Topological SLAM for Visual Navigation

【CVPR2020】视觉导航的神经拓扑SLAM，56页ppt，Neural Topological SLAM for Visual Navigation

专知会员服务

14+阅读 · 2020年6月18日

【ACL2020】Span-ConveRT：预训练对话表示小样本跨度提取，Span-ConveRT: Few-shot Span Extraction for Dialog with Pretrained Conversational Representations

【ACL2020】Span-ConveRT：预训练对话表示小样本跨度提取，Span-ConveRT: Few-shot Span Extraction for Dialog with Pretrained Conversational Representations

专知会员服务

17+阅读 · 2020年5月19日

【ACL2020】命名实体识别即依存解析，Named Entity Recognition as Dependency Parsing

【ACL2020】命名实体识别即依存解析，Named Entity Recognition as Dependency Parsing

专知会员服务

61+阅读 · 2020年5月15日

【UCLA-微软-WWW2020】异构图Transformer，Heterogeneous Graph Transformer

【UCLA-微软-WWW2020】异构图Transformer，Heterogeneous Graph Transformer

专知会员服务

137+阅读 · 2020年3月8日

【AAAI 2020 |接收论文】使用屏蔽层次Transformer进行会话结构建模，Conversation Structure Modeling Using Masked Hierarchical Transformer，波士顿大学

【AAAI 2020 |接收论文】使用屏蔽层次Transformer进行会话结构建模，Conversation Structure Modeling Using Masked Hierarchical Transformer，波士顿大学

专知会员服务

5+阅读 · 2019年11月25日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

50+阅读 · 2019年10月17日

Stabilizing Transformers for Reinforcement Learning

Stabilizing Transformers for Reinforcement Learning

专知会员服务

60+阅读 · 2019年10月17日

最新BERT相关论文清单，BERT-related Papers

最新BERT相关论文清单，BERT-related Papers

专知会员服务

53+阅读 · 2019年9月29日

热门VIP内容

开通专知VIP会员享更多权益服务

隐身自主无人水下航行器技术如何变革水下作战并重塑海军竞争

《俄乌战争中的无人系统：新的战争方式与新兴趋势——来自前线的印象》报告

《海上自主水面船舶远程操作中心：安全可持续运行的多维度分析》

相关资讯

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

19篇ICML2019论文摘录选读！

19篇ICML2019论文摘录选读！

专知

28+阅读 · 2019年4月28日

逆强化学习-学习人先验的动机

逆强化学习-学习人先验的动机

CreateAMind

16+阅读 · 2019年1月18日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

误差反向传播——RNN

误差反向传播——RNN

统计学习与视觉计算组

18+阅读 · 2018年9月6日

【SIGIR2018】五篇对抗训练文章

【SIGIR2018】五篇对抗训练文章

专知

12+阅读 · 2018年7月9日

Hierarchical Imitation - Reinforcement Learning

Hierarchical Imitation - Reinforcement Learning

CreateAMind

19+阅读 · 2018年5月25日

Hierarchical Disentangled Representations

Hierarchical Disentangled Representations

CreateAMind

4+阅读 · 2018年4月15日

分布式TensorFlow入门指南

分布式TensorFlow入门指南

机器学习研究会

4+阅读 · 2017年11月28日

相关论文

Grounding Spatio-Temporal Language with Transformers

Grounding Spatio-Temporal Language with Transformers

Arxiv

0+阅读 · 2021年6月16日

Scene Transformer: A unified multi-task model for behavior prediction and planning

Arxiv

0+阅读 · 2021年6月15日

On Multi-objective Policy Optimization as a Tool for Reinforcement Learning

Arxiv

0+阅读 · 2021年6月15日

A Distributed Model-Free Ride-Sharing Approach for Joint Matching, Pricing, and Dispatching using Deep Reinforcement Learning

Arxiv

0+阅读 · 2021年6月14日

Exploring Sparse Expert Models and Beyond

Exploring Sparse Expert Models and Beyond

Arxiv

0+阅读 · 2021年6月14日

Verified Synthesis of Optimal Safety Controllers for Human-Robot Collaboration

Arxiv

0+阅读 · 2021年6月11日

Transformers for Modeling Physical Systems

Arxiv

0+阅读 · 2021年6月8日

PlayVirtual: Augmenting Cycle-Consistent Virtual Trajectories for Reinforcement Learning

Arxiv

0+阅读 · 2021年6月8日

Mixture of Virtual-Kernel Experts for Multi-Objective User Profile Modeling

Arxiv

1+阅读 · 2021年6月4日

A Neural Influence Diffusion Model for Social Recommendation

Arxiv

4+阅读 · 2019年4月20日

微信扫码咨询专知VIP会员