为基于重复语言的劝说游戏设计自动代理 (Designing an Automatic Agent for Repeated Language based Persuasion Games) - 专知论文

会员服务 ·

0

INFORMS · 蒙特卡洛树搜索 · AIM · 蒙特卡罗 · Next ·

2021 年 5 月 11 日

Designing an Automatic Agent for Repeated Language based Persuasion Games

翻译：为基于重复语言的劝说游戏设计自动代理

Maya Raifer,Guy Rotman,Reut Apel,Moshe Tennenholtz,Roi Reichart

Persuasion games are fundamental in economics and AI research and serve as the basis for important applications. However, work on this setup assumes communication with stylized messages that do not consist of rich human language. In this paper we consider a repeated sender (expert) -- receiver (decision maker) game, where the sender is fully informed about the state of the world and aims to persuade the receiver to accept a deal by sending one of several possible natural language reviews. We design an automatic expert that plays this repeated game, aiming to achieve the maximal payoff. Our expert is implemented within the Monte Carlo Tree Search (MCTS) algorithm, with deep learning models that exploit behavioral and linguistic signals in order to predict the next action of the decision maker, and the future payoff of the expert given the state of the game and a candidate review. We demonstrate the superiority of our expert over strong baselines, its adaptability to different decision makers, and that its selected reviews are nicely adapted to the proposed deal.

翻译：劝导游戏在经济学和AI研究中具有根本意义,是重要应用的基础。然而,这一设置的工作假设了与不包含丰富人文的系统化信息进行沟通。在本文件中,我们考虑的是反复发送者(专家) -- -- 接收者(决策者)游戏,发送者充分了解世界状况,目的是说服接收者接受一项交易,发送若干可能的自然语言评论之一。我们设计了一名自动专家,玩这个重复游戏,以达到最大效益。我们的专家是在蒙特卡洛树搜索算法(MCTS)中执行的,采用利用行为和语言信号的深层学习模型,以预测决策者的下一步行动,以及考虑到游戏状况和候选人审查的专家的未来报酬。我们展示了我们的专家优于强的基线、对不同决策者的适应性,其选定的审查与拟议的交易相适应。

0

相关内容

INFORMS

《计算机信息》杂志发表高质量的论文，扩大了运筹学和计算的范围，寻求有关理论、方法、实验、系统和应用方面的原创研究论文、新颖的调查和教程论文，以及描述新的和有用的软件工具的论文。官网链接：https://pubsonline.informs.org/journal/ijoc

最新《自监督表示学习》报告，70页ppt

最新《自监督表示学习》报告，70页ppt

专知会员服务

85+阅读 · 2020年12月22日

【新书】人工智能Python代码，227页pdf，Python code for Artificial Intelligence: Foundations of Computational Agents

【新书】人工智能Python代码，227页pdf，Python code for Artificial Intelligence: Foundations of Computational Agents

专知会员服务

96+阅读 · 2020年6月21日

知识图谱推理，50页ppt，Salesforce首席科学家Richard Socher

知识图谱推理，50页ppt，Salesforce首席科学家Richard Socher

专知会员服务

105+阅读 · 2020年6月10日

【Google】微型化机器学习教程，17页ppt，Getting Started with TinyML

【Google】微型化机器学习教程，17页ppt，Getting Started with TinyML

专知会员服务

69+阅读 · 2020年3月28日

【微软Alekh等开放新书】强化学习理论与算法（Reinforcement Learning:Theory and Algorithms），附83页pdf

【微软Alekh等开放新书】强化学习理论与算法（Reinforcement Learning:Theory and Algorithms），附83页pdf

专知会员服务

112+阅读 · 2019年11月24日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

45+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

167+阅读 · 2019年10月11日

【新书】Python编程基础，669页pdf

【新书】Python编程基础，669页pdf

专知会员服务

186+阅读 · 2019年10月10日

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

专知会员服务

64+阅读 · 2019年10月9日

最新BERT相关论文清单，BERT-related Papers

最新BERT相关论文清单，BERT-related Papers

专知会员服务

52+阅读 · 2019年9月29日

Successor representations 强化学习表示的生物学启发

Successor representations 强化学习表示的生物学启发

CreateAMind

6+阅读 · 2019年9月5日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

23+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

25+阅读 · 2019年5月18日

动物脑的好奇心和强化学习的好奇心

动物脑的好奇心和强化学习的好奇心

CreateAMind

10+阅读 · 2019年1月26日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

41+阅读 · 2019年1月3日

Hierarchical Disentangled Representations

Hierarchical Disentangled Representations

CreateAMind

4+阅读 · 2018年4月15日

计算机视觉近一年进展综述

计算机视觉近一年进展综述

机器学习研究会

8+阅读 · 2017年11月25日

计算机类 | 国际会议信息7条

计算机类 | 国际会议信息7条

Call4Papers

3+阅读 · 2017年11月17日

强化学习族谱

强化学习族谱

CreateAMind

26+阅读 · 2017年8月2日

强化学习 cartpole_a3c

强化学习 cartpole_a3c

CreateAMind

9+阅读 · 2017年7月21日

An Ambient Intelligence-Based Human Behavior Monitoring Framework for Ubiquitous Environments

Arxiv

0+阅读 · 2021年6月29日

AI-Augmented Behavior Analysis for Children with Developmental Disabilities: Building Towards Precision Treatment

AI-Augmented Behavior Analysis for Children with Developmental Disabilities: Building Towards Precision Treatment

Arxiv

0+阅读 · 2021年6月28日

Model-Advantage Optimization for Model-Based Reinforcement Learning

Arxiv

0+阅读 · 2021年6月26日

Discovering Generalizable Skills via Automated Generation of Diverse Tasks

Arxiv

0+阅读 · 2021年6月26日

A hybrid model-based and learning-based approach for classification using limited number of training samples

Arxiv

0+阅读 · 2021年6月25日

Repeated Quantum Games and Strategic Efficiency

Arxiv

0+阅读 · 2021年6月25日

Learning Compositional Representations for Few-Shot Recognition

Learning Compositional Representations for Few-Shot Recognition

Arxiv

5+阅读 · 2018年12月21日

Contrastive Explanations for Reinforcement Learning in terms of Expected Consequences

Contrastive Explanations for Reinforcement Learning in terms of Expected Consequences

Arxiv

5+阅读 · 2018年7月23日

Emergent Translation in Multi-Agent Communication

Arxiv

3+阅读 · 2018年4月11日

Analyzing Language Learned by an Active Question Answering Agent

Arxiv

6+阅读 · 2018年1月23日

VIP会员

文章信息

相关主题

蒙特卡洛树搜索

相关VIP内容

最新《自监督表示学习》报告，70页ppt

最新《自监督表示学习》报告，70页ppt

专知会员服务

85+阅读 · 2020年12月22日

【新书】人工智能Python代码，227页pdf，Python code for Artificial Intelligence: Foundations of Computational Agents

【新书】人工智能Python代码，227页pdf，Python code for Artificial Intelligence: Foundations of Computational Agents

专知会员服务

96+阅读 · 2020年6月21日

知识图谱推理，50页ppt，Salesforce首席科学家Richard Socher

知识图谱推理，50页ppt，Salesforce首席科学家Richard Socher

专知会员服务

105+阅读 · 2020年6月10日

【Google】微型化机器学习教程，17页ppt，Getting Started with TinyML

【Google】微型化机器学习教程，17页ppt，Getting Started with TinyML

专知会员服务

69+阅读 · 2020年3月28日

【微软Alekh等开放新书】强化学习理论与算法（Reinforcement Learning:Theory and Algorithms），附83页pdf

【微软Alekh等开放新书】强化学习理论与算法（Reinforcement Learning:Theory and Algorithms），附83页pdf

专知会员服务

112+阅读 · 2019年11月24日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

45+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

167+阅读 · 2019年10月11日

【新书】Python编程基础，669页pdf

【新书】Python编程基础，669页pdf

专知会员服务

186+阅读 · 2019年10月10日

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

专知会员服务

64+阅读 · 2019年10月9日

最新BERT相关论文清单，BERT-related Papers

最新BERT相关论文清单，BERT-related Papers

专知会员服务

52+阅读 · 2019年9月29日

热门VIP内容

相关资讯

Successor representations 强化学习表示的生物学启发

Successor representations 强化学习表示的生物学启发

CreateAMind

6+阅读 · 2019年9月5日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

23+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

25+阅读 · 2019年5月18日

动物脑的好奇心和强化学习的好奇心

动物脑的好奇心和强化学习的好奇心

CreateAMind

10+阅读 · 2019年1月26日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

41+阅读 · 2019年1月3日

Hierarchical Disentangled Representations

Hierarchical Disentangled Representations

CreateAMind

4+阅读 · 2018年4月15日

计算机视觉近一年进展综述

计算机视觉近一年进展综述

机器学习研究会

8+阅读 · 2017年11月25日

计算机类 | 国际会议信息7条

计算机类 | 国际会议信息7条

Call4Papers

3+阅读 · 2017年11月17日

强化学习族谱

强化学习族谱

CreateAMind

26+阅读 · 2017年8月2日

强化学习 cartpole_a3c

强化学习 cartpole_a3c

CreateAMind

9+阅读 · 2017年7月21日

相关论文

An Ambient Intelligence-Based Human Behavior Monitoring Framework for Ubiquitous Environments

Arxiv

0+阅读 · 2021年6月29日

AI-Augmented Behavior Analysis for Children with Developmental Disabilities: Building Towards Precision Treatment

AI-Augmented Behavior Analysis for Children with Developmental Disabilities: Building Towards Precision Treatment

Arxiv

0+阅读 · 2021年6月28日

Model-Advantage Optimization for Model-Based Reinforcement Learning

Arxiv

0+阅读 · 2021年6月26日

Discovering Generalizable Skills via Automated Generation of Diverse Tasks

Arxiv

0+阅读 · 2021年6月26日

A hybrid model-based and learning-based approach for classification using limited number of training samples

Arxiv

0+阅读 · 2021年6月25日

Repeated Quantum Games and Strategic Efficiency

Arxiv

0+阅读 · 2021年6月25日

Learning Compositional Representations for Few-Shot Recognition

Learning Compositional Representations for Few-Shot Recognition

Arxiv

5+阅读 · 2018年12月21日

Contrastive Explanations for Reinforcement Learning in terms of Expected Consequences

Contrastive Explanations for Reinforcement Learning in terms of Expected Consequences

Arxiv

5+阅读 · 2018年7月23日

Emergent Translation in Multi-Agent Communication

Arxiv

3+阅读 · 2018年4月11日

Analyzing Language Learned by an Active Question Answering Agent

Arxiv

6+阅读 · 2018年1月23日

微信扫码咨询专知VIP会员