Bargaining can be used to resolve mixed-motive games in multi-agent systems. Although there is an abundance of negotiation strategies implemented in automated negotiating agents, most agents are based on single fixed strategies, while it is widely acknowledged that there is no single best-performing strategy for all negotiation settings. In this paper, we focus on bargaining settings where opponents are repeatedly encountered, but the bargaining problems change. We introduce a novel method that automatically creates and deploys a portfolio of complementary negotiation strategies using a training set and optimise pay-off in never-before-seen bargaining settings through per-setting strategy selection. Our method relies on the following contributions. We introduce a feature representation that captures characteristics for both the opponent and the bargaining problem. We model the behaviour of an opponent during a negotiation based on its actions, which is indicative of its negotiation strategy, in order to be more effective in future encounters. Our combination of feature-based methods generalises to new negotiation settings, as in practice, over time, it selects effective counter strategies in future encounters. Our approach is tested in an ANAC-like tournament, and we show that we are capable of winning such a tournament with a 5.6% increase in pay-off compared to the runner-up agent.
翻译:在多试办系统中,可以采用一种新颖的方法,利用一套培训,自动创造和部署一套互补的谈判战略,通过选择各种战略,在从未见过的谈判环境中优化收费。我们的方法依靠的是以下贡献。我们采用一种特征代表制,既捕捉对手的特点,又捕捉谈判问题。我们根据对手的行动来模拟对手在谈判中的表现,这说明谈判战略,以便在今后的交火中更加有效。我们采用基于特征的方法与新的谈判环境相结合,在实践中,随着时间的推移,我们选择了未来遇到的有效的对抗战略。我们的方法在类似ANAC的比赛中测试了我们的方法,我们显示我们有能力在类似ANAC的比赛中赢得这种竞争对手和讨价还价问题的特点。我们根据对手的行动来模拟对手在谈判中的行为,这是对谈判战略的提示,以便在今后的交价中更加有效。我们把基于特征的方法与新的谈判环境结合起来,在实践中,在将来遇到的交价中选择有效的反策略。我们的方法在类似ANAC的竞赛中测试,我们展示了我们的方法,并且我们展示了我们能够以5.6比得高的支付率比。