Recent work on promoting cooperation in multi-agent learning has resulted in many methods which successfully promote cooperation at the cost of becoming more vulnerable to exploitation by malicious actors. We show that this is an unavoidable trade-off and propose an objective which balances these concerns, promoting both safety and long-term cooperation. Moreover, the trade-off between safety and cooperation is not severe, and you can receive exponentially large returns through cooperation from a small amount of risk. We study both an exact solution method and propose a method for training policies that targets this objective, Accumulating Risk Capital Through Investing in Cooperation (ARCTIC), and evaluate them in iterated Prisoner's Dilemma and Stag Hunt.
翻译:最近促进多试剂学习合作的工作产生了许多方法,成功地促进了合作,但代价是更易受到恶意行为者的剥削。我们表明,这是一个不可避免的权衡,提出了平衡这些关切的目标,既促进安全,又促进长期合作。此外,安全与合作之间的权衡并不严重,通过少量风险的合作,你可以得到巨大的回报。我们研究了一种确切的解决办法,并提出了针对这一目标的培训政策方法,即通过合作投资积累风险资本,并在循环式囚犯的Dilemma和Stag Hunt中加以评估。