Multi-agent artificial intelligence research promises a path to develop intelligent technologies that are more human-like and more human-compatible than those produced by "solipsistic" approaches, which do not consider interactions between agents. Melting Pot is a research tool developed to facilitate work on multi-agent artificial intelligence, and provides an evaluation protocol that measures generalization to novel social partners in a set of canonical test scenarios. Each scenario pairs a physical environment (a "substrate") with a reference set of co-players (a "background population"), to create a social situation with substantial interdependence between the individuals involved. For instance, some scenarios were inspired by institutional-economics-based accounts of natural resource management and public-good-provision dilemmas. Others were inspired by considerations from evolutionary biology, game theory, and artificial life. Melting Pot aims to cover a maximally diverse set of interdependencies and incentives. It includes the commonly-studied extreme cases of perfectly-competitive (zero-sum) motivations and perfectly-cooperative (shared-reward) motivations, but does not stop with them. As in real-life, a clear majority of scenarios in Melting Pot have mixed incentives. They are neither purely competitive nor purely cooperative and thus demand successful agents be able to navigate the resulting ambiguity. Here we describe Melting Pot 2.0, which revises and expands on Melting Pot. We also introduce support for scenarios with asymmetric roles, and explain how to integrate them into the evaluation protocol. This report also contains: (1) details of all substrates and scenarios; (2) a complete description of all baseline algorithms and results. Our intention is for it to serve as a reference for researchers using Melting Pot 2.0.
翻译:多剂人工智能研究为开发智能技术开辟了一条道路,这些技术比“单质”方法产生的技术更人性化、更人性兼容,而“单质”方法并不考虑代理人之间的互动。Melting Pot是为促进多剂人工智能工作而开发的一个研究工具,它提供了一种评估协议,衡量在一系列罐头测试情景中向新型社会伙伴推广的范式。每种情景都配对一种物理环境(一种“基底” ), 参照一组共同参与者(一种“基底人口 ” ), 创造一种在所涉个人之间高度相互依存的社会状况。例如,一些情景的灵感来自基于机构经济的自然资源管理和公益性难题的描述。 其他一些情景的灵感来自进化生物学、游戏理论和人工生命的考量。 熔化波特旨在覆盖各种各样的相互依存和激励因素。 它包括常见的完美竞争(零总) 动机和完美操作性(共享的) 动机,但并不止止止止于它们。 在真实生活中,这种真实的模型中, 导致多数的激励因素成为了我们 。