Evidence that visual communication preceded written language and provided a basis for it goes back to prehistory, in forms such as cave and rock paintings depicting traces of our distant ancestors. Emergent communication research has sought to explore how agents can learn to communicate in order to collaboratively solve tasks. Existing research has focused on language, with a learned communication channel transmitting sequences of discrete tokens between the agents. In this work, we explore a visual communication channel between agents that are allowed to draw with simple strokes. Our agents are parameterised by deep neural networks, and the drawing procedure is differentiable, allowing for end-to-end training. In the framework of a referential communication game, we demonstrate that agents can not only successfully learn to communicate by drawing, but with appropriate inductive biases, can do so in a fashion that humans can interpret. We hope to encourage future research to consider visual communication as a more flexible and directly interpretable alternative of training collaborative agents.
翻译:有证据表明视觉通信先于书面语言,为它提供了可追溯到史前的证据,其形式包括洞穴和岩石绘画,描绘我们远古祖先的痕迹。新兴通信研究试图探索代理人如何学会交流,以便合作解决任务。现有研究侧重于语言,通过一个学习的通讯渠道传递代理人之间的离散象征序列。在这项工作中,我们探索允许用简单划线绘制的代理人之间的视觉通信渠道。我们的代理人被深层神经网络所参照,绘图程序是不同的,允许进行端到端的培训。在优惠通信游戏的框架内,我们证明代理人不仅能够成功地学会通过绘图进行交流,而且具有适当的感性偏见,能够以人类能够解释的方式进行交流。我们希望鼓励未来的研究将视觉通信视为培训协作代理人的一种更灵活和直接可解释的替代办法。