社会监督的代议制学习:主观性在学习效率代表制中的作用 (Socially Supervised Representation Learning: the Role of Subjectivity in Learning Efficient Representations)

Despite its rise as a prominent solution to the data inefficiency of today's machine learning models, self-supervised learning has yet to be studied from a purely multi-agent perspective. In this work, we propose that aligning internal subjective representations, which naturally arise in a multi-agent setup where agents receive partial observations of the same underlying environmental state, can lead to more data-efficient representations. We propose that multi-agent environments, where agents do not have access to the observations of others but can communicate within a limited range, guarantees a common context that can be leveraged in individual representation learning. The reason is that subjective observations necessarily refer to the same subset of the underlying environmental states and that communication about these states can freely offer a supervised signal. To highlight the importance of communication, we refer to our setting as \textit{socially supervised representation learning}. We present a minimal architecture comprised of a population of autoencoders, where we define loss functions, capturing different aspects of effective communication, and examine their effect on the learned representations. We show that our proposed architecture allows the emergence of aligned representations. The subjectivity introduced by presenting agents with distinct perspectives of the environment state contributes to learning abstract representations that outperform those learned by a single autoencoder and a population of autoencoders, presented with identical perspectives of the environment state. Altogether, our results demonstrate how communication from subjective perspectives can lead to the acquisition of more abstract representations in multi-agent systems, opening promising perspectives for future research at the intersection of representation learning and emergent communication.

翻译：尽管自我监督的学习是当今机器学习模式数据效率低下的一个突出解决办法,但自我监督的学习尚未从纯多剂的角度加以研究。在这项工作中,我们提议,将内部主观表述(在多剂结构中,代理人接受对同一基本环境状况的部分观察自然会产生),可能导致数据效率更高的表述。我们提议,多剂环境(在这种环境中,代理人无法接触他人的观察,但能够在有限的范围内进行交流)保证一种共同的环境,在个人代表性学习中可以加以利用。原因是,主观观测必然是指基础环境状态的同一部分,而关于这些状态的沟通可以自由提供一个受监督的信号。为了强调通信的重要性,我们提到我们的设置是作为对同一基本环境状况进行局部观察的。我们提出一个最小的结构,即由一组自动算数者组成,我们界定损失功能,收集有效通信的不同方面,并研究其对所学到的表述的影响。我们提议的架构允许出现统一的表述。由具有不同深度的传播观点的代理人介绍环境的同一部分,通过在深度视角介绍环境状况下进行主题性介绍,而从相同的购买结果的深度分析,从单一数据结构中学习了我们未来的人口结构。