How can we acquire world models that veridically represent the outside world both in terms of what is there and in terms of how our actions affect it? Can we acquire such models by interacting with the world, and can we state mathematical desiderata for their relationship with a hypothetical reality existing outside our heads? As machine learning is moving towards representations containing not just observational but also interventional knowledge, we study these problems using tools from representation learning and group theory. Under the assumption that our actuators act upon the world, we propose methods to learn internal representations of not just sensory information but also of actions that modify our sensory representations in a way that is consistent with the actions and transitions in the world. We use an autoencoder equipped with a group representation linearly acting on its latent space, trained on 2-step reconstruction such as to enforce a suitable homomorphism property on the group representation. Compared to existing work, our approach makes fewer assumptions on the group representation and on which transformations the agent can sample from the group. We motivate our method theoretically, and demonstrate empirically that it can learn the correct representation of the groups and the topology of the environment. We also compare its performance in trajectory prediction with previous methods.
翻译:我们怎样才能获得世界模型,这些模型既代表着外部世界,也代表着我们的行动?我们能否通过与世界互动而获得这些模型?我们能否通过与世界互动而获得这些模型?我们能否说明数学偏斜,因为它们与我们头脑外存在的假设现实存在关系?随着机器学习转向不仅包含观察知识,而且包含干预知识的表述,我们如何利用代表性学习和群体理论的工具来研究这些问题。根据我们的表演者对世界采取行动的假设,我们提出方法,以了解不仅感官信息的内部表达方式,而且以与世界上的行动和转变相一致的方式改变我们的感官表现方式的行动的内部表述方式。我们使用一个自动编码器,配有在潜在空间上进行线性活动的团体代表,受过两步制重建培训,如在团体代表上实施适当的同质性财产。与现有工作相比,我们的方法对团体代表性和代理人可从团体中抽样的转变假设较少。我们从理论上激励我们的方法,并用经验证明它能够了解团体的正确表述方式和环境的表层学。我们还将其业绩与以往的轨迹对比。