Extensive work has demonstrated that equivariant neural networks can significantly improve sample efficiency and generalization by enforcing an inductive bias in the network architecture. These applications typically assume that the domain symmetry is fully described by explicit transformations of the model inputs and outputs. However, many real-life applications contain only latent or partial symmetries which cannot be easily described by simple transformations of the input. In these cases, it is necessary to learn symmetry in the environment instead of imposing it mathematically on the network architecture. We discover, surprisingly, that imposing equivariance constraints that do not exactly match the domain symmetry is very helpful in learning the true symmetry in the environment. We differentiate between extrinsic and incorrect symmetry constraints and show that while imposing incorrect symmetry can impede the model's performance, imposing extrinsic symmetry can actually improve performance. We demonstrate that an equivariant model can significantly outperform non-equivariant methods on domains with latent symmetries both in supervised learning and in reinforcement learning for robotic manipulation and control problems.
翻译:广泛的工作表明,等离性神经网络可以通过在网络结构中实施感化偏差来大幅提高样本效率和一般化。这些应用通常假定域对称通过模型投入和产出的清晰转换来充分描述。然而,许多实际应用只包含潜在或部分的对称,而这种对称很难通过输入的简单转换来描述。在这些情况下,有必要学习环境中的对称性,而不是用数学将其强加在网络结构中。我们令人惊讶地发现,对域对称性施加不完全符合域对称性的对等性限制,非常有助于在环境中学习真正的对称性。我们区分了外部和不正确的对称性限制,并表明在强加不正确的对称性可以妨碍模型的性能的同时,施加外部对称性能的对称性实际上可以改善性能。我们证明,一种对等性模型可以大大超越对有潜在对称性的对称性,既在监督性学习中,又在加强机器人操纵和控制问题的学习中,对立性学习中都具有潜在对称性。