Capsule networks were proposed as an alternative approach to Convolutional Neural Networks (CNNs) for learning object-centric representations, which can be leveraged for improved generalization and sample complexity. Unlike CNNs, capsule networks are designed to explicitly model part-whole hierarchical relationships by using groups of neurons to encode visual entities, and learn the relationships between those entities. Promising early results achieved by capsule networks have motivated the deep learning community to continue trying to improve their performance and scalability across several application areas. However, a major hurdle for capsule network research has been the lack of a reliable point of reference for understanding their foundational ideas and motivations. The aim of this survey is to provide a comprehensive overview of the capsule network research landscape, which will serve as a valuable resource for the community going forward. To that end, we start with an introduction to the fundamental concepts and motivations behind capsule networks, such as equivariant inference in computer vision. We then cover the technical advances in the capsule routing mechanisms and the various formulations of capsule networks, e.g. generative and geometric. Additionally, we provide a detailed explanation of how capsule networks relate to the popular attention mechanism in Transformers, and highlight non-trivial conceptual similarities between them in the context of representation learning. Afterwards, we explore the extensive applications of capsule networks in computer vision, video and motion, graph representation learning, natural language processing, medical imaging and many others. To conclude, we provide an in-depth discussion regarding the main hurdles in capsule network research, and highlight promising research directions for future work.
翻译:提议建立胶囊网络,作为革命神经网络的替代方法,以学习以物体为中心的表达方式,这可以用来改进一般化和抽样复杂性。与CNN不同,胶囊网络设计的目的是通过使用神经神经组群对视觉实体进行编码,并学习这些实体之间的关系,来明确模拟全层的等级关系。胶囊网络的早期结果有望推动深层学习界继续设法提高它们在若干应用领域的性能和可缩缩放性。然而,胶囊网络研究的一个主要障碍是缺乏了解其基本想法和动机的可靠参照点。这次调查的目的是提供一个对胶囊网络研究前景的全面概览,这将为社区向前发展提供宝贵的资源。为此,我们首先介绍胶囊网络的基本概念和动机,例如计算机视觉的变异性推断。我们接着介绍胶囊路机制的技术进步以及胶囊网络的未来前景,例如,基因化和几何测量。此外,我们详细解释胶囊网络的主要结构网络是如何在模型处理中,我们从模型学的深度研究中,我们从模型学的深度研究中,从模型研究中,从模型研究中,从深度研究中,从深度研究其他网络的深度研究中,从深度研究中,我们从深度研究中,从深度研究中,从深度研究中,从深度研究中,从深度研究中,从深度研究中,从深度研究中,从深度研究中,从深度研究中,从深度研究中,从深度研究中,从深度研究中,从深度研究中,从深度研究中,从深度研究中,从深度研究中,从深度研究中,从中,从深度研究中,从深度研究中,从深度研究中,到深度研究,从深度研究,从深度研究中,从深度研究中,从深度研究,从深度研究,到深度研究,从深度研究,从深度研究,从深度研究,从深度研究,到深度研究,从深度研究,到深度研究,到深度研究,从深度研究,从深度研究中,从深度研究中,到深度研究中,到深度研究,从深度研究,从深度研究,从深度研究中,从深度研究中,从深度研究,从深度研究中,从深度研究中,从深度研究中,从深度研究中,到深度研究中,到深度研究,从深度研究,从深度研究,从深度研究,从深度研究,从