Spherical signals are useful mathematical models for data arising in many 3-D applications such as LIDAR images, panorama cameras, and optical scanners. Successful processing of spherical signals entails architectures capable of exploiting their inherent data structure. In particular, spherical convolutional neural networks (Spherical CNNs) have shown promising performance in shape analysis and object recognition. In this paper, we focus on analyzing the properties that Spherical CNNs exhibit as they pertain to the rotational structure present in spherical signals. More specifically, we prove that they are equivariant to rotations and stable to rotation diffeomorphisms. These two properties illustrate how Spherical CNNs exploit the rotational structure of spherical signals, thus offering good generalization and faster learning. We corroborate these properties through controlled numerical experiments.
翻译:球形信号是许多三维应用中产生的数据的有用的数学模型,如LIDAR图像、全景照相机和光学扫描仪。球状信号的成功处理需要能够利用其固有数据结构的架构。特别是球状神经网络(球状CNN)在形状分析和物体识别方面表现良好。在本文中,我们侧重于分析球状CNN所展示的与球状信号所存在的旋转结构有关的特性。更具体地说,我们证明它们与旋转具有等同性,稳定于旋转的二面形。这两个特性说明了球状CNN如何利用球状信号的旋转结构,从而提供了良好的概括和更快的学习。我们通过受控的数字实验来验证这些特性。