Equivariance has emerged as a desirable property of representations of objects subject to identity-preserving transformations that constitute a group, such as translations and rotations. However, the expressivity of a representation constrained by group equivariance is still not fully understood. We address this gap by providing a generalization of Cover's Function Counting Theorem that quantifies the number of linearly separable and group-invariant binary dichotomies that can be assigned to equivariant representations of objects. We find that the fraction of separable dichotomies is determined by the dimension of the space that is fixed by the group action. We show how this relation extends to operations such as convolutions, element-wise nonlinearities, and global and local pooling. While other operations do not change the fraction of separable dichotomies, local pooling decreases the fraction, despite being a highly nonlinear operation. Finally, we test our theory on intermediate representations of randomly initialized and fully trained convolutional neural networks and find perfect agreement.
翻译:受身份保留变化影响的物体构成一个群体,例如翻译和旋转,其表达形式已作为一种可取的属性出现。然而,由于群体差异而受群体差异限制的表达形式仍不完全理解。我们通过对封面函数数的概括化理论来弥补这一差距,该理论将线性分解和组性异性二进制二进制数量量化,可分配给对象的对等表达方式。我们发现,分立二进制的分母部分是由群体行动所确定的空间的维度决定的。我们展示这种关系如何延伸到共变、元素非线性以及全球和本地集合等业务。虽然其他业务并不改变分解分解的分解分数,但局部集合会减少碎片,尽管其作用高度非线性。最后,我们检验我们关于随机初始化和经过充分训练的神经网络的中间表达方式的理论,并找到完全一致。