对相似性和群体平等代表制的差别性学习和群体平等代表制 (Discriminative Learning of Similarity and Group Equivariant Representations)

from arxiv, PhD thesis, September 2018 [Previous version had a compile error that was spotted recently, which is fixed. The uploaded version is the final thesis that was submitted in September 2018]

One of the most fundamental problems in machine learning is to compare examples: Given a pair of objects we want to return a value which indicates degree of (dis)similarity. Similarity is often task specific, and pre-defined distances can perform poorly, leading to work in metric learning. However, being able to learn a similarity-sensitive distance function also presupposes access to a rich, discriminative representation for the objects at hand. In this dissertation we present contributions towards both ends. In the first part of the thesis, assuming good representations for the data, we present a formulation for metric learning that makes a more direct attempt to optimize for the k-NN accuracy as compared to prior work. We also present extensions of this formulation to metric learning for kNN regression, asymmetric similarity learning and discriminative learning of Hamming distance. In the second part, we consider a situation where we are on a limited computational budget i.e. optimizing over a space of possible metrics would be infeasible, but access to a label aware distance metric is still desirable. We present a simple, and computationally inexpensive approach for estimating a well motivated metric that relies only on gradient estimates, discussing theoretical and experimental results. In the final part, we address representational issues, considering group equivariant convolutional neural networks (GCNNs). Equivariance to symmetry transformations is explicitly encoded in GCNNs; a classical CNN being the simplest example. In particular, we present a SO(3)-equivariant neural network architecture for spherical data, that operates entirely in Fourier space, while also providing a formalism for the design of fully Fourier neural networks that are equivariant to the action of any continuous compact group.

翻译：机器学习中最根本的问题之一是比较实例: 我们想要返回一对表明(不同)差异程度的物体。相似性往往是任务特有的, 且预设的距离可能表现不佳, 导致计量学习。但是, 能够学习相似性敏感的距离功能, 也取决于对手边物体使用丰富的、歧视性的表达方式。在这份论文中, 我们为两端都展示了贡献。在论文的第一部分, 假设数据具有良好的表达方式, 我们提出了一个衡量学习的公式, 从而更直接地试图优化 k- NN 的精确度, 与先前的工作相比。相似性通常是任务特有的, 且预设的距离效果差差差差差差差差差差差差差差差差差差差差差差差差差差差差差差差差差差差。但是, 在论文的第一部分中, 优化可能的测量空间空间的空间空间空间是完全不可行的, 但简单的测距度指标仍然是可取的。我们提出一个简单、计算成本的方法, 来估计一个具有良好动机的网络的货币变距性指标,, 仅基于变压结构的计算结构。