The dictionary learning problem concerns the task of representing data as sparse linear sums drawn from a smaller collection of basic building blocks. In application domains where such techniques are deployed, we frequently encounter datasets where some form of symmetry or invariance is present. Motivated by this observation, we develop a framework for learning dictionaries for data under the constraint that the collection of basic building blocks remains invariant under such symmetries. Our procedure for learning such dictionaries relies on representing the symmetry as the action of a matrix group acting on the data, and subsequently introducing a convex penalty function so as to induce sparsity with respect to the collection of matrix group elements. Our framework specializes to the convolutional dictionary learning problem when we consider integer shifts. Using properties of positive semidefinite Hermitian Toeplitz matrices, we develop an extension that learns dictionaries that are invariant under continuous shifts. Our numerical experiments on synthetic data and ECG data show that the incorporation of such symmetries as priors are most valuable when the dataset has few data-points, or when the full range of symmetries is inadequately expressed in the dataset.
翻译:字典学习问题涉及将数据作为从一个较小的基本构件库收集的微小的线性总和来代表数据的任务。在使用这类技术的应用领域,我们经常遇到存在某种形式的对称或变异的数据集。根据这一观察,我们开发了一个数据学习词典的框架,其制约是,基本构件的收集在这种对称之下始终是无变的。我们学习这种词典的程序依赖于将对称作为根据数据采取行动的矩阵组的动作,并随后引入 convex惩罚功能,以诱导对矩阵组元素的收集产生松散。我们的框架在考虑整变时专门研究进字典学习问题。我们使用正半成半成的Hermitian Toeplitz矩阵的属性,我们开发一个扩展功能,以学习在连续变换中具有变异性的词典。我们关于合成数据和ECG数据的数字实验显示,在数据集仅有少量数据点时,或者在完整显示的数据集不完全范围时,将这种对等的配最有价值。