An effective unsupervised hashing algorithm leads to compact binary codes preserving the neighborhood structure of data as much as possible. One of the most established schemes for unsupervised hashing is to reduce the dimensionality of data and then find a rigid (neighbourhood-preserving) transformation that reduces the quantization error. Although employing rigid transformations is effective, we may not reduce quantization loss to the ultimate limits. As well, reducing dimensionality and quantization loss in two separate steps seems to be sub-optimal. Motivated by these shortcomings, we propose to employ both rigid and non-rigid transformations to reduce quantization error and dimensionality simultaneously. We relax the orthogonality constraint on the projection in a PCA-formulation and regularize this by a quantization term. We show that both the non-rigid projection matrix and rotation matrix contribute towards minimizing quantization loss but in different ways. A scalable nested coordinate descent approach is proposed to optimize this mixed-integer optimization problem. We evaluate the proposed method on five public benchmark datasets providing almost half a million images. Comparative results indicate that the proposed method mostly outperforms state-of-art linear methods and competes with end-to-end deep solutions.
翻译:有效的、不受监督的散列算法导致尽可能保护数据周围结构的简单二进制代码。最老化的未经监督的散列法方案之一是减少数据的维度,然后发现僵硬的(邻里保留)变形,以减少量化错误。虽然采用僵硬的变形是有效的,但我们可能不会将量化损失降低到最终极限。同样,在两个不同的步骤中降低维度和量化损失似乎也是次最佳的。受这些缺点的驱动,我们提议同时采用硬性和非硬性变形,以减少量化错误和维度。我们提议采用硬性变形和非硬性变形法,以同时减少数据的维度。我们放松对五氯苯甲醚成形的投影的或定性限制,并通过四分化术语来规范这一变形。我们表明,非硬性预测矩阵和旋转矩阵都有助于尽量减少量化损失,但以不同的方式。提议一个可缩缩放的嵌套式血模型,以优化这一混合内位优化问题。我们提议采用五个公共基准数据集的拟议方法来减少定量和非硬性变式的变形方法,提供近一半的直线式图像。比较显示,以近半的升级方法将产生的结果。