Online deep clustering refers to the joint use of a feature extraction network and a clustering model to assign cluster labels to each new data point or batch as it is processed. While faster and more versatile than offline methods, online clustering can easily reach the collapsed solution where the encoder maps all inputs to the same point and all are put into a single cluster. Successful existing models have employed various techniques to avoid this problem, most of which require data augmentation or which aim to make the average soft assignment across the dataset the same for each cluster. We propose a method that does not require data augmentation, and that, differently from existing methods, regularizes the hard assignments. Using a Bayesian framework, we derive an intuitive optimization objective that can be straightforwardly included in the training of the encoder network. Tested on four image datasets, we show that it consistently avoids collapse more robustly than other methods and that it leads to more accurate clustering. We also conduct further experiments and analyses justifying our choice to regularize the hard cluster assignments.
翻译:深度在线聚类是指联合使用特征提取网络和聚类模型来将每个新数据点或批次分配到聚类标签。虽然比离线方法更快且更灵活,但在线聚类很容易达到全部映射到同一点和全部放入一个聚类的坍塌解。现有的成功模型采用了各种技术来避免这个问题,其中大多数需要数据增强或旨在使数据集中每个聚类的平均软分配相同。我们提出了一种方法,不需要数据增强,并且与现有方法不同,对硬分配进行了正则化。使用贝叶斯框架,我们得出了一个直观的优化目标,可以简单地包含在编码器网络的训练中。在四个图像数据集上进行测试,我们发现它比其他方法更加稳健地避免坍塌,并且导致更精确的聚类。我们还进行了进一步实验证明了我们选择对硬聚类分配进行正则化的合适性。