Embracing the deep learning techniques for representation learning in clustering research has attracted broad attention in recent years, yielding a newly developed clustering paradigm, viz. the deep clustering (DC). Typically, the DC models capitalize on autoencoders to learn the intrinsic features which facilitate the clustering process in consequence. Nowadays, a generative model named variational autoencoder (VAE) has got wide acceptance in DC studies. Nevertheless, the plain VAE is insufficient to perceive the comprehensive latent features, leading to the deteriorative clustering performance. In this paper, a novel DC method is proposed to address this issue. Specifically, the generative adversarial network and VAE are coalesced into a new autoencoder called fusion autoencoder (FAE) for discerning more discriminative representation that benefits the downstream clustering task. Besides, the FAE is implemented with the deep residual network architecture which further enhances the representation learning ability. Finally, the latent space of the FAE is transformed to an embedding space shaped by a deep dense neural network for pulling away different clusters from each other and collapsing data points within individual clusters. Experiment conducted on several image datasets demonstrate the effectiveness of the proposed DC model against the baseline methods.
翻译:近年来,在集群研究中深入学习代表性学习技术已引起广泛关注,产生了新开发的集群模式,即深度集群(DC)。通常,DC模型利用自动编码器学习有助于集群进程的内在特征。如今,一个名为变式自动编码器(VAE)的基因化模型在DC的研究中得到广泛接受。然而,普通VAE不足以看到综合潜伏特征,从而导致不良集群的性能。在本文中,提出了一个新的DC方法来解决这一问题。具体地说,基因式对称网络和VAE被合并成一个新的自动编码器,称为聚变自动编码器(FAE),以识别有利于下游集群工作的更具有歧视性的代表性。此外,FAE在采用深残余网络结构,以进一步提高代表性学习能力。最后,FAE的潜在空间被转换为嵌入空间,由一个深密度的神经网络形成,用以将不同的集群从不同的集群中拉走,并在单个集群中断裂数据点。对几个图像集系进行了实验,以显示拟议的DC基线方法的有效性。