A deep clustering network is desired for data streams because of its aptitude in extracting natural features thus bypassing the laborious feature engineering step. While automatic construction of the deep networks in streaming environments remains an open issue, it is also hindered by the expensive labeling cost of data streams rendering the increasing demand for unsupervised approaches. This paper presents an unsupervised approach of deep clustering network construction on the fly via simultaneous deep learning and clustering termed Autonomous Deep Clustering Network (ADCN). It combines the feature extraction layer and autonomous fully connected layer in which both network width and depth are self-evolved from data streams based on the bias-variance decomposition of reconstruction loss. The self-clustering mechanism is performed in the deep embedding space of every fully connected layer while the final output is inferred via the summation of cluster prediction score. Further, a latent-based regularization is incorporated to resolve the catastrophic forgetting issue. A rigorous numerical study has shown that ADCN produces better performance compared to its counterparts while offering fully autonomous construction of ADCN structure in streaming environments with the absence of any labeled samples for model updates. To support the reproducible research initiative, codes, supplementary material, and raw results of ADCN are made available in \url{https://tinyurl.com/AutonomousDCN}.
翻译:虽然在流环境中自动建造深网络仍然是一个尚未解决的问题,但自我集束机制是在每一个完全连接层的深嵌空间内实施的,而最终产出则通过组合预测得分的加和推算得出。此外,基于潜伏的规范化也被纳入解决灾难性的遗忘问题。一项严格的数字研究表明,ADCN与对应方相比产生更好的性能,同时提供在流环境中完全自主地建造ADCN结构,同时没有任何标记的CN样本更新模型。