Deep clustering aims to learn a clustering representation through deep architectures. Most of the existing methods usually conduct clustering with the unique goal of maximizing clustering performance, that ignores the personalized demand of clustering tasks.% and results in unguided clustering solutions. However, in real scenarios, oracles may tend to cluster unlabeled data by exploiting distinct criteria, such as distinct semantics (background, color, object, etc.), and then put forward personalized clustering tasks. To achieve task-aware clustering results, in this study, Oracle-guided Contrastive Clustering(OCC) is then proposed to cluster by interactively making pairwise ``same-cluster" queries to oracles with distinctive demands. Specifically, inspired by active learning, some informative instance pairs are queried, and evaluated by oracles whether the pairs are in the same cluster according to their desired orientation. And then these queried same-cluster pairs extend the set of positive instance pairs for contrastive learning, guiding OCC to extract orientation-aware feature representation. Accordingly, the query results, guided by oracles with distinctive demands, may drive the OCC's clustering results in a desired orientation. Theoretically, the clustering risk in an active learning manner is given with a tighter upper bound, that guarantees active queries to oracles do mitigate the clustering risk. Experimentally, extensive results verify that OCC can cluster accurately along the specific orientation and it substantially outperforms the SOTA clustering methods as well. To the best of our knowledge, it is the first deep framework to perform personalized clustering.
翻译:深度集聚的目的是通过深层结构学习集群代表。 大多数现有方法通常以最大集聚性业绩的独特目标进行集群组合,忽视组合任务的个人化需求。% 并导致非引导的群集解决方案。 但是,在真实的情景下,甲骨文可能会通过使用不同的语义(背景、颜色、对象等)等不同标准,例如不同的语义(背景、颜色、对象等),然后提出个性化组合任务。为了实现任务识别组合结果,本研究中,甲骨文引导的深度对比集群(OCC)随后建议通过互动,对有独特需求的甲骨文进行配对的询问。具体地说,在积极学习的启发下,一些信息性实例配对可能会通过使用不同的标准(例如不同的语义(背景、颜色、对象等)将数据组合组合组合组合集中在一起,然后提出个个性化的组合任务。为了实现任务识别组合结果,指导奥骨质引导的深度群集群集(OCC)通过互动的对质化需求或触摸底的查询结果, 组织核心核心核心的分组,可以使组织更精确地、更精确的群集、更精确地核查。