Unsupervised representation learning (URL) that learns compact embeddings of high-dimensional data without supervision has achieved remarkable progress recently. Although the ultimate goal of URLs is similar across various scenarios, the related algorithms differ widely in different tasks because they were separately designed according to a specific URL task or data. For example, dimension reduction methods, t-SNE, and UMAP, optimize pair-wise data relationships by preserving the global geometric structure, while self-supervised learning, SimCLR, and BYOL, focus on mining the local statistics of instances under specific augmentations. From a general perspective, we summarize and propose a unified similarity-based URL framework, GenURL, which can adapt to various URL tasks smoothly and efficiently. Based on the manifold assumption, we regard URL tasks as different implicit constraints on the data geometric structure or content that help to seek an optimal low-dimensional representation for the high-dimensional data. Therefore, our method has two key steps to learning task-agnostic representation in URL: (1) data structural modeling and (2) low-dimensional transformation. Specifically, (1) provides a simple yet effective graph-based submodule to model data structures adaptively with predefined or constructed graphs; and based on data-specific pretext tasks, (2) learns compact low-dimensional embeddings. Moreover, (1) and (2) are successfully connected and benefit mutually through our novel objective function. Our comprehensive experiments demonstrate that GenURL achieves consistent state-of-the-art performance in self-supervised visual representation learning, unsupervised knowledge distillation, graph embeddings, and dimension reduction.
翻译:虽然URL的最终目的在不同情景中相似,但相关的算法在不同的任务中差异很大,因为它们是根据具体的URL任务或数据分别设计的。例如,维度减少方法、t-SNE和UMAP,通过维护全球几何结构优化双向数据关系,而自我监督的学习,SimCLR和BYOL,则侧重于挖掘特定增强系统下实例的当地统计数据。从一般角度看,我们总结和提议一个统一的类似URL框架,GenURL, 它可以顺利和有效地适应各种URL任务。根据多方面假设,我们认为,URL任务是数据几何结构或内容的不同隐含限制,有助于为高维数据寻求最佳的低维代表。因此,我们的方法有两个关键步骤,可以学习URstill Study-nority 代表 URlational, 数据结构中的数据结构建模和低维度变换。具体地,我们提供了一个简单、但有效的以自制的基于图形的直径直径直径代表制的缩缩缩缩缩缩缩缩缩缩缩缩缩缩缩缩略图。