This work addresses scaling up the sketch classification task into a large number of categories. Collecting sketches for training is a slow and tedious process that has so far precluded any attempts to large-scale sketch recognition. We overcome the lack of training sketch data by exploiting labeled collections of natural images that are easier to obtain. To bridge the domain gap we present a novel augmentation technique that is tailored to the task of learning sketch recognition from a training set of natural images. Randomization is introduced in the parameters of edge detection and edge selection. Natural images are translated to a pseudo-novel domain called "randomized Binary Thin Edges" (rBTE), which is used as a training domain instead of natural images. The ability to scale up is demonstrated by training CNN-based sketch recognition of more than 2.5 times larger number of categories than used previously. For this purpose, a dataset of natural images from 874 categories is constructed by combining a number of popular computer vision datasets. The categories are selected to be suitable for sketch recognition. To estimate the performance, a subset of 393 categories with sketches is also collected.
翻译:这项工作旨在将草图分类任务扩大为大量类别。 收集用于培训的草图是一个缓慢和烦琐的过程,迄今为止,它排除了任何大规模草图识别的尝试。 我们通过利用标签的自然图像收藏来克服缺乏培训草图数据的问题, 利用标签的自然图像收藏更容易获得。 为了缩小领域差距, 我们展示了一种新的增强技术, 用于学习从一组培训自然图像中获取草图识别的任务。 在边缘探测和边缘选择参数中引入随机化。 自然图像被翻译成一个假小说域, 称为“ 随机化的Bin- Thin Edges ” (rBTE), 用作培训领域, 而不是自然图像。 通过培训CNN的草图识别比以前使用的类别多2.5倍以上, 展示了扩展能力。 为此, 将一系列流行的计算机视觉数据集组合起来, 将自然图像的数据集构建为874 。 选择的类别适合用于素描识别。 为了估计性, 也收集了393个类别和草图。