With the widespread use of touch-screen devices, it is more and more convenient for people to draw sketches on screen. This results in the demand for automatically understanding the sketches. Thus, the sketch recognition task becomes more significant than before. To accomplish this task, it is necessary to solve the critical issue of improving the distinction of the sketch features. To this end, we have made efforts in three aspects. First, a novel multi-scale residual block is designed. Compared with the conventional basic residual block, it can better perceive multi-scale information and reduce the number of parameters during training. Second, a hierarchical residual structure is built by stacking multi-scale residual blocks in a specific way. In contrast with the single-level residual structure, the learned features from this structure are more sufficient. Last but not least, the compact triplet-center loss is proposed specifically for the sketch recognition task. It can solve the problem that the triplet-center loss does not fully consider too large intra-class space and too small inter-class space in sketch field. By studying the above modules, a hierarchical residual network as a whole is proposed for sketch recognition and evaluated on Tu-Berlin benchmark thoroughly. The experimental results show that the proposed network outperforms most of baseline methods and it is excellent among non-sequential models at present.
翻译:通过广泛使用触摸屏障设备,人们越来越方便地在屏幕上绘制草图。 这导致对草图的自动理解需求。 因此, 草图识别任务比以前更加重要。 为了完成这项任务, 有必要解决改进草图特征区别的关键问题。 为此, 我们已在三个方面作出努力。 首先, 设计了一个新的多尺度的多尺度残留块。 与传统的基本残留块相比, 它可以更好地看到多尺度的信息, 并减少培训中的参数数量。 其次, 通过以特定方式堆叠多尺度残余块来建立等级残余结构。 与单级残余结构相比, 从这一结构中学习到的特征更为充分。 最后但并非最不重要的是, 缩略三点损失是专门为素描识别任务提议的。 它可以解决三重中位损失没有充分考虑过大内部空间和在素描领域过小的等级间空间的问题。 通过研究上述模块, 将等级残余网络作为一个整体, 提议在图贝林最基本网络上进行素描的素描识别和评估。