Curriculum learning is a bio-inspired training technique that is widely adopted to machine learning for improved optimization and better training of neural networks regarding the convergence rate or obtained accuracy. The main concept in curriculum learning is to start the training with simpler tasks and gradually increase the level of difficulty. Therefore, a natural question is how to determine or generate these simpler tasks. In this work, we take inspiration from Spatial Transformer Networks (STNs) in order to form an easy-to-hard curriculum. As STNs have been proven to be capable of removing the clutter from the input images and obtaining higher accuracy in image classification tasks, we hypothesize that images processed by STNs can be seen as easier tasks and utilized in the interest of curriculum learning. To this end, we study multiple strategies developed for shaping the training curriculum, using the data generated by STNs. We perform various experiments on cluttered MNIST and Fashion-MNIST datasets, where on the former, we obtain an improvement of $3.8$pp in classification accuracy compared to the baseline.
翻译:课程学习是一种生物激励型培训技术,广泛用于机器学习,以便改善神经网络在趋同率方面的优化和更好培训,或获得准确性。课程学习的主要概念是开始培训,执行更简单的任务,逐步增加难度。因此,一个自然的问题是如何确定或产生这些更简单的任务。在这项工作中,我们从空间变换网络(STNs)中汲取灵感,以便形成易于操作的课程。由于科技网已证明能够消除输入图像的杂乱和图像分类任务的更准确性,我们低估了科技网所处理的图像可以被视为较容易的任务,并用于课程学习。为此目的,我们研究利用科技网生成的数据制定培训课程的多种战略。我们进行了各种关于结实的MNIST和Fashason-MNIST数据集的实验,在前者中,我们在分类精度方面比基线提高了3.8pp。