使用多任务学习的多手写数字识别 (Multi-script Handwritten Digit Recognition Using Multi-task Learning)

Handwritten digit recognition is one of the extensively studied area in machine learning. Apart from the wider research on handwritten digit recognition on MNIST dataset, there are many other research works on various script recognition. However, it is not very common for multi-script digit recognition which encourage the development of robust and multipurpose systems. Additionally working on multi-script digit recognition enables multi-task learning, considering the script classification as a related task for instance. It is evident that multi-task learning improves model performance through inductive transfer using the information contained in related tasks. Therefore, in this study multi-script handwritten digit recognition using multi-task learning will be investigated. As a specific case of demonstrating the solution to the problem, Amharic handwritten character recognition will also be experimented. The handwritten digits of three scripts including Latin, Arabic and Kannada are studied to show that multi-task models with reformulation of the individual tasks have shown promising results. In this study a novel way of using the individual tasks predictions was proposed to help classification performance and regularize the different loss for the purpose of the main task. This finding has outperformed the baseline and the conventional multi-task learning models. More importantly, it avoided the need for weighting the different losses of the tasks, which is one of the challenges in multi-task learning.

翻译：手写数字识别是机器学习中广泛研究的领域之一。除了对MNIST数据集手写数字识别进行更广泛的研究外,还有许多其他关于各种脚本识别的研究。然而,对于鼓励发展强大和多功能系统的多手写数字识别并不常见。此外,多手写数字识别工作有助于多任务学习,将脚本分类视为一项相关任务。很显然,多任务学习通过使用相关任务中所含的信息进行感化传输,提高了模型性能。因此,将调查这项研究中使用多任务学习的多手写数字识别的多手写数字。作为展示问题解决方案的一个具体实例,Amharic手写字符识别也将进行实验。对包括拉丁、阿拉伯语和Kannada在内的三种脚本的手写数字进行了研究,以显示重塑各项任务的多任务多任务多任务模式显示了有希望的结果。在本研究中,提出了一种使用个人任务预测来帮助对业绩进行分类并规范不同损失,用于主要任务的目的。这一发现,在展示问题的解决方案中,“最大程度”的基线和“多任务”中,“最重的学习需要的重的多任务。