Despite being studied extensively for a few decades, handwritten character recognition (HCR) is still considered a challenging learning problem in pattern recognition, and there is very limited research on script independent models. This is mainly because of similarity in structure of characters, different handwriting styles, noisy datasets, diversity of scripts, focus of the conventional research on handcrafted feature extraction techniques, and unavailability of public datasets and code-repositories to reproduce the results. On the other hand, deep learning has witnessed huge success in different areas of pattern recognition, including HCR, and provides an end-to-end learning. However, deep learning techniques are computationally expensive, need large amount of data for training and have been developed for specific scripts only. To address the above limitations, we have proposed a novel generic deep learning architecture for script independent handwritten character recognition, called HCR-Net. HCR-Net is based on a novel transfer learning approach for HCR, which partly utilizes feature extraction layers of a pre-trained network. Due to transfer learning and image-augmentation, HCR-Net provides faster and computationally efficient training, better performance and better generalizations, and can work with small datasets. HCR-Net is extensively evaluated on 40 publicly available datasets of Bangla, Punjabi, Hindi, English, Swedish, Urdu, Farsi, Tibetan, Kannada, Malayalam, Telugu, Marathi, Nepali and Arabic languages, and established 26 new benchmark results while performed close to the best results in the rest cases. HCR-Net showed performance improvements up to 11% against the existing results and achieved a fast convergence rate showing up to 99% of final performance in the very first epoch. HCR-Net significantly outperformed the state-of-the-art transfer learning techniques...
翻译:尽管经过几十年的广泛研究,手写字符识别(HCR)仍被视为模式识别中一个具有挑战性的学习问题,而且对脚本独立模型的研究非常有限。这主要是由于字符结构相似、笔迹风格不同、数据集噪音、脚本多样性、传统研究侧重于手写特征提取技术、没有公共数据集和代码存储器来复制结果。另一方面,深层次学习在模式识别的不同领域,包括HCR在内,仍被视为一个具有挑战性的学习问题,在模式识别方面是一个具有挑战性的学习问题,并且提供了端到端学习。然而,深层次学习技术在计算上非常昂贵,需要大量数据用于培训,并且仅针对特定的脚本。为了解决上述局限性,我们提出了一个新的通用深层次学习架构,用于手写特征提取技术,称为 HCR-Net。 HCRR-Net基于一种新型的传输学习方法,部分地利用了26级前阿拉伯网络的最终基准提取层。由于将学习和图像升级, HCR-Net提供快速和精确的学习,而现有的40级学习结果显示快速和高层次的学习,而快速的成绩和高层次数据在HCRRDRDR,在快速数据和普遍评估中可以展示现有数据。