Cursive handwritten text recognition is a challenging research problem in the domain of pattern recognition. The current state-of-the-art approaches include models based on convolutional recurrent neural networks and multi-dimensional long short-term memory recurrent neural networks techniques. These methods are highly computationally extensive as well model is complex at design level. In recent studies, combination of convolutional neural network and gated convolutional neural networks based models demonstrated less number of parameters in comparison to convolutional recurrent neural networks based models. In the direction to reduced the total number of parameters to be trained, in this work, we have used depthwise convolution in place of standard convolutions with a combination of gated-convolutional neural network and bidirectional gated recurrent unit to reduce the total number of parameters to be trained. Additionally, we have also included a lexicon based word beam search decoder at testing step. It also helps in improving the the overall accuracy of the model. We have obtained 3.84% character error rate and 9.40% word error rate on IAM dataset; 4.88% character error rate and 14.56% word error rate in George Washington dataset, respectively.
翻译:手写曲线文字识别是模式识别领域一个具有挑战性的研究问题。当前最先进的方法包括基于超常神经网络和多维长期内存中枢神经网络技术的模型。这些方法在计算上非常广泛,在设计层面也非常复杂。在最近的研究中,以超动神经网络和闭门神经神经网络模型为基础的模型组合显示的参数数少于以动态经常性神经网络为基础的模型。为了减少需要培训的参数总数,在这项工作中,我们使用深度共变模型,以取代标准共变模型,同时结合了门式共变神经网络和双向型内存中继单元,以减少需要培训的参数总数。此外,我们还在测试阶段纳入了基于单词的单词波搜索解码。还有助于提高模型的总体准确性。我们获得了3.84%的字符错误率和9.40%的单词错误率,在IAM数据集中,我们获得了4.88%的字符错误率和14.56%的华盛顿数据中,分别采用了14.56 %的字差率。