笔记脚本的消分和分解 (Denoising and Segmentation of Epigraphical Scripts)

This paper is a presentation of a new method for denoising images using Haralick features and further segmenting the characters using artificial neural networks. The image is divided into kernels, each of which is converted to a GLCM (Gray Level Co-Occurrence Matrix) on which a Haralick Feature generation function is called, the result of which is an array with fourteen elements corresponding to fourteen features The Haralick values and the corresponding noise/text classification form a dictionary, which is then used to de-noise the image through kernel comparison. Segmentation is the process of extracting characters from a document and can be used when letters are separated by white space, which is an explicit boundary marker. Segmentation is the first step in many Natural Language Processing problems. This paper explores the process of segmentation using Neural Networks. While there have been numerous methods to segment characters of a document, this paper is only concerned with the accuracy of doing so using neural networks. It is imperative that the characters be segmented correctly, for failing to do so will lead to incorrect recognition by Natural language processing tools. Artificial Neural Networks was used to attain accuracy of upto 89%. This method is suitable for languages where the characters are delimited by white space. However, this method will fail to provide acceptable results when the language heavily uses connected letters. An example would be the Devanagari script, which is predominantly used in northern India.

翻译：本文展示了使用 Haralick 特性解密图像的新方法, 以及使用人工神经网络进一步分割字符的新方法。图像分为内核, 每一个内核都转换为 GLCM (Gray level Co- Occurence 矩阵), 其中每个内核都转换为 GLCM (Gray level Co- Occurence 矩阵), 并使用 Haralick 特性生成功能, 其结果是一组14 个元素的阵列, 与14 个特性相对应 Haralick 值和相应的噪音/ 文本分类组成了字典, 用于通过内核比较来解密图像。分解是一个从文档中提取字符的过程, 由白色空间空间分隔工具将字符分离出来, 由白色空间分隔为明确标记。分解是许多自然语言的北页格式, 也就是用于北页格式的平整式格式, 用于北页格式的平坦度格式, 也就是用于北页缩式格式的平坦式格式, 。北面空间网络的平面格式是用于北端格式的平整式格式的平整式格式方法, 。此方法是可接受的平整式的平整式格式, 。

相关内容

Neural Networks

关注 1651

神经网络（Neural Networks）是世界上三个最古老的神经建模学会的档案期刊:国际神经网络学会(INNS)、欧洲神经网络学会(ENNS)和日本神经网络学会(JNNS)。神经网络提供了一个论坛，以发展和培育一个国际社会的学者和实践者感兴趣的所有方面的神经网络和相关方法的计算智能。神经网络欢迎高质量论文的提交，有助于全面的神经网络研究，从行为和大脑建模，学习算法，通过数学和计算分析，系统的工程和技术应用，大量使用神经网络的概念和技术。这一独特而广泛的范围促进了生物和技术研究之间的思想交流，并有助于促进对生物启发的计算智能感兴趣的跨学科社区的发展。因此，神经网络编委会代表的专家领域包括心理学，神经生物学，计算机科学，工程，数学，物理。该杂志发表文章、信件和评论以及给编辑的信件、社论、时事、软件调查和专利信息。文章发表在五个部分之一:认知科学，神经科学，学习系统，数学和计算分析、工程和应用。官网地址：http://dblp.uni-trier.de/db/journals/nn/

Google-EfficientNet v2来了！更快，更小，更强！

专知会员服务

19+阅读 · 2021年4月4日

【经典书】图理论与应用，270页pdf

专知会员服务

86+阅读 · 2020年12月5日

图像分类半监督自监督无监督学习综述，A survey on Semi-, Self- and Unsupervised Learning for Image Classification

专知会员服务

46+阅读 · 2020年7月29日