手写文本图像中以词汇为基础的关键词点点的概率框架 (A Probabilistic Framework for Lexicon-based Keyword Spotting in Handwritten Text Images)

Query by String Keyword Spotting (KWS) is here considered as a key technology for indexing large collections of handwritten text images to allow fast textual access to the contents of these collections. Under this perspective, a probabilistic framework for lexicon-based KWS in text images is presented. The presentation aims at providing a tutorial view that helps to understand the relations between classical statements of KWS and the relative challenges entailed by these statements. More specifically, the development of the proposed framework makes it self-evident that word recognition or classification implicitly or explicitly underlies any formulation of KWS. Moreover, it clearly suggests that the same statistical models and training methods successfully used for handwriting text recognition can advantageously be used also for KWS, even though KWS does not generally require or rely on any kind of previously produced image transcripts. These ideas are developed into a specific, probabilistically sound approach for segmentation-free, lexicon-based, query-by-string KWS. Experiments carried out using this approach are presented, which support the consistency and general interest of the proposed framework. Several datasets, traditionally used for KWS benchmarking are considered, with results significantly better than those previously published for these datasets. In addition, results on two new, larger handwritten text image datasets are reported, showing the great potential of the methods proposed in this paper for indexing and textual search in large collections of handwritten documents.

翻译：由字符串关键字 Spoteting (KWS) 查询此处被视为一种关键技术,用于对手写文本图像的大量收集进行索引化,以便快速获取这些收藏的内容。从这个角度来看,在文本图像中为基于词汇的 KWS 提供了一种概率框架。演示的目的是提供一种指导性观点,帮助理解KWS 传统语语句与这些发言带来的相对挑战之间的关系。更具体地说,拟议框架的制定使使用文字识别或分类隐含或明确地为 KWS 的任何配方提供了隐含或明确的基础。此外,它明确表明,为笔迹文本识别成功使用的相同的统计模式和培训方法也可以用于KWSS, 即使KWS通常不要求或依赖以前制作的图像记录的任何类型。这些想法被发展成一种具体的、概率化的可靠方法, 用于无分类、以词汇为基础、逐字框键 KWS 。使用这种方法进行的实验可以支持拟议框架的一致性和一般兴趣。此外, 某些数据集, 传统上用于KWS 的大型图像基准,, 用于先前报告的大型图表的结果, 被被被认为是两种格式格式。