Irregular-shaped texts bring challenges to Scene Text Detection (STD). Although existing contour point sequence-based approaches achieve comparable performances, they fail to cover some highly curved ribbon-like text lines. It leads to limited text fitting ability and STD technique application. Considering the above problem, we combine text geometric characteristics and bionics to design a natural leaf vein-based text representation method (LVT). Concretely, it is found that leaf vein is a generally directed graph, which can easily cover various geometries. Inspired by it, we treat text contour as leaf margin and represent it through main, lateral, and thin veins. We further construct a detection framework based on LVT, namely LeafText. In the text reconstruction stage, LeafText simulates the leaf growth process to rebuild text contour. It grows main vein in Cartesian coordinates to locate text roughly at first. Then, lateral and thin veins are generated along the main vein growth direction in polar coordinates. They are responsible for generating coarse contour and refining it, respectively. Considering the deep dependency of lateral and thin veins on main vein, the Multi-Oriented Smoother (MOS) is proposed to enhance the robustness of main vein to ensure a reliable detection result. Additionally, we propose a global incentive loss to accelerate the predictions of lateral and thin veins. Ablation experiments demonstrate LVT is able to depict arbitrary-shaped texts precisely and verify the effectiveness of MOS and global incentive loss. Comparisons show that LeafText is superior to existing state-of-the-art (SOTA) methods on MSRA-TD500, CTW1500, Total-Text, and ICDAR2015 datasets.
翻译:以非正则形状的文本给Scene Text 探测( STD) 带来挑战。 虽然现有的等距序列法取得了相似的性能, 但是它们没有覆盖一些高度曲线化的丝带相似的文本线。 它导致文本适配能力和科技应用有限。 考虑到上述问题, 我们将文字几何特征和生物精度结合起来来设计一种天然的以静脉为基础的文本表达法( LVT ) 。 具体地说, 发现叶静脉是一个一般定向的图形, 可以很容易覆盖各种地貌。 受其启发, 我们把文字等距作为叶叶边边边边, 并通过主、 横向和薄脉动来代表它。 我们进一步根据LVT, 即 LeafText 进一步构建一个检测框架。 在文本重建阶段, LeafText 模拟叶生长过程模拟了叶色增长过程。 卡斯特尔座坐标的主要静脉脉, 之后, 平流和薄静脉动在极坐标的主要直径增长方向产生。 它们分别负责生成透析和提振动的状态, 。 考虑到平直径直径直径直径性和直径级的轨道的精确度比值比 显示全球直径比值数据在主脉压上, 显示的是, 。