通过形式攻击对基于变压器形式的实地抽取器进行强力评价 (Robustness Evaluation of Transformer-based Form Field Extractors via Form Attacks) - 专知论文

会员服务 ·

0

稳健性 · 光学字符识别 · 得分 · Performer · state-of-the-art ·

2021 年 10 月 8 日

Robustness Evaluation of Transformer-based Form Field Extractors via Form Attacks

翻译：通过形式攻击对基于变压器形式的实地抽取器进行强力评价

Le Xue,Mingfei Gao,Zeyuan Chen,Caiming Xiong,Ran Xu

We propose a novel framework to evaluate the robustness of transformer-based form field extraction methods via form attacks. We introduce 14 novel form transformations to evaluate the vulnerability of the state-of-the-art field extractors against form attacks from both OCR level and form level, including OCR location/order rearrangement, form background manipulation and form field-value augmentation. We conduct robustness evaluation using real invoices and receipts, and perform comprehensive research analysis. Experimental results suggest that the evaluated models are very susceptible to form perturbations such as the variation of field-values (~15% drop in F1 score), the disarrangement of input text order(~15% drop in F1 score) and the disruption of the neighboring words of field-values(~10% drop in F1 score). Guided by the analysis, we make recommendations to improve the design of field extractors and the process of data collection.

翻译：我们提出一个新的框架,以评价以变压器为基础的以形式为基础的实地抽取方法通过形式攻击的稳健性。我们引入了14种新形式变换,以评价最先进的实地抽取器在从OCR级别和形式级别(包括OCR位置/顺序重新安排、背景操作和外地价值增殖形式)受到形式攻击的脆弱性。我们利用真实的发票和收据进行稳健性评价,并进行全面的研究分析。实验结果表明,经评估的模型非常容易形成干扰,例如外地价值的变化(F1分下降15%)、输入文本顺序脱序(F1分下降15%)和外地价值相邻词的中断(F1分下降10% ) 。根据分析,我们提出了改进外地提取器设计和数据收集过程的建议。

0

相关内容

稳健性

近期必读的6篇顶会CVPR 2021【对抗攻击】相关论文和代码

专知会员服务

51+阅读 · 2021年7月10日

【CVPR2021】基于结构保持的弱监督目标定位

专知会员服务

16+阅读 · 2021年6月6日

【CVPR 2021】变换器跟踪TransT: Transformer Tracking

【CVPR 2021】变换器跟踪TransT: Transformer Tracking

专知会员服务

22+阅读 · 2021年4月20日

近期必读的六篇AAAI 2021【对抗攻击（Adversarial Attack）】相关论文和代码

专知会员服务

55+阅读 · 2021年2月17日

【AAAI2021】属性引导对抗训练的自然扰动鲁棒性

专知会员服务

26+阅读 · 2021年1月21日

最新《人脸识别对抗攻击》综述 | Threat of Adversarial Attacks on Face Recognition: A Comprehensive Survey

最新《人脸识别对抗攻击》综述 | Threat of Adversarial Attacks on Face Recognition: A Comprehensive Survey

专知会员服务

26+阅读 · 2020年7月24日

【伯克利】黑盒机器翻译系统的模仿攻击与防御，Imitation Attacks and Defenses for Black-box Machine Translation Systems

【伯克利】黑盒机器翻译系统的模仿攻击与防御，Imitation Attacks and Defenses for Black-box Machine Translation Systems

专知会员服务

7+阅读 · 2020年5月4日

【AAAI2020】Context-Transformer:上下文转换器:解决对象混淆的小样本检测，Context-Transformer: Tackling Object Confusion for Few-Shot Detection

【AAAI2020】Context-Transformer:上下文转换器:解决对象混淆的小样本检测，Context-Transformer: Tackling Object Confusion for Few-Shot Detection

专知会员服务

51+阅读 · 2020年3月17日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

计算机视觉最佳实践、代码示例和相关文档

计算机视觉最佳实践、代码示例和相关文档

专知会员服务

20+阅读 · 2019年10月9日

细粒度情感分析任务（ABSA）的最新进展

细粒度情感分析任务（ABSA）的最新进展

PaperWeekly

18+阅读 · 2020年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

【代码资源】GAN | 七份最热GAN文章及代码分享（Github 1000+Stars）

【代码资源】GAN | 七份最热GAN文章及代码分享（Github 1000+Stars）

专知

13+阅读 · 2018年6月24日

笔记 | Sentiment Analysis

笔记 | Sentiment Analysis

黑龙江大学自然语言处理实验室

10+阅读 · 2018年5月6日

最新5篇生成对抗网络相关论文推荐—FusedGAN、DeblurGAN、AdvGAN、CipherGAN、MMD GANS

最新5篇生成对抗网络相关论文推荐—FusedGAN、DeblurGAN、AdvGAN、CipherGAN、MMD GANS

专知

23+阅读 · 2018年1月18日

stackGAN通过文字描述生成图片的V2项目

stackGAN通过文字描述生成图片的V2项目

CreateAMind

3+阅读 · 2018年1月1日

计算机视觉近一年进展综述

计算机视觉近一年进展综述

机器学习研究会

9+阅读 · 2017年11月25日

基于位置注意力机制模型和带标签数据来提升槽填充（EMNLP outstanding paper）

基于位置注意力机制模型和带标签数据来提升槽填充（EMNLP outstanding paper）

科技创新与创业

17+阅读 · 2017年11月17日

gan生成图像at 1024² 的代码论文

gan生成图像at 1024² 的代码论文

CreateAMind

4+阅读 · 2017年10月31日

Auto-Encoding GAN

Auto-Encoding GAN

CreateAMind

7+阅读 · 2017年8月4日

Does Summary Evaluation Survive Translation to Other Languages?

Arxiv

0+阅读 · 2021年12月8日

Presentation Attack Detection Methods based on Gaze Tracking and Pupil Dynamic: A Comprehensive Survey

Arxiv

0+阅读 · 2021年12月7日

Adversarial Example Detection for DNN Models: A Review and Experimental Comparison

Arxiv

0+阅读 · 2021年12月6日

Beyond Robustness: Resilience Verification of Tree-Based Classifiers

Arxiv

0+阅读 · 2021年12月5日

A Survey of Visual Transformers

Arxiv

39+阅读 · 2021年11月11日

ResT: An Efficient Transformer for Visual Recognition

Arxiv

3+阅读 · 2021年10月14日

ReNAS:Relativistic Evaluation of Neural Architecture Search

Arxiv

11+阅读 · 2021年3月10日

Privacy and Robustness in Federated Learning: Attacks and Defenses

Arxiv

35+阅读 · 2020年12月7日

Pretrained Transformers Improve Out-of-Distribution Robustness

Arxiv

5+阅读 · 2020年4月13日

Comprehensive Analysis of Aspect Term Extraction Methods using Various Text Embeddings

Comprehensive Analysis of Aspect Term Extraction Methods using Various Text Embeddings

Arxiv

5+阅读 · 2019年9月11日

VIP会员

文章信息

相关主题

光学字符识别

state-of-the-art

相关VIP内容

近期必读的6篇顶会CVPR 2021【对抗攻击】相关论文和代码

专知会员服务

51+阅读 · 2021年7月10日

【CVPR2021】基于结构保持的弱监督目标定位

专知会员服务

16+阅读 · 2021年6月6日

【CVPR 2021】变换器跟踪TransT: Transformer Tracking

【CVPR 2021】变换器跟踪TransT: Transformer Tracking

专知会员服务

22+阅读 · 2021年4月20日

近期必读的六篇AAAI 2021【对抗攻击（Adversarial Attack）】相关论文和代码

专知会员服务

55+阅读 · 2021年2月17日

【AAAI2021】属性引导对抗训练的自然扰动鲁棒性

专知会员服务

26+阅读 · 2021年1月21日

最新《人脸识别对抗攻击》综述 | Threat of Adversarial Attacks on Face Recognition: A Comprehensive Survey

最新《人脸识别对抗攻击》综述 | Threat of Adversarial Attacks on Face Recognition: A Comprehensive Survey

专知会员服务

26+阅读 · 2020年7月24日

【伯克利】黑盒机器翻译系统的模仿攻击与防御，Imitation Attacks and Defenses for Black-box Machine Translation Systems

【伯克利】黑盒机器翻译系统的模仿攻击与防御，Imitation Attacks and Defenses for Black-box Machine Translation Systems

专知会员服务

7+阅读 · 2020年5月4日

【AAAI2020】Context-Transformer:上下文转换器:解决对象混淆的小样本检测，Context-Transformer: Tackling Object Confusion for Few-Shot Detection

【AAAI2020】Context-Transformer:上下文转换器:解决对象混淆的小样本检测，Context-Transformer: Tackling Object Confusion for Few-Shot Detection

专知会员服务

51+阅读 · 2020年3月17日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

计算机视觉最佳实践、代码示例和相关文档

计算机视觉最佳实践、代码示例和相关文档

专知会员服务

20+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

《物联网（IoT）中的无人机通信高效控制》135页

《在GNSS信号降级环境中利用共识实现无人机集群稳健协调》

中程单向攻击无人机的战略意义：俄乌战争启示

《面向无人机集群的避障动态传感器覆盖算法》最新38页

相关资讯

细粒度情感分析任务（ABSA）的最新进展

细粒度情感分析任务（ABSA）的最新进展

PaperWeekly

18+阅读 · 2020年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

【代码资源】GAN | 七份最热GAN文章及代码分享（Github 1000+Stars）

【代码资源】GAN | 七份最热GAN文章及代码分享（Github 1000+Stars）

专知

13+阅读 · 2018年6月24日

笔记 | Sentiment Analysis

笔记 | Sentiment Analysis

黑龙江大学自然语言处理实验室

10+阅读 · 2018年5月6日

最新5篇生成对抗网络相关论文推荐—FusedGAN、DeblurGAN、AdvGAN、CipherGAN、MMD GANS

最新5篇生成对抗网络相关论文推荐—FusedGAN、DeblurGAN、AdvGAN、CipherGAN、MMD GANS

专知

23+阅读 · 2018年1月18日

stackGAN通过文字描述生成图片的V2项目

stackGAN通过文字描述生成图片的V2项目

CreateAMind

3+阅读 · 2018年1月1日

计算机视觉近一年进展综述

计算机视觉近一年进展综述

机器学习研究会

9+阅读 · 2017年11月25日

基于位置注意力机制模型和带标签数据来提升槽填充（EMNLP outstanding paper）

基于位置注意力机制模型和带标签数据来提升槽填充（EMNLP outstanding paper）

科技创新与创业

17+阅读 · 2017年11月17日

gan生成图像at 1024² 的代码论文

gan生成图像at 1024² 的代码论文

CreateAMind

4+阅读 · 2017年10月31日

Auto-Encoding GAN

Auto-Encoding GAN

CreateAMind

7+阅读 · 2017年8月4日

相关论文

Does Summary Evaluation Survive Translation to Other Languages?

Arxiv

0+阅读 · 2021年12月8日

Presentation Attack Detection Methods based on Gaze Tracking and Pupil Dynamic: A Comprehensive Survey

Arxiv

0+阅读 · 2021年12月7日

Adversarial Example Detection for DNN Models: A Review and Experimental Comparison

Arxiv

0+阅读 · 2021年12月6日

Beyond Robustness: Resilience Verification of Tree-Based Classifiers

Arxiv

0+阅读 · 2021年12月5日

A Survey of Visual Transformers

Arxiv

39+阅读 · 2021年11月11日

ResT: An Efficient Transformer for Visual Recognition

Arxiv

3+阅读 · 2021年10月14日

ReNAS:Relativistic Evaluation of Neural Architecture Search

Arxiv

11+阅读 · 2021年3月10日

Privacy and Robustness in Federated Learning: Attacks and Defenses

Arxiv

35+阅读 · 2020年12月7日

Pretrained Transformers Improve Out-of-Distribution Robustness

Arxiv

5+阅读 · 2020年4月13日

Comprehensive Analysis of Aspect Term Extraction Methods using Various Text Embeddings

Comprehensive Analysis of Aspect Term Extraction Methods using Various Text Embeddings

Arxiv

5+阅读 · 2019年9月11日

微信扫码咨询专知VIP会员