更仔细地看一看TDFA (A closer look at TDFA) - 专知论文

会员服务 ·

0

正则化项 · 正则表达式 · 讲稿 · 相互独立的 · 优化器 ·

2022 年 6 月 3 日

A closer look at TDFA

翻译：更仔细地看一看TDFA

Angelo Borsotti,Ulya Trafimovich

from arxiv, 26 pages, 11 figures

We present an algorithm for regular expression parsing and submatch extraction based on tagged deterministic finite automata. The algorithm works with different disambiguation policies. We give detailed pseudocode for the algorithm, covering important practical optimizations. All transformations from a regular expression to an optimized automaton are explained on a step-by-step example. We consider both ahead-of-time and just-in-time determinization and describe variants of the algorithm suited to each setting. We provide benchmarks showing that the algorithm is very fast in practice. Our research is based on two independent implementations: an open-source lexer generator RE2C and an experimental Java library.

翻译：我们提出了一个基于有标记的确定性有限自动数据进行正常表达式分析的算法和亚匹配提取法。算法与不同的模糊化政策起作用。我们给算法提供详细的假码, 包括重要的实际优化。所有从正常表达式转换到优化自动图的转换都是通过一个逐步的示例来解释的。我们考虑先期和即时确定性, 并描述适合每个设置的算法变量。我们提供基准, 表明算法在实际操作中非常快速。我们的研究基于两个独立的实施: 开放源源码生成 RE2C 和一个实验性 Java 图书馆。

0

相关内容

正则化项

【干货书】机器学习速查手册，135页pdf

【干货书】机器学习速查手册，135页pdf

专知会员服务

127+阅读 · 2020年11月20日

【Google】深度学习对抗鲁棒性，43页ppt

专知会员服务

45+阅读 · 2020年10月31日

Linux导论，Introduction to Linux，96页ppt

Linux导论，Introduction to Linux，96页ppt

专知会员服务

82+阅读 · 2020年7月26日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

167+阅读 · 2020年3月18日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

50+阅读 · 2019年10月17日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

专知会员服务

65+阅读 · 2019年10月9日

VCIP 2022 Call for Demos

VCIP 2022 Call for Demos

CCF多媒体专委会

1+阅读 · 2022年6月6日

征稿 | International Joint Conference on Knowledge Graphs (IJCKG)

征稿 | International Joint Conference on Knowledge Graphs (IJCKG)

开放知识图谱

2+阅读 · 2022年5月20日

【ICIG2021】Latest News & Announcements of the Workshop

【ICIG2021】Latest News & Announcements of the Workshop

中国图象图形学学会CSIG

0+阅读 · 2021年12月20日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium9

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium9

中国图象图形学学会CSIG

0+阅读 · 2021年12月17日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium8

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium8

中国图象图形学学会CSIG

0+阅读 · 2021年11月16日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium5

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium5

中国图象图形学学会CSIG

1+阅读 · 2021年11月11日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium4

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium4

中国图象图形学学会CSIG

0+阅读 · 2021年11月10日

【ICIG2021】Latest News & Announcements of the Industry Talk1

【ICIG2021】Latest News & Announcements of the Industry Talk1

中国图象图形学学会CSIG

0+阅读 · 2021年7月28日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

吡咯-咪唑聚酰胺多肽抑制程序性死亡受体1基因转录机制的研究

国家自然科学基金

0+阅读 · 2015年12月31日

线粒体自噬-Warburg效应介导apelin促血管平滑肌细胞增殖

国家自然科学基金

0+阅读 · 2014年12月31日

L-BM诱导的血流动力学改变对慢性心衰中自噬的调控和机制研究

国家自然科学基金

0+阅读 · 2013年12月31日

强磁场下低活化钢中合金碳化物(Fe,Cr)xCy析出的热力学机制研究

国家自然科学基金

0+阅读 · 2013年12月31日

Calderon问题和边界刚性问题

国家自然科学基金

0+阅读 · 2013年12月31日

Toll样受体4信号通路在角膜新生血管形成机制中的作用研究

国家自然科学基金

0+阅读 · 2011年12月31日

hsBAFF上调胞内钙离子激活B淋巴细胞的信号转导网络免疫调控机理研究

国家自然科学基金

0+阅读 · 2011年12月31日

Cystatin B缺失与Prion疾病自噬作用机制的研究

国家自然科学基金

0+阅读 · 2011年12月31日

B细胞刺激因子受体BAFF-R、BCMA和TACI介导信号和相互关系在胶原性关节炎发病中作用及受体抑制剂对其的影响

国家自然科学基金

0+阅读 · 2011年12月31日

几类非自伴微分方程周期解的存在性及渐近性态

国家自然科学基金

0+阅读 · 2008年12月31日

Learn-Morph-Infer: a new way of solving the inverse problem for brain tumor modeling

Arxiv

0+阅读 · 2022年7月18日

A Reachability Index for Recursive Label-Concatenated Graph Queries

Arxiv

0+阅读 · 2022年7月18日

The Importance Markov Chain

Arxiv

0+阅读 · 2022年7月17日

Imaginary Zeroth-Order Optimization

Arxiv

0+阅读 · 2022年7月17日

Threat Model-Agnostic Adversarial Defense using Diffusion Models

Arxiv

0+阅读 · 2022年7月17日

AutoMix: Unveiling the Power of Mixup for Stronger Classifiers

Arxiv

0+阅读 · 2022年7月15日

ReFormer: The Relational Transformer for Image Captioning

Arxiv

0+阅读 · 2022年7月14日

Your "Labrador" is My "Dog": Fine-Grained, or Not

Arxiv

27+阅读 · 2021年2月17日

Attribute-Guided Adversarial Training for Robustness to Natural Perturbations

Arxiv

15+阅读 · 2020年12月3日

DOTA: A Large-scale Dataset for Object Detection in Aerial Images

Arxiv

19+阅读 · 2018年1月27日

VIP会员

文章信息

相关主题

正则表达式

相互独立的

相关VIP内容

【干货书】机器学习速查手册，135页pdf

【干货书】机器学习速查手册，135页pdf

专知会员服务

127+阅读 · 2020年11月20日

【Google】深度学习对抗鲁棒性，43页ppt

专知会员服务

45+阅读 · 2020年10月31日

Linux导论，Introduction to Linux，96页ppt

Linux导论，Introduction to Linux，96页ppt

专知会员服务

82+阅读 · 2020年7月26日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

167+阅读 · 2020年3月18日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

50+阅读 · 2019年10月17日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

专知会员服务

65+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

《代码、指挥与冲突：描绘军事人工智能的未来》报告

【斯坦福博士论文】面向地理空间数据的多模态与多尺度建模：时空生成式人工智能

美国启动“自有军事人工智能计划”：采用谷歌Gemini以推动全军人工智能应用

《创新与适应性作为军事成功的关键因素：来自俄乌战争的战略洞见》报告

相关资讯

VCIP 2022 Call for Demos

VCIP 2022 Call for Demos

CCF多媒体专委会

1+阅读 · 2022年6月6日

征稿 | International Joint Conference on Knowledge Graphs (IJCKG)

征稿 | International Joint Conference on Knowledge Graphs (IJCKG)

开放知识图谱

2+阅读 · 2022年5月20日

【ICIG2021】Latest News & Announcements of the Workshop

【ICIG2021】Latest News & Announcements of the Workshop

中国图象图形学学会CSIG

0+阅读 · 2021年12月20日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium9

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium9

中国图象图形学学会CSIG

0+阅读 · 2021年12月17日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium8

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium8

中国图象图形学学会CSIG

0+阅读 · 2021年11月16日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium5

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium5

中国图象图形学学会CSIG

1+阅读 · 2021年11月11日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium4

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium4

中国图象图形学学会CSIG

0+阅读 · 2021年11月10日

【ICIG2021】Latest News & Announcements of the Industry Talk1

【ICIG2021】Latest News & Announcements of the Industry Talk1

中国图象图形学学会CSIG

0+阅读 · 2021年7月28日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

相关论文

Learn-Morph-Infer: a new way of solving the inverse problem for brain tumor modeling

Arxiv

0+阅读 · 2022年7月18日

A Reachability Index for Recursive Label-Concatenated Graph Queries

Arxiv

0+阅读 · 2022年7月18日

The Importance Markov Chain

Arxiv

0+阅读 · 2022年7月17日

Imaginary Zeroth-Order Optimization

Arxiv

0+阅读 · 2022年7月17日

Threat Model-Agnostic Adversarial Defense using Diffusion Models

Arxiv

0+阅读 · 2022年7月17日

AutoMix: Unveiling the Power of Mixup for Stronger Classifiers

Arxiv

0+阅读 · 2022年7月15日

ReFormer: The Relational Transformer for Image Captioning

Arxiv

0+阅读 · 2022年7月14日

Your "Labrador" is My "Dog": Fine-Grained, or Not

Arxiv

27+阅读 · 2021年2月17日

Attribute-Guided Adversarial Training for Robustness to Natural Perturbations

Arxiv

15+阅读 · 2020年12月3日

DOTA: A Large-scale Dataset for Object Detection in Aerial Images

Arxiv

19+阅读 · 2018年1月27日

相关基金

吡咯-咪唑聚酰胺多肽抑制程序性死亡受体1基因转录机制的研究

国家自然科学基金

0+阅读 · 2015年12月31日

线粒体自噬-Warburg效应介导apelin促血管平滑肌细胞增殖

国家自然科学基金

0+阅读 · 2014年12月31日

L-BM诱导的血流动力学改变对慢性心衰中自噬的调控和机制研究

国家自然科学基金

0+阅读 · 2013年12月31日

强磁场下低活化钢中合金碳化物(Fe,Cr)xCy析出的热力学机制研究

国家自然科学基金

0+阅读 · 2013年12月31日

Calderon问题和边界刚性问题

国家自然科学基金

0+阅读 · 2013年12月31日

Toll样受体4信号通路在角膜新生血管形成机制中的作用研究

国家自然科学基金

0+阅读 · 2011年12月31日

hsBAFF上调胞内钙离子激活B淋巴细胞的信号转导网络免疫调控机理研究

国家自然科学基金

0+阅读 · 2011年12月31日

Cystatin B缺失与Prion疾病自噬作用机制的研究

国家自然科学基金

0+阅读 · 2011年12月31日

B细胞刺激因子受体BAFF-R、BCMA和TACI介导信号和相互关系在胶原性关节炎发病中作用及受体抑制剂对其的影响

国家自然科学基金

0+阅读 · 2011年12月31日

几类非自伴微分方程周期解的存在性及渐近性态

国家自然科学基金

0+阅读 · 2008年12月31日

微信扫码咨询专知VIP会员