困难注意力变异者的正式语言识别:电路复杂度的视角 (Formal Language Recognition by Hard Attention Transformers: Perspectives from Circuit Complexity) - 专知论文

会员服务 ·

0

硬性注意力 · 注意力机制 · 类别 · contrastive · Networks ·

2022 年 4 月 13 日

Formal Language Recognition by Hard Attention Transformers: Perspectives from Circuit Complexity

翻译：困难注意力变异者的正式语言识别:电路复杂度的视角

Yiding Hao,Dana Angluin,Robert Frank

from arxiv, To appear in Transactions of the Association for Computational Linguistics

This paper analyzes three formal models of Transformer encoders that differ in the form of their self-attention mechanism: unique hard attention (UHAT); generalized unique hard attention (GUHAT), which generalizes UHAT; and averaging hard attention (AHAT). We show that UHAT and GUHAT Transformers, viewed as string acceptors, can only recognize formal languages in the complexity class AC$^0$, the class of languages recognizable by families of Boolean circuits of constant depth and polynomial size. This upper bound subsumes Hahn's (2020) results that GUHAT cannot recognize the DYCK languages or the PARITY language, since those languages are outside AC$^0$ (Furst et al., 1984). In contrast, the non-AC$^0$ languages MAJORITY and DYCK-1 are recognizable by AHAT networks, implying that AHAT can recognize languages that UHAT and GUHAT cannot.

翻译：本文分析了三种以自我注意机制形式不同的变换器编码器的正式模式:独特的难感(UHAT);普遍独特的难感(GUHAT),它一般地概括了UHAT;以及平均地难感(AHAT)。我们表明,UHAT和GUHAT变换器,作为接受字符串的接受者,只能承认复杂等级AC$0的正规语言,即由持续深度和多面体大小的波林电路家族所识别的一类语言。这一上界次组合(2020年)的结果是,GUHAT无法承认DYCK语言或种族语言,因为这些语言在AC$0(Furst等人,1984年)之外。相比之下,非AC$0美元语言MAJORITY和DYCK-1可以被AHAT网络所识别,意味着AHAT可以识别UHAT和GUHAT无法识别的语言。

0

相关内容

硬性注意力

硬性注意力

最新《Transformers模型》教程，64页ppt

最新《Transformers模型》教程，64页ppt

专知会员服务

279+阅读 · 2020年11月26日

纽约大学最新《语音识别Speech Recognition》2020课程，不可错过！

纽约大学最新《语音识别Speech Recognition》2020课程，不可错过！

专知会员服务

43+阅读 · 2020年11月2日

NLP必读经典文献100篇

专知会员服务

123+阅读 · 2020年9月8日

Linux导论，Introduction to Linux，96页ppt

Linux导论，Introduction to Linux，96页ppt

专知会员服务

76+阅读 · 2020年7月26日

自然语言处理中的注意力机制，Attention in Natural Language Processing

自然语言处理中的注意力机制，Attention in Natural Language Processing

专知会员服务

132+阅读 · 2020年5月30日

【跨语言BERT模型大集合】Transfer learning is increasingly going multilingual with language-specific BERT models

专知会员服务

52+阅读 · 2020年1月30日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

45+阅读 · 2019年10月17日

Stabilizing Transformers for Reinforcement Learning

Stabilizing Transformers for Reinforcement Learning

专知会员服务

57+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

168+阅读 · 2019年10月11日

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

专知会员服务

79+阅读 · 2019年10月10日

征稿 | CFP：Special Issue of NLP and KG(JCR Q2，IF2.67)

征稿 | CFP：Special Issue of NLP and KG(JCR Q2，IF2.67)

开放知识图谱

1+阅读 · 2022年4月4日

VCIP 2022 Call for Special Session Proposals

VCIP 2022 Call for Special Session Proposals

CCF多媒体专委会

1+阅读 · 2022年4月1日

IEEE ICKG 2022: Call for Papers

IEEE ICKG 2022: Call for Papers

机器学习与推荐算法

3+阅读 · 2022年3月30日

IEEE TII Call For Papers

IEEE TII Call For Papers

CCF多媒体专委会

3+阅读 · 2022年3月24日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

BERT/Transformer/迁移学习NLP资源大列表

BERT/Transformer/迁移学习NLP资源大列表

专知

19+阅读 · 2019年6月9日

BERT/注意力机制/Transformer/迁移学习NLP资源大列表：awesome-bert-nlp

BERT/注意力机制/Transformer/迁移学习NLP资源大列表：awesome-bert-nlp

AINLP

39+阅读 · 2019年6月9日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

23+阅读 · 2019年5月22日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

41+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

16+阅读 · 2018年12月24日

基于ancilla量子位的多通道量子视频生成及加密方法研究

国家自然科学基金

0+阅读 · 2015年12月31日

栅极调制的气体传感器量子输运特性的理论研究

国家自然科学基金

0+阅读 · 2015年12月31日

DKDP晶体生长母液的脆性特征对激光损伤阈值的影响

国家自然科学基金

0+阅读 · 2013年12月31日

几类不可积系统行波解的分岔

国家自然科学基金

0+阅读 · 2013年12月31日

用于精确计算光子和质子放疗剂量的玻尔兹曼-福克-普朗克输运方程快速数值解方法研究

国家自然科学基金

0+阅读 · 2013年12月31日

图案化聚电解质刷诱导纳米颗粒的有序组装及性能研究

国家自然科学基金

0+阅读 · 2013年12月31日

土木工程中CFRP构件的涡流热成像损伤检测方法研究

国家自然科学基金

0+阅读 · 2013年12月31日

肿瘤相关重组抗原蛋白与肿瘤治疗性DNA疫苗静电耦合成为复合纳米颗粒的初步研究

国家自然科学基金

0+阅读 · 2012年12月31日

基于数据稀疏特性的电磁积分方程快速算法研究

国家自然科学基金

0+阅读 · 2012年12月31日

图的若干参数及算法研究

国家自然科学基金

0+阅读 · 2011年12月31日

Transformer Feed-Forward Layers Build Predictions by Promoting Concepts in the Vocabulary Space

Arxiv

0+阅读 · 2022年4月20日

Functional Covering of Point Processes

Arxiv

0+阅读 · 2022年4月20日

Learned Queries for Efficient Local Attention

Arxiv

0+阅读 · 2022年4月19日

Impact of Tokenization on Language Models: An Analysis for Turkish

Arxiv

0+阅读 · 2022年4月19日

DecBERT: Enhancing the Language Understanding of BERT with Causal Attention Masks

Arxiv

0+阅读 · 2022年4月19日

Zero-shot Entity and Tweet Characterization with Designed Conditional Prompts and Contexts

Arxiv

0+阅读 · 2022年4月18日

Hierarchical Transformers Are More Efficient Language Models

Arxiv

2+阅读 · 2022年4月16日

Quantum Computing -- from NISQ to PISQ

Quantum Computing -- from NISQ to PISQ

Arxiv

1+阅读 · 2022年4月15日

Decoding Neural Correlation of Language-Specific Imagined Speech using EEG Signals

Arxiv

0+阅读 · 2022年4月15日

Characterizing the Efficiency vs. Accuracy Trade-off for Long-Context NLP Models

Arxiv

0+阅读 · 2022年4月15日

VIP会员

文章信息

相关主题

硬性注意力

注意力机制

相关VIP内容

最新《Transformers模型》教程，64页ppt

最新《Transformers模型》教程，64页ppt

专知会员服务

279+阅读 · 2020年11月26日

纽约大学最新《语音识别Speech Recognition》2020课程，不可错过！

纽约大学最新《语音识别Speech Recognition》2020课程，不可错过！

专知会员服务

43+阅读 · 2020年11月2日

NLP必读经典文献100篇

专知会员服务

123+阅读 · 2020年9月8日

Linux导论，Introduction to Linux，96页ppt

Linux导论，Introduction to Linux，96页ppt

专知会员服务

76+阅读 · 2020年7月26日

自然语言处理中的注意力机制，Attention in Natural Language Processing

自然语言处理中的注意力机制，Attention in Natural Language Processing

专知会员服务

132+阅读 · 2020年5月30日

【跨语言BERT模型大集合】Transfer learning is increasingly going multilingual with language-specific BERT models

专知会员服务

52+阅读 · 2020年1月30日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

45+阅读 · 2019年10月17日

Stabilizing Transformers for Reinforcement Learning

Stabilizing Transformers for Reinforcement Learning

专知会员服务

57+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

168+阅读 · 2019年10月11日

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

专知会员服务

79+阅读 · 2019年10月10日

热门VIP内容

相关资讯

征稿 | CFP：Special Issue of NLP and KG(JCR Q2，IF2.67)

征稿 | CFP：Special Issue of NLP and KG(JCR Q2，IF2.67)

开放知识图谱

1+阅读 · 2022年4月4日

VCIP 2022 Call for Special Session Proposals

VCIP 2022 Call for Special Session Proposals

CCF多媒体专委会

1+阅读 · 2022年4月1日

IEEE ICKG 2022: Call for Papers

IEEE ICKG 2022: Call for Papers

机器学习与推荐算法

3+阅读 · 2022年3月30日

IEEE TII Call For Papers

IEEE TII Call For Papers

CCF多媒体专委会

3+阅读 · 2022年3月24日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

BERT/Transformer/迁移学习NLP资源大列表

BERT/Transformer/迁移学习NLP资源大列表

专知

19+阅读 · 2019年6月9日

BERT/注意力机制/Transformer/迁移学习NLP资源大列表：awesome-bert-nlp

BERT/注意力机制/Transformer/迁移学习NLP资源大列表：awesome-bert-nlp

AINLP

39+阅读 · 2019年6月9日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

23+阅读 · 2019年5月22日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

41+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

16+阅读 · 2018年12月24日

相关论文

Transformer Feed-Forward Layers Build Predictions by Promoting Concepts in the Vocabulary Space

Arxiv

0+阅读 · 2022年4月20日

Functional Covering of Point Processes

Arxiv

0+阅读 · 2022年4月20日

Learned Queries for Efficient Local Attention

Arxiv

0+阅读 · 2022年4月19日

Impact of Tokenization on Language Models: An Analysis for Turkish

Arxiv

0+阅读 · 2022年4月19日

DecBERT: Enhancing the Language Understanding of BERT with Causal Attention Masks

Arxiv

0+阅读 · 2022年4月19日

Zero-shot Entity and Tweet Characterization with Designed Conditional Prompts and Contexts

Arxiv

0+阅读 · 2022年4月18日

Hierarchical Transformers Are More Efficient Language Models

Arxiv

2+阅读 · 2022年4月16日

Quantum Computing -- from NISQ to PISQ

Quantum Computing -- from NISQ to PISQ

Arxiv

1+阅读 · 2022年4月15日

Decoding Neural Correlation of Language-Specific Imagined Speech using EEG Signals

Arxiv

0+阅读 · 2022年4月15日

Characterizing the Efficiency vs. Accuracy Trade-off for Long-Context NLP Models

Arxiv

0+阅读 · 2022年4月15日

相关基金

基于ancilla量子位的多通道量子视频生成及加密方法研究

国家自然科学基金

0+阅读 · 2015年12月31日

栅极调制的气体传感器量子输运特性的理论研究

国家自然科学基金

0+阅读 · 2015年12月31日

DKDP晶体生长母液的脆性特征对激光损伤阈值的影响

国家自然科学基金

0+阅读 · 2013年12月31日

几类不可积系统行波解的分岔

国家自然科学基金

0+阅读 · 2013年12月31日

用于精确计算光子和质子放疗剂量的玻尔兹曼-福克-普朗克输运方程快速数值解方法研究

国家自然科学基金

0+阅读 · 2013年12月31日

图案化聚电解质刷诱导纳米颗粒的有序组装及性能研究

国家自然科学基金

0+阅读 · 2013年12月31日

土木工程中CFRP构件的涡流热成像损伤检测方法研究

国家自然科学基金

0+阅读 · 2013年12月31日

肿瘤相关重组抗原蛋白与肿瘤治疗性DNA疫苗静电耦合成为复合纳米颗粒的初步研究

国家自然科学基金

0+阅读 · 2012年12月31日

基于数据稀疏特性的电磁积分方程快速算法研究

国家自然科学基金

0+阅读 · 2012年12月31日

图的若干参数及算法研究

国家自然科学基金

0+阅读 · 2011年12月31日

微信扫码咨询专知VIP会员