基础自然语言教学:大语言模型能够捕获空间信息吗? (Grounding Natural Language Instructions: Can Large Language Models Capture Spatial Information?) - 专知论文

会员服务 ·

0

语言模型化 · LayoutLM · MoDELS · Automator · SCAN ·

2021 年 9 月 17 日

Grounding Natural Language Instructions: Can Large Language Models Capture Spatial Information?

翻译：基础自然语言教学:大语言模型能够捕获空间信息吗?

Julia Rozanova,Deborah Ferreira,Krishna Dubba,Weiwei Cheng,Dell Zhang,Andre Freitas

from arxiv, *Equal contribution

Models designed for intelligent process automation are required to be capable of grounding user interface elements. This task of interface element grounding is centred on linking instructions in natural language to their target referents. Even though BERT and similar pre-trained language models have excelled in several NLP tasks, their use has not been widely explored for the UI grounding domain. This work concentrates on testing and probing the grounding abilities of three different transformer-based models: BERT, RoBERTa and LayoutLM. Our primary focus is on these models' spatial reasoning skills, given their importance in this domain. We observe that LayoutLM has a promising advantage for applications in this domain, even though it was created for a different original purpose (representing scanned documents): the learned spatial features appear to be transferable to the UI grounding setting, especially as they demonstrate the ability to discriminate between target directions in natural language instructions.

翻译：用于智能程序自动化的模型必须能够定位用户界面元素。界面元素的定位任务集中在将自然语言的指令与其目标参照点联系起来上。尽管BERT和类似的预先培训的语言模型在一些NLP任务中表现优异, 但这些模型的使用尚未被广泛探索用于UI的定位域。这项工作侧重于测试和探测基于三个不同变压器的模型的地面能力: BERT、 RoBERTA和TlumpLM。我们的主要重点是这些模型的空间推理技能,因为它们在这方面的重要性。我们观察到DlogLM对这个领域的应用具有很有希望的优势, 尽管它是为了不同的原始目的( 代表扫描文件) : 所学到的空间特征似乎可以转移到定位设置, 特别是因为它们显示在自然语言指令中区分目标方向的能力。

0

相关内容

语言模型化

语言模型化

【干货书】计算成像，483页pdf，Computational Imaging Book, MIT 出版社

专知会员服务

65+阅读 · 2021年9月12日

Linux导论，Introduction to Linux，96页ppt

Linux导论，Introduction to Linux，96页ppt

专知会员服务

80+阅读 · 2020年7月26日

【跨语言BERT模型大集合】Transfer learning is increasingly going multilingual with language-specific BERT models

专知会员服务

54+阅读 · 2020年1月30日

【剑桥大学】神经机器翻译综述论文，Neural Machine Translation: A Review，附88页pdf

【剑桥大学】神经机器翻译综述论文，Neural Machine Translation: A Review，附88页pdf

专知会员服务

37+阅读 · 2019年12月4日

【自然语言处理快速入门】《Natural Language Processing: A Crash Course!》by Shantanu Phadke

【自然语言处理快速入门】《Natural Language Processing: A Crash Course!》by Shantanu Phadke

专知会员服务

38+阅读 · 2019年11月2日

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

专知会员服务

160+阅读 · 2019年10月12日

开源书：PyTorch深度学习起步

开源书：PyTorch深度学习起步

专知会员服务

51+阅读 · 2019年10月11日

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

专知会员服务

79+阅读 · 2019年10月10日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

105+阅读 · 2019年10月9日

最新BERT相关论文清单，BERT-related Papers

最新BERT相关论文清单，BERT-related Papers

专知会员服务

53+阅读 · 2019年9月29日

已删除

将门创投

3+阅读 · 2019年10月18日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Call for Participation: Shared Tasks in NLPCC 2019

Call for Participation: Shared Tasks in NLPCC 2019

中国计算机学会

5+阅读 · 2019年3月22日

人工智能 | CCF推荐期刊专刊约稿信息6条

人工智能 | CCF推荐期刊专刊约稿信息6条

Call4Papers

5+阅读 · 2019年2月18日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

计算机类 | 期刊专刊截稿信息9条

计算机类 | 期刊专刊截稿信息9条

Call4Papers

4+阅读 · 2018年1月26日

分布式TensorFlow入门指南

分布式TensorFlow入门指南

机器学习研究会

4+阅读 · 2017年11月28日

计算机视觉近一年进展综述

计算机视觉近一年进展综述

机器学习研究会

9+阅读 · 2017年11月25日

【今日新增】IEEE Trans.专刊截稿信息8条

【今日新增】IEEE Trans.专刊截稿信息8条

Call4Papers

7+阅读 · 2017年6月29日

Andrew NG的新书《Machine Learning Yearning》

Andrew NG的新书《Machine Learning Yearning》

我爱机器学习

11+阅读 · 2016年12月7日

Information structures and their cohomology

Arxiv

0+阅读 · 2021年11月8日

StrucTexT: Structured Text Understanding with Multi-Modal Transformers

Arxiv

0+阅读 · 2021年11月8日

Recent Advances in Natural Language Processing via Large Pre-Trained Language Models: A Survey

Arxiv

31+阅读 · 2021年11月1日

Pre-Trained Models: Past, Present and Future

Arxiv

19+阅读 · 2021年6月15日

Compositional Generalization in Image Captioning

Compositional Generalization in Image Captioning

Arxiv

3+阅读 · 2019年9月16日

Neural Image Captioning

Neural Image Captioning

Arxiv

5+阅读 · 2019年7月2日

Attention, please! A Critical Review of Neural Attention Models in Natural Language Processing

Attention, please! A Critical Review of Neural Attention Models in Natural Language Processing

Arxiv

21+阅读 · 2019年2月4日

Multimodal Sentiment Analysis To Explore the Structure of Emotions

Arxiv

19+阅读 · 2018年5月25日

Image Captioning

Arxiv

11+阅读 · 2018年5月13日

Image Captioning using Deep Neural Architectures

Arxiv

20+阅读 · 2018年1月17日

VIP会员

文章信息

相关主题

语言模型化

相关VIP内容

【干货书】计算成像，483页pdf，Computational Imaging Book, MIT 出版社

专知会员服务

65+阅读 · 2021年9月12日

Linux导论，Introduction to Linux，96页ppt

Linux导论，Introduction to Linux，96页ppt

专知会员服务

80+阅读 · 2020年7月26日

【跨语言BERT模型大集合】Transfer learning is increasingly going multilingual with language-specific BERT models

专知会员服务

54+阅读 · 2020年1月30日

【剑桥大学】神经机器翻译综述论文，Neural Machine Translation: A Review，附88页pdf

【剑桥大学】神经机器翻译综述论文，Neural Machine Translation: A Review，附88页pdf

专知会员服务

37+阅读 · 2019年12月4日

【自然语言处理快速入门】《Natural Language Processing: A Crash Course!》by Shantanu Phadke

【自然语言处理快速入门】《Natural Language Processing: A Crash Course!》by Shantanu Phadke

专知会员服务

38+阅读 · 2019年11月2日

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

专知会员服务

160+阅读 · 2019年10月12日

开源书：PyTorch深度学习起步

开源书：PyTorch深度学习起步

专知会员服务

51+阅读 · 2019年10月11日

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

专知会员服务

79+阅读 · 2019年10月10日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

105+阅读 · 2019年10月9日

最新BERT相关论文清单，BERT-related Papers

最新BERT相关论文清单，BERT-related Papers

专知会员服务

53+阅读 · 2019年9月29日

热门VIP内容

开通专知VIP会员享更多权益服务

《生成式人工智能与大/小语言模型在供应链管理决策优化与可持续性提升中的作用评估》最新51页

白宫发布《赢得AI竞赛：美国人工智能行动计划》最新28页

地下战：地下空间的战略博弈

《美地下作战条令手册》228页

相关资讯

已删除

将门创投

3+阅读 · 2019年10月18日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Call for Participation: Shared Tasks in NLPCC 2019

Call for Participation: Shared Tasks in NLPCC 2019

中国计算机学会

5+阅读 · 2019年3月22日

人工智能 | CCF推荐期刊专刊约稿信息6条

人工智能 | CCF推荐期刊专刊约稿信息6条

Call4Papers

5+阅读 · 2019年2月18日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

计算机类 | 期刊专刊截稿信息9条

计算机类 | 期刊专刊截稿信息9条

Call4Papers

4+阅读 · 2018年1月26日

分布式TensorFlow入门指南

分布式TensorFlow入门指南

机器学习研究会

4+阅读 · 2017年11月28日

计算机视觉近一年进展综述

计算机视觉近一年进展综述

机器学习研究会

9+阅读 · 2017年11月25日

【今日新增】IEEE Trans.专刊截稿信息8条

【今日新增】IEEE Trans.专刊截稿信息8条

Call4Papers

7+阅读 · 2017年6月29日

Andrew NG的新书《Machine Learning Yearning》

Andrew NG的新书《Machine Learning Yearning》

我爱机器学习

11+阅读 · 2016年12月7日

相关论文

Information structures and their cohomology

Arxiv

0+阅读 · 2021年11月8日

StrucTexT: Structured Text Understanding with Multi-Modal Transformers

Arxiv

0+阅读 · 2021年11月8日

Recent Advances in Natural Language Processing via Large Pre-Trained Language Models: A Survey

Arxiv

31+阅读 · 2021年11月1日

Pre-Trained Models: Past, Present and Future

Arxiv

19+阅读 · 2021年6月15日

Compositional Generalization in Image Captioning

Compositional Generalization in Image Captioning

Arxiv

3+阅读 · 2019年9月16日

Neural Image Captioning

Neural Image Captioning

Arxiv

5+阅读 · 2019年7月2日

Attention, please! A Critical Review of Neural Attention Models in Natural Language Processing

Attention, please! A Critical Review of Neural Attention Models in Natural Language Processing

Arxiv

21+阅读 · 2019年2月4日

Multimodal Sentiment Analysis To Explore the Structure of Emotions

Arxiv

19+阅读 · 2018年5月25日

Image Captioning

Arxiv

11+阅读 · 2018年5月13日

Image Captioning using Deep Neural Architectures

Arxiv

20+阅读 · 2018年1月17日

微信扫码咨询专知VIP会员