如何查询语言模型? (How to Query Language Models?)

Large pre-trained language models (LMs) are capable of not only recovering linguistic but also factual and commonsense knowledge. To access the knowledge stored in mask-based LMs, we can use cloze-style questions and let the model fill in the blank. The flexibility advantage over structured knowledge bases comes with the drawback of finding the right query for a certain information need. Inspired by human behavior to disambiguate a question, we propose to query LMs by example. To clarify the ambivalent question "Who does Neuer play for?", a successful strategy is to demonstrate the relation using another subject, e.g., "Ronaldo plays for Portugal. Who does Neuer play for?". We apply this approach of querying by example to the LAMA probe and obtain substantial improvements of up to 37.8% for BERT-large on the T-REx data when providing only 10 demonstrations--even outperforming a baseline that queries the model with up to 40 paraphrases of the question. The examples are provided through the model's context and thus require neither fine-tuning nor an additional forward pass. This suggests that LMs contain more factual and commonsense knowledge than previously assumed--if we query the model in the right way.

翻译：经过培训的大型语言模型(LMS)不仅能够恢复语言知识,而且能够恢复事实和常识知识。为了获取以遮罩为主的LMS中储存的知识,我们可以使用凝胶式的问题,让模型填入空白。结构化知识基础的灵活性优势在于无法找到正确的信息需求查询。受人类行为驱使,无法解析某个问题,我们建议以实例来查询LMS。为了澄清模糊不清的“谁为Neuer玩谁?”问题,一个成功的策略是使用另一个主题,例如“Ronaldo为葡萄牙演戏。谁为Neuer演戏?”来演示这一关系,例如“Ronaldo为葡萄牙演唱?谁为Neuer演唱?”。我们采用这种以实例查询LAMaMA探测器的方法,在只提供10个演示-甚至比查询模型40个参数的基线要好。通过模型背景提供,因此不要求微调,也不需要向前推进。这表明LMS-Ms在假设的右方方面有更多的事实和普通知识。

相关内容

MoDELS

关注 43

ACM/IEEE第23届模型驱动工程语言和系统国际会议，是模型驱动软件和系统工程的首要会议系列，由ACM-SIGSOFT和IEEE-TCSE支持组织。自1998年以来，模型涵盖了建模的各个方面，从语言和方法到工具和应用程序。模特的参加者来自不同的背景，包括研究人员、学者、工程师和工业专业人士。MODELS 2019是一个论坛，参与者可以围绕建模和模型驱动的软件和系统交流前沿研究成果和创新实践经验。今年的版本将为建模社区提供进一步推进建模基础的机会，并在网络物理系统、嵌入式系统、社会技术系统、云计算、大数据、机器学习、安全、开源等新兴领域提出建模的创新应用以及可持续性。官网链接：http://www.modelsconference.org/

【如何做研究】How to research ，22页ppt

专知会员服务

112+阅读 · 2021年4月17日

【万字长文】注意力机制可解释大论述

专知会员服务

55+阅读 · 2020年11月17日

Python计算导论，560页pdf，Introduction to Computing Using Python

专知会员服务

76+阅读 · 2020年5月5日