DREAM:在语言模式背后发现心理模式 (DREAM: Uncovering Mental Models behind Language Models)

To what extent do language models (LMs) build "mental models" of a scene when answering situated questions (e.g., questions about a specific ethical dilemma)? While cognitive science has shown that mental models play a fundamental role in human problem-solving, it is unclear whether the high question-answering performance of existing LMs is backed by similar model building - and if not, whether that can explain their well-known catastrophic failures. We observed that Macaw, an existing T5-based LM, when probed provides somewhat useful but inadequate mental models for situational questions (estimated accuracy=43%, usefulness=21%, consistency=42%). We propose DREAM, a model that takes a situational question as input to produce a mental model elaborating the situation, without any additional task specific training data for mental models. It inherits its social commonsense through distant supervision from existing NLP resources. Our analysis shows that DREAM can produce significantly better mental models (estimated accuracy=67%, usefulness=37%, consistency=71%) compared to Macaw. Finally, mental models generated by DREAM can be used as additional context for situational QA tasks. This additional context improves the answer accuracy of a Macaw zero-shot model by between +1% and +4% (absolute) on three different datasets.

翻译：语言模型(LMS)在回答定位问题(例如关于特定道德困境的问题)时,在多大程度上能构建“心理模型”呢?认知科学已经表明,心理模型在解决人类问题方面起着根本作用。虽然认知科学已经表明,心理模型在解决人类问题方面起着根本性的作用,但尚不清楚现有LM的高度问答性表现是否得到类似模型建设的支持,如果不是,这能否解释其众所周知的灾难性失败。我们发现,Macaw,一个以T5为基础的现有LM(当被调查时,它为形势问题(估计准确性=43%,有用性=21%,一致性=42%)提供了一些有用但不充分的心理模型。最后,我们建议DREAM所生成的心理模型可以作为提供描述情况的精神模型的投入,用以生成一种阐述情况的精神模型,而没有为心理模型提供任何额外的具体培训数据数据数据。它通过从现有的NLP资源进行远程监督而继承其社会常识。我们的分析表明,DREAM与Maaw相比,它能够产生显著更好的心理模型(估计准确性=67%,实用性=77%,一致性=71%)。最后,由DREAM生成的心理模型生成的心理模型可以用作额外的图像的另外的精确度1+零4A号的答案。

相关内容

MoDELS

关注 43

ACM/IEEE第23届模型驱动工程语言和系统国际会议，是模型驱动软件和系统工程的首要会议系列，由ACM-SIGSOFT和IEEE-TCSE支持组织。自1998年以来，模型涵盖了建模的各个方面，从语言和方法到工具和应用程序。模特的参加者来自不同的背景，包括研究人员、学者、工程师和工业专业人士。MODELS 2019是一个论坛，参与者可以围绕建模和模型驱动的软件和系统交流前沿研究成果和创新实践经验。今年的版本将为建模社区提供进一步推进建模基础的机会，并在网络物理系统、嵌入式系统、社会技术系统、云计算、大数据、机器学习、安全、开源等新兴领域提出建模的创新应用以及可持续性。官网链接：http://www.modelsconference.org/

维多利亚运输政策研究所“Autonomous Vehicle Implementation Predictions：Implications for Transport Planning”（自动驾驶汽车实施预测:对交通规划的影响）

专知会员服务

17+阅读 · 2022年2月16日

【IJCAI2020】从语言图谱到常识图谱，TransOMCS: From Linguistic Graphs to Commonsense Knowledge

专知会员服务

26+阅读 · 2020年5月6日

因果图，Causal Graphs，52页ppt

专知会员服务

250+阅读 · 2020年4月19日

【论文翻译】NLP注意力机制综述论文翻译，Attention, please! A Critical Review of Neural Attention Models in Natural Language Processing

专知会员服务

96+阅读 · 2020年4月18日