灰色区域导航:语言模型中的过度自信和不确定性的表现</s> (Navigating the Grey Area: Expressions of Overconfidence and Uncertainty in Language Models)

Despite increasingly fluent, relevant, and coherent language generation, major gaps remain between how humans and machines use language. We argue that a key dimension that is missing from our understanding of language models (LMs) is the model's ability to interpret and generate expressions of uncertainty. Whether it be the weatherperson announcing a chance of rain or a doctor giving a diagnosis, information is often not black-and-white and expressions of uncertainty provide nuance to support human-decision making. The increasing deployment of LMs in the wild motivates us to investigate whether LMs are capable of interpreting expressions of uncertainty and how LMs' behaviors change when learning to emit their own expressions of uncertainty. When injecting expressions of uncertainty into prompts (e.g., "I think the answer is..."), we discover that GPT3's generations vary upwards of 80% in accuracy based on the expression used. We analyze the linguistic characteristics of these expressions and find a drop in accuracy when naturalistic expressions of certainty are present. We find similar effects when teaching models to emit their own expressions of uncertainty, where model calibration suffers when teaching models to emit certainty rather than uncertainty. Together, these results highlight the challenges of building LMs that interpret and generate trustworthy expressions of uncertainty.

翻译：尽管人们和机器使用语言的方式越来越流利、相关和一致,但人类和机器使用语言的方式之间仍然存在重大差距。我们争辩说,我们理解语言模型(LMs)所缺少的一个关键层面是模型解释和产生不确定表现的能力。无论是天气人宣布下雨机会,还是医生诊断,信息往往不是黑白的,不确定性的表达方式为人类决策提供了支持。在野外越来越多地使用LMs,促使我们调查LMs是否有能力解释不确定性的表达方式,以及LMs在学习自己表达不确定性时的行为变化。当将不确定性的表达方式注入提示时(例如,“我认为答案是...”),我们发现GPT3的世代根据所使用的表达方式在准确性方面差异高达80%。我们分析了这些表达方式的语言特征,并在自然的确定性表达方式出现时发现准确性下降。我们发现类似的影响是,当教学模型在教授模型以显示不确定性时,模型在构建确定性而不是不确定性时会遇到什么样的问题。这些结果突出了LPT3的难度。</s>

相关内容

MoDELS

关注 43

ACM/IEEE第23届模型驱动工程语言和系统国际会议，是模型驱动软件和系统工程的首要会议系列，由ACM-SIGSOFT和IEEE-TCSE支持组织。自1998年以来，模型涵盖了建模的各个方面，从语言和方法到工具和应用程序。模特的参加者来自不同的背景，包括研究人员、学者、工程师和工业专业人士。MODELS 2019是一个论坛，参与者可以围绕建模和模型驱动的软件和系统交流前沿研究成果和创新实践经验。今年的版本将为建模社区提供进一步推进建模基础的机会，并在网络物理系统、嵌入式系统、社会技术系统、云计算、大数据、机器学习、安全、开源等新兴领域提出建模的创新应用以及可持续性。官网链接：http://www.modelsconference.org/

【干货书】机器学习设计模式，408页pdf，Machine Learning Design Patterns

专知会员服务

138+阅读 · 2022年2月6日

史上最全！358篇机器学习&自然语言处理综述论文！都这儿了

专知会员服务

129+阅读 · 2020年7月18日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

165+阅读 · 2020年3月18日

【跨语言BERT模型大集合】Transfer learning is increasingly going multilingual with language-specific BERT models

专知会员服务

54+阅读 · 2020年1月30日