用于对对话框检索模型的 Pólya-Gamma 增强校准和不确定性模型</s> (On the Calibration and Uncertainty with Pólya-Gamma Augmentation for Dialog Retrieval Models)

Deep neural retrieval models have amply demonstrated their power but estimating the reliability of their predictions remains challenging. Most dialog response retrieval models output a single score for a response on how relevant it is to a given question. However, the bad calibration of deep neural network results in various uncertainty for the single score such that the unreliable predictions always misinform user decisions. To investigate these issues, we present an efficient calibration and uncertainty estimation framework PG-DRR for dialog response retrieval models which adds a Gaussian Process layer to a deterministic deep neural network and recovers conjugacy for tractable posterior inference by P\'{o}lya-Gamma augmentation. Finally, PG-DRR achieves the lowest empirical calibration error (ECE) in the in-domain datasets and the distributional shift task while keeping $R_{10}@1$ and MAP performance.

翻译：深神经检索模型已经充分展示了它们的力量,但估计其预测的可靠性仍然具有挑战性。大多数对话框响应检索模型都输出一个分数来回答它与某个问题的相关性。然而,深神经网络的错误校准导致单一分数的各种不确定性,因此不可靠的预测总是错误地反映用户的决定。为了调查这些问题,我们提出了一个高效的校准和不确定性估计框架PG-DRR,用于对话响应检索模型,该模型将高斯进程层添加到一个确定性的深神经网络中,并恢复了P\'{o}lya-Gamma 递增可移动的后传推力的相似性。最后,PG-DRR在内部数据集和分布转移任务中实现了最低的实验性校准错误(欧洲经委会),同时保持$R<unk> 1美元和MAP的性能。</s>

相关内容

MoDELS

关注 43

ACM/IEEE第23届模型驱动工程语言和系统国际会议，是模型驱动软件和系统工程的首要会议系列，由ACM-SIGSOFT和IEEE-TCSE支持组织。自1998年以来，模型涵盖了建模的各个方面，从语言和方法到工具和应用程序。模特的参加者来自不同的背景，包括研究人员、学者、工程师和工业专业人士。MODELS 2019是一个论坛，参与者可以围绕建模和模型驱动的软件和系统交流前沿研究成果和创新实践经验。今年的版本将为建模社区提供进一步推进建模基础的机会，并在网络物理系统、嵌入式系统、社会技术系统、云计算、大数据、机器学习、安全、开源等新兴领域提出建模的创新应用以及可持续性。官网链接：http://www.modelsconference.org/

不可错过！《机器学习100讲》课程，UBC Mark Schmidt讲授

专知会员服务

75+阅读 · 2022年6月28日

【开放书】卡耐基梅隆大学Elaine Shi 教授《Foundations of Distributed Consensus and Blockchains（分布式共识和区块链的基础）》150页pdf

专知会员服务

30+阅读 · 2022年2月22日

神经网络序列数据建模，229页ppt，Modeling Sequential Data with Neural Nets

专知会员服务

67+阅读 · 2020年7月25日

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

专知会员服务

95+阅读 · 2020年3月12日