探讨自然语言推论假设-唯一模式中的逻辑异常情况 (Exploring Lexical Irregularities in Hypothesis-OnlyModels of Natural Language Inference)

Natural Language Inference (NLI) or Recognizing Textual Entailment (RTE) is the task of predicting the entailment relation between a pair of sentences (premise and hypothesis). This task has been described as a valuable testing ground for the development of semantic representations, and is a key component in natural language understanding evaluation benchmarks. Models that understand entailment should encode both, the premise and the hypothesis. However, experiments by Poliak et al. revealed a strong preference of these models towards patterns observed only in the hypothesis, based on a 10 dataset comparison. Their results indicated the existence of statistical irregularities present in the hypothesis that bias the model into performing competitively with the state of the art. While recast datasets provide large scale generation of NLI instances due to minimal human intervention, the papers that generate them do not provide fine-grained analysis of the potential statistical patterns that can bias NLI models. In this work, we analyze hypothesis-only models trained on one of the recast datasets provided in Poliak et al. for word-level patterns. Our results indicate the existence of potential lexical biases that could contribute to inflating the model performance.

翻译：自然语言推断(NLI)或确认文本细节(RTE)是预测一对判决(假设和假设)之间必然存在的关系的任务。这项任务被描述为发展语义表达的一种宝贵的试验场,是自然语言理解评价基准的一个关键组成部分。理解要求的模型应该将前提和假设都编码起来。然而,Poliak等人的实验显示,这些模型非常倾向于只根据10个数据集比较而假设所观察到的模式。其结果表明,假设中存在的统计违规现象使模型偏向于与艺术状态竞争。重新构建的数据集由于人类的干预程度最小,提供了大规模生成国家语言表达实例,但产生这些数据集的文件并没有对可能偏向国家语言分类模式的潜在统计模式进行精确分析。在这项工作中,我们分析了在Poliak等人为文字层次模式提供的重编数据集中经过培训的单一假设模型。我们的结果表明,存在潜在的词法偏见,可能助长模型的形成。

相关内容

MoDELS

关注 44

ACM/IEEE第23届模型驱动工程语言和系统国际会议，是模型驱动软件和系统工程的首要会议系列，由ACM-SIGSOFT和IEEE-TCSE支持组织。自1998年以来，模型涵盖了建模的各个方面，从语言和方法到工具和应用程序。模特的参加者来自不同的背景，包括研究人员、学者、工程师和工业专业人士。MODELS 2019是一个论坛，参与者可以围绕建模和模型驱动的软件和系统交流前沿研究成果和创新实践经验。今年的版本将为建模社区提供进一步推进建模基础的机会，并在网络物理系统、嵌入式系统、社会技术系统、云计算、大数据、机器学习、安全、开源等新兴领域提出建模的创新应用以及可持续性。官网链接：http://www.modelsconference.org/

【干货书】机器学习速查手册，135页pdf

专知会员服务

127+阅读 · 2020年11月20日

数据科学导论，54页ppt，Introduction to Data Science

专知会员服务

42+阅读 · 2020年7月27日

【NLP模型的跨语言/跨领域迁移】《Transferring NLP models across languages and domains》

专知会员服务

43+阅读 · 2019年11月25日

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日