通过话语连接透镜,培训前语言模式的实用能力 (Pragmatic competence of pre-trained language models through the lens of discourse connectives)

As pre-trained language models (LMs) continue to dominate NLP, it is increasingly important that we understand the depth of language capabilities in these models. In this paper, we target pre-trained LMs' competence in pragmatics, with a focus on pragmatics relating to discourse connectives. We formulate cloze-style tests using a combination of naturally-occurring data and controlled inputs drawn from psycholinguistics. We focus on testing models' ability to use pragmatic cues to predict discourse connectives, models' ability to understand implicatures relating to connectives, and the extent to which models show humanlike preferences regarding temporal dynamics of connectives. We find that although models predict connectives reasonably well in the context of naturally-occurring data, when we control contexts to isolate high-level pragmatic cues, model sensitivity is much lower. Models also do not show substantial humanlike temporal preferences. Overall, the findings suggest that at present, dominant pre-training paradigms do not result in substantial pragmatic competence in our models.

翻译：由于预先培训的语言模型(LMS)继续主导国家语言模型,我们越来越需要理解这些模型的语言能力的深度。在本文中,我们把受过培训的LMS的能力定位为务实,重点是与对话连接的实用性有关的实用性。我们使用自然生成的数据和从心理语言学中获取的受控投入组合来制定凝聚式测试。我们侧重于测试模型使用务实的线索来预测对话连接性的能力,模型理解连接性相关隐含的能力,模型显示模型在连接性时间动态方面人性化偏好的程度。我们发现,虽然模型在自然生成的数据方面预测连接性相当好,但我们控制环境以孤立高层次的实用提示时,模型的敏感性要低得多。模型也没有显示出与人性类似的大量时间偏好。总体而言,研究结果表明,目前占主导地位的培训前模式并没有在我们的模型中产生实质性的务实能力。

相关内容

MoDELS

关注 43

ACM/IEEE第23届模型驱动工程语言和系统国际会议，是模型驱动软件和系统工程的首要会议系列，由ACM-SIGSOFT和IEEE-TCSE支持组织。自1998年以来，模型涵盖了建模的各个方面，从语言和方法到工具和应用程序。模特的参加者来自不同的背景，包括研究人员、学者、工程师和工业专业人士。MODELS 2019是一个论坛，参与者可以围绕建模和模型驱动的软件和系统交流前沿研究成果和创新实践经验。今年的版本将为建模社区提供进一步推进建模基础的机会，并在网络物理系统、嵌入式系统、社会技术系统、云计算、大数据、机器学习、安全、开源等新兴领域提出建模的创新应用以及可持续性。官网链接：http://www.modelsconference.org/