神经材料生成模型反向工程配置 (Reverse Engineering Configurations of Neural Text Generation Models)

This paper seeks to develop a deeper understanding of the fundamental properties of neural text generations models. The study of artifacts that emerge in machine generated text as a result of modeling choices is a nascent research area. Previously, the extent and degree to which these artifacts surface in generated text has not been well studied. In the spirit of better understanding generative text models and their artifacts, we propose the new task of distinguishing which of several variants of a given model generated a piece of text, and we conduct an extensive suite of diagnostic tests to observe whether modeling choices (e.g., sampling methods, top-$k$ probabilities, model architectures, etc.) leave detectable artifacts in the text they generate. Our key finding, which is backed by a rigorous set of experiments, is that such artifacts are present and that different modeling choices can be inferred by observing the generated text alone. This suggests that neural text generators may be more sensitive to various modeling choices than previously thought.

翻译：本文试图加深对神经文本代数模型基本特性的理解。研究机器生成的文本中因模型选择而出现的文物是一个新生的研究领域。以前,这些文物在生成文本中表面的深度和程度没有得到很好研究。本着更好地理解基因化文本模型及其文物的精神,我们提出新的任务,即区分某个模型的若干变种中哪一个生成了一块文字,我们进行了一系列广泛的诊断性测试,以观察模型选择(例如抽样方法、最高-千元概率、模型结构等)是否在其生成的文本中留下可探测的文物。我们的主要发现是,这些文物存在,单凭观察生成文本就可以推断出不同的模型选择。这意味着神经文本生成器对各种模型选择可能比先前想象的更为敏感。

相关内容

MoDELS

关注 43

ACM/IEEE第23届模型驱动工程语言和系统国际会议，是模型驱动软件和系统工程的首要会议系列，由ACM-SIGSOFT和IEEE-TCSE支持组织。自1998年以来，模型涵盖了建模的各个方面，从语言和方法到工具和应用程序。模特的参加者来自不同的背景，包括研究人员、学者、工程师和工业专业人士。MODELS 2019是一个论坛，参与者可以围绕建模和模型驱动的软件和系统交流前沿研究成果和创新实践经验。今年的版本将为建模社区提供进一步推进建模基础的机会，并在网络物理系统、嵌入式系统、社会技术系统、云计算、大数据、机器学习、安全、开源等新兴领域提出建模的创新应用以及可持续性。官网链接：http://www.modelsconference.org/