语言模式适应的视觉比较 (Visual Comparison of Language Model Adaptation)

Neural language models are widely used; however, their model parameters often need to be adapted to the specific domains and tasks of an application, which is time- and resource-consuming. Thus, adapters have recently been introduced as a lightweight alternative for model adaptation. They consist of a small set of task-specific parameters with a reduced training time and simple parameter composition. The simplicity of adapter training and composition comes along with new challenges, such as maintaining an overview of adapter properties and effectively comparing their produced embedding spaces. To help developers overcome these challenges, we provide a twofold contribution. First, in close collaboration with NLP researchers, we conducted a requirement analysis for an approach supporting adapter evaluation and detected, among others, the need for both intrinsic (i.e., embedding similarity-based) and extrinsic (i.e., prediction-based) explanation methods. Second, motivated by the gathered requirements, we designed a flexible visual analytics workspace that enables the comparison of adapter properties. In this paper, we discuss several design iterations and alternatives for interactive, comparative visual explanation methods. Our comparative visualizations show the differences in the adapted embedding vectors and prediction outcomes for diverse human-interpretable concepts (e.g., person names, human qualities). We evaluate our workspace through case studies and show that, for instance, an adapter trained on the language debiasing task according to context-0 (decontextualized) embeddings introduces a new type of bias where words (even gender-independent words such as countries) become more similar to female than male pronouns. We demonstrate that these are artifacts of context-0 embeddings.

翻译：神经语言模型被广泛使用;然而,它们的模型参数往往需要适应应用的具体领域和任务,因为应用需要花费时间和资源。因此,最近引入了适应器,作为模型适应的轻量替代方法。它们包括一小套任务特有参数,培训时间减少,参数构成简单。适应器培训和构成的简单性伴随着新的挑战,例如保持对适应器特性的概览并有效地比较其生成的嵌入空间。为了帮助开发者克服这些挑战,我们提供了双重贡献。首先,我们与国家实验室研究人员密切合作,对支持适应器类型评估的方法进行了需求分析,并发现,除其他外,对内在(即嵌入相似性基)和外部(即基于预测的)解释方法的需要。第二,根据所收集的要求,我们设计了一个灵活的视觉分析空间,以便能够比较适应适应器特性。在本文中,我们讨论了一些用于互动、比较视觉解释方法的版本和替代品设计。我们进行比较的可视化背景背景背景背景分析,显示了内在(即嵌入式的性别背景分析,展示了人类变异性变的变量,通过案例研究展示了人类变异性变的变量变异性分析结果)。