NLP researchers regularly invoke abstract concepts like "interpretability," "bias," "reasoning," and "stereotypes," without defining them. Each subfield has a shared understanding or conceptualization of what these terms mean and how we should treat them, and this shared understanding is the basis on which operational decisions are made: Datasets are built to evaluate these concepts, metrics are proposed to quantify them, and claims are made about systems. But what do they mean, what should they mean, and how should we measure them? I outline a seminar I created for students to explore these questions of conceptualization and operationalization, with an interdisciplinary reading list and an emphasis on discussion and critique.
翻译:自然语言处理研究者常援引"可解释性"、"偏见"、"推理"、"刻板印象"等抽象概念却未加明确定义。每个子领域对这些术语的含义及处理方式存在共享的理解或概念化,这种共识构成了操作决策的基础:为评估这些概念而构建数据集,为量化它们而提出度量指标,并对系统性能作出论断。但这些概念究竟意指什么?它们应当具有何种内涵?我们又该如何进行测量?本文概述了我为学生设计的专题研讨课程,通过跨学科阅读材料与强调讨论批判的教学方式,引导学生深入探究概念化与操作化的核心问题。