用于自动评估的性别偏见和普遍替代性对显性错误校正系统进行反弹攻击 (Gender Bias and Universal Substitution Adversarial Attacks on Grammatical Error Correction Systems for Automated Assessment)

Grammatical Error Correction (GEC) systems perform a sequence-to-sequence task, where an input word sequence containing grammatical errors, is corrected for these errors by the GEC system to output a grammatically correct word sequence. With the advent of deep learning methods, automated GEC systems have become increasingly popular. For example, GEC systems are often used on speech transcriptions of English learners as a form of assessment and feedback - these powerful GEC systems can be used to automatically measure an aspect of a candidate's fluency. The count of \textit{edits} from a candidate's input sentence (or essay) to a GEC system's grammatically corrected output sentence is indicative of a candidate's language ability, where fewer edits suggest better fluency. The count of edits can thus be viewed as a \textit{fluency score} with zero implying perfect fluency. However, although deep learning based GEC systems are extremely powerful and accurate, they are susceptible to adversarial attacks: an adversary can introduce a small, specific change at the input of a system that causes a large, undesired change at the output. When considering the application of GEC systems to automated language assessment, the aim of an adversary could be to cheat by making a small change to a grammatically incorrect input sentence that conceals the errors from a GEC system, such that no edits are found and the candidate is unjustly awarded a perfect fluency score. This work examines a simple universal substitution adversarial attack that non-native speakers of English could realistically employ to deceive GEC systems used for assessment.

翻译：语法错误校正( GEC) 系统通常用于英语学生的语音校正, 作为一种评估和反馈形式。这些强大的 GEC 系统可以自动测量候选人的不透明性。从候选人的输入句子( 或作文) 到 GEC 系统校正后的产出句子, 显示候选人的语言能力, 较少的编辑显示更流利。因此, 编辑的计数可以被视为一种写字( textit{ 流利分数 ), 零意味着完全流畅。但是, 尽管基于深层次学习的 GEC 系统非常强大和准确, 它们很容易受到对抗性攻击。从候选人的输入句( 或作文) 到 GEC 系统校正后输出句子的计算, 显示候选人的语言能力, 显示他的语言能力, 较少的编辑显示更流利。编辑的计分数可以被视为一种纯正的 GEC 系统, 其结果被理解为一种不准确的系统。当一个简单的系统输入时, 当一个不精确的系统被应用到一个不精确的GEC 。

相关内容

Automator

关注 5

Automator是苹果公司为他们的Mac OS X系统开发的一款软件。 只要通过点击拖拽鼠标等操作就可以将一系列动作组合成一个工作流，从而帮助你自动的（可重复的）完成一些复杂的工作。Automator还能横跨很多不同种类的程序，包括：查找器、Safari网络浏览器、iCal、地址簿或者其他的一些程序。它还能和一些第三方的程序一起工作，如微软的Office、Adobe公司的Photoshop或者Pixelmator等。

NeurlPS 2022 | 自然语言处理相关论文分类整理

专知会员服务

51+阅读 · 2022年10月2日

NLP必读经典文献100篇

专知会员服务

124+阅读 · 2020年9月8日

2020数据工程师成长路线图

专知会员服务

41+阅读 · 2020年9月6日

史上最全！358篇机器学习&自然语言处理综述论文！都这儿了

专知会员服务

129+阅读 · 2020年7月18日