REECAST:使用户能够利用和解释具有交互式视觉化的毒性检测模型 (RECAST: Enabling User Recourse and Interpretability of Toxicity Detection Models with Interactive Visualization)

With the widespread use of toxic language online, platforms are increasingly using automated systems that leverage advances in natural language processing to automatically flag and remove toxic comments. However, most automated systems---when detecting and moderating toxic language---do not provide feedback to their users, let alone provide an avenue of recourse for these users to make actionable changes. We present our work, RECAST, an interactive, open-sourced web tool for visualizing these models' toxic predictions, while providing alternative suggestions for flagged toxic language. Our work also provides users with a new path of recourse when using these automated moderation tools. RECAST highlights text responsible for classifying toxicity, and allows users to interactively substitute potentially toxic phrases with neutral alternatives. We examined the effect of RECAST via two large-scale user evaluations, and found that RECAST was highly effective at helping users reduce toxicity as detected through the model. Users also gained a stronger understanding of the underlying toxicity criterion used by black-box models, enabling transparency and recourse. In addition, we found that when users focus on optimizing language for these models instead of their own judgement (which is the implied incentive and goal of deploying automated models), these models cease to be effective classifiers of toxicity compared to human annotations. This opens a discussion for how toxicity detection models work and should work, and their effect on the future of online discourse.

翻译：由于在网上广泛使用有毒语言,平台正在越来越多地使用自动化系统,利用自然语言处理的进展,自动悬挂自动旗帜,删除有毒评论;然而,大多数自动系统 -- -- 检测和调节有毒语言 -- -- 不向用户提供反馈,更不用说为这些用户提供进行可采取行动的改变的渠道。我们介绍了我们的工作,即REECAST,一个互动的、开放来源的网络工具,用于直观这些模型的毒性预测,同时为标记有毒语言提供了替代建议。我们的工作还为用户提供了使用这些自动调适工具时的新的追索途径。REAST着重介绍了负责毒性分类的文本,并允许用户以中性替代潜在有毒的词语。我们研究了RECAST通过两个大规模用户评估对用户的影响,发现REECAST非常有效地帮助用户降低通过模型检测到的毒性。用户们还更深入地了解黑箱模型所使用的基本毒性标准,从而能够实现透明度和追索。此外,我们发现当用户在使用这些模型时侧重于优化其语言,而不是他们自己的判断力(这是关于使用自动检测模型的隐含的动力和目的)时,这些模型应该停止将自动检测模型的模拟。

相关内容

Automator

关注 5

Automator是苹果公司为他们的Mac OS X系统开发的一款软件。 只要通过点击拖拽鼠标等操作就可以将一系列动作组合成一个工作流，从而帮助你自动的（可重复的）完成一些复杂的工作。Automator还能横跨很多不同种类的程序，包括：查找器、Safari网络浏览器、iCal、地址簿或者其他的一些程序。它还能和一些第三方的程序一起工作，如微软的Office、Adobe公司的Photoshop或者Pixelmator等。

【KDD2020】动态图的拉普拉斯变换点检测，Laplacian Change Point Detection for Dynamic Graphs

专知会员服务

38+阅读 · 2020年7月3日

【深度伪造综述论文】The Creation and Detection of Deepfakes: A Survey

专知会员服务

55+阅读 · 2020年4月26日

【开放书】预测模型:探索、解释和调试，以人为本的可解释机器学习，Predictive Models: Explore, Explain, and Debug，Human-Centered Interpretable Machine Learning

专知会员服务

37+阅读 · 2019年12月26日