解决自然语言系统中不显眼的不安全文本问题 (Mitigating Covertly Unsafe Text within Natural Language Systems) - 专知论文

会员服务 ·

0

INFORMS · 声明 · 论文 · 人工智能 · 自然语言处理 ·

2023 年 3 月 20 日

Mitigating Covertly Unsafe Text within Natural Language Systems

翻译：解决自然语言系统中不显眼的不安全文本问题

Alex Mei,Anisha Kabir,Sharon Levy,Melanie Subbiah,Emily Allaway,John Judge,Desmond Patton,Bruce Bimber,Kathleen McKeown,William Yang Wang

from arxiv, In Findings of the 2022 Conference on Empirical Methods in Natural Language Processing

An increasingly prevalent problem for intelligent technologies is text safety, as uncontrolled systems may generate recommendations to their users that lead to injury or life-threatening consequences. However, the degree of explicitness of a generated statement that can cause physical harm varies. In this paper, we distinguish types of text that can lead to physical harm and establish one particularly underexplored category: covertly unsafe text. Then, we further break down this category with respect to the system's information and discuss solutions to mitigate the generation of text in each of these subcategories. Ultimately, our work defines the problem of covertly unsafe language that causes physical harm and argues that this subtle yet dangerous issue needs to be prioritized by stakeholders and regulators. We highlight mitigation strategies to inspire future researchers to tackle this challenging problem and help improve safety within smart systems.

翻译：智能科技面临的越来越普遍的问题是文本安全，因为不受控制的系统可能为其用户产生建议，其中包含可能导致伤害或危及生命的内容。然而，能够导致身体伤害的陈述的明确程度各不相同。在本文中，我们区分了可能导致身体伤害的文本类型，并建立了一个特别未被充分探索的类型：隐蔽的不安全文本。然后，我们进一步分析了与系统信息相关的这一类别，并讨论了减少生成每个子类别中的文本的解决方案。最终，我们的工作定义了会导致身体危害的隐蔽语言问题，并认为这个微妙而危险的问题需要得到利益相关者和监管机构的优先考虑。我们强调缓解策略，以激发未来研究人员解决这一具有挑战性的问题并帮助改善智能系统的安全性。

0

相关内容

INFORMS

《计算机信息》杂志发表高质量的论文，扩大了运筹学和计算的范围，寻求有关理论、方法、实验、系统和应用方面的原创研究论文、新颖的调查和教程论文，以及描述新的和有用的软件工具的论文。官网链接：https://pubsonline.informs.org/journal/ijoc

【干货书】机器学习设计模式，408页pdf，Machine Learning Design Patterns

【干货书】机器学习设计模式，408页pdf，Machine Learning Design Patterns

专知会员服务

138+阅读 · 2022年2月6日

不可错过! CMU CMU《高级自然语言处理》结课了，附课件与视频

不可错过! CMU CMU《高级自然语言处理》结课了，附课件与视频

专知会员服务

73+阅读 · 2021年10月4日

史上最全！358篇机器学习&自然语言处理综述论文！都这儿了

专知会员服务

129+阅读 · 2020年7月18日

【干货书】真实机器学习，264页pdf，Real-World Machine Learning

【干货书】真实机器学习，264页pdf，Real-World Machine Learning

专知会员服务

115+阅读 · 2020年4月5日

【InterSpeech2020】混合语音识别系统中的词汇扩展技术，Techniques for Vocabulary Expansion in Hybrid Speech Recognition Systems

【InterSpeech2020】混合语音识别系统中的词汇扩展技术，Techniques for Vocabulary Expansion in Hybrid Speech Recognition Systems

专知会员服务

17+阅读 · 2020年3月23日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

深度自进化聚类：Deep Self-Evolution Clustering

深度自进化聚类：Deep Self-Evolution Clustering

我爱读PAMI

15+阅读 · 2019年4月13日

逆强化学习-学习人先验的动机

逆强化学习-学习人先验的动机

CreateAMind

16+阅读 · 2019年1月18日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

基于格值逻辑的语言真值α-群锁语义归结自动推理研究

国家自然科学基金

0+阅读 · 2015年12月31日

ESR1经SDF-1/CXCR4轴介导的BMSCs归巢与分化在薄型子宫内膜发病中的作用及分子机制

国家自然科学基金

0+阅读 · 2014年12月31日

铁过载损伤机制及Hmox1基因的条件控制表达技术在脑出血后神经干细胞移植中的研究

国家自然科学基金

0+阅读 · 2014年12月31日

lncRNAs和miR-592的相互作用对mESC向神经元分化的影响

国家自然科学基金

0+阅读 · 2012年12月31日

藏文字符排序研究

国家自然科学基金

0+阅读 · 2009年12月31日

Uncovering ChatGPT's Capabilities in Recommender Systems

Arxiv

0+阅读 · 2023年5月11日

Auditing Cross-Cultural Consistency of Human-Annotated Labels for Recommendation Systems

Arxiv

0+阅读 · 2023年5月10日

Foundations and Recent Trends in Multimodal Machine Learning: Principles, Challenges, and Open Questions

Arxiv

48+阅读 · 2022年9月7日

Machine Reading Comprehension: The Role of Contextualized Language Models and Beyond

Arxiv

15+阅读 · 2020年5月13日

Text Generation from Knowledge Graphs with Graph Transformers

Arxiv

35+阅读 · 2019年4月4日

VIP会员

文章信息

相关主题

自然语言处理

相关VIP内容

【干货书】机器学习设计模式，408页pdf，Machine Learning Design Patterns

【干货书】机器学习设计模式，408页pdf，Machine Learning Design Patterns

专知会员服务

138+阅读 · 2022年2月6日

不可错过! CMU CMU《高级自然语言处理》结课了，附课件与视频

不可错过! CMU CMU《高级自然语言处理》结课了，附课件与视频

专知会员服务

73+阅读 · 2021年10月4日

史上最全！358篇机器学习&自然语言处理综述论文！都这儿了

专知会员服务

129+阅读 · 2020年7月18日

【干货书】真实机器学习，264页pdf，Real-World Machine Learning

【干货书】真实机器学习，264页pdf，Real-World Machine Learning

专知会员服务

115+阅读 · 2020年4月5日

【InterSpeech2020】混合语音识别系统中的词汇扩展技术，Techniques for Vocabulary Expansion in Hybrid Speech Recognition Systems

【InterSpeech2020】混合语音识别系统中的词汇扩展技术，Techniques for Vocabulary Expansion in Hybrid Speech Recognition Systems

专知会员服务

17+阅读 · 2020年3月23日

热门VIP内容

开通专知VIP会员享更多权益服务

因果强化学习的统一框架：综述、分类体系、算法与应用

《无人机系统 - 反无人机系统：测试方法》364页

【MIT博士论文】语言模型的推理时学习算法

美军低成本无人作战攻击系统（LUCAS）：扩大无人机战争规模

相关资讯

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

深度自进化聚类：Deep Self-Evolution Clustering

深度自进化聚类：Deep Self-Evolution Clustering

我爱读PAMI

15+阅读 · 2019年4月13日

逆强化学习-学习人先验的动机

逆强化学习-学习人先验的动机

CreateAMind

16+阅读 · 2019年1月18日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

相关论文

Uncovering ChatGPT's Capabilities in Recommender Systems

Arxiv

0+阅读 · 2023年5月11日

Auditing Cross-Cultural Consistency of Human-Annotated Labels for Recommendation Systems

Arxiv

0+阅读 · 2023年5月10日

Foundations and Recent Trends in Multimodal Machine Learning: Principles, Challenges, and Open Questions

Arxiv

48+阅读 · 2022年9月7日

Machine Reading Comprehension: The Role of Contextualized Language Models and Beyond

Arxiv

15+阅读 · 2020年5月13日

Text Generation from Knowledge Graphs with Graph Transformers

Arxiv

35+阅读 · 2019年4月4日

相关基金

基于格值逻辑的语言真值α-群锁语义归结自动推理研究

国家自然科学基金

0+阅读 · 2015年12月31日

ESR1经SDF-1/CXCR4轴介导的BMSCs归巢与分化在薄型子宫内膜发病中的作用及分子机制

国家自然科学基金

0+阅读 · 2014年12月31日

铁过载损伤机制及Hmox1基因的条件控制表达技术在脑出血后神经干细胞移植中的研究

国家自然科学基金

0+阅读 · 2014年12月31日

lncRNAs和miR-592的相互作用对mESC向神经元分化的影响

国家自然科学基金

0+阅读 · 2012年12月31日

藏文字符排序研究

国家自然科学基金

0+阅读 · 2009年12月31日

微信扫码咨询专知VIP会员