从多重子任务到最优化的临界值的可靠决定: (Reliable Decision from Multiple Subtasks through Threshold Optimization: Content Moderation in the Wild) - 专知论文

会员服务 ·

0

阈值 · 优化器 · 最优化 · MoDELS · Automator ·

2022 年 11 月 2 日

Reliable Decision from Multiple Subtasks through Threshold Optimization: Content Moderation in the Wild

翻译：从多重子任务到最优化的临界值的可靠决定:

Donghyun Son,Byounggyu Lew,Kwanghee Choi,Yongsu Baek,Seungwoo Choi,Beomjun Shin,Sungjoo Ha,Buru Chang

from arxiv, WSDM 2023

Social media platforms struggle to protect users from harmful content through content moderation. These platforms have recently leveraged machine learning models to cope with the vast amount of user-generated content daily. Since moderation policies vary depending on countries and types of products, it is common to train and deploy the models per policy. However, this approach is highly inefficient, especially when the policies change, requiring dataset re-labeling and model re-training on the shifted data distribution. To alleviate this cost inefficiency, social media platforms often employ third-party content moderation services that provide prediction scores of multiple subtasks, such as predicting the existence of underage personnel, rude gestures, or weapons, instead of directly providing final moderation decisions. However, making a reliable automated moderation decision from the prediction scores of the multiple subtasks for a specific target policy has not been widely explored yet. In this study, we formulate real-world scenarios of content moderation and introduce a simple yet effective threshold optimization method that searches the optimal thresholds of the multiple subtasks to make a reliable moderation decision in a cost-effective way. Extensive experiments demonstrate that our approach shows better performance in content moderation compared to existing threshold optimization methods and heuristics.

翻译：社交媒体平台努力通过内容调适来保护用户免受有害内容的伤害。这些平台最近利用了机器学习模式来应对每天大量用户生成的内容。由于温适政策因国家和产品类型而异,因此通常按政策培训和部署模式。然而,这种做法效率极低,特别是在政策变化要求数据集重新标签和对转移的数据分发进行模式再培训的情况下。为减轻这种成本低效率,社交媒体平台经常使用第三方内容调控服务,提供多种子任务分数的预测分数,如预测是否存在未成年人、粗鲁手势或武器,而不是直接提供最终的温和决定。然而,从多个子任务对具体目标政策的预测分数中做出可靠的自动调和决定,尚未得到广泛探讨。在本研究中,我们制定了真实而有效的内容调适情景,并引入简单而有效的门槛优化方法,以寻找多个子任务的最佳阈值,以具有成本效益的方式做出可靠的温和决定。广泛的实验表明,我们的方法在内容调适度方面比现有的阈值优化方法和超感力力。

0

相关内容

不可错过！《机器学习100讲》课程，UBC Mark Schmidt讲授

不可错过！《机器学习100讲》课程，UBC Mark Schmidt讲授

专知会员服务

76+阅读 · 2022年6月28日

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

专知会员服务

78+阅读 · 2022年3月15日

INRIA 最新《机器学习理论》课程笔记，176页pdf

专知会员服务

51+阅读 · 2020年12月14日

【新书：机器学习简介】《A Concise Introduction to Machine Learning》by A.C. Faul (CRC 2019)

【新书：机器学习简介】《A Concise Introduction to Machine Learning》by A.C. Faul (CRC 2019)

专知会员服务

77+阅读 · 2020年2月8日

【深度学习表格检测、信息提取和结构化】《Table Detection, Information Extraction and Structuring using Deep Learning》by Vihar Kurama

专知会员服务

38+阅读 · 2020年1月23日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

专知会员服务

83+阅读 · 2019年10月9日

VCIP 2022 Call for Demos

VCIP 2022 Call for Demos

CCF多媒体专委会

1+阅读 · 2022年6月6日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

【ICIG2021】Latest News & Announcements of the Workshop

【ICIG2021】Latest News & Announcements of the Workshop

中国图象图形学学会CSIG

0+阅读 · 2021年12月20日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium9

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium9

中国图象图形学学会CSIG

0+阅读 · 2021年12月17日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium2

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium2

中国图象图形学学会CSIG

0+阅读 · 2021年11月8日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

Stokes/Darcy 耦合问题的数值方法及预处理技术研究

国家自然科学基金

0+阅读 · 2015年12月31日

拓扑绝缘体与超导体耦合体系中交叉Andreev反射研究

国家自然科学基金

1+阅读 · 2014年12月31日

气道上皮细胞RUNX1调控急性肺损伤肺部炎症机制研究

国家自然科学基金

0+阅读 · 2014年12月31日

有理函数非旋转Fatou域与不连通Julia集的结构

国家自然科学基金

0+阅读 · 2014年12月31日

具有类年龄结构和空间异质性的传染病动力学的建模与研究

国家自然科学基金

0+阅读 · 2012年12月31日

急性肺损伤中颗粒蛋白前体的microRNA调控机制研究

国家自然科学基金

0+阅读 · 2012年12月31日

粤西海域CTW（Coastal Trapped Wave）特征分析与数值模拟研究

国家自然科学基金

0+阅读 · 2009年12月31日

Tribble3基因调控MAPK信号通路在表皮增殖及银屑病皮损形成中的作用

国家自然科学基金

0+阅读 · 2009年12月31日

Unscented卡尔曼滤波算法及其在通信中的应用

国家自然科学基金

0+阅读 · 2008年12月31日

AlGaN基PIN太阳光盲雪崩探测器研究

国家自然科学基金

0+阅读 · 2008年12月31日

Yes We Care! -- Certification for Machine Learning Methods through the Care Label Framework

Arxiv

0+阅读 · 2022年12月22日

Multiple Imputation with Neural Network Gaussian Process for High-dimensional Incomplete Data

Arxiv

0+阅读 · 2022年12月21日

Uncertainty quantification for sparse spectral variational approximations in Gaussian process regression

Arxiv

0+阅读 · 2022年12月21日

Reward Bonuses with Gain Scheduling Inspired by Iterative Deepening Search

Arxiv

0+阅读 · 2022年12月21日

Scheduling with Predictions

Scheduling with Predictions

Arxiv

0+阅读 · 2022年12月20日

RepMode: Learning to Re-parameterize Diverse Experts for Subcellular Structure Prediction

Arxiv

0+阅读 · 2022年12月20日

Multiple Testing in Genome-Wide Association Studies via Hierarchical Hidden Markov Models

Arxiv

0+阅读 · 2022年12月20日

Z-ICL: Zero-Shot In-Context Learning with Pseudo-Demonstrations

Arxiv

0+阅读 · 2022年12月19日

Spatially Consistent Representation Learning

Arxiv

14+阅读 · 2021年3月10日

Learning to Learn and Predict: A Meta-Learning Approach for Multi-Label Classification

Learning to Learn and Predict: A Meta-Learning Approach for Multi-Label Classification

Arxiv

17+阅读 · 2019年9月9日

VIP会员

文章信息

相关主题

相关VIP内容

不可错过！《机器学习100讲》课程，UBC Mark Schmidt讲授

不可错过！《机器学习100讲》课程，UBC Mark Schmidt讲授

专知会员服务

76+阅读 · 2022年6月28日

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

专知会员服务

78+阅读 · 2022年3月15日

INRIA 最新《机器学习理论》课程笔记，176页pdf

专知会员服务

51+阅读 · 2020年12月14日

【新书：机器学习简介】《A Concise Introduction to Machine Learning》by A.C. Faul (CRC 2019)

【新书：机器学习简介】《A Concise Introduction to Machine Learning》by A.C. Faul (CRC 2019)

专知会员服务

77+阅读 · 2020年2月8日

【深度学习表格检测、信息提取和结构化】《Table Detection, Information Extraction and Structuring using Deep Learning》by Vihar Kurama

专知会员服务

38+阅读 · 2020年1月23日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

专知会员服务

83+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

小规模训练指南：打造世界级大语言模型的关键方法

无人机编队飞行：复杂环境中作战的策略、挑战与应用

大模型APP，AI时代第一个爆款

从数据中心视角出发的高效大语言模型训练综述

相关资讯

VCIP 2022 Call for Demos

VCIP 2022 Call for Demos

CCF多媒体专委会

1+阅读 · 2022年6月6日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

【ICIG2021】Latest News & Announcements of the Workshop

【ICIG2021】Latest News & Announcements of the Workshop

中国图象图形学学会CSIG

0+阅读 · 2021年12月20日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium9

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium9

中国图象图形学学会CSIG

0+阅读 · 2021年12月17日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium2

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium2

中国图象图形学学会CSIG

0+阅读 · 2021年11月8日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

相关论文

Yes We Care! -- Certification for Machine Learning Methods through the Care Label Framework

Arxiv

0+阅读 · 2022年12月22日

Multiple Imputation with Neural Network Gaussian Process for High-dimensional Incomplete Data

Arxiv

0+阅读 · 2022年12月21日

Uncertainty quantification for sparse spectral variational approximations in Gaussian process regression

Arxiv

0+阅读 · 2022年12月21日

Reward Bonuses with Gain Scheduling Inspired by Iterative Deepening Search

Arxiv

0+阅读 · 2022年12月21日

Scheduling with Predictions

Scheduling with Predictions

Arxiv

0+阅读 · 2022年12月20日

RepMode: Learning to Re-parameterize Diverse Experts for Subcellular Structure Prediction

Arxiv

0+阅读 · 2022年12月20日

Multiple Testing in Genome-Wide Association Studies via Hierarchical Hidden Markov Models

Arxiv

0+阅读 · 2022年12月20日

Z-ICL: Zero-Shot In-Context Learning with Pseudo-Demonstrations

Arxiv

0+阅读 · 2022年12月19日

Spatially Consistent Representation Learning

Arxiv

14+阅读 · 2021年3月10日

Learning to Learn and Predict: A Meta-Learning Approach for Multi-Label Classification

Learning to Learn and Predict: A Meta-Learning Approach for Multi-Label Classification

Arxiv

17+阅读 · 2019年9月9日

相关基金

Stokes/Darcy 耦合问题的数值方法及预处理技术研究

国家自然科学基金

0+阅读 · 2015年12月31日

拓扑绝缘体与超导体耦合体系中交叉Andreev反射研究

国家自然科学基金

1+阅读 · 2014年12月31日

气道上皮细胞RUNX1调控急性肺损伤肺部炎症机制研究

国家自然科学基金

0+阅读 · 2014年12月31日

有理函数非旋转Fatou域与不连通Julia集的结构

国家自然科学基金

0+阅读 · 2014年12月31日

具有类年龄结构和空间异质性的传染病动力学的建模与研究

国家自然科学基金

0+阅读 · 2012年12月31日

急性肺损伤中颗粒蛋白前体的microRNA调控机制研究

国家自然科学基金

0+阅读 · 2012年12月31日

粤西海域CTW（Coastal Trapped Wave）特征分析与数值模拟研究

国家自然科学基金

0+阅读 · 2009年12月31日

Tribble3基因调控MAPK信号通路在表皮增殖及银屑病皮损形成中的作用

国家自然科学基金

0+阅读 · 2009年12月31日

Unscented卡尔曼滤波算法及其在通信中的应用

国家自然科学基金

0+阅读 · 2008年12月31日

AlGaN基PIN太阳光盲雪崩探测器研究

国家自然科学基金

0+阅读 · 2008年12月31日

微信扫码咨询专知VIP会员