关于集体强力反对数据中毒问题 (On Collective Robustness of Bagging Against Data Poisoning) - 专知论文

会员服务 ·

0

Bagging · 稳健性 · Extensibility · 引导聚合 · 绝对多数投票 ·

2022 年 5 月 26 日

On Collective Robustness of Bagging Against Data Poisoning

翻译：关于集体强力反对数据中毒问题

Ruoxin Chen,Zenan Li,Jie Li,Chentao Wu,Junchi Yan

from arxiv, Accepted by ICML22, code: https://github.com/Emiyalzn/ICML22-CRB

Bootstrap aggregating (bagging) is an effective ensemble protocol, which is believed can enhance robustness by its majority voting mechanism. Recent works further prove the sample-wise robustness certificates for certain forms of bagging (e.g. partition aggregation). Beyond these particular forms, in this paper, \emph{we propose the first collective certification for general bagging to compute the tight robustness against the global poisoning attack}. Specifically, we compute the maximum number of simultaneously changed predictions via solving a binary integer linear programming (BILP) problem. Then we analyze the robustness of vanilla bagging and give the upper bound of the tolerable poison budget. Based on this analysis, \emph{we propose hash bagging} to improve the robustness of vanilla bagging almost for free. This is achieved by modifying the random subsampling in vanilla bagging to a hash-based deterministic subsampling, as a way of controlling the influence scope for each poisoning sample universally. Our extensive experiments show the notable advantage in terms of applicability and robustness.

翻译：捆绑集( 捆绑) 是一个有效的组合协议, 据信它能通过多数投票机制增强稳健性。最近的工作进一步证明了某些包装形式的样本智能强健性证书( 例如分区汇总 ) 。除这些特定形式外, 本文中我们建议对普通包装进行首次集体认证, 以计算全球中毒袭击的紧固性。具体地说, 我们通过解决二元整线性编程( BILP) 问题来计算同时修改的预测的最大数量。然后我们分析香草包装的稳健性, 并给出可耐性毒预算的上限。基于此分析, \ emph{ we 提议 hash baging} 来提高香草包装几乎免费的稳健性。这是通过修改香草包装中的随机子取样方法实现的, 以此来控制每种中毒样本的影响范围。我们的广泛实验显示了在适用性和稳健性方面的显著优势。

0

相关内容

Bagging

【开放书】卡耐基梅隆大学Elaine Shi 教授《Foundations of Distributed Consensus and Blockchains（分布式共识和区块链的基础）》150页pdf

【开放书】卡耐基梅隆大学Elaine Shi 教授《Foundations of Distributed Consensus and Blockchains（分布式共识和区块链的基础）》150页pdf

专知会员服务

30+阅读 · 2022年2月22日

INRIA 最新《机器学习理论》课程笔记，176页pdf

专知会员服务

51+阅读 · 2020年12月14日

哥伦比亚大学最新《机器学习》课程，Fall-B 2020 (Machine Learning)

专知会员服务

39+阅读 · 2020年11月3日

【干货书】真实机器学习，264页pdf，Real-World Machine Learning

【干货书】真实机器学习，264页pdf，Real-World Machine Learning

专知会员服务

115+阅读 · 2020年4月5日

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

专知会员服务

96+阅读 · 2020年3月12日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

专知会员服务

163+阅读 · 2019年10月12日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

105+阅读 · 2019年10月9日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium8

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium8

中国图象图形学学会CSIG

0+阅读 · 2021年11月16日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium6

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium6

中国图象图形学学会CSIG

2+阅读 · 2021年11月12日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium1

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium1

中国图象图形学学会CSIG

0+阅读 · 2021年11月3日

局部学习的特征选择：Local-Learning-Based Feature Selection

局部学习的特征选择：Local-Learning-Based Feature Selection

我爱读PAMI

14+阅读 · 2019年9月20日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

无监督元学习表示学习

无监督元学习表示学习

CreateAMind

27+阅读 · 2019年1月4日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

SiC纳米线增韧C/C-ZrB2-ZrC-SiC超高温陶瓷基复合材料高温抗氧化耐烧蚀性能与机理研究

国家自然科学基金

0+阅读 · 2013年12月31日

Calderon问题和边界刚性问题

国家自然科学基金

0+阅读 · 2013年12月31日

可压缩Navier-Stokes方程和Boltzmann方程解的渐近行为

国家自然科学基金

0+阅读 · 2013年12月31日

A,B位双掺杂及LaAlO3复合钙钛矿LnFeO3的载流子输运特性与介电损耗机理研究

国家自然科学基金

1+阅读 · 2013年12月31日

实时安全关键系统的建模、仿真与验证

国家自然科学基金

1+阅读 · 2012年12月31日

纳米晶金属卸载塑性变形行为及其机制的研究

国家自然科学基金

0+阅读 · 2012年12月31日

非平稳时间序列的非参数预测回归

国家自然科学基金

7+阅读 · 2012年12月31日

轴对称的Navier-Stokes方程

国家自然科学基金

1+阅读 · 2011年12月31日

丙酮醛在胰岛素抵抗相关脂质代谢异常和动脉粥样硬化形成中的作用与机制研究

国家自然科学基金

0+阅读 · 2011年12月31日

微结构辐射特性及其调控机理

国家自然科学基金

0+阅读 · 2008年12月31日

Improved Certified Defenses against Data Poisoning with (Deterministic) Finite Aggregation

Improved Certified Defenses against Data Poisoning with (Deterministic) Finite Aggregation

Arxiv

0+阅读 · 2022年7月14日

Uncertainty quantification for predictions of atomistic neural networks

Arxiv

0+阅读 · 2022年7月14日

Unsupervised Learning for Combinatorial Optimization with Principled Objective Relaxation

Arxiv

0+阅读 · 2022年7月14日

Susceptibility of Continual Learning Against Adversarial Attacks

Arxiv

0+阅读 · 2022年7月13日

Constraint-Based Causal Structure Learning from Undersampled Graphs

Arxiv

0+阅读 · 2022年7月13日

Uncertainty-Aware Learning Against Label Noise on Imbalanced Datasets

Arxiv

0+阅读 · 2022年7月12日

Simultaneously Learning Stochastic and Adversarial Bandits under the Position-Based Model

Arxiv

0+阅读 · 2022年7月12日

Guiding the retraining of convolutional neural networks against adversarial inputs

Arxiv

0+阅读 · 2022年7月12日

Attention and Self-Attention in Random Forests

Arxiv

0+阅读 · 2022年7月9日

Active Learning for Contextual Search with Binary Feedbacks

Arxiv

0+阅读 · 2022年7月9日

VIP会员

文章信息

相关主题

绝对多数投票

相关VIP内容

【开放书】卡耐基梅隆大学Elaine Shi 教授《Foundations of Distributed Consensus and Blockchains（分布式共识和区块链的基础）》150页pdf

【开放书】卡耐基梅隆大学Elaine Shi 教授《Foundations of Distributed Consensus and Blockchains（分布式共识和区块链的基础）》150页pdf

专知会员服务

30+阅读 · 2022年2月22日

INRIA 最新《机器学习理论》课程笔记，176页pdf

专知会员服务

51+阅读 · 2020年12月14日

哥伦比亚大学最新《机器学习》课程，Fall-B 2020 (Machine Learning)

专知会员服务

39+阅读 · 2020年11月3日

【干货书】真实机器学习，264页pdf，Real-World Machine Learning

【干货书】真实机器学习，264页pdf，Real-World Machine Learning

专知会员服务

115+阅读 · 2020年4月5日

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

专知会员服务

96+阅读 · 2020年3月12日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

专知会员服务

163+阅读 · 2019年10月12日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

105+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

《俄乌战争中的无人系统：新的战争方式与新兴趋势——来自前线的印象》报告

《海上自主水面船舶远程操作中心：安全可持续运行的多维度分析》

多模态大语言模型下游调优中“保持自我”的重要性

隐身自主无人水下航行器技术如何变革水下作战并重塑海军竞争

相关资讯

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium8

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium8

中国图象图形学学会CSIG

0+阅读 · 2021年11月16日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium6

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium6

中国图象图形学学会CSIG

2+阅读 · 2021年11月12日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium1

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium1

中国图象图形学学会CSIG

0+阅读 · 2021年11月3日

局部学习的特征选择：Local-Learning-Based Feature Selection

局部学习的特征选择：Local-Learning-Based Feature Selection

我爱读PAMI

14+阅读 · 2019年9月20日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

无监督元学习表示学习

无监督元学习表示学习

CreateAMind

27+阅读 · 2019年1月4日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

相关论文

Improved Certified Defenses against Data Poisoning with (Deterministic) Finite Aggregation

Improved Certified Defenses against Data Poisoning with (Deterministic) Finite Aggregation

Arxiv

0+阅读 · 2022年7月14日

Uncertainty quantification for predictions of atomistic neural networks

Arxiv

0+阅读 · 2022年7月14日

Unsupervised Learning for Combinatorial Optimization with Principled Objective Relaxation

Arxiv

0+阅读 · 2022年7月14日

Susceptibility of Continual Learning Against Adversarial Attacks

Arxiv

0+阅读 · 2022年7月13日

Constraint-Based Causal Structure Learning from Undersampled Graphs

Arxiv

0+阅读 · 2022年7月13日

Uncertainty-Aware Learning Against Label Noise on Imbalanced Datasets

Arxiv

0+阅读 · 2022年7月12日

Simultaneously Learning Stochastic and Adversarial Bandits under the Position-Based Model

Arxiv

0+阅读 · 2022年7月12日

Guiding the retraining of convolutional neural networks against adversarial inputs

Arxiv

0+阅读 · 2022年7月12日

Attention and Self-Attention in Random Forests

Arxiv

0+阅读 · 2022年7月9日

Active Learning for Contextual Search with Binary Feedbacks

Arxiv

0+阅读 · 2022年7月9日

相关基金

SiC纳米线增韧C/C-ZrB2-ZrC-SiC超高温陶瓷基复合材料高温抗氧化耐烧蚀性能与机理研究

国家自然科学基金

0+阅读 · 2013年12月31日

Calderon问题和边界刚性问题

国家自然科学基金

0+阅读 · 2013年12月31日

可压缩Navier-Stokes方程和Boltzmann方程解的渐近行为

国家自然科学基金

0+阅读 · 2013年12月31日

A,B位双掺杂及LaAlO3复合钙钛矿LnFeO3的载流子输运特性与介电损耗机理研究

国家自然科学基金

1+阅读 · 2013年12月31日

实时安全关键系统的建模、仿真与验证

国家自然科学基金

1+阅读 · 2012年12月31日

纳米晶金属卸载塑性变形行为及其机制的研究

国家自然科学基金

0+阅读 · 2012年12月31日

非平稳时间序列的非参数预测回归

国家自然科学基金

7+阅读 · 2012年12月31日

轴对称的Navier-Stokes方程

国家自然科学基金

1+阅读 · 2011年12月31日

丙酮醛在胰岛素抵抗相关脂质代谢异常和动脉粥样硬化形成中的作用与机制研究

国家自然科学基金

0+阅读 · 2011年12月31日

微结构辐射特性及其调控机理

国家自然科学基金

0+阅读 · 2008年12月31日

微信扫码咨询专知VIP会员