AI 系统测试的传播意识 (Distribution Awareness for AI System Testing) - 专知论文

会员服务 ·

0

可约的 · 可辨认的 · Continuity · CIFAR-10 · CASES ·

2021 年 5 月 6 日

Distribution Awareness for AI System Testing

翻译：AI 系统测试的传播意识

from arxiv, 2 pages, 1 figure, pre-print

As Deep Learning (DL) is continuously adopted in many safety critical applications, its quality and reliability start to raise concerns. Similar to the traditional software development process, testing the DL software to uncover its defects at an early stage is an effective way to reduce risks after deployment. Although recent progress has been made in designing novel testing techniques for DL software, the distribution of generated test data is not taken into consideration. It is therefore hard to judge whether the identified errors are indeed meaningful errors to the DL application. Therefore, we propose a new OOD-guided testing technique which aims to generate new unseen test cases relevant to the underlying DL system task. Our results show that this technique is able to filter up to 55.44% of error test case on CIFAR-10 and is 10.05% more effective in enhancing robustness.

翻译：由于深入学习(DL)在许多安全关键应用程序中不断被采用,其质量和可靠性开始引起关注。与传统的软件开发程序一样,在早期测试DL软件以发现其缺陷是减少部署后风险的有效方法。虽然最近在设计DL软件新测试技术方面取得了进展,但所生成的测试数据的传播却没有考虑到这一点。因此,很难判断所查明的错误是否确实是DL应用程序的有意义的错误。因此,我们提议一种新的OOD制导测试技术,目的是产生与DL系统基本任务相关的新的无形测试案例。我们的结果显示,这一技术能够过滤到CIFAR-10系统55.44%的错误测试案例,并且更能有效增强强性。

0

相关内容

可约的

2020数据工程师成长路线图

专知会员服务

41+阅读 · 2020年9月6日

Linux导论，Introduction to Linux，96页ppt

Linux导论，Introduction to Linux，96页ppt

专知会员服务

81+阅读 · 2020年7月26日

【CVPR2020-Oral-浙江大学】深度知识迁移的深度归因图，DEPARA: Deep Attribution Graph

【CVPR2020-Oral-浙江大学】深度知识迁移的深度归因图，DEPARA: Deep Attribution Graph

专知会员服务

27+阅读 · 2020年3月19日

【医学图像处理中的因果性】52页ppt，Causality Matters in Medical Imaging

【医学图像处理中的因果性】52页ppt，Causality Matters in Medical Imaging

专知会员服务

60+阅读 · 2020年3月14日

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

专知会员服务

95+阅读 · 2020年3月12日

【AAAI2020接受论文】预测性参与:开放领域对话系统自动评估的有效指标（Predictive Engagement: An Efficient Metric For Automatic Evaluation of Open-Domain Dialogue Systems）

【AAAI2020接受论文】预测性参与:开放领域对话系统自动评估的有效指标（Predictive Engagement: An Efficient Metric For Automatic Evaluation of Open-Domain Dialogue Systems）

专知会员服务

14+阅读 · 2019年11月15日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

开源书：PyTorch深度学习起步

开源书：PyTorch深度学习起步

专知会员服务

51+阅读 · 2019年10月11日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

最新BERT相关论文清单，BERT-related Papers

最新BERT相关论文清单，BERT-related Papers

专知会员服务

53+阅读 · 2019年9月29日

已删除

将门创投

5+阅读 · 2019年10月29日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

人工智能 | SCI期刊专刊信息3条

人工智能 | SCI期刊专刊信息3条

Call4Papers

5+阅读 · 2019年1月10日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

Disentangled的假设的探讨

Disentangled的假设的探讨

CreateAMind

9+阅读 · 2018年12月10日

Hierarchical Disentangled Representations

Hierarchical Disentangled Representations

CreateAMind

4+阅读 · 2018年4月15日

人工智能 | 国际会议截稿信息9条

人工智能 | 国际会议截稿信息9条

Call4Papers

4+阅读 · 2018年3月13日

SaSeVAL: A Safety/Security-Aware Approach for Validation of Safety-Critical Systems

Arxiv

0+阅读 · 2021年6月25日

Blockchain-based Security Framework for Critical Industry 4.0 Cyber-physical System

Arxiv

0+阅读 · 2021年6月24日

Distributed IDA-PBC for a Class of Nonholonomic Mechanical Systems

Arxiv

0+阅读 · 2021年6月24日

Self-healing Dilemmas in Distributed Systems: Fault Correction vs. Fault Tolerance

Arxiv

0+阅读 · 2021年6月24日

SecureDL: Securing Code Execution and Access Control for Distributed Data Analytics Platforms

SecureDL: Securing Code Execution and Access Control for Distributed Data Analytics Platforms

Arxiv

0+阅读 · 2021年6月24日

Few-Shot Bearing Fault Diagnosis Based on Model-Agnostic Meta-Learning

Arxiv

2+阅读 · 2021年6月24日

On Attribution of Recurrent Neural Network Predictions via Additive Decomposition

Arxiv

3+阅读 · 2019年3月27日

Addressing the Item Cold-start Problem by Attribute-driven Active Learning

Arxiv

8+阅读 · 2018年5月23日

Online Deep Metric Learning

Arxiv

8+阅读 · 2018年5月15日

Stable Distribution Alignment Using the Dual of the Adversarial Distance

Arxiv

3+阅读 · 2018年1月30日

VIP会员

文章信息

相关主题

相关VIP内容

2020数据工程师成长路线图

专知会员服务

41+阅读 · 2020年9月6日

Linux导论，Introduction to Linux，96页ppt

Linux导论，Introduction to Linux，96页ppt

专知会员服务

81+阅读 · 2020年7月26日

【CVPR2020-Oral-浙江大学】深度知识迁移的深度归因图，DEPARA: Deep Attribution Graph

【CVPR2020-Oral-浙江大学】深度知识迁移的深度归因图，DEPARA: Deep Attribution Graph

专知会员服务

27+阅读 · 2020年3月19日

【医学图像处理中的因果性】52页ppt，Causality Matters in Medical Imaging

【医学图像处理中的因果性】52页ppt，Causality Matters in Medical Imaging

专知会员服务

60+阅读 · 2020年3月14日

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

专知会员服务

95+阅读 · 2020年3月12日

【AAAI2020接受论文】预测性参与:开放领域对话系统自动评估的有效指标（Predictive Engagement: An Efficient Metric For Automatic Evaluation of Open-Domain Dialogue Systems）

【AAAI2020接受论文】预测性参与:开放领域对话系统自动评估的有效指标（Predictive Engagement: An Efficient Metric For Automatic Evaluation of Open-Domain Dialogue Systems）

专知会员服务

14+阅读 · 2019年11月15日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

开源书：PyTorch深度学习起步

开源书：PyTorch深度学习起步

专知会员服务

51+阅读 · 2019年10月11日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

最新BERT相关论文清单，BERT-related Papers

最新BERT相关论文清单，BERT-related Papers

专知会员服务

53+阅读 · 2019年9月29日

热门VIP内容

开通专知VIP会员享更多权益服务

Nature综述：金融网络中的物理学

【CMU博士论文】通信高效且差分隐私的优化方法

《数字孪生与生成式AI融合构建战术网络弹性边缘智能》

新质生成式AI赋能产业变革的实践与路径

相关资讯

已删除

将门创投

5+阅读 · 2019年10月29日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

人工智能 | SCI期刊专刊信息3条

人工智能 | SCI期刊专刊信息3条

Call4Papers

5+阅读 · 2019年1月10日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

Disentangled的假设的探讨

Disentangled的假设的探讨

CreateAMind

9+阅读 · 2018年12月10日

Hierarchical Disentangled Representations

Hierarchical Disentangled Representations

CreateAMind

4+阅读 · 2018年4月15日

人工智能 | 国际会议截稿信息9条

人工智能 | 国际会议截稿信息9条

Call4Papers

4+阅读 · 2018年3月13日

相关论文

SaSeVAL: A Safety/Security-Aware Approach for Validation of Safety-Critical Systems

Arxiv

0+阅读 · 2021年6月25日

Blockchain-based Security Framework for Critical Industry 4.0 Cyber-physical System

Arxiv

0+阅读 · 2021年6月24日

Distributed IDA-PBC for a Class of Nonholonomic Mechanical Systems

Arxiv

0+阅读 · 2021年6月24日

Self-healing Dilemmas in Distributed Systems: Fault Correction vs. Fault Tolerance

Arxiv

0+阅读 · 2021年6月24日

SecureDL: Securing Code Execution and Access Control for Distributed Data Analytics Platforms

SecureDL: Securing Code Execution and Access Control for Distributed Data Analytics Platforms

Arxiv

0+阅读 · 2021年6月24日

Few-Shot Bearing Fault Diagnosis Based on Model-Agnostic Meta-Learning

Arxiv

2+阅读 · 2021年6月24日

On Attribution of Recurrent Neural Network Predictions via Additive Decomposition

Arxiv

3+阅读 · 2019年3月27日

Addressing the Item Cold-start Problem by Attribute-driven Active Learning

Arxiv

8+阅读 · 2018年5月23日

Online Deep Metric Learning

Arxiv

8+阅读 · 2018年5月15日

Stable Distribution Alignment Using the Dual of the Adversarial Distance

Arxiv

3+阅读 · 2018年1月30日

微信扫码咨询专知VIP会员