通过探索性数据形态测试发现基于地貌的机械学习分类器的边界价值 (Discovering Boundary Values of Feature-based Machine Learning Classifiers through Exploratory Datamorphic Testing)

Testing has been widely recognised as difficult for AI applications. This paper proposes a set of testing strategies for testing machine learning applications in the framework of the datamorphism testing methodology. In these strategies, testing aims at exploring the data space of a classification or clustering application to discover the boundaries between classes that the machine learning application defines. This enables the tester to understand precisely the behaviour and function of the software under test. In the paper, three variants of exploratory strategies are presented with the algorithms implemented in the automated datamorphic testing tool Morphy. The correctness of these algorithms are formally proved. Their capability and cost of discovering borders between classes are evaluated via a set of controlled experiments with manually designed subjects and a set of case studies with real machine learning models.

翻译：本文提出了一套在数据形态测试方法框架内测试机器学习应用的测试战略; 在这些战略中,测试旨在探索分类或集群应用的数据空间,以发现机器学习应用所定义的类别之间的界限; 使测试者能够准确理解测试中的软件的行为和功能; 本文介绍了三个探索战略的变式,并介绍了自动数据形态测试工具Morphy中采用的算法; 这些算法的正确性得到了正式证明; 其发现各班之间边界的能力和费用通过一套有控制的实验,用人工设计的科目和一套用真正的机器学习模型进行的个案研究进行评估。

相关内容

Machine Learning

关注 2241

机器学习（Machine Learning）是一个研究计算学习方法的国际论坛。该杂志发表文章，报告广泛的学习方法应用于各种学习问题的实质性结果。该杂志的特色论文描述研究的问题和方法，应用研究和研究方法的问题。有关学习问题或方法的论文通过实证研究、理论分析或与心理现象的比较提供了坚实的支持。应用论文展示了如何应用学习方法来解决重要的应用问题。研究方法论文改进了机器学习的研究方法。所有的论文都以其他研究人员可以验证或复制的方式描述了支持证据。论文还详细说明了学习的组成部分，并讨论了关于知识表示和性能任务的假设。官网地址：http://dblp.uni-trier.de/db/journals/ml/

【经典书】使用机器学习R语言，149页pdf，Practical Machine Learning in R

专知会员服务

24+阅读 · 2021年1月13日

专知会员服务

39+阅读 · 2020年11月3日

【伯克利】机器学习蛋白质工程，Machine learning for protein engineering，83页ppt

专知会员服务

36+阅读 · 2020年5月9日

【干货书】真实机器学习，264页pdf，Real-World Machine Learning

专知会员服务

115+阅读 · 2020年4月5日

【经典书】数据挖掘：理论、算法与示例，347页pdf，Nong Ye，Arizona State University