Testing has been widely recognised as difficult for AI applications. This paper proposes a set of testing strategies for testing machine learning applications in the framework of the datamorphism testing methodology. In these strategies, testing aims at exploring the data space of a classification or clustering application to discover the boundaries between classes that the machine learning application defines. This enables the tester to understand precisely the behaviour and function of the software under test. In the paper, three variants of exploratory strategies are presented with the algorithms implemented in the automated datamorphic testing tool Morphy. The correctness of these algorithms are formally proved. Their capability and cost of discovering borders between classes are evaluated via a set of controlled experiments with manually designed subjects and a set of case studies with real machine learning models.
翻译:本文提出了一套在数据形态测试方法框架内测试机器学习应用的测试战略; 在这些战略中,测试旨在探索分类或集群应用的数据空间,以发现机器学习应用所定义的类别之间的界限; 使测试者能够准确理解测试中的软件的行为和功能; 本文介绍了三个探索战略的变式,并介绍了自动数据形态测试工具Morphy中采用的算法; 这些算法的正确性得到了正式证明; 其发现各班之间边界的能力和费用通过一套有控制的实验,用人工设计的科目和一套用真正的机器学习模型进行的个案研究进行评估。