分类数据质量:以几何复杂度为基础的自动基线和透视生成方法 (Classifier Data Quality: A Geometric Complexity Based Method for Automated Baseline And Insights Generation)

Testing Machine Learning (ML) models and AI-Infused Applications (AIIAs), or systems that contain ML models, is highly challenging. In addition to the challenges of testing classical software, it is acceptable and expected that statistical ML models sometimes output incorrect results. A major challenge is to determine when the level of incorrectness, e.g., model accuracy or F1 score for classifiers, is acceptable and when it is not. In addition to business requirements that should provide a threshold, it is a best practice to require any proposed ML solution to out-perform simple baseline models, such as a decision tree. We have developed complexity measures, which quantify how difficult given observations are to assign to their true class label; these measures can then be used to automatically determine a baseline performance threshold. These measures are superior to the best practice baseline in that, for a linear computation cost, they also quantify each observation' classification complexity in an explainable form, regardless of the classifier model used. Our experiments with both numeric synthetic data and real natural language chatbot data demonstrate that the complexity measures effectively highlight data regions and observations that are likely to be misclassified.

翻译：测试机器学习模型和AI-Infed Applications(AIIAs)模型和AI-Infused(AIIAs)模型或含有ML模型的系统非常具有挑战性。除了测试古典软件的挑战外,可以接受并预期统计ML模型有时会产生不正确的结果。一个重大挑战是确定不正确的程度,例如模型精确度或分类者的F1分等,何时可以接受,何时不能接受。除了应提供一个阈值的商业要求外,要求任何拟议的ML解决方案都超越诸如决策树这样的简单基准模型。我们制定了复杂度措施,量化了所给出的观察结果如何难以分配到真正的类别标签;然后,这些措施可用于自动确定基线性能阈值。就线性计算成本而言,这些措施还优于最佳做法基线,以可解释的形式量化每个观察的分类复杂性,而不论使用的分类模型是何种。我们用数字合成数据和真实的自然语言聊天模型进行的实验都表明,这些复杂度有效地强调了数据区域和观察结果可能会被错误分类。

相关内容

Automator

关注 5

Automator是苹果公司为他们的Mac OS X系统开发的一款软件。 只要通过点击拖拽鼠标等操作就可以将一系列动作组合成一个工作流，从而帮助你自动的（可重复的）完成一些复杂的工作。Automator还能横跨很多不同种类的程序，包括：查找器、Safari网络浏览器、iCal、地址簿或者其他的一些程序。它还能和一些第三方的程序一起工作，如微软的Office、Adobe公司的Photoshop或者Pixelmator等。

不可错过！《机器学习100讲》课程，UBC Mark Schmidt讲授

专知会员服务

75+阅读 · 2022年6月28日

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

专知会员服务

78+阅读 · 2022年3月15日

【干货书】真实机器学习，264页pdf，Real-World Machine Learning

专知会员服务

115+阅读 · 2020年4月5日