为全面评价机器学习分类系统生成和复制基准 (Generative and reproducible benchmarks for comprehensive evaluation of machine learning classifiers)

Understanding the strengths and weaknesses of machine learning (ML) algorithms is crucial for determine their scope of application. Here, we introduce the DIverse and GENerative ML Benchmark (DIGEN) - a collection of synthetic datasets for comprehensive, reproducible, and interpretable benchmarking of machine learning algorithms for classification of binary outcomes. The DIGEN resource consists of 40 mathematical functions which map continuous features to discrete endpoints for creating synthetic datasets. These 40 functions were discovered using a heuristic algorithm designed to maximize the diversity of performance among multiple popular machine learning algorithms thus providing a useful test suite for evaluating and comparing new methods. Access to the generative functions facilitates understanding of why a method performs poorly compared to other algorithms thus providing ideas for improvement. The resource with extensive documentation and analyses is open-source and available on GitHub.

翻译：了解机器学习(ML)算法的优缺点对于确定其应用范围至关重要。在这里, 我们引入了DIverse 和General ML基准(DIGEN)—— 集合成数据集,用于综合、可复制和可解释的机器学习算法基准,用于二元结果分类。 DIGEN资源由40个数学功能组成, 绘制离散端点的连续特征图, 用于创建合成数据集。这40个功能是使用一种超常算法发现的, 目的是尽量扩大多种流行机器学习算法的性能多样性, 从而为评价和比较新方法提供一个有用的测试套。使用基因化功能有助于理解为什么一种方法与其他算法相比表现不佳, 从而提供了改进的想法。拥有大量文献和分析的资源是公开的, 可在 GitHub 上查阅。

相关内容

Machine Learning

关注 2245

机器学习（Machine Learning）是一个研究计算学习方法的国际论坛。该杂志发表文章，报告广泛的学习方法应用于各种学习问题的实质性结果。该杂志的特色论文描述研究的问题和方法，应用研究和研究方法的问题。有关学习问题或方法的论文通过实证研究、理论分析或与心理现象的比较提供了坚实的支持。应用论文展示了如何应用学习方法来解决重要的应用问题。研究方法论文改进了机器学习的研究方法。所有的论文都以其他研究人员可以验证或复制的方式描述了支持证据。论文还详细说明了学习的组成部分，并讨论了关于知识表示和性能任务的假设。官网地址：http://dblp.uni-trier.de/db/journals/ml/

【干货书】机器学习速查手册，135页pdf

专知会员服务

127+阅读 · 2020年11月20日