Prioritization of machine learning projects requires estimates of both the potential ROI of the business case and the technical difficulty of building a model with the required characteristics. In this work we present a technique for estimating the minimum required performance characteristics of a predictive model given a set of information about how it will be used. This technique will result in robust, objective comparisons between potential projects. The resulting estimates will allow data scientists and managers to evaluate whether a proposed machine learning project is likely to succeed before any modelling needs to be done. The technique has been implemented into the open source application MinViME (Minimum Viable Model Estimator) which can be installed via the PyPI python package management system, or downloaded directly from the GitHub repository. Available at https://github.com/john-hawkins/MinViME
翻译:在这项工作中,我们提出了一个技术方法,用于估计预测模型的最低所需性能特点,提供一套关于如何使用该模型的信息。这一技术将导致对潜在项目进行稳健和客观的比较。由此得出的估计将使数据科学家和管理人员能够在任何建模需要完成之前评价拟议的机器学习项目是否可能成功。该技术已经应用到开放源码应用程序MinVime(最起码可行的模型模拟模拟器)中,可以通过PyPIP Python软件包管理系统安装,或直接从GitHub储存库下载。可查阅https://github.com/john-hawkins/MinViME。