Learning curves are a concept from social sciences that has been adopted in the context of machine learning to assess the performance of a learning algorithm with respect to a certain resource, e.g. the number of training examples or the number of training iterations. Learning curves have important applications in several contexts of machine learning, most importantly for the context of data acquisition, early stopping of model training and model selection. For example, by modelling the learning curves, one can assess at an early stage whether the algorithm and hyperparameter configuration have the potential to be a suitable choice, often speeding up the algorithm selection process. A variety of approaches has been proposed to use learning curves for decision making. Some models answer the binary decision question of whether a certain algorithm at a certain budget will outperform a certain reference performance, whereas more complex models predict the entire learning curve of an algorithm. We contribute a framework that categorizes learning curve approaches using three criteria: the decision situation that they address, the intrinsic learning curve question that they answer and the type of resources that they use. We survey papers from literature and classify them into this framework.
翻译:学习曲线是社会科学的概念,是在机器学习中采用的,用来评估某种资源(例如培训实例的数量或培训迭代的数量)的学习算法的性能。学习曲线在机器学习的若干方面有重要的应用,最重要的是在数据获取、模型培训的早期停止和模型选择方面。例如,通过模拟学习曲线,人们可以在早期阶段评估算法和超参数配置是否具有成为适当选择的潜力,往往加快算法选择过程。提出了各种办法,用学习曲线来进行决策。一些模型回答关于某种预算的某种算法是否会超过某种参考性能的二进制决定问题,而更为复杂的模型则预测一种算法的整个学习曲线。我们提出一个框架,用三个标准对学习曲线方法进行分类:它们处理的决定状况、它们回答的内在学习曲线问题和它们使用的资源类型。我们从文献中查看论文并将其分类到这个框架。