Plotting a learner's generalization performance against the training set size results in a so-called learning curve. This tool, providing insight in the behavior of the learner, is also practically valuable for model selection, predicting the effect of more training data, and reducing the computational complexity of training. We set out to make the (ideal) learning curve concept precise and briefly discuss the aforementioned usages of such curves. The larger part of this survey's focus, however, is on learning curves that show that more data does not necessarily leads to better generalization performance. A result that seems surprising to many researchers in the field of artificial intelligence. We point out the significance of these findings and conclude our survey with an overview and discussion of open problems in this area that warrant further theoretical and empirical investigation.
翻译:将学习者的概括性表现与培训数据集的大小相比,得出所谓的学习曲线。这一工具为学习者的行为提供洞察力,对于模型选择、预测更多培训数据的影响以及降低培训的计算复杂性也具有实际价值。我们着手使(理想的)学习曲线概念精确化,并简要讨论上述这些曲线的用法。然而,这次调查的重点大都集中在学习曲线上,这些曲线表明,更多的数据不一定导致更好的概括性表现。对许多人工智能领域的研究人员来说,这一结果似乎令人吃惊。我们指出这些发现的重要性,并在调查结束时,以对这一领域公开问题的概述和讨论结束,这需要进一步的理论和经验调查。