Lately, several benchmark studies have shown that the state of the art in some of the sub-fields of machine learning actually has not progressed despite progress being reported in the literature. The lack of progress is partly caused by the irreproducibility of many model comparison studies. Model comparison studies are conducted that do not control for many known sources of irreproducibility. This leads to results that cannot be verified by third parties. Our objective is to provide an overview of the sources of irreproducibility that are reported in the literature. We review the literature to provide an overview and a taxonomy in addition to a discussion on the identified sources of irreproducibility. Finally, we identify three lines of further inquiry.
翻译:最近,一些基准研究显示,尽管文献中报告了进展,但一些机器学习子领域的最新水平实际上没有进展,缺乏进展的部分原因是许多模型比较研究无法复制,进行模型比较研究并不控制许多已知的不可复制来源,结果无法由第三方核实。我们的目标是概述文献中报道的不可复制来源。我们审查文献,除了讨论已查明的不可复制来源外,还提供概览和分类。最后,我们确定了进一步调查的三条路线。