In this paper we explore the challenges of automating experiments in data science. We propose an extensible experiment model as a foundation for integration of different open source tools for running research experiments. We implement our approach in a prototype open source MLDev software package and evaluate it in a series of experiments yielding promising results. Comparison with other state-of-the-art tools signifies novelty of our approach.
翻译:在本文中,我们探讨了数据科学实验自动化的挑战。我们提出了一个可扩展的实验模型,作为整合不同开放源码工具以进行研究实验的基础。我们用一个开源 MLDev 软件包原型来实施我们的方法,并在一系列实验中加以评估,从而产生有希望的结果。与其他最先进的工具相比,我们的方法具有新颖性。