Data scientists often develop machine learning models to solve a variety of problems in the industry and academy but not without facing several challenges in terms of Model Development. The problems regarding Machine Learning Development involves the fact that such professionals do not realize that they usually perform ad-hoc practices that could be improved by the adoption of activities presented in the Software Engineering Development Lifecycle. Of course, since machine learning systems are different from traditional Software systems, some differences in their respective development processes are to be expected. In this context, this paper is an effort to investigate the challenges and practices that emerge during the development of ML models from the software engineering perspective by focusing on understanding how software developers could benefit from applying or adapting the traditional software engineering process to the Machine Learning workflow.
翻译:数据科学家往往开发机器学习模型,以解决工业和学院的各种问题,但并非没有在模型开发方面面临若干挑战。关于机器学习开发的问题涉及这样的事实:这些专业人员没有意识到他们通常会采取通过采用软件工程开发生命周期中介绍的活动而可以改进的临时性做法。当然,由于机器学习系统不同于传统的软件系统,因此预期他们各自的开发过程会有所不同。在这方面,本文件努力从软件工程的角度,通过侧重于了解软件开发者如何从应用或调整传统软件工程程序到机器学习工作流程中受益,来调查在开发ML模型过程中出现的挑战和做法。