Machine Learning (ML) techniques are becoming essential components of many software systems today, causing an increasing need to adapt traditional software engineering practices and tools to the development of ML-based software systems. This need is especially pronounced due to the challenges associated with the large-scale development and deployment of ML systems. Among the most commonly reported challenges during the development, production, and operation of ML-based systems are experiment management, dependency management, monitoring, and logging of ML assets. In recent years, we have seen several efforts to address these challenges as witnessed by an increasing number of tools for tracking and managing ML experiments and their assets. To facilitate research and practice on engineering intelligent systems, it is essential to understand the nature of the current tool support for managing ML assets. What kind of support is provided? What asset types are tracked? What operations are offered to users for managing those assets? We discuss and position ML asset management as an important discipline that provides methods and tools for ML assets as structures and the ML development activities as their operations. We present a feature-based survey of 17 tools with ML asset management support identified in a systematic search. We overview these tools' features for managing the different types of assets used for engineering ML-based systems and performing experiments. We found that most of the asset management support depends on traditional version control systems, while only a few tools support an asset granularity level that differentiates between important ML assets, such as datasets and models.
翻译:机械学习技术正在成为当今许多软件系统的基本组成部分,因此越来越需要使传统软件工程做法和工具适应以ML为基础的软件系统的发展,这种需要之所以特别突出,是因为大规模开发和部署ML系统带来挑战。在ML系统开发、生产和运行期间,最经常报告的挑战包括试验管理、依赖管理、监测和记录ML资产。近年来,我们看到为应对这些挑战作出了一些努力,例如跟踪和管理ML实验及其资产的工具越来越多,这导致越来越需要将传统软件工程工程工程工程工程技术做法和工具用于开发ML软件系统。为了便利工程智能系统的研究和实践,必须了解目前管理ML资产管理工具的性质。提供了何种支助?跟踪了哪些资产类型?向用户提供了哪些业务来管理这些资产?我们讨论ML资产管理,并将ML资产管理作为一项重要学科,为ML资产作为结构和ML开发活动提供方法和工具。我们介绍了对17个工具的基于ML资产管理支持进行基于少数系统搜索的特征调查。我们概述了这些工具在管理各种ML资产管理工具时所使用的最有区别的模型,我们只支持了ML系统所使用的模型。