Today, machine learning (ML) is widely used in industry to provide the core functionality of production systems. However, it is practically always used in production systems as part of a larger end-to-end software system that is made up of several other components in addition to the ML model. Due to production demand and time constraints, automated software engineering practices are highly applicable. The increased use of automated ML software engineering practices in industries such as manufacturing and utilities requires an automated Quality Assurance (QA) approach as an integral part of ML software. Here, QA helps reduce risk by offering an objective perspective on the software task. Although conventional software engineering has automated tools for QA data analysis for data-driven ML, the use of QA practices for ML in operation (MLOps) is lacking. This paper examines the QA challenges that arise in industrial MLOps and conceptualizes modular strategies to deal with data integrity and Data Quality (DQ). The paper is accompanied by real industrial use-cases from industrial partners. The paper also presents several challenges that may serve as a basis for future studies.
翻译:目前,在工业中广泛使用机器学习(ML),以提供生产系统的核心功能;然而,实际上,在生产系统中一直使用机器学习(ML),作为大型端对端软件系统的一部分,该系统除ML模型外,还由其他几个组成部分组成。由于生产需求和时间限制,自动化软件工程做法非常适用。在制造业和公用事业等行业中更多地使用自动ML软件工程做法,需要采用自动化质量保证(QA)方法,作为ML软件的一个组成部分。这里,QA通过对软件任务提供客观的视角,帮助减少风险。虽然常规软件工程具有数据驱动的MLQA数据分析自动化工具,但在操作中缺乏使用质量控制(MLOps)的做法。本文件审查了工业MLOPs中产生的质量评估挑战,并构想了处理数据完整性和数据质量的模块战略。本文还附有工业伙伴的实际工业使用案例。该文件还提出了若干挑战,可作为未来研究的基础。