Companies struggle to continuously develop and deploy AI models to complex production systems due to AI characteristics while assuring quality. To ease the development process, continuous pipelines for AI have become an active research area where consolidated and in-depth analysis regarding the terminology, triggers, tasks, and challenges is required. This paper includes a Multivocal Literature Review where we consolidated 151 relevant formal and informal sources. In addition, nine-semi structured interviews with participants from academia and industry verified and extended the obtained information. Based on these sources, this paper provides and compares terminologies for DevOps and CI/CD for AI, MLOps, (end-to-end) lifecycle management, and CD4ML. Furthermore, the paper provides an aggregated list of potential triggers for reiterating the pipeline, such as alert systems or schedules. In addition, this work uses a taxonomy creation strategy to present a consolidated pipeline comprising tasks regarding the continuous development of AI. This pipeline consists of four stages: Data Handling, Model Learning, Software Development and System Operations. Moreover, we map challenges regarding pipeline implementation, adaption, and usage for the continuous development of AI to these four stages.
翻译:为便利开发过程,需要就术语、触发因素、任务和挑战进行综合和深入分析。本文件包括多语言文学评论,我们综合了151个相关的正式和非正式来源。此外,与学术界和工业界参与者进行的九点半结构性访谈核实并扩大了所获得的信息。根据这些来源,本文件提供并比较了AI、MLOPs、(端至端)生命周期管理和CD4ML的DevOps和CI/CD术语。此外,本文件还列出了重申该管道的可能触发因素的综合清单,例如警报系统或时间表。此外,这项工作还采用分类创建战略,提出由持续开发AI的任务构成的综合管道。该管道由四个阶段组成:数据处理、模型学习、软件开发和系统操作。此外,我们绘制了在管道执行、调整和使用方面对AI持续开发的管道至这四个阶段的挑战。