Requirements engineering (RE) activities for machine learning (ML) are not well-established and researched in the literature. Many issues and challenges exist when specifying, designing, and developing ML-enabled systems. Adding more focus on RE for ML can help to develop more reliable ML-enabled systems. Based on insights collected from previous work and industrial experiences, we propose a catalogue of 45 concerns to be considered when specifying ML-enabled systems, covering five different perspectives we identified as relevant for such systems: objectives, user experience, infrastructure, model, and data. Examples of such concerns include the execution engine and telemetry for the infrastructure perspective, and explainability and reproducibility for the model perspective. We conducted a focus group session with eight software professionals with experience developing ML-enabled systems to validate the importance, quality and feasibility of using our catalogue. The feedback allowed us to improve the catalogue and confirmed its practical relevance. The main research contribution of this work consists in providing a validated set of concerns grouped into perspectives that can be used by requirements engineers to support the specification of ML-enabled systems.
翻译:在文献中,机器学习(ML)要求(RE)活动没有很好地确立和研究。在具体指定、设计和开发ML辅助系统时,存在许多问题和挑战。为ML增加对RE的更多关注有助于开发更可靠的ML辅助系统。根据从以往工作和工业经验中收集的见解,我们提议在指定ML辅助系统时考虑45种关切的目录,涵盖我们确定与这些系统相关的五种不同观点:目标、用户经验、基础设施、模型和数据。这些关切的例子包括基础设施视角的执行引擎和遥测,以及模型视角的可解释性和可复制性。我们与8名具有开发ML辅助系统经验的软件专业人员举行了一次重点小组会议,以验证使用我们目录的重要性、质量和可行性。反馈使我们能够改进目录,并确认其实际相关性。这项工作的主要研究贡献是提供一套经过验证的关切,这些关切可归为需要工程师支持ML辅助系统规格的视角。