Many engineering organizations are reimplementing and extending deep neural networks from the research community. We describe this process as deep learning model reengineering. Deep learning model reengineering - reusing, reproducing, adapting, and enhancing state-of-the-art deep learning approaches - is challenging for reasons including under-documented reference models, changing requirements, and the cost of implementation and testing. In addition, individual engineers may lack expertise in software engineering, yet teams must apply knowledge of software engineering and deep learning to succeed. Prior work has examined on DL systems from a "product" view, examining defects from projects regardless of the engineers' purpose. Our study is focused on reengineering activities from a "process" view, and focuses on engineers specifically engaged in the reengineering process. Our goal is to understand the characteristics and challenges of deep learning model reengineering. We conducted a case study of this phenomenon, focusing on the context of computer vision. Our results draw from two data sources: defects reported in open-source reeengineering projects, and interviews conducted with open-source project contributors and the leaders of a reengineering team. Our results describe how deep learning-based computer vision techniques are reengineered, analyze the distribution of defects in this process, and discuss challenges and practices. Integrating our quantitative and qualitative data, we proposed a novel reengineering workflow. Our findings inform several future directions, including: measuring additional unknown aspects of model reengineering; standardizing engineering practices to facilitate reengineering; and developing tools to support model reengineering and model reuse.
翻译:许多工程组织正在重新实施和扩展研究界的深层神经网络。我们把这一过程描述为深层次学习模型再造。深深层次学习模型再造――重新使用、复制、改造和加强最先进的深层学习方法――由于记录不足的参考模型、不断变化的要求以及实施和测试费用等原因具有挑战性。此外,个别工程师可能缺乏软件工程方面的专门知识,但各小组必须应用软件工程知识和深层学习才能取得成功。以前的工作是从“产品”角度对DL系统进行审查,检查项目缺陷,而不论工程师的目的如何。我们的研究侧重于从“处理”角度重新设计活动,并侧重于专门从事再设计进程的工程师。我们的目标是了解深层学习模型再造的特征和挑战。我们研究了这一现象的案例研究,重点是计算机的视野,并侧重于计算机的视野,我们从两个数据来源得出了我们的成果:公开源模型再设计项目的缺陷,与开源项目提供者和再设计小组领导人进行的访谈。我们的成果描述了如何深层次学习的计算机再造活动,并分析了我们今后在再造过程中的定性研究方向,分析了我们提出的数据结构的分布。</s>