Deep Neural Networks (DNN) are nowadays largely adopted in many application domains thanks to their human-like, or even superhuman, performance in specific tasks. However, due to unpredictable/unconsidered operating conditions, unexpected failures show up on field, making the performance of a DNN in operation very different from the one estimated prior to release. In the life cycle of DNN systems, the assessment of accuracy is typically addressed in two ways: offline, via sampling of operational inputs, or online, via pseudo-oracles. The former is considered more expensive due to the need for manual labeling of the sampled inputs. The latter is automatic but less accurate. We believe that emerging iterative industrial-strength life cycle models for Machine Learning systems, like MLOps, offer the possibility to leverage inputs observed in operation not only to provide faithful estimates of a DNN accuracy, but also to improve it through remodeling/retraining actions. We propose DAIC (DNN Assessment and Improvement Cycle), an approach which combines ''low-cost'' online pseudo-oracles and ''high-cost'' offline sampling techniques to estimate and improve the operational accuracy of a DNN in the iterations of its life cycle. Preliminary results show the benefits of combining the two approaches and integrating them in the DNN life cycle.
翻译:深神经网络(DNN)目前主要用于许多应用领域,因为其人性化,甚至超人性,在具体任务方面表现良好。然而,由于不可预测/未考虑的操作条件,外地出现出乎意料的故障,使DNN在运行中的性能与释放前估计的性能大不相同。在DNN系统的生命周期中,对准确性的评估通常以两种方式进行:通过对操作投入进行取样,或通过假眼镜进行在线评估。前者被认为更昂贵,因为需要对抽样投入进行人工标签。后者是自动的,但不太准确。我们认为,正在出现的机器学习系统的迭代工业强度生命周期模型,如MLOPs, 提供了利用在运行中观察到的投入的可能性,不仅能够提供对DNN的准确性准确性的忠实估计,而且还可以通过重新建模/再培训行动来改进。我们建议DAIC(DNNE(DN评估和改进周期)将“低成本”在线假眼镜和“高成本”的离线取样技术结合起来,从而将DNNNN的寿命周期的准确性结果整合到初步周期。</s>