The automation of condition monitoring and workpiece inspection plays an essential role in maintaining high quality as well as high throughput of the manufacturing process. To this end, the recent rise of developments in machine learning has lead to vast improvements in the area of autonomous process supervision. However, the more complex and powerful these models become, the less transparent and explainable they generally are as well. One of the main challenges is the monitoring of live deployments of these machine learning systems and raising alerts when encountering events that might impact model performance. In particular, supervised classifiers are typically build under the assumption of stationarity in the underlying data distribution. For example, a visual inspection system trained on a set of material surface defects generally does not adapt or even recognize gradual changes in the data distribution - an issue known as "data drift" - such as the emergence of new types of surface defects. This, in turn, may lead to detrimental mispredictions, e.g. samples from new defect classes being classified as non-defective. To this end, it is desirable to provide real-time tracking of a classifier's performance to inform about the putative onset of additional error classes and the necessity for manual intervention with respect to classifier re-training. Here, we propose an unsupervised framework that acts on top of a supervised classification system, thereby harnessing its internal deep feature representations as a proxy to track changes in the data distribution during deployment and, hence, to anticipate classifier performance degradation.
翻译:条件监测自动化和工作场所检查在保持高质量以及制造过程的高输送量方面发挥着基本作用。为此目的,最近机器学习的发展导致自动程序监督领域出现巨大改进。然而,这些模型越来越复杂和强大,其透明度和解释通常也较少。主要挑战之一是监测这些机器学习系统的实时部署,并在遇到可能影响模型性能的事件时发出警报。特别是,受监督的分类人员通常在假定基本数据分配的固定性下进行,例如,在一组物质表面缺陷方面受过训练的视觉检查系统一般不适应甚至承认数据分配的逐步变化——一个被称为“数据流”的问题,通常也不太透明,更清楚。这反过来又可能导致有害的错误,例如,新缺陷类别样本被归类为不完美性能。为此,最好提供对分类人员业绩的实时跟踪,以便告知在内部部署中出现更多误差的情况,甚至没有认识到数据流的逐步变化,因此,有必要对内部管理结构结构进行重新分类,从而对内部管理进行系统进行分类,从而对系统进行不精确的分类。