The notion of concept drift refers to the phenomenon that the distribution generating the observed data changes over time. If drift is present, machine learning models can become inaccurate and need adjustment. While there do exist methods to detect concept drift or to adjust models in the presence of observed drift, the question of explaining drift, i.e., describing the potentially complex and high dimensional change of distribution in a human-understandable fashion, has hardly been considered so far. This problem is of importance since it enables an inspection of the most prominent characteristics of how and where drift manifests itself. Hence, it enables human understanding of the change and it increases acceptance of life-long learning models. In this paper, we present a novel technology characterizing concept drift in terms of the characteristic change of spatial features based on various explanation techniques. To do so, we propose a methodology to reduce the explanation of concept drift to an explanation of models that are trained in a suitable way extracting relevant information regarding the drift. This way a large variety of explanation schemes is available. Thus, a suitable method can be selected for the problem of drift explanation at hand. We outline the potential of this approach and demonstrate its usefulness in several examples.
翻译:概念漂移的概念是指随着时间推移而产生观测到的数据变化的分布现象,如果流流存在,机器学习模型就会变得不准确,需要调整。虽然确实存在在观测到漂移时发现概念漂移或调整模型的方法,但解释漂移的问题,即以人类可以理解的方式描述潜在复杂和高度的分布变化的问题,迄今没有被考虑过,这个问题很重要,因为它能够检查漂移本身如何和在何处表现的最突出的特点。因此,它使人类能够了解变化,并增加对终身学习模型的接受程度。在本文件中,我们提出了一种新的技术,根据各种解释技术将概念漂移的概念定性为空间特征变化的特点。为了这样做,我们提出一种方法,将概念漂移的解释减少到以适当方式获得有关漂移信息的模型的解释。这样就可以有多种解释方法。因此,可以选择一种合适的方法来说明流流学问题。我们从几个例子中概述了这一方法的潜力,并展示其效用。</s>