This article studies how to detect and explain concept drift. Human activity recognition is used as a case study together with a online batch learning situation where the quality of the labels used in the model updating process starts to decrease. Drift detection is based on identifying a set of features having the largest relevance difference between the drifting model and a model that is known to be accurate and monitoring how the relevance of these features changes over time. As a main result of this article, it is shown that feature relevance analysis cannot only be used to detect the concept drift but also to explain the reason for the drift when a limited number of typical reasons for the concept drift are predefined. To explain the reason for the concept drift, it is studied how these predefined reasons effect to feature relevance. In fact, it is shown that each of these has an unique effect to features relevance and these can be used to explain the reason for concept drift.
翻译:本条研究如何探测和解释概念的漂移。人类活动识别与在线批次学习情况一起作为案例研究使用,模型更新过程中使用的标签的质量开始下降。漂移探测基于确定一套特征,这些特征在漂移模型和已知准确的模型之间具有最大的关联性差异,并监测这些特征随时间变化的关联性。这一条的主要结果显示,特征关联性分析不能仅用于探测概念的漂移,而且在概念漂移的典型原因有限的情况下解释漂移的原因。为解释概念漂移的原因,研究这些预设的原因如何产生关联性。事实上,这些都表明这些特征具有独特的关联性,可以用来解释概念漂移的原因。