The phenomena of concept drift refers to a change of the data distribution affecting the data stream of future samples -- such non-stationary environments are often encountered in the real world. Consequently, learning models operating on the data stream might become obsolete, and need costly and difficult adjustments such as retraining or adaptation. Existing methods to address concept drift are, typically, categorised as active or passive. The former continually adapt a model using incremental learning, while the latter perform a complete model retraining when a drift detection mechanism triggers an alarm. We depart from the traditional avenues and propose for the first time an alternative approach which "unlearns" the effects of the concept drift. Specifically, we propose an autoencoder-based method for "unlearning" the concept drift in an unsupervised manner, without having to retrain or adapt any of the learning models operating on the data.
翻译:概念漂移现象是指影响未来样品数据流的数据分配的变化 -- -- 这种非静止环境经常在现实世界中遇到,因此,在数据流上运行的学习模式可能过时,需要费用高昂和困难的调整,如再培训或适应。处理概念漂移的现有方法通常被归类为主动或被动。前者不断使用渐进学习来调整模型,而后者则在漂移探测机制触发警报时进行完全的示范再培训。我们偏离了传统途径,首次提出了“隐蔽”概念漂移的影响的替代方法。具体地说,我们提议一种基于自动编码器的方法,用不受监督的方式“不学习”概念的漂移,而无需再培训或调整在数据上运行的任何学习模式。