Active learning is widely used to reduce labeling effort and training time by repeatedly querying only the most beneficial samples from unlabeled data. In real-world problems where data cannot be stored indefinitely due to limited storage or privacy issues, the query selection and the model update should be performed as soon as a new data sample is observed. Various online active learning methods have been studied to deal with these challenges; however, there are difficulties in selecting representative query samples and updating the model efficiently without forgetting. In this study, we propose Message Passing Adaptive Resonance Theory (MPART) that learns the distribution and topology of input data online. Through message passing on the topological graph, MPART actively queries informative and representative samples, and continuously improves the classification performance using both labeled and unlabeled data. We evaluate our model in stream-based selective sampling scenarios with comparable query selection strategies, showing that MPART significantly outperforms competitive models.
翻译:积极学习被广泛用于减少标签工作和培训时间,反复查询来自未贴标签数据的最有益样本。在由于储存有限或隐私问题无法无限期储存数据的真实世界问题中,一旦观察到新的数据样本,即应进行查询选择和模式更新;已经研究各种在线积极学习方法来应对这些挑战;然而,在选择有代表性的查询样本和有效更新模型方面遇到困难,而不会忘记。在本研究中,我们提议通过信息传递适应共振理论(MPART)来学习在线输入数据的分布和结构学。通过在表层图上传递信息,MPART积极查询信息和代表性样本,并不断利用有标签和无标签的数据提高分类性能。我们用可比较的查询选择战略来评价我们基于流的选择性抽样模型,显示MPART大大优于竞争性模型。