Classifying pill categories from real-world images is crucial for various smart healthcare applications. Although existing approaches in image classification might achieve a good performance on fixed pill categories, they fail to handle novel instances of pill categories that are frequently presented to the learning algorithm. To this end, a trivial solution is to train the model with novel classes. However, this may result in a phenomenon known as catastrophic forgetting, in which the system forgets what it learned in previous classes. In this paper, we address this challenge by introducing the class incremental learning (CIL) ability to traditional pill image classification systems. Specifically, we propose a novel incremental multi-stream intermediate fusion framework enabling incorporation of an additional guidance information stream that best matches the domain of the problem into various state-of-the-art CIL methods. From this framework, we consider color-specific information of pill images as a guidance stream and devise an approach, namely "Color Guidance with Multi-stream intermediate fusion"(CG-IMIF) for solving CIL pill image classification task. We conduct comprehensive experiments on real-world incremental pill image classification dataset, namely VAIPE-PCIL, and find that the CG-IMIF consistently outperforms several state-of-the-art methods by a large margin in different task settings. Our code, data, and trained model are available at https://github.com/vinuni-vishc/CG-IMIF.
翻译:将药片从现实世界图像分类对于各种智能医疗应用至关重要。 虽然在成像分类中,现有的药片分类方法可能会在固定药片类别上取得良好表现, 但是它们无法处理经常向学习算法展示的药片类别的新案例。 为此, 一个微不足道的解决办法是用新类来培训药片的模型。 但是, 这可能会导致一种被称为灾难性的遗忘现象, 使系统忘记它在前几类中学学到的东西。 在本文中, 我们通过将类级增量学习(CIL)能力引入传统药片图像分类系统来应对这一挑战。 具体地说, 我们提出一个新型增量多流中间融合框架, 以便能够将更多最符合问题领域的指导信息流纳入到各种最先进的CIL方法中。 我们从这个框架中, 将药片图像的彩色信息视为指导流, 并设计一种方法, 即“ 多流中间输解的色彩指导” (CG-IMIF), 用于解决CIL药片图像的模型分类任务。 我们对真实世界增量图像分类数据集进行了全面实验, 即VAIPE-PCIL,, 并发现C-IFG- GLAxxxxxxxxxxx