Medical image classification is one of the most critical problems in the image recognition area. One of the major challenges in this field is the scarcity of labelled training data. Additionally, there is often class imbalance in datasets as some cases are very rare to happen. As a result, accuracy in classification task is normally low. Deep Learning models, in particular, show promising results on image segmentation and classification problems, but they require very large datasets for training. Therefore, there is a need to generate more of synthetic samples from the same distribution. Previous work has shown that feature generation is more efficient and leads to better performance than corresponding image generation. We apply this idea in the Medical Imaging domain. We use transfer learning to train a segmentation model for the small dataset for which gold-standard class annotations are available. We extracted the learnt features and use them to generate synthetic features conditioned on class labels, using Auxiliary Classifier GAN (ACGAN). We test the quality of the generated features in a downstream classification task for brain tumors according to their severity level. Experimental results show a promising result regarding the validity of these generated features and their overall contribution to balancing the data and improving the classification class-wise accuracy.
翻译:医学图像分类是图像识别领域最关键的问题之一。该领域的主要挑战之一是缺少贴标签的培训数据。此外,由于某些情况很少发生,数据集中往往存在阶级不平衡,因为有些情况很少发生。因此,分类任务的准确性通常较低。特别是深层学习模型显示图像分解和分类问题方面有希望的结果,但是它们需要非常大的数据集来进行培训。因此,需要从同样的分布中产生更多的合成样品。以前的工作表明,特性生成比相应的图像生成更有效率,并导致更好的性能。我们在医疗成像领域应用了这一想法。我们利用转移学习来训练一个小数据集的分解模型,这些小数据集有金标准类说明。我们提取了学到的特性,并用它们来生成以类标签为合成条件的合成特征,使用辅助级分类器GAN(ACGAN)来进行培训。我们测试脑肿瘤下游分类任务中生成的特征的质量,以其严重程度为标准。实验结果显示,这些特性的有效性及其对于平衡数据和改进分类的总体贡献是很有希望的。