Class imbalance is a pervasive issue in the field of disease classification from medical images. It is necessary to balance out the class distribution while training a model for decent results. However, in the case of rare medical diseases, images from affected patients are much harder to come by compared to images from non-affected patients, resulting in unwanted class imbalance. Various processes of tackling class imbalance issues have been explored so far, each having its fair share of drawbacks. In this research, we propose an outlier detection based binary medical image classification technique which can handle even the most extreme case of class imbalance. We have utilized a dataset of malaria parasitized and uninfected cells. An autoencoder model titled AnoMalNet is trained with only the uninfected cell images at the beginning and then used to classify both the affected and non-affected cell images by thresholding a loss value. We have achieved an accuracy, precision, recall, and F1 score of 98.49%, 97.07%, 100%, and 98.52% respectively, performing better than large deep learning models and other published works. As our proposed approach can provide competitive results without needing the disease-positive samples during training, it should prove to be useful in binary disease classification on imbalanced datasets.
翻译:在医疗图像的疾病分类方面,阶级不平衡是一个普遍的问题。有必要平衡班级分布,同时培训一个体面结果的模型。然而,在罕见的医疗疾病方面,与非受影响病人的图像相比,受影响病人的图像比非受影响病人的图像要难得多,导致不必要的阶级不平衡。到目前为止,已经探索了解决阶级不平衡问题的各种过程,每个过程都有其公平的缺点。在这个研究中,我们建议了一种基于外部检测的二进制医学图像分类技术,它可以处理甚至最极端的阶级不平衡案例。我们使用了一套疟疾寄生虫和未感染细胞的数据集。一个名为“AnoMalNet”的自动编码模型,最初只用未感染的细胞图像进行培训,然后通过设定损失值来对受影响的和未受影响的细胞图像进行分类。我们已经取得了准确性、精确性、回顾和F1分的98.49%、97.07%、100%和98.52%的分数,比大型深度学习模型和其他已公布的作品都好。我们提出的方法可以在不需要疾病阳性样本的情况下提供竞争性的结果。在培训中,应该证明它能够提供有用的数据分类。</s>