Wireless Capsule Endoscopy (WCE) helps physicians examine the gastrointestinal (GI) tract noninvasively. There are few studies that address pathological assessment of endoscopy images in multiclass classification and most of them are based on binary anomaly detection or aim to detect a specific type of anomaly. Multiclass anomaly detection is challenging, especially when the dataset is poorly sampled or imbalanced. Many available datasets in endoscopy field, such as KID2, suffer from an imbalance issue, which makes it difficult to train a high-performance model. Additionally, increasing the number of classes makes classification more difficult. We proposed a multiclass classification algorithm that is extensible to any number of classes and can handle an imbalance issue. The proposed method uses multiple autoencoders where each one is trained on one class to extract features with the most discrimination from other classes. The loss function of autoencoders is set based on reconstruction, compactness, distance from other classes, and Kullback-Leibler (KL) divergence. The extracted features are clustered and then classified using an ensemble of support vector data descriptors. A total of 1,778 normal, 227 inflammation, 303 vascular, and 44 polyp images from the KID2 dataset are used for evaluation. The entire algorithm ran 5 times and achieved F1-score of 96.3 +- 0.2% and 85.0 +- 0.4% on the test set for binary and multiclass anomaly detection, respectively. The impact of each step of the algorithm was investigated by various ablation studies and the results were compared with published works. The suggested approach is a competitive option for detecting multiclass anomalies in the GI field.
翻译:多级异常检测具有挑战性,特别是当数据集取样不力或不平衡时。在内镜检查领域,许多可用的数据集,如KID2, 存在不平衡问题,使得难以训练高性能模型。此外, 增加班级数量使得分类更加困难。 我们提议了一个多级分类分析对内镜分析图像进行病理评估, 并可以处理一个不平衡问题。 提议的方法使用多级解析器, 每个人在其中接受一个班级培训, 以提取来自其他班级的最大差异特征。 自动解析器的损失功能是根据重建、 压缩、 与其他班级的距离, 以及 KRillback- Leiber (KL) 检测方法的差异设置的。 提取的功能是集成的,然后用一个支持数级分析器对任何一个班级进行分类, 并可以处理一个不平衡问题。 提议的方法使用多个自动解析器, 每一个班级, 以提取与其他班级相比, 自动解算器的特性功能。 用于SBAR5 正常的 IMISL1 和多级数据, 正常的SIMLSLADA 。 使用SDLA 。