Deep learning-based approaches for content-based image retrieval (CBIR) of CT liver images is an active field of research, but suffers from some critical limitations. First, they are heavily reliant on labeled data, which can be challenging and costly to acquire. Second, they lack transparency and explainability, which limits the trustworthiness of deep CBIR systems. We address these limitations by (1) proposing a self-supervised learning framework that incorporates domain-knowledge into the training procedure and (2) providing the first representation learning explainability analysis in the context of CBIR of CT liver images. Results demonstrate improved performance compared to the standard self-supervised approach across several metrics, as well as improved generalisation across datasets. Further, we conduct the first representation learning explainability analysis in the context of CBIR, which reveals new insights into the feature extraction process. Lastly, we perform a case study with cross-examination CBIR that demonstrates the usability of our proposed framework. We believe that our proposed framework could play a vital role in creating trustworthy deep CBIR systems that can successfully take advantage of unlabeled data.
翻译:对基于内容的CT肝脏图像图像检索(CBIR)的深入学习方法是一个积极的研究领域,但有一些关键的局限性。首先,它们严重依赖标签数据,而这种数据可能具有挑战性和成本高昂。其次,它们缺乏透明度和解释性,从而限制了CBIR深层系统的信誉。我们处理这些局限性的方式是:(1) 提出一个自我监督的学习框架,将域知识纳入培训程序,(2) 在CBIR的CT肝脏图像背景下提供首次代表性学习可解释性分析。结果表明,与标准的自我监督方法相比,若干指标的性能有所改善,而且各数据集的通用性也有所改善。此外,我们在CBIR背景下进行了第一次代表性学习解释性分析,揭示了对特征提取过程的新认识。最后,我们进行了案例研究,对CBIR进行了交叉审查,以显示我们拟议框架的可用性。我们认为,我们提议的框架可以在创建可靠的深度CBIR系统方面发挥至关重要的作用,这些系统能够成功地利用未标的数据。