This paper tackles the challenge of forensic medical image matching (FMIM) using deep neural networks (DNNs). FMIM is a particular case of content-based image retrieval (CBIR). The main challenge in FMIM compared to the general case of CBIR, is that the subject to whom a query image belongs may be affected by aging and progressive degenerative disorders, making it difficult to match data on a subject level. CBIR with DNNs is generally solved by minimizing a ranking loss, such as Triplet loss (TL), computed on image representations extracted by a DNN from the original data. TL, in particular, operates on triplets: anchor, positive (similar to anchor) and negative (dissimilar to anchor). Although TL has been shown to perform well in many CBIR tasks, it still has limitations, which we identify and analyze in this work. In this paper, we introduce (i) the AdaTriplet loss -- an extension of TL whose gradients adapt to different difficulty levels of negative samples, and (ii) the AutoMargin method -- a technique to adjust hyperparameters of margin-based losses such as TL and our proposed loss dynamically. Our results are evaluated on two large-scale benchmarks for FMIM based on the Osteoarthritis Initiative and Chest X-ray-14 datasets. The codes allowing replication of this study have been made publicly available at \url{https://github.com/Oulu-IMEDS/AdaTriplet}.
翻译:本文用深层神经网络解决法医图像匹配(FMIM)的挑战。 FMIM是一个基于内容的图像检索(CBIR)的特例。与CBIR的一般案例相比,FMIM的主要挑战在于查询图像所属的主体可能受到老化和逐渐退化性失调的影响,从而难以在某一主题级别上匹配数据。 CBIR与DNNS一般通过尽量减少排序损失来解决,如Triplet损失(TL),根据DNN从原始数据中提取的图像表达方式计算。特别是,TL在三重数据上运行:锚定、正(类似于锚定)和负(类似于锚定)。尽管TL显示查询图像的主体可能受到CBIR许多任务的良好表现,但它仍然有局限性,我们在本文中查明和分析了这一点。 我们介绍了(i)AdaTriplet损失 -- -- 其梯度可适应于不同难度的样本水平的Tripllet(TL),以及(ii)AutMargin方法 -- -- 一种用于调整基于我们基于Sloral-S标准的大规模成本标准(例如对我们的Slimal-al-alalal-al)数据库数据库数据库数据库的大规模损失进行估测测测。