This contribution presents a deep learning method for the extraction and fusion of information relating to kidney stone fragments acquired from different viewpoints of the endoscope. Surface and section fragment images are jointly used during the training of the classifier to improve the discrimination power of the features by adding attention layers at the end of each convolutional block. This approach is specifically designed to mimic the morpho-constitutional analysis performed in ex-vivo by biologists to visually identify kidney stones by inspecting both views. The addition of attention mechanisms to the backbone improved the results of single view extraction backbones by 4% on average. Moreover, in comparison to the state-of-the-art, the fusion of the deep features improved the overall results up to 11% in terms of kidney stone classification accuracy.
翻译:这一贡献为提取和整合从内镜不同角度获得的肾结石碎片信息提供了深层次的学习方法;在对分类员进行培训期间,在每一卷块末端增加关注层,共同使用地表和部分碎块图像,以提高特征的区别力量;这一方法特别旨在模仿生物学家通过检查两种观点而以先行方式进行的立宪分析,以视像方式识别肾结石;对脊柱增加关注机制,使单视结晶骨脊椎平均提高了4%;此外,与最新技术相比,深层特征的融合使肾结石分类的总体结果提高到11%。