Knowing the cause of kidney stone formation is crucial to establish treatments that prevent recurrence. There are currently different approaches for determining the kidney stone type. However, the reference ex-vivo identification procedure can take up to several weeks, while an in-vivo visual recognition requires highly trained specialists. Machine learning models have been developed to provide urologists with an automated classification of kidney stones during an ureteroscopy; however, there is a general lack in terms of quality of the training data and methods. In this work, a two-step transfer learning approach is used to train the kidney stone classifier. The proposed approach transfers knowledge learned on a set of images of kidney stones acquired with a CCD camera (ex-vivo dataset) to a final model that classifies images from endoscopic images (ex-vivo dataset). The results show that learning features from different domains with similar information helps to improve the performance of a model that performs classification in real conditions (for instance, uncontrolled lighting conditions and blur). Finally, in comparison to models that are trained from scratch or by initializing ImageNet weights, the obtained results suggest that the two-step approach extracts features improving the identification of kidney stones in endoscopic images.
翻译:了解肾结石形成的原因对于确定防止肾结石的治疗至关重要。目前,在确定肾结石类型方面有不同的方法。但是,参考前验证程序可能需要几周时间,而现场的视觉识别需要训练有素的专家。已经开发了机器学习模型,以便为泌尿检查期间的动物对肾结石进行自动分类;然而,在培训数据和方法的质量方面普遍缺乏。在这项工作中,使用了两步转移学习方法来培训肾结石分类员。拟议方法将利用CCD相机(事后验证数据集)获得的一套肾结石图像的知识转移到最后模型中,将来自肾结石图像分类(前验证数据集),结果显示,从不同领域获得的类似信息有助于改进在真实条件下进行分类(例如,无控制的照明条件和模糊)的模型的性能。最后,与从抓取或通过初始图像网重量培训的模型相比,获得的结果表明,两步方法的特征是改进了在尾端识别肾结石的图像的特征。