Deep learning has shown great promise in diverse areas of computer vision, such as image classification, object detection and semantic segmentation, among many others. However, as it has been repeatedly demonstrated, deep learning methods trained on a dataset do not generalize well to datasets from other domains or even to similar datasets, due to data distribution shifts. In this work, we propose the use of a meta-learning based few-shot learning approach to alleviate these problems. In order to demonstrate its efficacy, we use two datasets of kidney stones samples acquired with different endoscopes and different acquisition conditions. The results show how such methods are indeed capable of handling domain-shifts by attaining an accuracy of 74.38% and 88.52% in the 5-way 5-shot and 5-way 20-shot settings respectively. Instead, in the same dataset, traditional Deep Learning (DL) methods attain only an accuracy of 45%.
翻译:深层学习在计算机视觉的不同领域,如图像分类、物体探测和语义分离等,都显示了巨大的希望。然而,正如人们一再证明的那样,由于数据分布的变化,在数据集方面受过训练的深层学习方法没有很好地概括到其他领域的数据集,甚至类似的数据集。在这项工作中,我们建议使用基于微小的元学习学习方法来缓解这些问题。为了展示其有效性,我们使用了用不同内镜和不同获取条件获得的两套肾结石样本数据集。结果显示,在5行5和5行20分的设置中,这些方法的精确度分别为74.38%和88.52%,从而确实能够处理域变。相反,在同一数据集中,传统的深层学习方法只达到45%的精确度。