Training deep learning models in technical domains is often accompanied by the challenge that although the task is clear, insufficient data for training is available. In this work, we propose a novel approach based on the combination of Siamese networks and radial basis function networks to perform data-efficient classification without pretraining by measuring the distance between images in semantic space in a data-efficient manner. We develop the models using three technical datasets, the NEU dataset, the BSD dataset, and the TEX dataset. In addition to the technical domain, we show the general applicability to classical datasets (cifar10 and MNIST) as well. The approach is tested against state-of-the-art models (Resnet50 and Resnet101) by stepwise reduction of the number of samples available for training. The authors show that the proposed approach outperforms the state-of-the-art models in the low data regime.
翻译:技术领域深层培训模式往往伴随着挑战,即尽管任务明确,但培训数据不足;在这项工作中,我们提议采用基于暹罗网络和辐射基功能网络相结合的新办法,通过以数据高效的方式测量语空图像之间的距离,在不事先培训的情况下进行数据高效分类;我们利用三个技术数据集,即国家电子系统数据集、BSD数据集和TEX数据集,开发模型;除了技术领域外,我们还展示了对古典数据集(cifar10和MNIST)的一般适用性;通过逐步减少可供培训的样本数量,对照最先进的模型(Resnet50和Resnet101)测试该办法;作者们显示,拟议办法超出了低数据制度中最新模型的功能。