In this paper, we study the performance of few-shot learning, specifically meta learning empowered few-shot relation networks, over classic supervised learning in the problem of sound source distance estimation(SSDE). In previous research on deep supervised SSDE, obtaining low accuracies due to the mismatch between the training data(sound from known environments) and the test data(sound from unknown environments) has almost always been the case. By performing comparative experiments on a sufficient amount of data, we show that the few-shot relation network outperform a classic CNN which is a supervised deep learning approach, and hence it is possible to calibrate a microphone-equipped system, with a few labeled examples of audio recorded in a particular unknown environment to adjust and generalize our classifier to the possible input data and gain higher accuracies.
翻译:在本文中,我们研究了几眼学习的绩效,具体来说,元学习赋予了几眼关系网,超过了在可靠源距离估计(SSDE)问题上的经典监督学习。 在以前对深入监管的SSDE的研究中,由于培训数据(来自已知环境的声学)与测试数据(来自未知环境的声学)的不匹配而获得低理解度的情况几乎总是如此。 通过对足够数量的数据进行比较实验,我们发现,微眼关系网优于经典CNN,这是一种受监管的深入学习方法,因此有可能校准一个麦克风设备系统,并配有少数在特定未知环境中录音的标签例子,以调整和普及我们的分类器,使其与可能的输入数据相适应,并获得更高理解度。