Deep learning-based image matching methods are improved significantly during the recent years. Although these methods are reported to outperform the classical techniques, the performance of the classical methods is not examined in detail. In this study, we compare classical and learning-based methods by employing mutual nearest neighbor search with ratio test and optimizing the ratio test threshold to achieve the best performance on two different performance metrics. After a fair comparison, the experimental results on HPatches dataset reveal that the performance gap between classical and learning-based methods is not that significant. Throughout the experiments, we demonstrated that SuperGlue is the state-of-the-art technique for the image matching problem on HPatches dataset. However, if a single parameter, namely ratio test threshold, is carefully optimized, a well-known traditional method SIFT performs quite close to SuperGlue and even outperforms in terms of mean matching accuracy (MMA) under 1 and 2 pixel thresholds. Moreover, a recent approach, DFM, which only uses pre-trained VGG features as descriptors and ratio test, is shown to outperform most of the well-trained learning-based methods. Therefore, we conclude that the parameters of any classical method should be analyzed carefully before comparing against a learning-based technique.
翻译:近些年来,基于深层次学习的图像匹配方法得到显著改进。虽然据报告这些方法的性能超过了古典技术,但古典方法的性能没有经过详细研究。在本研究中,我们通过使用相近邻居的相互搜索和比率测试比较古典和基于学习的方法,并优化比率测试阈值,以便在两种不同的性能尺度上达到最佳性能。经过公平的比较,HPatche数据集的实验结果显示,古典和基于学习的方法之间的性能差距并不大。在整个实验中,我们证明超级Glue是HPatche数据集中图像匹配问题的最新先进技术。然而,如果对单一参数,即比率测试阈值进行仔细优化,那么众所周知的传统SIMFT方法与超级Glue非常接近,甚至超出平均匹配精度(MMA)在1和2像素阈值以下。此外,最近的DFM(DFM)方法仅使用经过事先训练的VGG特性作为解调和比率测试。我们发现,在进行任何经过认真分析的学习方法之前,那么,我们在任何经过认真分析的方法中会作出结论。