In this paper, we propose revisited versions for two recent hotel recognition datasets: Hotels50K and Hotel-ID. The revisited versions provide evaluation setups with different levels of difficulty to better align with the intended real-world application, i.e. countering human trafficking. Real-world scenarios involve hotels and locations that are not captured in the current data sets, therefore it is important to consider evaluation settings where classes are truly unseen. We test this setup using multiple state-of-the-art image retrieval models and show that as expected, the models' performances decrease as the evaluation gets closer to the real-world unseen settings. The rankings of the best performing models also change across the different evaluation settings, which further motivates using the proposed revisited datasets.
翻译:在本文中,我们为最近的两个酒店识别数据集提出了经过重新审视的版本:Hotels50K和Hotel-ID。经过重新审视的版本提供了不同程度的评估组合,以更好地与预期的现实世界应用程序(即打击人口贩运)保持一致。现实世界情景涉及酒店和未在目前数据集中捕捉到的地点,因此,必须考虑教室真正不可见的评价设置。我们使用多种最先进的图像检索模型测试这一设置,并显示随着评价接近现实世界的隐形环境,模型的性能会下降。最佳运行模式的排名也会在不同的评价环境中发生变化,这进一步鼓励使用拟议的重新审视数据集。