Memorability measures how easily an image is to be memorized after glancing, which may contribute to designing magazine covers, tourism publicity materials, and so forth. Recent works have shed light on the visual features that make generic images, object images or face photographs memorable. However, these methods are not able to effectively predict the memorability of outdoor natural scene images. To overcome this shortcoming of previous works, in this paper, we provide an attempt to answer: "what exactly makes outdoor natural scenes memorable". To this end, we first establish a large-scale outdoor natural scene image memorability (LNSIM) database, containing 2,632 outdoor natural scene images with their ground truth memorability scores and the multi-label scene category annotations. Then, similar to previous works, we mine our database to investigate how low-, middle- and high-level handcrafted features affect the memorability of outdoor natural scenes. In particular, we find that the high-level feature of scene category is rather correlated with outdoor natural scene memorability, and the deep features learnt by deep neural network (DNN) are also effective in predicting the memorability scores. Moreover, combining the deep features with the category feature can further boost the performance of memorability prediction. Therefore, we propose an end-to-end DNN based outdoor natural scene memorability (DeepNSM) predictor, which takes advantage of the learned category-related features. Then, the experimental results validate the effectiveness of our DeepNSM model, exceeding the state-of-the-art methods. Finally, we try to understand the reason of the good performance for our DeepNSM model, and also study the cases that our DeepNSM model succeeds or fails to accurately predict the memorability of outdoor natural scenes. Code: github.com/JiaxinLu-home/Natural-Scene-Memorability-Dataset.
翻译:光滑后, 图像可以很容易地被记忆化, 这可能有助于设计杂志封面、 旅游宣传材料等等。 最近的作品揭示了视觉特征, 使得通用图像、 对象图像或脸部照片可以令人难忘。 然而, 这些方法无法有效地预测户外自然场景图像的可记忆性。 为了克服以前作品的这一缺陷, 在本文件中, 我们试图解答 : “ 是什么让户外自然场景可以令人难忘的。 为此, 我们首先建立了一个大型户外自然场景图像可感应( LNSIM) 数据库, 包含 2, 632 户外自然场面图像, 以及这些图像的地面真实性能得分和多标签图像类别。 然而, 和以往的作品一样, 我们的数据库无法有效地预测户外自然场景图像的可感应识性。 我们的高级性能与室内自然性能的模型性能( RNSM) 与室内可感应性能( DNO型) 的可感应性能, 也能够预估测到我们的直角性直径直径的直径的直径直径直径直径直径预测性功能。