With the rise of deep learning algorithms nowadays, scene image representation methods on big data (e.g., SUN-397) have achieved a significant performance boost in classification. However, the performance is still limited because the scene images are mostly complex in nature having higher intra-class dissimilarity and inter-class similarity problems. To deal with such problems, there are several methods proposed in the literature with their own advantages and limitations. A detailed study of previous works is necessary to understand their pros and cons in image representation and classification. In this paper, we review the existing scene image representation methods that are being used widely for image classification. For this, we, first, devise the taxonomy using the seminal existing methods proposed in the literature to this date. Next, we compare their performance both qualitatively (e.g., quality of outputs, pros/cons, etc.) and quantitatively (e.g., accuracy). Last, we speculate the prominent research directions in scene image representation tasks. Overall, this survey provides in-depth insights and applications of recent scene image representation methods for traditional Computer Vision (CV)-based methods, Deep Learning (DL)-based methods, and Search Engine (SE)-based methods.
翻译:随着当今深层次学习算法的崛起,海量数据(例如,SUN-397)的现场图像展示方法(例如,SUN-397)在分类方面实现了显著的绩效提升,但业绩仍然有限,因为场景图像在性质上极为复杂,具有较高的阶级内部差异和不同类别之间的相似问题。为了处理这些问题,文献中提出了几种方法,这些方法本身具有优势和局限性。最后,我们有必要详细研究以往的工作,以了解其在图像展示和分类方面的利弊。在本文件中,我们审查了目前广泛用于图像分类的现有场景图像展示方法。为此,我们首先利用文献中迄今建议的原始现有方法来设计分类学。接下来,我们比较其性能(例如,产出的质量、赞成/同意等)和定量(例如,准确性)。最后,我们推测了现场图像展示任务中突出的研究方向。总体而言,本项调查为基于计算机视觉(CV)方法、深层学习(DL)方法、基于搜索的方法和搜索(DL)方法(DSEAR-SE)的近期现场图像展示展示展示方法提供了深入的洞见和应用。