Fine-grained image analysis (FGIA) is a longstanding and fundamental problem in computer vision and pattern recognition, and underpins a diverse set of real-world applications. The task of FGIA targets analyzing visual objects from subordinate categories, e.g., species of birds or models of cars. The small inter-class and large intra-class variation inherent to fine-grained image analysis makes it a challenging problem. Capitalizing on advances in deep learning, in recent years we have witnessed remarkable progress in deep learning powered FGIA. In this paper we present a systematic survey of these advances, where we attempt to re-define and broaden the field of FGIA by consolidating two fundamental fine-grained research areas -- fine-grained image recognition and fine-grained image retrieval. In addition, we also review other key issues of FGIA, such as publicly available benchmark datasets and related domain-specific applications. We conclude by highlighting several research directions and open problems which need further exploration from the community.
翻译:精细图像分析(FGIA)是一个长期和根本性的计算机视觉和模式识别问题,是一系列多样化的现实应用的基础。FGIA的目标是分析亚次类的视觉物体,例如鸟类物种或汽车模型。微细图像分析所固有的小类间和大类内差异使其成为一个具有挑战性的问题。利用深层学习的进展,近年来我们看到在深层次学习的动力FGIA方面取得了显著进展。我们在本文件中介绍了对这些进展的系统调查,我们试图通过巩固两个基本精细研究领域 -- -- 细细微图像识别和精细雕刻图像检索 -- -- 来重新定义和扩大FGIA的领域。此外,我们还审查了FGIA的其他关键问题,例如公开提供的基准数据集和相关的具体领域应用。我们最后强调了一些需要社区进一步探讨的研究方向和公开问题。