Content-based fashion image retrieval (CBFIR) has been widely used in our daily life for searching fashion images or items from online platforms. In e-commerce purchasing, the CBFIR system can retrieve fashion items or products with the same or comparable features when a consumer uploads a reference image, image with text, sketch or visual stream from their daily life. This lowers the CBFIR system reliance on text and allows for a more accurate and direct searching of the desired fashion product. Considering recent developments, CBFIR still has limits when it comes to visual searching in the real world due to the simultaneous availability of multiple fashion items, occlusion of fashion products, and shape deformation. This paper focuses on CBFIR methods with the guidance of images, images with text, sketches, and videos. Accordingly, we categorized CBFIR methods into four main categories, i.e., image-guided CBFIR (with the addition of attributes and styles), image and text-guided, sketch-guided, and video-guided CBFIR methods. The baseline methodologies have been thoroughly analyzed, and the most recent developments in CBFIR over the past six years (2017 to 2022) have been thoroughly examined. Finally, key issues are highlighted for CBFIR with promising directions for future research.
翻译:时装图像内容检索(CBFIR)广泛用于我们日常生活中的在线平台,以便搜索时装图像或物品。在电子商务购买中,当消费者上传参考图像、带有文本的图像、草图或日常生活中的视觉流时,CBFIR系统可以检索具有相同或可比较特征的时装物品或产品。这降低了CBFIR系统对文本的依赖,并允许更准确和直接地搜索所需的时尚产品。考虑到最近的发展,CBFIR在现实世界中进行视觉搜索仍存在限制,因为同时提供多个时装物品、时装产品的遮挡和形状变形。本文重点介绍CBFIR方法,重点关注以图像、带有文本的图像、草图和视频为指导的CBFIR方法。因此,我们将CBFIR方法分为四类,即以图像为导向的CBFIR(添加属性和风格)、图像和文本为导向的CBFIR、草图为导向的CBFIR和视频为导向的CBFIR方法。基线方法已经仔细分析,过去六年(2017年至2022年)CBFIR的最新发展进行了彻底的研究。最后,本文突出了CBFIR的主要问题,并为未来的研究提供了有前途的方向。