Being able to forecast the popularity of new garment designs is very important in an industry as fast paced as fashion, both in terms of profitability and reducing the problem of unsold inventory. Here, we attempt to address this task in order to provide informative forecasts to fashion designers within a virtual reality designer application that will allow them to fine tune their creations based on current consumer preferences within an interactive and immersive environment. To achieve this we have to deal with the following central challenges: (1) the proposed method should not hinder the creative process and thus it has to rely only on the garment's visual characteristics, (2) the new garment lacks historical data from which to extrapolate their future popularity and (3) fashion trends in general are highly dynamical. To this end, we develop a computer vision pipeline fine tuned on fashion imagery in order to extract relevant visual features along with the category and attributes of the garment. We propose a hierarchical label sharing (HLS) pipeline for automatically capturing hierarchical relations among fashion categories and attributes. Moreover, we propose MuQAR, a Multimodal Quasi-AutoRegressive neural network that forecasts the popularity of new garments by combining their visual features and categorical features while an autoregressive neural network is modelling the popularity time series of the garment's category and attributes. Both the proposed HLS and MuQAR prove capable of surpassing the current state-of-the-art in key benchmark datasets, DeepFashion for image classification and VISUELLE for new garment sales forecasting.
翻译:能够预测新服装设计受到新服装设计欢迎的程度,对于像时装一样快速的行业非常重要,这既包括盈利性,也包括减少未售出库存问题。在这里,我们试图完成这项任务,以便在虚拟现实设计设计者应用程序中向时装设计者提供信息预报,在虚拟现实设计设计者应用程序中向时装设计者提供信息预报,使他们能够在互动和消沉的环境中根据当前的消费者偏好微调其创作作品。为了做到这一点,我们必须应对以下核心挑战:(1) 拟议方法不应阻碍创造性进程,因此它只能依赖服装的视觉特征,(2) 新服装缺乏历史数据,无法从中推断其未来的受欢迎程度,(3) 时装趋势总的来说是动态的。为此,我们开发了一台计算机视觉编程编程管道,以便根据当前消费者的喜好和服装属性来精细调其创作。我们建议一个等级标签共享(HLS)管道,以便自动捕捉时装类别和属性之间的等级关系。此外,我们建议 MuQAR,一个多式自动递增的神经网络,用来预测当前服装的深度和内装图像的精度。