Based on recent advances in realistic language modeling (GPT-3) and cross-modal representations (CLIP), Gaud\'i was developed to help designers search for inspirational images using natural language. In the early stages of the design process, with the goal of eliciting a client's preferred creative direction, designers will typically create thematic collections of inspirational images called "mood-boards". Creating a mood-board involves sequential image searches which are currently performed using keywords or images. Gaud\'i transforms this process into a conversation where the user is gradually detailing the mood-board's theme. This representation allows our AI to generate new search queries from scratch, straight from a project briefing, following a theme hypothesized by GPT-3. Compared to previous computational approaches to mood-board creation, to the best of our knowledge, ours is the first attempt to represent mood-boards as the stories that designers tell when presenting a creative direction to a client.
翻译:根据现实语言建模(GPT-3)和跨模式展示(CLIP)的最新进展,Gaud\'i是用来帮助设计者使用自然语言搜索灵感图像的。在设计过程的早期阶段,设计者通常会制作灵感图像的专题集,称为“混合板 ” 。创建情绪板需要按顺序搜索图像,目前使用关键词或图像进行搜索。Gaud\i将这一过程转化为对话,用户正在逐渐详细描述情绪板的主题。这种演示使我们的AI能够直接从项目简报中从零开始产生新的搜索查询,直接根据GPT-3的虚构主题进行。与以前对创造情绪板的计算方法相比,我们最了解的是,我们第一次尝试将情绪板作为设计者在向客户展示创造性方向时所讲述的故事来表现。