Text-to-image generative models have recently exploded in popularity and accessibility. Yet so far, use of these models in creative tasks that bridge the 2D digital world and the creation of physical artefacts has been understudied. We conduct a pilot study to investigate if and how text-to-image models can be used to assist in upstream tasks within the creative process, such as ideation and visualization, prior to a sculpture-making activity. Thirty participants selected sculpture-making materials and generated three images using the Stable Diffusion text-to-image generator, each with text prompts of their choice, with the aim of informing and then creating a physical sculpture. The majority of participants (23/30) reported that the generated images informed their sculptures, and 28/30 reported interest in using text-to-image models to help them in a creative task in the future. We identify several prompt engineering strategies and find that a participant's prompting strategy relates to their stage in the creative process. We discuss how our findings can inform support for users at different stages of the design process and for using text-to-image models for physical artefact design.
翻译:然而,迄今为止,这些模型在连接2D数字世界和创造物理文物的创造性任务中的使用一直没有得到足够的研究。我们进行了一项试点研究,以调查文字到图像模型是否以及如何在雕塑活动之前用于协助在创造性过程中的上游任务,例如构想和视觉化。30名参加者选择了雕塑材料,并用稳定传播文本到图像生成器制作了三幅图像,每个模型都有自己选择的文字提示,目的是提供信息并随后创建物理雕塑。大多数参加者(23/30)报告说,制作的图像告知了他们的雕塑,28/30人报告说有兴趣使用文字到图像模型来帮助他们在未来的创造性任务中完成。我们确定了若干迅速的工程战略,发现参与者的迅速战略与其创作过程的舞台有关。我们讨论了我们的发现如何为设计过程不同阶段的用户提供支持,以及使用文字到图像模型来进行物理雕塑设计。