Text-to-image synthesis has made a giant leap towards becoming a mainstream phenomenon since 2021. With text-to-image systems, anybody can create digital images and artworks. This provokes the question of whether text-to-image art is creative. This paper expounds on the nature of human creativity involved in text-to-image art with a specific focus on the practice of "prompt engineering". The paper argues that the current product-centered view of creativity may fall short in the context of text-to-image generation. A case exemplifying this shortcoming is provided and the importance of online communities for the creative ecosystem of text-to-image art is highlighted. We provide a high-level summary of this online ecosystem drawing on Rhodes's conceptual model of creativity. We provide a discussion on the challenges for evaluating the creativity of text-to-image generation and discuss opportunities for research on text-to-image art in the field of Human-Computer Interaction (HCI).
翻译:自2021年以来,以文字为图像的合成为主流现象迈出了一大步。随着文本为图像的系统,任何人都可以创建数字图像和艺术作品。这引起了文本为图像的艺术是否具有创造性的问题。本文阐述了文本为图像艺术中涉及的人类创造力的性质,并具体侧重于“快速工程”的做法。本文认为,目前以产品为中心的创造力观点在文本为图像的生成方面可能并不尽如人意。本文举例说明了这一缺陷,并强调了在线社区对于文本为图像的艺术的创造性生态系统的重要性。我们利用罗得斯的创造力概念模型,提供了这一在线生态系统的高层次摘要。我们讨论了评估文本为图像的生成的创造力的挑战,并讨论了在人类-计算机互动领域研究文本为图像的艺术的机会。