The recent progress of text-to-image generation has been recognized in architectural design. Our study is the first to investigate the potential of text-to-image generators in supporting creativity during the early stages of the architectural design process. We conducted a laboratory study with 17 architecture students, who developed a concept for a culture center using three popular text-to-image generators: Midjourney, Stable Diffusion, and DALL-E. Through standardized questionnaires and group interviews, we found that image generation could be a meaningful part of the design process when design constraints are carefully considered. Generative tools support serendipitous discovery of ideas and an imaginative mindset, enriching the design process. We identified several challenges of image generators and provided considerations for software development and educators to support creativity and emphasize designers' imaginative mindset. By understanding the limitations and potential of text-to-image generators, architects and designers can leverage this technology in their design process and education, facilitating innovation and effective communication of concepts.
翻译:最近文本生成图像技术在建筑设计领域已经受到了认可。本研究首次探讨文本生成图像技术在建筑设计初期创意构想中的潜在作用。我们进行了一项实验,共有17名建筑学生使用三种流行的文本生成图像器(Midjourney、Stable Diffusion和DALL-E)开发文化中心的概念设计。通过标准化问卷和小组访谈,我们发现当设计约束条件得到认真考虑时,图像生成可以成为设计过程的有意义的部分。生成工具支持偶然的发现和想象力,丰富了设计过程。我们确认了图像生成器的几个挑战,并提供了软件开发和教育工作者应该考虑的因素来支持创造力和强调设计师富有想象力的心态。通过了解文本生成图像技术的局限性和潜力,建筑师和设计师可以在设计过程和教育中利用这项技术,促进创新和概念的有效沟通。