Generative AI models have shown impressive ability to produce images with text prompts, which could benefit creativity in visual art creation and self-expression. However, it is unclear how precisely the generated images express contexts and emotions from the input texts. We explored the emotional expressiveness of AI-generated images and developed RePrompt, an automatic method to refine text prompts toward precise expression of the generated images. Inspired by crowdsourced editing strategies, we curated intuitive text features, such as the number and concreteness of nouns, and trained a proxy model to analyze the feature effects on the AI-generated image. With model explanations of the proxy model, we curated a rubric to adjust text prompts to optimize image generation for precise emotion expression. We conducted simulation and user studies, which showed that RePrompt significantly improves the emotional expressiveness of AI-generated images, especially for negative emotions.
翻译:生成AI模型已经展示了惊人的能力,可以通过文本提示创造出图像,这有利于创意视觉艺术创作和自我表达。然而,难以确定生成图像在表达情境和情感方面的准确度。我们探讨了AI生成图像的情感表达能力,开发了自动方法RePrompt,用于将文本提示精细调整以实现对生成图像的精准表达。受众包编辑策略的启发,我们策划了直观的文本特征,例如名词数量和具体性,并训练代理模型来分析特征对AI生成图像的影响。通过代理模型的说明,我们策划了评分表以调整文本提示,以优化生成图像的准确情感表达。我们开展了模拟和用户研究,结果表明RePrompt显着提高了AI生成图像的情感表达能力,特别是在负面情绪方面。