Most image generation methods are difficult to precisely control the properties of the generated images, such as structure, scale, shape, etc., which limits its large-scale application in creative industries such as conceptual design and graphic design, and so on. Using the prompt and the sketch is a practical solution for controllability. Existing datasets lack either prompt or sketch and are not designed for the creative industry. Here is the main contribution of our work. a) This is the first dataset that covers the 4 most important areas of creative industry domains and is labeled with prompt and sketch. b) We provide multiple reference images in the test set and fine-grained scores for each reference which are useful for measurement. c) We apply two state-of-the-art models to our dataset and then find some shortcomings, such as the prompt is more highly valued than the sketch.
翻译:多数图像生成方法难以精确控制生成图像的特性,例如结构、比例、形状等,从而限制了其在概念设计和图形设计等创意产业中的大规模应用。 使用即时和草图是可控性的实际解决办法。 现有的数据集缺乏即时或草图,而且不是为创意产业设计。 这是我们工作的主要贡献。 a) 这是第一个数据集,它覆盖了创造性产业领域的四个最重要的领域,并标有迅速和草图。 b) 我们在测试组中提供了多个参考图像,每个参考组的精细分数可用于测量。 c) 我们对我们的数据集应用两种最先进的模型,然后发现一些缺点,例如快速比草图更受重视。