In this work, we study how the performance and evaluation of generative image models are impacted by the racial composition of their training datasets. By examining and controlling the racial distributions in various training datasets, we are able to observe the impacts of different training distributions on generated image quality and the racial distributions of the generated images. Our results show that the racial compositions of generated images successfully preserve that of the training data. However, we observe that truncation, a technique used to generate higher quality images during inference, exacerbates racial imbalances in the data. Lastly, when examining the relationship between image quality and race, we find that the highest perceived visual quality images of a given race come from a distribution where that race is well-represented, and that annotators consistently prefer generated images of white people over those of Black people.
翻译:在这项工作中,我们研究基因化图像模型的性能和评估如何受到其培训数据集的种族构成的影响;通过审查和控制各种培训数据集中的种族分布,我们能够观察不同培训分布对生成图像质量和生成图像的种族分布的影响;我们的结果显示,生成图像的种族构成成功地保留了培训数据的种族构成;然而,我们注意到,在推论期间用来产生更高质量图像的技术脱节加剧了数据中的种族不平衡。 最后,在审查图像质量和种族之间的关系时,我们发现,某一种族的最高可见视觉质量图像来自种族代表性强的分布,而警告者一贯倾向于产生白人形象,而不是黑人形象。