Evaluation of generative models has been an underrepresented field despite the surge of generative architectures. Most recent models are evaluated upon rather obsolete metrics which suffer from robustness issues, while being unable to assess more aspects of visual quality, such as compositionality and logic of synthesis. At the same time, the explainability of generative models remains a limited, though important, research direction with several current attempts requiring access to the inner functionalities of generative models. Contrary to prior literature, we view generative models as a black box, and we propose a framework for the evaluation and explanation of synthesized results based on concepts instead of pixels. Our framework exploits knowledge-based counterfactual edits that underline which objects or attributes should be inserted, removed, or replaced from generated images to approach their ground truth conditioning. Moreover, global explanations produced by accumulating local edits can also reveal what concepts a model cannot generate in total. The application of our framework on various models designed for the challenging tasks of Story Visualization and Scene Synthesis verifies the power of our approach in the model-agnostic setting.
翻译:尽管基因结构激增,对基因模型的评价一直是一个代表性不足的领域,尽管基因结构激增,但大多数最近的模型都是在相当过时的衡量标准上评价的,这些衡量标准都存在稳健性问题,同时无法评估视觉质量的更多方面,例如合成和合成逻辑。与此同时,基因模型的解释性仍然是一个有限但重要的研究方向,目前有若干尝试需要利用基因模型的内部功能。与以往的文献相反,我们认为基因模型是一个黑盒,我们提出了一个框架,用于根据概念而不是像素来评价和解释综合结果。我们的框架利用基于知识的反事实编辑,强调哪些物体或属性应该插入、删除或从生成的图像中替换,以接近地面的真相调节。此外,通过积累当地编辑产生的全球解释也可以揭示模型无法产生的全部概念。我们为具有挑战性的Story可视化和Scenemin综合任务设计的各种模型的框架的应用证实了我们在模型环境中的方法的力量。</s>