Have you ever looked at a painting and wondered what is the story behind it? This work presents a framework to bring art closer to people by generating comprehensive descriptions of fine-art paintings. Generating informative descriptions for artworks, however, is extremely challenging, as it requires to 1) describe multiple aspects of the image such as its style, content, or composition, and 2) provide background and contextual knowledge about the artist, their influences, or the historical period. To address these challenges, we introduce a multi-topic and knowledgeable art description framework, which modules the generated sentences according to three artistic topics and, additionally, enhances each description with external knowledge. The framework is validated through an exhaustive analysis, both quantitative and qualitative, as well as a comparative human evaluation, demonstrating outstanding results in terms of both topic diversity and information veracity.
翻译:你是否看过绘画,并想知道它背后的故事是什么? 这项工作提供了一个框架,通过对美术绘画进行全面描述,使艺术更贴近人。然而,为艺术作品制作信息化描述极为具有挑战性,因为它要求(1) 描述图像的多个方面,例如其风格、内容或组成,(2) 提供关于艺术家的背景和背景知识,以及艺术家的影响或历史时期。为了应对这些挑战,我们引入了一个多主题和知识化艺术描述框架,将生成的句子按照三个艺术专题进行,此外,用外部知识加强每个描述。框架通过定量和定性的详尽分析以及比较性人类评估得到验证,在主题多样性和信息真实性两方面都展示了突出的成果。