TISE: 用于文本到图像综合评估的一袋计量器 (TISE: Bag of Metrics for Text-to-Image Synthesis Evaluation) - 专知论文

会员服务 ·

0

Bagging · CASE · 面向服务的架构（SOA） · 秩 · 模型评估 ·

2022 年 7 月 19 日

TISE: Bag of Metrics for Text-to-Image Synthesis Evaluation

翻译：TISE: 用于文本到图像综合评估的一袋计量器

Tan M. Dinh,Rang Nguyen,Binh-Son Hua

from arxiv, Accepted to ECCV 2022; TISE toolbox is available at https://github.com/VinAIResearch/tise-toolbox

In this paper, we conduct a study on the state-of-the-art methods for text-to-image synthesis and propose a framework to evaluate these methods. We consider syntheses where an image contains a single or multiple objects. Our study outlines several issues in the current evaluation pipeline: (i) for image quality assessment, a commonly used metric, e.g., Inception Score (IS), is often either miscalibrated for the single-object case or misused for the multi-object case; (ii) for text relevance and object accuracy assessment, there is an overfitting phenomenon in the existing R-precision (RP) and Semantic Object Accuracy (SOA) metrics, respectively; (iii) for multi-object case, many vital factors for evaluation, e.g., object fidelity, positional alignment, counting alignment, are largely dismissed; (iv) the ranking of the methods based on current metrics is highly inconsistent with real images. To overcome these issues, we propose a combined set of existing and new metrics to systematically evaluate the methods. For existing metrics, we offer an improved version of IS named IS* by using temperature scaling to calibrate the confidence of the classifier used by IS; we also propose a solution to mitigate the overfitting issues of RP and SOA. For new metrics, we develop counting alignment, positional alignment, object-centric IS, and object-centric FID metrics for evaluating the multi-object case. We show that benchmarking with our bag of metrics results in a highly consistent ranking among existing methods that is well-aligned with human evaluation. As a by-product, we create AttnGAN++, a simple but strong baseline for the benchmark by stabilizing the training of AttnGAN using spectral normalization. We also release our toolbox, so-called TISE, for advocating fair and consistent evaluation of text-to-image models.

翻译：在本文中,我们研究了文本到图像合成的最新方法,并提出了评估这些方法的框架。我们考虑了图像包含单一或多个对象的合成。我们的研究概述了当前评价管道中的若干问题:(一) 图像质量评估,即常用的度量,例如,“感知分数”(IS),往往不是为单点情况进行错误校正,就是为多点数据错误校正;(二) 文本稳定性和目标精确度评估,现有R-精确度(RP)和Semical 对象精确度(SOA)指标中存在一种超标现象;(三) 对于多点情况,许多评价的至关重要因素,例如,目标性、定位校正、校正(IIS),我们用SIS的比标定比值,我们用SIS的比标比值,我们用SIS的比值比值,我们用SIS的比值比值比值,我们用SIS的比值比值,我们用SIS的比值比值比比标的。

0

相关内容

Bagging

NLP必读经典文献100篇

专知会员服务

124+阅读 · 2020年9月8日

【ACL2020-Google】学习鲁棒度量的文本生成，BLEURT: Learning Robust Metrics for Text Generation

【ACL2020-Google】学习鲁棒度量的文本生成，BLEURT: Learning Robust Metrics for Text Generation

专知会员服务

17+阅读 · 2020年4月10日

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

专知会员服务

96+阅读 · 2020年3月12日

【深度学习表格检测、信息提取和结构化】《Table Detection, Information Extraction and Structuring using Deep Learning》by Vihar Kurama

专知会员服务

38+阅读 · 2020年1月23日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

专知会员服务

83+阅读 · 2019年10月9日

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

专知会员服务

65+阅读 · 2019年10月9日

ACM MM 2022 Call for Papers

ACM MM 2022 Call for Papers

CCF多媒体专委会

5+阅读 · 2022年3月29日

ACM TOMM Call for Papers

ACM TOMM Call for Papers

CCF多媒体专委会

2+阅读 · 2022年3月23日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

【ICIG2021】Latest News & Announcements of the Tutorial

【ICIG2021】Latest News & Announcements of the Tutorial

中国图象图形学学会CSIG

3+阅读 · 2021年12月20日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium8

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium8

中国图象图形学学会CSIG

0+阅读 · 2021年11月16日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium6

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium6

中国图象图形学学会CSIG

2+阅读 · 2021年11月12日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium1

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium1

中国图象图形学学会CSIG

0+阅读 · 2021年11月3日

【ICIG2021】Latest News & Announcements of the Plenary Talk1

【ICIG2021】Latest News & Announcements of the Plenary Talk1

中国图象图形学学会CSIG

0+阅读 · 2021年11月1日

【ICIG2021】Latest News & Announcements of the Industry Talk1

【ICIG2021】Latest News & Announcements of the Industry Talk1

中国图象图形学学会CSIG

0+阅读 · 2021年7月28日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

SUMO促进创伤性颅脑损伤神经修复的研究

国家自然科学基金

0+阅读 · 2015年12月31日

一个新的干扰素刺激基因TRIM69抑制登革病毒感染的作用与机制的研究

国家自然科学基金

0+阅读 · 2015年12月31日

扁平转子小型磁悬浮控制力矩陀螺磁轴承控制方法研究

国家自然科学基金

0+阅读 · 2015年12月31日

拓扑绝缘体与超导体耦合体系中交叉Andreev反射研究

国家自然科学基金

1+阅读 · 2014年12月31日

TNF-α诱导鼻咽癌淋巴管生成和淋巴结转移的机制研究

国家自然科学基金

0+阅读 · 2013年12月31日

CNP对慢性高眼压性RGCs细胞损伤神经保护作用的机制研究

国家自然科学基金

0+阅读 · 2012年12月31日

基于鞅理论与统计信息的仿真优化

国家自然科学基金

0+阅读 · 2012年12月31日

复形范畴中的Gorenstein同调维数

国家自然科学基金

0+阅读 · 2009年12月31日

约化群酉表示的branching law及其应用

国家自然科学基金

0+阅读 · 2009年12月31日

球面学习理论研究

国家自然科学基金

1+阅读 · 2008年12月31日

Distribution Aware Metrics for Conditional Natural Language Generation

Arxiv

0+阅读 · 2022年9月15日

Rethinking Round-trip Translation for Automatic Machine Translation Evaluation

Arxiv

0+阅读 · 2022年9月15日

Exploring Visual Interpretability for Contrastive Language-Image Pre-training

Arxiv

0+阅读 · 2022年9月15日

vec2text with Round-Trip Translations

vec2text with Round-Trip Translations

Arxiv

0+阅读 · 2022年9月14日

Combining Metric Learning and Attention Heads For Accurate and Efficient Multilabel Image Classification

Arxiv

0+阅读 · 2022年9月14日

Learning to Evaluate Performance of Multi-modal Semantic Localization

Arxiv

0+阅读 · 2022年9月14日

Meta Pattern Concern Score: A Novel Metric for Customizable Evaluation of Multi-classification

Arxiv

0+阅读 · 2022年9月14日

StoryDALL-E: Adapting Pretrained Text-to-Image Transformers for Story Continuation

Arxiv

0+阅读 · 2022年9月13日

A Benchmark and a Baseline for Robust Multi-view Depth Estimation

Arxiv

0+阅读 · 2022年9月13日

Image Captioning using Deep Neural Architectures

Arxiv

20+阅读 · 2018年1月17日

VIP会员

文章信息

相关主题

面向服务的架构（SOA）

相关VIP内容

NLP必读经典文献100篇

专知会员服务

124+阅读 · 2020年9月8日

【ACL2020-Google】学习鲁棒度量的文本生成，BLEURT: Learning Robust Metrics for Text Generation

【ACL2020-Google】学习鲁棒度量的文本生成，BLEURT: Learning Robust Metrics for Text Generation

专知会员服务

17+阅读 · 2020年4月10日

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

专知会员服务

96+阅读 · 2020年3月12日

【深度学习表格检测、信息提取和结构化】《Table Detection, Information Extraction and Structuring using Deep Learning》by Vihar Kurama

专知会员服务

38+阅读 · 2020年1月23日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

专知会员服务

83+阅读 · 2019年10月9日

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

专知会员服务

65+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

数据智能体综述：新兴范式还是被高估的炒作？

海底战已至：美国构思海底安全战略 | 最新报告

【ICCV2025教程】视觉异常检测中的基础模型：进展、挑战与应用

美军将无人自主等新技术融入潜艇部队以更具杀伤力

相关资讯

ACM MM 2022 Call for Papers

ACM MM 2022 Call for Papers

CCF多媒体专委会

5+阅读 · 2022年3月29日

ACM TOMM Call for Papers

ACM TOMM Call for Papers

CCF多媒体专委会

2+阅读 · 2022年3月23日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

【ICIG2021】Latest News & Announcements of the Tutorial

【ICIG2021】Latest News & Announcements of the Tutorial

中国图象图形学学会CSIG

3+阅读 · 2021年12月20日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium8

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium8

中国图象图形学学会CSIG

0+阅读 · 2021年11月16日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium6

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium6

中国图象图形学学会CSIG

2+阅读 · 2021年11月12日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium1

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium1

中国图象图形学学会CSIG

0+阅读 · 2021年11月3日

【ICIG2021】Latest News & Announcements of the Plenary Talk1

【ICIG2021】Latest News & Announcements of the Plenary Talk1

中国图象图形学学会CSIG

0+阅读 · 2021年11月1日

【ICIG2021】Latest News & Announcements of the Industry Talk1

【ICIG2021】Latest News & Announcements of the Industry Talk1

中国图象图形学学会CSIG

0+阅读 · 2021年7月28日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

相关论文

Distribution Aware Metrics for Conditional Natural Language Generation

Arxiv

0+阅读 · 2022年9月15日

Rethinking Round-trip Translation for Automatic Machine Translation Evaluation

Arxiv

0+阅读 · 2022年9月15日

Exploring Visual Interpretability for Contrastive Language-Image Pre-training

Arxiv

0+阅读 · 2022年9月15日

vec2text with Round-Trip Translations

vec2text with Round-Trip Translations

Arxiv

0+阅读 · 2022年9月14日

Combining Metric Learning and Attention Heads For Accurate and Efficient Multilabel Image Classification

Arxiv

0+阅读 · 2022年9月14日

Learning to Evaluate Performance of Multi-modal Semantic Localization

Arxiv

0+阅读 · 2022年9月14日

Meta Pattern Concern Score: A Novel Metric for Customizable Evaluation of Multi-classification

Arxiv

0+阅读 · 2022年9月14日

StoryDALL-E: Adapting Pretrained Text-to-Image Transformers for Story Continuation

Arxiv

0+阅读 · 2022年9月13日

A Benchmark and a Baseline for Robust Multi-view Depth Estimation

Arxiv

0+阅读 · 2022年9月13日

Image Captioning using Deep Neural Architectures

Arxiv

20+阅读 · 2018年1月17日

相关基金

SUMO促进创伤性颅脑损伤神经修复的研究

国家自然科学基金

0+阅读 · 2015年12月31日

一个新的干扰素刺激基因TRIM69抑制登革病毒感染的作用与机制的研究

国家自然科学基金

0+阅读 · 2015年12月31日

扁平转子小型磁悬浮控制力矩陀螺磁轴承控制方法研究

国家自然科学基金

0+阅读 · 2015年12月31日

拓扑绝缘体与超导体耦合体系中交叉Andreev反射研究

国家自然科学基金

1+阅读 · 2014年12月31日

TNF-α诱导鼻咽癌淋巴管生成和淋巴结转移的机制研究

国家自然科学基金

0+阅读 · 2013年12月31日

CNP对慢性高眼压性RGCs细胞损伤神经保护作用的机制研究

国家自然科学基金

0+阅读 · 2012年12月31日

基于鞅理论与统计信息的仿真优化

国家自然科学基金

0+阅读 · 2012年12月31日

复形范畴中的Gorenstein同调维数

国家自然科学基金

0+阅读 · 2009年12月31日

约化群酉表示的branching law及其应用

国家自然科学基金

0+阅读 · 2009年12月31日

球面学习理论研究

国家自然科学基金

1+阅读 · 2008年12月31日

微信扫码咨询专知VIP会员