How to robustly rank the aesthetic quality of given images has been a long-standing ill-posed topic. Such challenge stems mainly from the diverse subjective opinions of different observers about the varied types of content. There is a growing interest in estimating the user agreement by considering the standard deviation of the scores, instead of only predicting the mean aesthetic opinion score. Nevertheless, when comparing a pair of contents, few studies consider how confident are we regarding the difference in the aesthetic scores. In this paper, we thus propose (1) a re-adapted multi-task attention network to predict both the mean opinion score and the standard deviation in an end-to-end manner; (2) a brand-new confidence interval ranking loss that encourages the model to focus on image-pairs that are less certain about the difference of their aesthetic scores. With such loss, the model is encouraged to learn the uncertainty of the content that is relevant to the diversity of observers' opinions, i.e., user disagreement. Extensive experiments have demonstrated that the proposed multi-task aesthetic model achieves state-of-the-art performance on two different types of aesthetic datasets, i.e., AVA and TMGA.
翻译:如何对特定图像的审美质量进行稳健的排序是一个长期错误的专题,这种挑战主要来自不同观察者对不同类型内容的不同主观观点。人们越来越有兴趣通过考虑分数的标准偏差来估计用户协议,而不是仅仅预测平均审美评分。然而,在比较一对内容时,很少有研究考虑我们对审美评分差异的自信程度。因此,我们提议(1) 重新调整多任务关注网络,以预测平均评分和最终标准偏差;(2) 品牌新信任期排名损失,鼓励模型侧重于不太确定其审美评分差异的图像批量。随着这种损失,鼓励模型了解与观察者意见多样性有关的内容的不确定性,即用户的不同意见。广泛的实验表明,拟议的多任务审美模型在两种不同类型的审美数据集上取得了状态和艺术表现,即A.A.A.A.A.A.A.A.A.A.A.A.和A.A.A.A.A.A.A.A.A.A.A.A.A.A.A.A.和A.A.A.A.A.A.A.A.