Action quality assessment (AQA) from videos is a challenging vision task since the relation between videos and action scores is difficult to model. Thus, action quality assessment has been widely studied in the literature. Traditionally, AQA task is treated as a regression problem to learn the underlying mappings between videos and action scores. More recently, the method of uncertainty score distribution learning (USDL) made success due to the introduction of label distribution learning (LDL). But USDL does not apply to dataset with continuous labels and needs a fixed variance in training. In this paper, to address the above problems, we further develop Distribution Auto-Encoder (DAE). DAE takes both advantages of regression algorithms and label distribution learning (LDL).Specifically, it encodes videos into distributions and uses the reparameterization trick in variational auto-encoders (VAE) to sample scores, which establishes a more accurate mapping between videos and scores. Meanwhile, a combined loss is constructed to accelerate the training of DAE. DAE-MT is further proposed to deal with AQA on multi-task datasets. We evaluate our DAE approach on MTL-AQA and JIGSAWS datasets. Experimental results on public datasets demonstrate that our method achieves state-of-the-arts under the Spearman's Rank Correlation: 0.9449 on MTL-AQA and 0.73 on JIGSAWS.
翻译:视频中的不确定分数质量评估(AQA)是一项具有挑战性的愿景任务,因为视频与行动分数之间的关系难以建模,因此,文献中广泛研究了行动质量评估。传统上,AQA任务被视为一个回归问题,以学习视频与行动分数之间的基本映射;最近,由于引入标签分配学习(LLDL),不确定性分数分配学习方法(USL)取得了成功。但是,DUSL不适用于具有连续标签的数据集,而且需要固定的培训差异。在本文中,为了解决上述问题,我们进一步开发了ADIGSA-Ecoder(DAE)。DAE利用回归算法和标签分配学习(LLDL)的优势。具体地说,它将视频编码为分布,并将变式自动计算器(VAE)中的重新校正法用于样本分数,从而在视频和分数之间建立更准确的映射图。与此同时,为了加快DAE的训练,我们又提议DIGSA-MTA-MT在多任务数据集上与AQA-AAAA级方法进行交易,我们用DAAAAAMA-CRA-CRAAAA的实验方法下的数据。