行动质量评估分配倒退 (Auto-Encoding Score Distribution Regression for Action Quality Assessment) - 专知论文

会员服务 ·

0

去噪自编码 · 得分 · 再参数化/重参数化 · 学成 · 标注 ·

2021 年 11 月 22 日

Auto-Encoding Score Distribution Regression for Action Quality Assessment

翻译：行动质量评估分配倒退

Boyu Zhang,Jiayuan Chen,Yinfei Xu,Hui Zhang,Xu Yang,Xin Geng

Action quality assessment (AQA) from videos is a challenging vision task since the relation between videos and action scores is difficult to model. Thus, action quality assessment has been widely studied in the literature. Traditionally, AQA task is treated as a regression problem to learn the underlying mappings between videos and action scores. More recently, the method of uncertainty score distribution learning (USDL) made success due to the introduction of label distribution learning (LDL). But USDL does not apply to dataset with continuous labels and needs a fixed variance in training. In this paper, to address the above problems, we further develop Distribution Auto-Encoder (DAE). DAE takes both advantages of regression algorithms and label distribution learning (LDL).Specifically, it encodes videos into distributions and uses the reparameterization trick in variational auto-encoders (VAE) to sample scores, which establishes a more accurate mapping between videos and scores. Meanwhile, a combined loss is constructed to accelerate the training of DAE. DAE-MT is further proposed to deal with AQA on multi-task datasets. We evaluate our DAE approach on MTL-AQA and JIGSAWS datasets. Experimental results on public datasets demonstrate that our method achieves state-of-the-arts under the Spearman's Rank Correlation: 0.9449 on MTL-AQA and 0.73 on JIGSAWS.

翻译：视频中的不确定分数质量评估(AQA)是一项具有挑战性的愿景任务,因为视频与行动分数之间的关系难以建模,因此,文献中广泛研究了行动质量评估。传统上,AQA任务被视为一个回归问题,以学习视频与行动分数之间的基本映射;最近,由于引入标签分配学习(LLDL),不确定性分数分配学习方法(USL)取得了成功。但是,DUSL不适用于具有连续标签的数据集,而且需要固定的培训差异。在本文中,为了解决上述问题,我们进一步开发了ADIGSA-Ecoder(DAE)。DAE利用回归算法和标签分配学习(LLDL)的优势。具体地说,它将视频编码为分布,并将变式自动计算器(VAE)中的重新校正法用于样本分数,从而在视频和分数之间建立更准确的映射图。与此同时,为了加快DAE的训练,我们又提议DIGSA-MTA-MT在多任务数据集上与AQA-AAAA级方法进行交易,我们用DAAAAAMA-CRA-CRAAAA的实验方法下的数据。

0

相关内容

去噪自编码

去噪自编码

去噪自编码器背后的思想很简单. 为了迫使隐藏层单元发现更多鲁棒性好的特征, 以及阻止它学习恒等函数, 我们拿受损的输入来训练自编码器重构输入。

最新《生成式对抗网络》简介，25页ppt

最新《生成式对抗网络》简介，25页ppt

专知会员服务

175+阅读 · 2020年6月28日

【ICML2020】对比多视角表示学习

【ICML2020】对比多视角表示学习

专知会员服务

53+阅读 · 2020年6月28日

基于深度学习的表面缺陷检测方法综述

基于深度学习的表面缺陷检测方法综述

专知会员服务

94+阅读 · 2020年5月31日

【视频描述综述论文】Video Description: A Survey of Methods, Datasets, and Evaluation Metrics

【视频描述综述论文】Video Description: A Survey of Methods, Datasets, and Evaluation Metrics

专知会员服务

65+阅读 · 2020年5月12日

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

专知会员服务

95+阅读 · 2020年3月12日

【牛津大学】深度残差强化学习，Deep Residual Reinforcement Learning

【牛津大学】深度残差强化学习，Deep Residual Reinforcement Learning

专知会员服务

84+阅读 · 2020年2月18日

Risk Sensitive Portfolio Optimization with Regime-Switching and Default Contagion，香港理工大学应用数学系余翔助理教授，第八届全国社会媒体处理大会SMP2019

Risk Sensitive Portfolio Optimization with Regime-Switching and Default Contagion，香港理工大学应用数学系余翔助理教授，第八届全国社会媒体处理大会SMP2019

专知会员服务

10+阅读 · 2019年10月24日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

逆强化学习-学习人先验的动机

逆强化学习-学习人先验的动机

CreateAMind

16+阅读 · 2019年1月18日

Disentangled的假设的探讨

Disentangled的假设的探讨

CreateAMind

9+阅读 · 2018年12月10日

disentangled-representation-papers

disentangled-representation-papers

CreateAMind

26+阅读 · 2018年9月12日

已删除

清华大学研究生教育

3+阅读 · 2018年6月30日

Hierarchical Disentangled Representations

Hierarchical Disentangled Representations

CreateAMind

4+阅读 · 2018年4月15日

【论文推荐】最新六篇行人再识别（ReID）相关论文—和谐注意力网络、时序残差学习、评估和基准、图像生成、三元组、对抗属性-图像

【论文推荐】最新六篇行人再识别（ReID）相关论文—和谐注意力网络、时序残差学习、评估和基准、图像生成、三元组、对抗属性-图像

专知

9+阅读 · 2018年3月1日

gan生成图像at 1024² 的代码论文

gan生成图像at 1024² 的代码论文

CreateAMind

4+阅读 · 2017年10月31日

Auto-Encoding GAN

Auto-Encoding GAN

CreateAMind

7+阅读 · 2017年8月4日

ReLU Regression with Massart Noise

Arxiv

0+阅读 · 2022年1月26日

Score-based Generative Neural Networks for Large-Scale Optimal Transport

Arxiv

0+阅读 · 2022年1月25日

On Evaluation Metrics for Graph Generative Models

Arxiv

0+阅读 · 2022年1月24日

Neural Architecture Searching for Facial Attributes-based Depression Recognition

Neural Architecture Searching for Facial Attributes-based Depression Recognition

Arxiv

1+阅读 · 2022年1月24日

ERQA: Edge-Restoration Quality Assessment for Video Super-Resolution

Arxiv

0+阅读 · 2022年1月24日

Probability Distribution on Rooted Trees

Arxiv

0+阅读 · 2022年1月24日

Human Pose Regression with Residual Log-likelihood Estimation

Arxiv

4+阅读 · 2021年7月26日

SDD-FIQA: Unsupervised Face Image Quality Assessment with Similarity Distribution Distance

Arxiv

13+阅读 · 2021年3月10日

Self-supervised Geometric Perception

Arxiv

24+阅读 · 2021年3月4日

A Compact Embedding for Facial Expression Similarity

A Compact Embedding for Facial Expression Similarity

Arxiv

3+阅读 · 2019年1月9日

VIP会员

文章信息

相关主题

去噪自编码

再参数化/重参数化

相关VIP内容

最新《生成式对抗网络》简介，25页ppt

最新《生成式对抗网络》简介，25页ppt

专知会员服务

175+阅读 · 2020年6月28日

【ICML2020】对比多视角表示学习

【ICML2020】对比多视角表示学习

专知会员服务

53+阅读 · 2020年6月28日

基于深度学习的表面缺陷检测方法综述

基于深度学习的表面缺陷检测方法综述

专知会员服务

94+阅读 · 2020年5月31日

【视频描述综述论文】Video Description: A Survey of Methods, Datasets, and Evaluation Metrics

【视频描述综述论文】Video Description: A Survey of Methods, Datasets, and Evaluation Metrics

专知会员服务

65+阅读 · 2020年5月12日

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

专知会员服务

95+阅读 · 2020年3月12日

【牛津大学】深度残差强化学习，Deep Residual Reinforcement Learning

【牛津大学】深度残差强化学习，Deep Residual Reinforcement Learning

专知会员服务

84+阅读 · 2020年2月18日

Risk Sensitive Portfolio Optimization with Regime-Switching and Default Contagion，香港理工大学应用数学系余翔助理教授，第八届全国社会媒体处理大会SMP2019

Risk Sensitive Portfolio Optimization with Regime-Switching and Default Contagion，香港理工大学应用数学系余翔助理教授，第八届全国社会媒体处理大会SMP2019

专知会员服务

10+阅读 · 2019年10月24日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

热门VIP内容

开通专知VIP会员享更多权益服务

《使用量化测量将传感器节点关联到融合中心的算法设计》171页

军事前沿模型

提升军事训练能力的最佳人工智能模拟工具

《社交媒体信息作战》最新48页技术报告

相关资讯

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

逆强化学习-学习人先验的动机

逆强化学习-学习人先验的动机

CreateAMind

16+阅读 · 2019年1月18日

Disentangled的假设的探讨

Disentangled的假设的探讨

CreateAMind

9+阅读 · 2018年12月10日

disentangled-representation-papers

disentangled-representation-papers

CreateAMind

26+阅读 · 2018年9月12日

已删除

清华大学研究生教育

3+阅读 · 2018年6月30日

Hierarchical Disentangled Representations

Hierarchical Disentangled Representations

CreateAMind

4+阅读 · 2018年4月15日

【论文推荐】最新六篇行人再识别（ReID）相关论文—和谐注意力网络、时序残差学习、评估和基准、图像生成、三元组、对抗属性-图像

【论文推荐】最新六篇行人再识别（ReID）相关论文—和谐注意力网络、时序残差学习、评估和基准、图像生成、三元组、对抗属性-图像

专知

9+阅读 · 2018年3月1日

gan生成图像at 1024² 的代码论文

gan生成图像at 1024² 的代码论文

CreateAMind

4+阅读 · 2017年10月31日

Auto-Encoding GAN

Auto-Encoding GAN

CreateAMind

7+阅读 · 2017年8月4日

相关论文

ReLU Regression with Massart Noise

Arxiv

0+阅读 · 2022年1月26日

Score-based Generative Neural Networks for Large-Scale Optimal Transport

Arxiv

0+阅读 · 2022年1月25日

On Evaluation Metrics for Graph Generative Models

Arxiv

0+阅读 · 2022年1月24日

Neural Architecture Searching for Facial Attributes-based Depression Recognition

Neural Architecture Searching for Facial Attributes-based Depression Recognition

Arxiv

1+阅读 · 2022年1月24日

ERQA: Edge-Restoration Quality Assessment for Video Super-Resolution

Arxiv

0+阅读 · 2022年1月24日

Probability Distribution on Rooted Trees

Arxiv

0+阅读 · 2022年1月24日

Human Pose Regression with Residual Log-likelihood Estimation

Arxiv

4+阅读 · 2021年7月26日

SDD-FIQA: Unsupervised Face Image Quality Assessment with Similarity Distribution Distance

Arxiv

13+阅读 · 2021年3月10日

Self-supervised Geometric Perception

Arxiv

24+阅读 · 2021年3月4日

A Compact Embedding for Facial Expression Similarity

A Compact Embedding for Facial Expression Similarity

Arxiv

3+阅读 · 2019年1月9日

微信扫码咨询专知VIP会员