低位调低位数视频录相带混合深深动代码c (A Hybrid Deep Animation Codec for Low-bitrate Video Conferencing) - 专知论文

会员服务 ·

0

HEVC · 代码 · Performer · 流 · 值域 ·

2022 年 7 月 27 日

A Hybrid Deep Animation Codec for Low-bitrate Video Conferencing

翻译：低位调低位数视频录相带混合深深动代码c

Goluck Konuko,Stéphane Lathuilière,Giuseppe Valenzise

from arxiv, Preprint paper. Accepted for publication at ICIP 2022

Deep generative models, and particularly facial animation schemes, can be used in video conferencing applications to efficiently compress a video through a sparse set of keypoints, without the need to transmit dense motion vectors. While these schemes bring significant coding gains over conventional video codecs at low bitrates, their performance saturates quickly when the available bandwidth increases. In this paper, we propose a layered, hybrid coding scheme to overcome this limitation. Specifically, we extend a codec based on facial animation by adding an auxiliary stream consisting of a very low bitrate version of the video, obtained through a conventional video codec (e.g., HEVC). The animated and auxiliary videos are combined through a novel fusion module. Our results show consistent average BD-Rate gains in excess of -30% on a large dataset of video conferencing sequences, extending the operational range of bitrates of a facial animation codec alone

翻译：深基因模型,特别是面部动画计划,可以在视频会议应用中使用深基因模型,在不需传输浓密运动矢量的情况下,通过分散的一组关键点有效地压缩视频,而无需传输密集运动矢量。尽管这些计划在低比位速率的常规视频编码器上带来显著的编码收益,但当可用带宽增加时,其性能会效会迅速饱和。在本文中,我们提议了一个基于面部动画的分层混合编码方案。具体地说,我们扩展了一个基于面部动画的编码器,增加了一个由非常低比特率的视频版本组成的辅助流程,仅通过一个传统的视频编码器(例如,HEVC)获得。动画和辅助视频是通过一个新的聚合模块结合的。我们的结果显示,在大型视频会议序列中,平均BD-Rate增幅在-30%以上,扩大了面片动标码的操作范围。

0

相关内容

HEVC

计算机科学课程与视频课件合集，Computer Science courses with video lectures

计算机科学课程与视频课件合集，Computer Science courses with video lectures

专知会员服务

37+阅读 · 2022年1月24日

Linux导论，Introduction to Linux，96页ppt

Linux导论，Introduction to Linux，96页ppt

专知会员服务

81+阅读 · 2020年7月26日

【CVPR2020】用于图像超分辨率的深度展开网络，Deep Unfolding Network for Image Super-Resolution

【CVPR2020】用于图像超分辨率的深度展开网络，Deep Unfolding Network for Image Super-Resolution

专知会员服务

44+阅读 · 2020年3月26日

50+篇《神经架构搜索NAS》2020论文合集

专知会员服务

61+阅读 · 2020年3月19日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

2019年机器学习框架回顾

2019年机器学习框架回顾

专知会员服务

36+阅读 · 2019年10月11日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

专知会员服务

65+阅读 · 2019年10月9日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

VCIP 2022 Call for Demos

VCIP 2022 Call for Demos

CCF多媒体专委会

1+阅读 · 2022年6月6日

VCIP 2022 Call for Special Session Proposals

VCIP 2022 Call for Special Session Proposals

CCF多媒体专委会

1+阅读 · 2022年4月1日

ACM MM 2022 Call for Papers

ACM MM 2022 Call for Papers

CCF多媒体专委会

5+阅读 · 2022年3月29日

ACM TOMM Call for Papers

ACM TOMM Call for Papers

CCF多媒体专委会

2+阅读 · 2022年3月23日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

MoCoGAN 分解运动和内容的视频生成

MoCoGAN 分解运动和内容的视频生成

CreateAMind

18+阅读 · 2017年10月21日

转录激活蛋白YLGat1介导氮饥饿与油脂合成偶联的分子机制研究

国家自然科学基金

0+阅读 · 2015年12月31日

轻质高韧牙科纳米复合材料的制备和界面力学行为研究

国家自然科学基金

0+阅读 · 2014年12月31日

加氢TiO2纳米线阵列的制备及其光解水制氢性能研究

国家自然科学基金

0+阅读 · 2013年12月31日

高速钢轨疲劳损伤破坏机理与寿命估算

国家自然科学基金

0+阅读 · 2013年12月31日

飞秒激光湿法刻蚀微纳制造的基础研究

国家自然科学基金

0+阅读 · 2013年12月31日

AKT/ROS信号通路对急性髓细胞白血病增殖的影响及靶向作用研究

国家自然科学基金

0+阅读 · 2012年12月31日

铁电材料对FePt薄膜垂直磁各向异性调控机理的研究

国家自然科学基金

0+阅读 · 2012年12月31日

函数域中的Vinogradov中值定理

国家自然科学基金

0+阅读 · 2012年12月31日

颗粒增强涂层材料损伤与破坏的多尺度模拟与实验研究

国家自然科学基金

0+阅读 · 2009年12月31日

复合材料层合板多尺度破坏失效力学性能研究

国家自然科学基金

0+阅读 · 2009年12月31日

AFT-VO: Asynchronous Fusion Transformers for Multi-View Visual Odometry Estimation

AFT-VO: Asynchronous Fusion Transformers for Multi-View Visual Odometry Estimation

Arxiv

0+阅读 · 2022年9月16日

One-Shot Synthesis of Images and Segmentation Masks

Arxiv

0+阅读 · 2022年9月15日

Delving into Inter-Image Invariance for Unsupervised Visual Representations

Arxiv

0+阅读 · 2022年9月15日

Efficient Planar Pose Estimation via UWB Measurements

Efficient Planar Pose Estimation via UWB Measurements

Arxiv

0+阅读 · 2022年9月15日

Domain Adversarial Training on Conditional Variational Auto-Encoder for Controllable Music Generation

Arxiv

0+阅读 · 2022年9月15日

Task Oriented Video Coding: A Survey

Arxiv

0+阅读 · 2022年9月15日

Generative Models as a Data Source for Multiview Representation Learning

Arxiv

16+阅读 · 2021年6月9日

MVFNet: Multi-View Fusion Network for Efficient Video Recognition

Arxiv

13+阅读 · 2021年1月5日

Adversarial Multimodal Representation Learning for Click-Through Rate Prediction

Arxiv

23+阅读 · 2020年3月7日

SlowFast Networks for Video Recognition

SlowFast Networks for Video Recognition

Arxiv

19+阅读 · 2018年12月10日

VIP会员

文章信息

相关主题

相关VIP内容

计算机科学课程与视频课件合集，Computer Science courses with video lectures

计算机科学课程与视频课件合集，Computer Science courses with video lectures

专知会员服务

37+阅读 · 2022年1月24日

Linux导论，Introduction to Linux，96页ppt

Linux导论，Introduction to Linux，96页ppt

专知会员服务

81+阅读 · 2020年7月26日

【CVPR2020】用于图像超分辨率的深度展开网络，Deep Unfolding Network for Image Super-Resolution

【CVPR2020】用于图像超分辨率的深度展开网络，Deep Unfolding Network for Image Super-Resolution

专知会员服务

44+阅读 · 2020年3月26日

50+篇《神经架构搜索NAS》2020论文合集

专知会员服务

61+阅读 · 2020年3月19日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

2019年机器学习框架回顾

2019年机器学习框架回顾

专知会员服务

36+阅读 · 2019年10月11日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

专知会员服务

65+阅读 · 2019年10月9日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

三维高斯泼溅应用综述：分割、编辑与生成

《多智能体不确定环境追逃博弈研究》216页

【博士论文】基于不确定性的可靠性：现代机器学习中的选择性预测与可信部署

现代战争"杀伤区"理论：空间尺度与结构特征、控制手段与毁伤机制、生存策略与战线转移

相关资讯

VCIP 2022 Call for Demos

VCIP 2022 Call for Demos

CCF多媒体专委会

1+阅读 · 2022年6月6日

VCIP 2022 Call for Special Session Proposals

VCIP 2022 Call for Special Session Proposals

CCF多媒体专委会

1+阅读 · 2022年4月1日

ACM MM 2022 Call for Papers

ACM MM 2022 Call for Papers

CCF多媒体专委会

5+阅读 · 2022年3月29日

ACM TOMM Call for Papers

ACM TOMM Call for Papers

CCF多媒体专委会

2+阅读 · 2022年3月23日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

MoCoGAN 分解运动和内容的视频生成

MoCoGAN 分解运动和内容的视频生成

CreateAMind

18+阅读 · 2017年10月21日

相关论文

AFT-VO: Asynchronous Fusion Transformers for Multi-View Visual Odometry Estimation

AFT-VO: Asynchronous Fusion Transformers for Multi-View Visual Odometry Estimation

Arxiv

0+阅读 · 2022年9月16日

One-Shot Synthesis of Images and Segmentation Masks

Arxiv

0+阅读 · 2022年9月15日

Delving into Inter-Image Invariance for Unsupervised Visual Representations

Arxiv

0+阅读 · 2022年9月15日

Efficient Planar Pose Estimation via UWB Measurements

Efficient Planar Pose Estimation via UWB Measurements

Arxiv

0+阅读 · 2022年9月15日

Domain Adversarial Training on Conditional Variational Auto-Encoder for Controllable Music Generation

Arxiv

0+阅读 · 2022年9月15日

Task Oriented Video Coding: A Survey

Arxiv

0+阅读 · 2022年9月15日

Generative Models as a Data Source for Multiview Representation Learning

Arxiv

16+阅读 · 2021年6月9日

MVFNet: Multi-View Fusion Network for Efficient Video Recognition

Arxiv

13+阅读 · 2021年1月5日

Adversarial Multimodal Representation Learning for Click-Through Rate Prediction

Arxiv

23+阅读 · 2020年3月7日

SlowFast Networks for Video Recognition

SlowFast Networks for Video Recognition

Arxiv

19+阅读 · 2018年12月10日

相关基金

转录激活蛋白YLGat1介导氮饥饿与油脂合成偶联的分子机制研究

国家自然科学基金

0+阅读 · 2015年12月31日

轻质高韧牙科纳米复合材料的制备和界面力学行为研究

国家自然科学基金

0+阅读 · 2014年12月31日

加氢TiO2纳米线阵列的制备及其光解水制氢性能研究

国家自然科学基金

0+阅读 · 2013年12月31日

高速钢轨疲劳损伤破坏机理与寿命估算

国家自然科学基金

0+阅读 · 2013年12月31日

飞秒激光湿法刻蚀微纳制造的基础研究

国家自然科学基金

0+阅读 · 2013年12月31日

AKT/ROS信号通路对急性髓细胞白血病增殖的影响及靶向作用研究

国家自然科学基金

0+阅读 · 2012年12月31日

铁电材料对FePt薄膜垂直磁各向异性调控机理的研究

国家自然科学基金

0+阅读 · 2012年12月31日

函数域中的Vinogradov中值定理

国家自然科学基金

0+阅读 · 2012年12月31日

颗粒增强涂层材料损伤与破坏的多尺度模拟与实验研究

国家自然科学基金

0+阅读 · 2009年12月31日

复合材料层合板多尺度破坏失效力学性能研究

国家自然科学基金

0+阅读 · 2009年12月31日

微信扫码咨询专知VIP会员