交叉模式压缩:走向人类可理解的语义压缩 (Cross Modal Compression: Towards Human-comprehensible Semantic Compression) - 专知论文

会员服务 ·

0

逼真度 · 模态 · 可约的 · Analysis · Performer ·

2022 年 9 月 6 日

Cross Modal Compression: Towards Human-comprehensible Semantic Compression

翻译：交叉模式压缩:走向人类可理解的语义压缩

Jiguo Li,Chuanmin Jia,Xinfeng Zhang,Siwei Ma,Wen Gao

from arxiv, 10 pages, 4 figures

Traditional image/video compression aims to reduce the transmission/storage cost with signal fidelity as high as possible. However, with the increasing demand for machine analysis and semantic monitoring in recent years, semantic fidelity rather than signal fidelity is becoming another emerging concern in image/video compression. With the recent advances in cross modal translation and generation, in this paper, we propose the cross modal compression~(CMC), a semantic compression framework for visual data, to transform the high redundant visual data~(such as image, video, etc.) into a compact, human-comprehensible domain~(such as text, sketch, semantic map, attributions, etc.), while preserving the semantic. Specifically, we first formulate the CMC problem as a rate-distortion optimization problem. Secondly, we investigate the relationship with the traditional image/video compression and the recent feature compression frameworks, showing the difference between our CMC and these prior frameworks. Then we propose a novel paradigm for CMC to demonstrate its effectiveness. The qualitative and quantitative results show that our proposed CMC can achieve encouraging reconstructed results with an ultrahigh compression ratio, showing better compression performance than the widely used JPEG baseline.

翻译：传统图像/视频压缩的目的是尽可能降低信号忠诚度的传输/存储成本,然而,随着近年来对机器分析和语义监测的需求不断增加,语义忠诚性而不是信号忠诚性正在成为图像/视频压缩方面的另一个新出现的关注问题。由于最近在跨模式翻译和生成方面的最新进展,我们在本文件中提议采用跨模式压缩~(CMC),这是视觉数据的一个语义压缩框架,将高冗余视觉数据~(如图像、视频等)转化为一个紧凑的、人理解的域(如文字、素描、语义图、属性等),同时保留语义。具体地说,我们首先将CMC问题设计成一个比例扭曲性优化问题。第二,我们调查与传统图像/视频压缩和最近特征压缩框架的关系,显示我们的CMC与这些先前框架之间的差异。然后我们提出一个新的CMC模式,以展示其有效性。质量和数量结果显示,我们提议的CMC能够以超高压缩率的基压率率来鼓励重建结果,显示比广泛使用的GEB更好的压度业绩。

0

相关内容

逼真度

ICLR 2022杰出论文公布：7篇论文获得，清华朱军课题组摘得

ICLR 2022杰出论文公布：7篇论文获得，清华朱军课题组摘得

专知会员服务

60+阅读 · 2022年4月22日

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

专知会员服务

78+阅读 · 2022年3月15日

2020数据工程师成长路线图

专知会员服务

41+阅读 · 2020年9月6日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

166+阅读 · 2020年3月18日

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

专知会员服务

96+阅读 · 2020年3月12日

Aspect-Oriented Syntax Network for Aspect-Based Sentiment Analysis，中山大学数据科学与计算机学院权小军教授，第八届全国社会媒体处理大会SMP2019

Aspect-Oriented Syntax Network for Aspect-Based Sentiment Analysis，中山大学数据科学与计算机学院权小军教授，第八届全国社会媒体处理大会SMP2019

专知会员服务

19+阅读 · 2019年10月22日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

专知会员服务

79+阅读 · 2019年10月10日

VCIP 2022 Call for Special Session Proposals

VCIP 2022 Call for Special Session Proposals

CCF多媒体专委会

1+阅读 · 2022年4月1日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

【ICIG2021】Latest News & Announcements of the Workshop

【ICIG2021】Latest News & Announcements of the Workshop

中国图象图形学学会CSIG

0+阅读 · 2021年12月20日

【ICIG2021】Latest News & Announcements of the Plenary Talk1

【ICIG2021】Latest News & Announcements of the Plenary Talk1

中国图象图形学学会CSIG

0+阅读 · 2021年11月1日

【ICIG2021】Latest News & Announcements of the Industry Talk1

【ICIG2021】Latest News & Announcements of the Industry Talk1

中国图象图形学学会CSIG

0+阅读 · 2021年7月28日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

disentangled-representation-papers

disentangled-representation-papers

CreateAMind

26+阅读 · 2018年9月12日

【代码资源】GAN | 七份最热GAN文章及代码分享（Github 1000+Stars）

【代码资源】GAN | 七份最热GAN文章及代码分享（Github 1000+Stars）

专知

13+阅读 · 2018年6月24日

活性氧介导的内质网应激在博莱霉素诱发肺上皮-间质转化和肺纤维化中的作用

国家自然科学基金

0+阅读 · 2016年12月31日

Akt磷酸化Prohibitin介导其线粒体转位促进膀胱癌的增殖

国家自然科学基金

0+阅读 · 2014年12月31日

长链非编码RNA CAR intergenic 10在细胞衰老中的作用和机制

国家自然科学基金

1+阅读 · 2013年12月31日

基于AOD-FLIM/CARS多模光学平台监测单个活细胞内RNA合成过程

国家自然科学基金

0+阅读 · 2013年12月31日

两栖动物镇痛肽odorranaopin结构与功能研究

国家自然科学基金

0+阅读 · 2012年12月31日

行星系统形成研究:（III）类地行星和气态巨行星形成与演化

国家自然科学基金

0+阅读 · 2012年12月31日

Paterson装置上二辉橄榄岩和方辉橄榄岩在高温高压轴向压缩条件下的流变学实验

国家自然科学基金

0+阅读 · 2011年12月31日

长波长星载森林生物量观测SAR电离层效应补偿方法研究

国家自然科学基金

0+阅读 · 2011年12月31日

行星系统形成研究:(II)类地行星形成演化和内部结构

国家自然科学基金

0+阅读 · 2009年12月31日

信号肽介导亚细胞区域靶位近红外Ca2+荧光纳米传感器的研究

国家自然科学基金

0+阅读 · 2009年12月31日

A Deep Learning based No-reference Quality Assessment Model for UGC Videos

Arxiv

0+阅读 · 2022年10月20日

MFFN: Multi-view Feature Fusion Network for Camouflaged Object Detection

Arxiv

0+阅读 · 2022年10月19日

Image Semantic Relation Generation

Arxiv

0+阅读 · 2022年10月19日

On the Adversarial Robustness of Mixture of Experts

Arxiv

0+阅读 · 2022年10月19日

Active Metric-Semantic Mapping by Multiple Aerial Robots

Arxiv

0+阅读 · 2022年10月18日

Real-Time Multi-Modal Semantic Fusion on Unmanned Aerial Vehicles with Label Propagation for Cross-Domain Adaptation

Arxiv

0+阅读 · 2022年10月18日

WaGI : Wavelet-based GAN Inversion for Preserving High-frequency Image Details

Arxiv

0+阅读 · 2022年10月18日

Homogeneous Multi-modal Feature Fusion and Interaction for 3D Object Detection

Arxiv

0+阅读 · 2022年10月18日

Efficient Modeling of Future Context for Image Captioning

Arxiv

0+阅读 · 2022年10月18日

Towards Cognitive Robots That People Accept in Their Home

Arxiv

0+阅读 · 2022年10月17日

VIP会员

文章信息

相关主题

相关VIP内容

ICLR 2022杰出论文公布：7篇论文获得，清华朱军课题组摘得

ICLR 2022杰出论文公布：7篇论文获得，清华朱军课题组摘得

专知会员服务

60+阅读 · 2022年4月22日

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

专知会员服务

78+阅读 · 2022年3月15日

2020数据工程师成长路线图

专知会员服务

41+阅读 · 2020年9月6日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

166+阅读 · 2020年3月18日

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

专知会员服务

96+阅读 · 2020年3月12日

Aspect-Oriented Syntax Network for Aspect-Based Sentiment Analysis，中山大学数据科学与计算机学院权小军教授，第八届全国社会媒体处理大会SMP2019

Aspect-Oriented Syntax Network for Aspect-Based Sentiment Analysis，中山大学数据科学与计算机学院权小军教授，第八届全国社会媒体处理大会SMP2019

专知会员服务

19+阅读 · 2019年10月22日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

专知会员服务

79+阅读 · 2019年10月10日

热门VIP内容

开通专知VIP会员享更多权益服务

《军事域人工智能风险、机遇与治理战略指导报告》2025最新76页报告

《杀伤网与精确规模：智能饱和战争时代的战略要务-印度视角》2025最新报告

俄乌冲突的地缘政治与军事教训（万字长文）

《弹药快速效能建模：推进互操作性与技术优势》2025最新26页报告

相关资讯

VCIP 2022 Call for Special Session Proposals

VCIP 2022 Call for Special Session Proposals

CCF多媒体专委会

1+阅读 · 2022年4月1日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

【ICIG2021】Latest News & Announcements of the Workshop

【ICIG2021】Latest News & Announcements of the Workshop

中国图象图形学学会CSIG

0+阅读 · 2021年12月20日

【ICIG2021】Latest News & Announcements of the Plenary Talk1

【ICIG2021】Latest News & Announcements of the Plenary Talk1

中国图象图形学学会CSIG

0+阅读 · 2021年11月1日

【ICIG2021】Latest News & Announcements of the Industry Talk1

【ICIG2021】Latest News & Announcements of the Industry Talk1

中国图象图形学学会CSIG

0+阅读 · 2021年7月28日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

disentangled-representation-papers

disentangled-representation-papers

CreateAMind

26+阅读 · 2018年9月12日

【代码资源】GAN | 七份最热GAN文章及代码分享（Github 1000+Stars）

【代码资源】GAN | 七份最热GAN文章及代码分享（Github 1000+Stars）

专知

13+阅读 · 2018年6月24日

相关论文

A Deep Learning based No-reference Quality Assessment Model for UGC Videos

Arxiv

0+阅读 · 2022年10月20日

MFFN: Multi-view Feature Fusion Network for Camouflaged Object Detection

Arxiv

0+阅读 · 2022年10月19日

Image Semantic Relation Generation

Arxiv

0+阅读 · 2022年10月19日

On the Adversarial Robustness of Mixture of Experts

Arxiv

0+阅读 · 2022年10月19日

Active Metric-Semantic Mapping by Multiple Aerial Robots

Arxiv

0+阅读 · 2022年10月18日

Real-Time Multi-Modal Semantic Fusion on Unmanned Aerial Vehicles with Label Propagation for Cross-Domain Adaptation

Arxiv

0+阅读 · 2022年10月18日

WaGI : Wavelet-based GAN Inversion for Preserving High-frequency Image Details

Arxiv

0+阅读 · 2022年10月18日

Homogeneous Multi-modal Feature Fusion and Interaction for 3D Object Detection

Arxiv

0+阅读 · 2022年10月18日

Efficient Modeling of Future Context for Image Captioning

Arxiv

0+阅读 · 2022年10月18日

Towards Cognitive Robots That People Accept in Their Home

Arxiv

0+阅读 · 2022年10月17日

相关基金

活性氧介导的内质网应激在博莱霉素诱发肺上皮-间质转化和肺纤维化中的作用

国家自然科学基金

0+阅读 · 2016年12月31日

Akt磷酸化Prohibitin介导其线粒体转位促进膀胱癌的增殖

国家自然科学基金

0+阅读 · 2014年12月31日

长链非编码RNA CAR intergenic 10在细胞衰老中的作用和机制

国家自然科学基金

1+阅读 · 2013年12月31日

基于AOD-FLIM/CARS多模光学平台监测单个活细胞内RNA合成过程

国家自然科学基金

0+阅读 · 2013年12月31日

两栖动物镇痛肽odorranaopin结构与功能研究

国家自然科学基金

0+阅读 · 2012年12月31日

行星系统形成研究:（III）类地行星和气态巨行星形成与演化

国家自然科学基金

0+阅读 · 2012年12月31日

Paterson装置上二辉橄榄岩和方辉橄榄岩在高温高压轴向压缩条件下的流变学实验

国家自然科学基金

0+阅读 · 2011年12月31日

长波长星载森林生物量观测SAR电离层效应补偿方法研究

国家自然科学基金

0+阅读 · 2011年12月31日

行星系统形成研究:(II)类地行星形成演化和内部结构

国家自然科学基金

0+阅读 · 2009年12月31日

信号肽介导亚细胞区域靶位近红外Ca2+荧光纳米传感器的研究

国家自然科学基金

0+阅读 · 2009年12月31日

微信扫码咨询专知VIP会员