CNERV: 视觉数据的内容适应性神经图示 (CNeRV: Content-adaptive Neural Representation for Visual Data)

Compression and reconstruction of visual data have been widely studied in the computer vision community, even before the popularization of deep learning. More recently, some have used deep learning to improve or refine existing pipelines, while others have proposed end-to-end approaches, including autoencoders and implicit neural representations, such as SIREN and NeRV. In this work, we propose Neural Visual Representation with Content-adaptive Embedding (CNeRV), which combines the generalizability of autoencoders with the simplicity and compactness of implicit representation. We introduce a novel content-adaptive embedding that is unified, concise, and internally (within-video) generalizable, that compliments a powerful decoder with a single-layer encoder. We match the performance of NeRV, a state-of-the-art implicit neural representation, on the reconstruction task for frames seen during training while far surpassing for frames that are skipped during training (unseen images). To achieve similar reconstruction quality on unseen images, NeRV needs 120x more time to overfit per-frame due to its lack of internal generalization. With the same latent code length and similar model size, CNeRV outperforms autoencoders on reconstruction of both seen and unseen images. We also show promising results for visual data compression. More details can be found in the project pagehttps://haochen-rye.github.io/CNeRV/

翻译：甚至在普及深层学习之前,计算机视觉界就对视觉数据的压缩和重建进行了广泛的研究,甚至在普及深层学习之前就已经对视觉数据的压缩和重建进行了广泛的研究;最近,一些人利用深层的学习来改进或完善现有的管道,另一些人则提议了端到端的方法,包括自动解码器和隐含神经显示器,如SIREN和NERV。在这项工作中,我们提议以内容适应性嵌入式(CNeRV)来进行神经视觉显示,将自动解析器的一般可及隐含代表器(CNeRV)的简洁和紧凑性结合起来;我们引入了一个新的内容适应性新颖的嵌入(在视频内),以统一、简洁和内部(在视频内)可以做到,向一个强大的解码器表示一个强大的解码器,与一个单一层的编码和隐含神经系统显示的神经系统的表现相匹配,在培训期间看到的框架的重建任务上远远超过在培训期间被跳过的框架(不见的图像)。要达到相似的重建质量,为了实现对视觉图像的修复,NRCV需要120多的时间,因为其内部一般图像的重建结果。

相关内容

自编码器

关注 140

自动编码器是一种人工神经网络，用于以无监督的方式学习有效的数据编码。自动编码器的目的是通过训练网络忽略信号“噪声”来学习一组数据的表示（编码），通常用于降维。与简化方面一起，学习了重构方面，在此，自动编码器尝试从简化编码中生成尽可能接近其原始输入的表示形式，从而得到其名称。基本模型存在几种变体，其目的是迫使学习的输入表示形式具有有用的属性。自动编码器可有效地解决许多应用问题，从面部识别到获取单词的语义。

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

专知会员服务

78+阅读 · 2022年3月15日

50+篇《神经架构搜索NAS》2020论文合集

专知会员服务

61+阅读 · 2020年3月19日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

166+阅读 · 2020年3月18日

Aspect-Oriented Syntax Network for Aspect-Based Sentiment Analysis，中山大学数据科学与计算机学院权小军教授，第八届全国社会媒体处理大会SMP2019

专知会员服务

19+阅读 · 2019年10月22日