Frame Flexible Network (Frame Flexible Network) - 专知论文

会员服务 ·

0

多频 · Networking · 存储 · 频度 · 视频识别 ·

2023 年 3 月 26 日

Frame Flexible Network

翻译：Frame Flexible Network

Yitian Zhang,Yue Bai,Chang Liu,Huan Wang,Sheng Li,Yun Fu

from arxiv, Accepted by CVPR2023

Existing video recognition algorithms always conduct different training pipelines for inputs with different frame numbers, which requires repetitive training operations and multiplying storage costs. If we evaluate the model using other frames which are not used in training, we observe the performance will drop significantly (see Fig.1), which is summarized as Temporal Frequency Deviation phenomenon. To fix this issue, we propose a general framework, named Frame Flexible Network (FFN), which not only enables the model to be evaluated at different frames to adjust its computation, but also reduces the memory costs of storing multiple models significantly. Concretely, FFN integrates several sets of training sequences, involves Multi-Frequency Alignment (MFAL) to learn temporal frequency invariant representations, and leverages Multi-Frequency Adaptation (MFAD) to further strengthen the representation abilities. Comprehensive empirical validations using various architectures and popular benchmarks solidly demonstrate the effectiveness and generalization of FFN (e.g., 7.08/5.15/2.17% performance gain at Frame 4/8/16 on Something-Something V1 dataset over Uniformer). Code is available at https://github.com/BeSpontaneous/FFN.

翻译：现有的视频识别算法往往针对不同帧数的输入进行不同的训练流程，这需要重复的训练操作和大量的存储成本。如果我们使用未用于训练的其他帧进行模型评估，我们会观察到性能会显著下降（见图1），这被总结为时间频率偏差现象。为了解决这个问题，我们提出了一个通用的框架，称为Flexible Frame Network (FFN)，它不仅能够在不同的帧上评估模型以调整其计算，而且还显著减少了存储多个模型的内存成本。具体而言，FFN集成了几个训练序列，并利用多频对齐（Multi-Frequency Alignment，MFAL）学习时间频率不变表示，利用多频度自适应（Multi-Frequency Adaptation，MFAD）进一步加强表示能力。全面的实证验证使用各种体系结构和流行的基准坚实地证明了FFN的有效性和泛化性（例如，在Something-Something V1数据集上，与Uniformer相比，Frame 4/8/16时，性能提高了7.08/5.15/2.17％）。代码可在https://github.com/BeSpontaneous/FFN上获得。

0

相关内容

【CVPR 2022】基于灵活模态Transformer的人脸防伪 FM-ViT: Flexible Modal Vision Transformers for Face Anti-Spoofing

【CVPR 2022】基于灵活模态Transformer的人脸防伪 FM-ViT: Flexible Modal Vision Transformers for Face Anti-Spoofing

专知会员服务

17+阅读 · 2022年3月19日

【干货书】机器学习设计模式，408页pdf，Machine Learning Design Patterns

【干货书】机器学习设计模式，408页pdf，Machine Learning Design Patterns

专知会员服务

138+阅读 · 2022年2月6日

对比学习简述

专知会员服务

90+阅读 · 2021年6月29日

【ETH】最新《几何数据分析》2020课程，附PPT下载

专知会员服务

44+阅读 · 2020年12月18日

【CMU】图卷积神经网络中的池化综述，Pooling in Graph Convolutional Neural Network

【CMU】图卷积神经网络中的池化综述，Pooling in Graph Convolutional Neural Network

专知会员服务

46+阅读 · 2020年4月8日

【LITIS Lab】衔接图卷积神经网络谱域和空间域，Spectral and Spatial Domains in GNN

【LITIS Lab】衔接图卷积神经网络谱域和空间域，Spectral and Spatial Domains in GNN

专知会员服务

25+阅读 · 2020年3月30日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

166+阅读 · 2020年3月18日

【DeepMind-ICLR2020】MEMO-情景记忆的灵活组合的深层网络，A DEEP NETWORK FOR FLEXIBLE COMBINATION OF EPISODIC MEMORIES

【DeepMind-ICLR2020】MEMO-情景记忆的灵活组合的深层网络，A DEEP NETWORK FOR FLEXIBLE COMBINATION OF EPISODIC MEMORIES

专知会员服务

18+阅读 · 2020年2月2日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

专知会员服务

83+阅读 · 2019年10月9日

RoBERTa中文预训练模型：RoBERTa for Chinese

RoBERTa中文预训练模型：RoBERTa for Chinese

PaperWeekly

57+阅读 · 2019年9月16日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

TorchSeg：基于pytorch的语义分割算法开源了

TorchSeg：基于pytorch的语义分割算法开源了

极市平台

20+阅读 · 2019年1月28日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

无监督元学习表示学习

无监督元学习表示学习

CreateAMind

27+阅读 · 2019年1月4日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

LibRec 精选：基于LSTM的序列推荐实现（PyTorch）

LibRec 精选：基于LSTM的序列推荐实现（PyTorch）

LibRec智能推荐

50+阅读 · 2018年8月27日

【论文推荐】最新七篇图像分割相关论文—Attention U-Net、对抗结构匹配损失、卷积CRFs、对抗样本、弱监督分割

【论文推荐】最新七篇图像分割相关论文—Attention U-Net、对抗结构匹配损失、卷积CRFs、对抗样本、弱监督分割

专知

19+阅读 · 2018年5月31日

可解释的CNN

可解释的CNN

CreateAMind

17+阅读 · 2017年10月5日

混沌时间序列Volterra建模及其在语音信号处理中的应用

国家自然科学基金

0+阅读 · 2015年12月31日

BMP2-BMSCs对胶质瘤干细胞增殖和分化的调控作用及机制

国家自然科学基金

0+阅读 · 2015年12月31日

分子间氢键诱导的表面手性组装纳米结构的调控机制研究

国家自然科学基金

0+阅读 · 2013年12月31日

miR-223调控动脉粥样硬化斑块泡沫细胞形成和巨噬细胞凋亡的分子机制

国家自然科学基金

0+阅读 · 2012年12月31日

染料敏化太阳电池中准固态陷光电解质的研究

国家自然科学基金

0+阅读 · 2012年12月31日

间充质干细胞在创伤性颞下颌关节强直发生过程中的作用

国家自然科学基金

0+阅读 · 2011年12月31日

LFA-1介导Th17细胞活化分化在EAE发病过程中的作用研究

国家自然科学基金

0+阅读 · 2009年12月31日

听神经瘤中merlin的磷酸化对p53稳定性及细胞凋亡的影响

国家自然科学基金

0+阅读 · 2008年12月31日

被子植物基部类群细辛AP3亚族基因表达、功能与进化的研究

国家自然科学基金

0+阅读 · 2008年12月31日

基于量子点敏化的透明型纳米管阵列基固态太阳能电池

国家自然科学基金

0+阅读 · 2008年12月31日

One-shot neural band selection for spectral recovery

Arxiv

0+阅读 · 2023年5月16日

Learning Structure Aware Deep Spectral Embedding

Arxiv

0+阅读 · 2023年5月14日

Learning to Generalize for Cross-domain QA

Arxiv

1+阅读 · 2023年5月14日

Agile gesture recognition for capacitive sensing devices: adapting on-the-job

Agile gesture recognition for capacitive sensing devices: adapting on-the-job

Arxiv

0+阅读 · 2023年5月12日

A Lightweight Domain Adversarial Neural Network Based on Knowledge Distillation for EEG-based Cross-subject Emotion Recognition

Arxiv

0+阅读 · 2023年5月12日

Configurable Spatial-Temporal Hierarchical Analysis for Flexible Video Anomaly Detection

Arxiv

0+阅读 · 2023年5月12日

Distribution-Flexible Subset Quantization for Post-Quantizing Super-Resolution Networks

Arxiv

0+阅读 · 2023年5月12日

Boosting the Speed of Entity Alignment 10*: Dual Attention Matching Network with Normalized Hard Sample Mining

Arxiv

10+阅读 · 2021年3月29日

Network of Tensor Time Series

Arxiv

20+阅读 · 2021年2月28日

CAN-NER: Convolutional Attention Network forChinese Named Entity Recognition

Arxiv

16+阅读 · 2019年4月3日

VIP会员

文章信息

相关主题

相关VIP内容

【CVPR 2022】基于灵活模态Transformer的人脸防伪 FM-ViT: Flexible Modal Vision Transformers for Face Anti-Spoofing

【CVPR 2022】基于灵活模态Transformer的人脸防伪 FM-ViT: Flexible Modal Vision Transformers for Face Anti-Spoofing

专知会员服务

17+阅读 · 2022年3月19日

【干货书】机器学习设计模式，408页pdf，Machine Learning Design Patterns

【干货书】机器学习设计模式，408页pdf，Machine Learning Design Patterns

专知会员服务

138+阅读 · 2022年2月6日

对比学习简述

专知会员服务

90+阅读 · 2021年6月29日

【ETH】最新《几何数据分析》2020课程，附PPT下载

专知会员服务

44+阅读 · 2020年12月18日

【CMU】图卷积神经网络中的池化综述，Pooling in Graph Convolutional Neural Network

【CMU】图卷积神经网络中的池化综述，Pooling in Graph Convolutional Neural Network

专知会员服务

46+阅读 · 2020年4月8日

【LITIS Lab】衔接图卷积神经网络谱域和空间域，Spectral and Spatial Domains in GNN

【LITIS Lab】衔接图卷积神经网络谱域和空间域，Spectral and Spatial Domains in GNN

专知会员服务

25+阅读 · 2020年3月30日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

166+阅读 · 2020年3月18日

【DeepMind-ICLR2020】MEMO-情景记忆的灵活组合的深层网络，A DEEP NETWORK FOR FLEXIBLE COMBINATION OF EPISODIC MEMORIES

【DeepMind-ICLR2020】MEMO-情景记忆的灵活组合的深层网络，A DEEP NETWORK FOR FLEXIBLE COMBINATION OF EPISODIC MEMORIES

专知会员服务

18+阅读 · 2020年2月2日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

专知会员服务

83+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

面向具身智能的多模态数据存储与检索：综述

《算法战争研究计划全景评估》35页

【CMU博士论文】水下三维视觉感知与生成

智能体战争：自主人工智能军备竞赛全景透视

相关资讯

RoBERTa中文预训练模型：RoBERTa for Chinese

RoBERTa中文预训练模型：RoBERTa for Chinese

PaperWeekly

57+阅读 · 2019年9月16日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

TorchSeg：基于pytorch的语义分割算法开源了

TorchSeg：基于pytorch的语义分割算法开源了

极市平台

20+阅读 · 2019年1月28日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

无监督元学习表示学习

无监督元学习表示学习

CreateAMind

27+阅读 · 2019年1月4日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

LibRec 精选：基于LSTM的序列推荐实现（PyTorch）

LibRec 精选：基于LSTM的序列推荐实现（PyTorch）

LibRec智能推荐

50+阅读 · 2018年8月27日

【论文推荐】最新七篇图像分割相关论文—Attention U-Net、对抗结构匹配损失、卷积CRFs、对抗样本、弱监督分割

【论文推荐】最新七篇图像分割相关论文—Attention U-Net、对抗结构匹配损失、卷积CRFs、对抗样本、弱监督分割

专知

19+阅读 · 2018年5月31日

可解释的CNN

可解释的CNN

CreateAMind

17+阅读 · 2017年10月5日

相关论文

One-shot neural band selection for spectral recovery

Arxiv

0+阅读 · 2023年5月16日

Learning Structure Aware Deep Spectral Embedding

Arxiv

0+阅读 · 2023年5月14日

Learning to Generalize for Cross-domain QA

Arxiv

1+阅读 · 2023年5月14日

Agile gesture recognition for capacitive sensing devices: adapting on-the-job

Agile gesture recognition for capacitive sensing devices: adapting on-the-job

Arxiv

0+阅读 · 2023年5月12日

A Lightweight Domain Adversarial Neural Network Based on Knowledge Distillation for EEG-based Cross-subject Emotion Recognition

Arxiv

0+阅读 · 2023年5月12日

Configurable Spatial-Temporal Hierarchical Analysis for Flexible Video Anomaly Detection

Arxiv

0+阅读 · 2023年5月12日

Distribution-Flexible Subset Quantization for Post-Quantizing Super-Resolution Networks

Arxiv

0+阅读 · 2023年5月12日

Boosting the Speed of Entity Alignment 10*: Dual Attention Matching Network with Normalized Hard Sample Mining

Arxiv

10+阅读 · 2021年3月29日

Network of Tensor Time Series

Arxiv

20+阅读 · 2021年2月28日

CAN-NER: Convolutional Attention Network forChinese Named Entity Recognition

Arxiv

16+阅读 · 2019年4月3日

相关基金

混沌时间序列Volterra建模及其在语音信号处理中的应用

国家自然科学基金

0+阅读 · 2015年12月31日

BMP2-BMSCs对胶质瘤干细胞增殖和分化的调控作用及机制

国家自然科学基金

0+阅读 · 2015年12月31日

分子间氢键诱导的表面手性组装纳米结构的调控机制研究

国家自然科学基金

0+阅读 · 2013年12月31日

miR-223调控动脉粥样硬化斑块泡沫细胞形成和巨噬细胞凋亡的分子机制

国家自然科学基金

0+阅读 · 2012年12月31日

染料敏化太阳电池中准固态陷光电解质的研究

国家自然科学基金

0+阅读 · 2012年12月31日

间充质干细胞在创伤性颞下颌关节强直发生过程中的作用

国家自然科学基金

0+阅读 · 2011年12月31日

LFA-1介导Th17细胞活化分化在EAE发病过程中的作用研究

国家自然科学基金

0+阅读 · 2009年12月31日

听神经瘤中merlin的磷酸化对p53稳定性及细胞凋亡的影响

国家自然科学基金

0+阅读 · 2008年12月31日

被子植物基部类群细辛AP3亚族基因表达、功能与进化的研究

国家自然科学基金

0+阅读 · 2008年12月31日

基于量子点敏化的透明型纳米管阵列基固态太阳能电池

国家自然科学基金

0+阅读 · 2008年12月31日

微信扫码咨询专知VIP会员