行动识别ECCV 2020年行动追踪第二地方计划VIPR挑战:高效光学流动流指导框架 (2nd Place Scheme on Action Recognition Track of ECCV 2020 VIPriors Challenges: An Efficient Optical Flow Stream Guided Framework) - 专知论文

会员服务 ·

0

ECCV 2020 · ECCV · 流 · MoDELS · 数据集 ·

2020 年 8 月 10 日

2nd Place Scheme on Action Recognition Track of ECCV 2020 VIPriors Challenges: An Efficient Optical Flow Stream Guided Framework

翻译：行动识别ECCV 2020年行动追踪第二地方计划VIPR挑战:高效光学流动流指导框架

Haoyu Chen,Zitong Yu,Xin Liu,Wei Peng,Yoon Lee,Guoying Zhao

from arxiv, Technical report for ECCV 2020 VIPrior Challenge, Action Recognition Track

To address the problem of training on small datasets for action recognition tasks, most prior works are either based on a large number of training samples or require pre-trained models transferred from other large datasets to tackle overfitting problems. However, it limits the research within organizations that have strong computational abilities. In this work, we try to propose a data-efficient framework that can train the model from scratch on small datasets while achieving promising results. Specifically, by introducing a 3D central difference convolution operation, we proposed a novel C3D neural network-based two-stream (Rank Pooling RGB and Optical Flow) framework for the task. The method is validated on the action recognition track of the ECCV 2020 VIPriors challenges and got the 2nd place (88.31%). It is proved that our method can achieve a promising result even without a pre-trained model on large scale datasets. The code will be released soon.

翻译：为解决行动识别任务小型数据集培训问题,大多数先前的工作要么基于大量培训样本,要么需要事先培训的模型,从其他大型数据集中转让,以解决过于适应的问题。然而,这限制了具有强大计算能力的组织内部的研究。在这项工作中,我们试图提出一个数据高效框架,从零开始对小型数据集进行模型培训,同时取得有希望的结果。具体地说,通过引入3D中央差异变换操作,我们为这项任务提出了一个新的C3D神经网络双流(Rank Pooling RGB和光学流动)框架。该方法在ECCV 2020VIPriors挑战的行动识别轨道上得到验证,并获得了第2位(88.31%)。事实证明,即使没有大规模数据集的预先培训模型,我们的方法也能够取得大有希望的结果。该代码将很快发布。

0

相关内容

ECCV 2020

[NeurIPS 2020 oral] 基于因果干预的弱监督语义分割

专知会员服务

47+阅读 · 2020年10月5日

近期必读的六篇 IJCAI 2020【图神经网络 (GNN)+数据挖掘（DM）】相关论文

近期必读的六篇 IJCAI 2020【图神经网络 (GNN)+数据挖掘（DM）】相关论文

专知会员服务

75+阅读 · 2020年7月28日

Linux导论，Introduction to Linux，96页ppt

Linux导论，Introduction to Linux，96页ppt

专知会员服务

81+阅读 · 2020年7月26日

【Google】多模态Transformer视频检索，Multi-modal Transformer

【Google】多模态Transformer视频检索，Multi-modal Transformer

专知会员服务

103+阅读 · 2020年7月22日

【IJCAJ 2020】多通道神经网络 Multi-Channel Graph Neural Networks

【IJCAJ 2020】多通道神经网络 Multi-Channel Graph Neural Networks

专知会员服务

26+阅读 · 2020年7月19日

商业数据分析，39页ppt

商业数据分析，39页ppt

专知会员服务

165+阅读 · 2020年6月2日

[CVPR 2020 Oral-牛津] RandLA-Net:大场景三维点云语义分割新框架

[CVPR 2020 Oral-牛津] RandLA-Net:大场景三维点云语义分割新框架

专知会员服务

26+阅读 · 2020年3月15日

【深度学习表格检测、信息提取和结构化】《Table Detection, Information Extraction and Structuring using Deep Learning》by Vihar Kurama

专知会员服务

38+阅读 · 2020年1月23日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

CVPR2019| 04-22更新19篇论文及代码（2篇oral，含物体检测、动作识别、医学影像等）

CVPR2019| 04-22更新19篇论文及代码（2篇oral，含物体检测、动作识别、医学影像等）

极市平台

13+阅读 · 2019年4月22日

Call for Participation: Shared Tasks in NLPCC 2019

Call for Participation: Shared Tasks in NLPCC 2019

中国计算机学会

5+阅读 · 2019年3月22日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

【泡泡一分钟】基于视频修复的时空转换网络

【泡泡一分钟】基于视频修复的时空转换网络

泡泡机器人SLAM

5+阅读 · 2018年12月30日

【泡泡一分钟】将3D全卷积网络应用于车辆激光点云处理（IROS-11）

【泡泡一分钟】将3D全卷积网络应用于车辆激光点云处理（IROS-11）

泡泡机器人SLAM

13+阅读 · 2018年3月23日

(TensorFlow)实时语义分割比较研究

(TensorFlow)实时语义分割比较研究

机器学习研究会

9+阅读 · 2018年3月12日

【干货】ICCV2017 PoseTrack challenge优异方法：基于检测和跟踪的视频中人体姿态估计

【干货】ICCV2017 PoseTrack challenge优异方法：基于检测和跟踪的视频中人体姿态估计

专知

4+阅读 · 2017年12月29日

【推荐】YOLO实时目标检测(6fps)

【推荐】YOLO实时目标检测(6fps)

机器学习研究会

20+阅读 · 2017年11月5日

Gated Channel Transformation for Visual Recognition

Arxiv

4+阅读 · 2020年3月27日

Proposal, Tracking and Segmentation (PTS): A Cascaded Network for Video Object Segmentation

Proposal, Tracking and Segmentation (PTS): A Cascaded Network for Video Object Segmentation

Arxiv

4+阅读 · 2019年7月4日

Fast and Accurate 3D Medical Image Segmentation with Data-swapping Method

Fast and Accurate 3D Medical Image Segmentation with Data-swapping Method

Arxiv

5+阅读 · 2018年12月19日

Efficient semantic image segmentation with superpixel pooling

Arxiv

6+阅读 · 2018年6月7日

Object Tracking in Satellite Videos Based on a Multi-Frame Optical Flow Tracker

Arxiv

5+阅读 · 2018年4月25日

Comparative Study of ECO and CFNet Trackers in Noisy Environment

Arxiv

5+阅读 · 2018年1月29日

Depth-Adaptive Computational Policies for Efficient Visual Tracking

Arxiv

8+阅读 · 2018年1月1日

A Unified Method for First and Third Person Action Recognition

Arxiv

3+阅读 · 2017年12月30日

Long-Term Visual Object Tracking Benchmark

Arxiv

7+阅读 · 2017年12月28日

Detect-and-Track: Efficient Pose Estimation in Videos

Arxiv

7+阅读 · 2017年12月26日

VIP会员

文章信息

相关主题

相关VIP内容

[NeurIPS 2020 oral] 基于因果干预的弱监督语义分割

专知会员服务

47+阅读 · 2020年10月5日

近期必读的六篇 IJCAI 2020【图神经网络 (GNN)+数据挖掘（DM）】相关论文

近期必读的六篇 IJCAI 2020【图神经网络 (GNN)+数据挖掘（DM）】相关论文

专知会员服务

75+阅读 · 2020年7月28日

Linux导论，Introduction to Linux，96页ppt

Linux导论，Introduction to Linux，96页ppt

专知会员服务

81+阅读 · 2020年7月26日

【Google】多模态Transformer视频检索，Multi-modal Transformer

【Google】多模态Transformer视频检索，Multi-modal Transformer

专知会员服务

103+阅读 · 2020年7月22日

【IJCAJ 2020】多通道神经网络 Multi-Channel Graph Neural Networks

【IJCAJ 2020】多通道神经网络 Multi-Channel Graph Neural Networks

专知会员服务

26+阅读 · 2020年7月19日

商业数据分析，39页ppt

商业数据分析，39页ppt

专知会员服务

165+阅读 · 2020年6月2日

[CVPR 2020 Oral-牛津] RandLA-Net:大场景三维点云语义分割新框架

[CVPR 2020 Oral-牛津] RandLA-Net:大场景三维点云语义分割新框架

专知会员服务

26+阅读 · 2020年3月15日

【深度学习表格检测、信息提取和结构化】《Table Detection, Information Extraction and Structuring using Deep Learning》by Vihar Kurama

专知会员服务

38+阅读 · 2020年1月23日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

热门VIP内容

开通专知VIP会员享更多权益服务

大型语言模型遇上文本属性图：一种融合框架与应用的综述

人工智能赋能自主武器与人类控制第三部分：人类控制与系统操作员 | 35页

【博士论文】用于概率程序与生成模型的变分推断

军事指挥控制系统：2025年5种用途

相关资讯

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

CVPR2019| 04-22更新19篇论文及代码（2篇oral，含物体检测、动作识别、医学影像等）

CVPR2019| 04-22更新19篇论文及代码（2篇oral，含物体检测、动作识别、医学影像等）

极市平台

13+阅读 · 2019年4月22日

Call for Participation: Shared Tasks in NLPCC 2019

Call for Participation: Shared Tasks in NLPCC 2019

中国计算机学会

5+阅读 · 2019年3月22日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

【泡泡一分钟】基于视频修复的时空转换网络

【泡泡一分钟】基于视频修复的时空转换网络

泡泡机器人SLAM

5+阅读 · 2018年12月30日

【泡泡一分钟】将3D全卷积网络应用于车辆激光点云处理（IROS-11）

【泡泡一分钟】将3D全卷积网络应用于车辆激光点云处理（IROS-11）

泡泡机器人SLAM

13+阅读 · 2018年3月23日

(TensorFlow)实时语义分割比较研究

(TensorFlow)实时语义分割比较研究

机器学习研究会

9+阅读 · 2018年3月12日

【干货】ICCV2017 PoseTrack challenge优异方法：基于检测和跟踪的视频中人体姿态估计

【干货】ICCV2017 PoseTrack challenge优异方法：基于检测和跟踪的视频中人体姿态估计

专知

4+阅读 · 2017年12月29日

【推荐】YOLO实时目标检测(6fps)

【推荐】YOLO实时目标检测(6fps)

机器学习研究会

20+阅读 · 2017年11月5日

相关论文

Gated Channel Transformation for Visual Recognition

Arxiv

4+阅读 · 2020年3月27日

Proposal, Tracking and Segmentation (PTS): A Cascaded Network for Video Object Segmentation

Proposal, Tracking and Segmentation (PTS): A Cascaded Network for Video Object Segmentation

Arxiv

4+阅读 · 2019年7月4日

Fast and Accurate 3D Medical Image Segmentation with Data-swapping Method

Fast and Accurate 3D Medical Image Segmentation with Data-swapping Method

Arxiv

5+阅读 · 2018年12月19日

Efficient semantic image segmentation with superpixel pooling

Arxiv

6+阅读 · 2018年6月7日

Object Tracking in Satellite Videos Based on a Multi-Frame Optical Flow Tracker

Arxiv

5+阅读 · 2018年4月25日

Comparative Study of ECO and CFNet Trackers in Noisy Environment

Arxiv

5+阅读 · 2018年1月29日

Depth-Adaptive Computational Policies for Efficient Visual Tracking

Arxiv

8+阅读 · 2018年1月1日

A Unified Method for First and Third Person Action Recognition

Arxiv

3+阅读 · 2017年12月30日

Long-Term Visual Object Tracking Benchmark

Arxiv

7+阅读 · 2017年12月28日

Detect-and-Track: Efficient Pose Estimation in Videos

Arxiv

7+阅读 · 2017年12月26日

微信扫码咨询专知VIP会员