利用时间利益探测和关键框架预测的视频摘要方法 (A Video Summarization Method Using Temporal Interest Detection and Key Frame Prediction) - 专知论文

会员服务 ·

0

Extensibility · Performer · state-of-the-art · Weight · 序列标注 ·

2021 年 9 月 26 日

A Video Summarization Method Using Temporal Interest Detection and Key Frame Prediction

翻译：利用时间利益探测和关键框架预测的视频摘要方法

Yubo An,Shenghui Zhao

In this paper, a Video Summarization Method using Temporal Interest Detection and Key Frame Prediction is proposed for supervised video summarization, where video summarization is formulated as a combination of sequence labeling and temporal interest detection problem. In our method, we firstly built a flexible universal network frame to simultaneously predicts frame-level importance scores and temporal interest segments, and then combine the two components with different weights to achieve a more detailed video summarization. Extensive experiments and analysis on two benchmark datasets prove the effectiveness of our method. Specifically, compared with other state-of-the-art methods, its performance is increased by at least 2.6% and 4.2% on TVSum and SumMe respectively.

翻译：在本文中,提出了使用时间利益探测和关键框架预测的视频总结方法,用于监督视频总结,其中视频总结结合了序列标签和时间利益探测问题。在我们的方法中,我们首先建立了一个灵活的通用网络框架,以同时预测框架级别重要性分数和时间利益区段,然后将两个组成部分与不同重量结合起来,以便实现更详细的视频总结。关于两个基准数据集的广泛实验和分析证明了我们的方法的有效性。具体地说,与其他最先进的方法相比,其性能在TVSum和SumMe上分别提高了2.6%和4.2%。

0

相关内容

Extensibility

iOS 8 提供的应用间和应用跟系统的功能交互特性。

Today (iOS and OS X): widgets for the Today view of Notification Center
Share (iOS and OS X): post content to web services or share content with others
Actions (iOS and OS X): app extensions to view or manipulate inside another app
Photo Editing (iOS): edit a photo or video in Apple's Photos app with extensions from a third-party apps
Finder Sync (OS X): remote file storage in the Finder with support for Finder content annotation
Storage Provider (iOS): an interface between files inside an app and other apps on a user's device
Custom Keyboard (iOS): system-wide alternative keyboards

Source: iOS 8 Extensions: Apple’s Plan for a Powerful App Ecosystem

以自我为中心的视觉分析综述（Analysis of the hands in egocentric vision: A survey）

以自我为中心的视觉分析综述（Analysis of the hands in egocentric vision: A survey）

专知会员服务

5+阅读 · 2019年12月25日

【KDD2019|讲座推荐】社会用户兴趣挖掘：方法与应用：Social User Interest Mining: Methods and Applications

【KDD2019|讲座推荐】社会用户兴趣挖掘：方法与应用：Social User Interest Mining: Methods and Applications

专知会员服务

41+阅读 · 2019年12月11日

【ECML-PKDD 2019】二部图中通过社区发现算法进行链接预测（Link Prediction via Community Detection inBipartite Multi-Layer Graphs）

【ECML-PKDD 2019】二部图中通过社区发现算法进行链接预测（Link Prediction via Community Detection inBipartite Multi-Layer Graphs）

专知会员服务

34+阅读 · 2019年12月3日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

《DeepGCNs: Making GCNs Go as Deep as CNNs》

《DeepGCNs: Making GCNs Go as Deep as CNNs》

专知会员服务

31+阅读 · 2019年10月17日

【深度学习视频分析/多模态学习资源大列表】

【深度学习视频分析/多模态学习资源大列表】

专知会员服务

92+阅读 · 2019年10月16日

视频摘要最新综述文章，Video Skimming: Taxonomy and Comprehensive Survey

视频摘要最新综述文章，Video Skimming: Taxonomy and Comprehensive Survey

专知会员服务

29+阅读 · 2019年10月13日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

专知会员服务

79+阅读 · 2019年10月10日

已删除

将门创投

8+阅读 · 2019年6月13日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

无人机视觉挑战赛 | ICCV 2019 Workshop—VisDrone2019

无人机视觉挑战赛 | ICCV 2019 Workshop—VisDrone2019

PaperWeekly

7+阅读 · 2019年5月5日

【泡泡一分钟】基于视频修复的时空转换网络

【泡泡一分钟】基于视频修复的时空转换网络

泡泡机器人SLAM

5+阅读 · 2018年12月30日

计算机类 | ISCC 2019等国际会议信息9条

计算机类 | ISCC 2019等国际会议信息9条

Call4Papers

5+阅读 · 2018年12月25日

【泡泡一分钟】SegFlow：视频目标分割和光流的联合学习(ICCV2017-67)

【泡泡一分钟】SegFlow：视频目标分割和光流的联合学习(ICCV2017-67)

泡泡机器人SLAM

9+阅读 · 2018年8月15日

Jointly Improving Summarization and Sentiment Classification

Jointly Improving Summarization and Sentiment Classification

黑龙江大学自然语言处理实验室

3+阅读 · 2018年6月12日

Enhanced Correlation Matching based Video Frame Interpolation

Arxiv

0+阅读 · 2021年11月17日

Triple-cooperative Video Shadow Detection

Arxiv

6+阅读 · 2021年3月11日

Video Super Resolution Based on Deep Learning: A Comprehensive Survey

Video Super Resolution Based on Deep Learning: A Comprehensive Survey

Arxiv

8+阅读 · 2020年12月20日

Text Detection and Recognition in the Wild: A Review

Arxiv

20+阅读 · 2020年6月8日

Meta Learning for Task-Driven Video Summarization

Arxiv

6+阅读 · 2019年7月29日

Reverse Attention for Salient Object Detection

Arxiv

11+阅读 · 2019年4月15日

Progressive Sparse Local Attention for Video object detection

Arxiv

4+阅读 · 2019年3月21日

Video-to-Video Synthesis

Video-to-Video Synthesis

Arxiv

9+阅读 · 2018年8月20日

Detect-and-Track: Efficient Pose Estimation in Videos

Arxiv

7+阅读 · 2017年12月26日

Spatial-Temporal Memory Networks for Video Object Detection

Arxiv

4+阅读 · 2017年12月18日

VIP会员

文章信息

相关主题

state-of-the-art

相关VIP内容

以自我为中心的视觉分析综述（Analysis of the hands in egocentric vision: A survey）

以自我为中心的视觉分析综述（Analysis of the hands in egocentric vision: A survey）

专知会员服务

5+阅读 · 2019年12月25日

【KDD2019|讲座推荐】社会用户兴趣挖掘：方法与应用：Social User Interest Mining: Methods and Applications

【KDD2019|讲座推荐】社会用户兴趣挖掘：方法与应用：Social User Interest Mining: Methods and Applications

专知会员服务

41+阅读 · 2019年12月11日

【ECML-PKDD 2019】二部图中通过社区发现算法进行链接预测（Link Prediction via Community Detection inBipartite Multi-Layer Graphs）

【ECML-PKDD 2019】二部图中通过社区发现算法进行链接预测（Link Prediction via Community Detection inBipartite Multi-Layer Graphs）

专知会员服务

34+阅读 · 2019年12月3日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

《DeepGCNs: Making GCNs Go as Deep as CNNs》

《DeepGCNs: Making GCNs Go as Deep as CNNs》

专知会员服务

31+阅读 · 2019年10月17日

【深度学习视频分析/多模态学习资源大列表】

【深度学习视频分析/多模态学习资源大列表】

专知会员服务

92+阅读 · 2019年10月16日

视频摘要最新综述文章，Video Skimming: Taxonomy and Comprehensive Survey

视频摘要最新综述文章，Video Skimming: Taxonomy and Comprehensive Survey

专知会员服务

29+阅读 · 2019年10月13日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

专知会员服务

79+阅读 · 2019年10月10日

热门VIP内容

开通专知VIP会员享更多权益服务

从社会学实验到行为仿真：理解基于Agent的观点动力学建模思维

中英文版《GPT-5 System Card速览》报告

ACL 2025 | 大模型结构化知识提示的泛化能力研究

【普林斯顿博士论文】大型模型的高效推理

相关资讯

已删除

将门创投

8+阅读 · 2019年6月13日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

无人机视觉挑战赛 | ICCV 2019 Workshop—VisDrone2019

无人机视觉挑战赛 | ICCV 2019 Workshop—VisDrone2019

PaperWeekly

7+阅读 · 2019年5月5日

【泡泡一分钟】基于视频修复的时空转换网络

【泡泡一分钟】基于视频修复的时空转换网络

泡泡机器人SLAM

5+阅读 · 2018年12月30日

计算机类 | ISCC 2019等国际会议信息9条

计算机类 | ISCC 2019等国际会议信息9条

Call4Papers

5+阅读 · 2018年12月25日

【泡泡一分钟】SegFlow：视频目标分割和光流的联合学习(ICCV2017-67)

【泡泡一分钟】SegFlow：视频目标分割和光流的联合学习(ICCV2017-67)

泡泡机器人SLAM

9+阅读 · 2018年8月15日

Jointly Improving Summarization and Sentiment Classification

Jointly Improving Summarization and Sentiment Classification

黑龙江大学自然语言处理实验室

3+阅读 · 2018年6月12日

相关论文

Enhanced Correlation Matching based Video Frame Interpolation

Arxiv

0+阅读 · 2021年11月17日

Triple-cooperative Video Shadow Detection

Arxiv

6+阅读 · 2021年3月11日

Video Super Resolution Based on Deep Learning: A Comprehensive Survey

Video Super Resolution Based on Deep Learning: A Comprehensive Survey

Arxiv

8+阅读 · 2020年12月20日

Text Detection and Recognition in the Wild: A Review

Arxiv

20+阅读 · 2020年6月8日

Meta Learning for Task-Driven Video Summarization

Arxiv

6+阅读 · 2019年7月29日

Reverse Attention for Salient Object Detection

Arxiv

11+阅读 · 2019年4月15日

Progressive Sparse Local Attention for Video object detection

Arxiv

4+阅读 · 2019年3月21日

Video-to-Video Synthesis

Video-to-Video Synthesis

Arxiv

9+阅读 · 2018年8月20日

Detect-and-Track: Efficient Pose Estimation in Videos

Arxiv

7+阅读 · 2017年12月26日

Spatial-Temporal Memory Networks for Video Object Detection

Arxiv

4+阅读 · 2017年12月18日

微信扫码咨询专知VIP会员