多式段段集成网络,以重要-一致性评分方式进行视频编辑 (Multi-modal Segment Assemblage Network for Ad Video Editing with Importance-Coherence Reward) - 专知论文

会员服务 ·

0

Performer · Networking · INFORMS · HTTPS · 随机选择 ·

2022 年 9 月 25 日

Multi-modal Segment Assemblage Network for Ad Video Editing with Importance-Coherence Reward

翻译：多式段段集成网络,以重要-一致性评分方式进行视频编辑

Yunlong Tang,Siting Xu,Teng Wang,Qin Lin,Qinglin Lu,Feng Zheng

from arxiv, Accepted by ACCV2022

Advertisement video editing aims to automatically edit advertising videos into shorter videos while retaining coherent content and crucial information conveyed by advertisers. It mainly contains two stages: video segmentation and segment assemblage. The existing method performs well at video segmentation stages but suffers from the problems of dependencies on extra cumbersome models and poor performance at the segment assemblage stage. To address these problems, we propose M-SAN (Multi-modal Segment Assemblage Network) which can perform efficient and coherent segment assemblage task end-to-end. It utilizes multi-modal representation extracted from the segments and follows the Encoder-Decoder Ptr-Net framework with the Attention mechanism. Importance-coherence reward is designed for training M-SAN. We experiment on the Ads-1k dataset with 1000+ videos under rich ad scenarios collected from advertisers. To evaluate the methods, we propose a unified metric, Imp-Coh@Time, which comprehensively assesses the importance, coherence, and duration of the outputs at the same time. Experimental results show that our method achieves better performance than random selection and the previous method on the metric. Ablation experiments further verify that multi-modal representation and importance-coherence reward significantly improve the performance. Ads-1k dataset is available at: https://github.com/yunlong10/Ads-1k

翻译：广告视频编辑旨在自动将广告视频编辑成较短的视频,同时保留连贯的内容和广告商传递的重要信息,主要包括两个阶段:视频分层和分段组合。现有方法在视频分层阶段运行良好,但因在片段组合阶段依赖超繁琐模型和不良性能而受到影响。为解决这些问题,我们建议M-SAN(多式段组合网络)能够高效和一致地段组合任务端至端。它利用从各段提取的多模式代表,并遵循Encoder-Decoder Ptr-Net框架,并采用注意机制。为培训M-SAN设计了重要性-一致性奖励。我们在Ads-1k数据集上试验,在从广告商收集的丰富的广告情景下用1000+视频进行试验。为评估方法,我们建议一个统一的衡量标准,即Inmp-Coh@Time,全面评估产出的重要性、一致性和持续时间。实验结果显示,我们的方法比随机数据选择和以往的测试方法更加重要。

0

相关内容

Performer

计算机科学课程与视频课件合集，Computer Science courses with video lectures

计算机科学课程与视频课件合集，Computer Science courses with video lectures

专知会员服务

37+阅读 · 2022年1月24日

【Google】深度学习对抗鲁棒性，43页ppt

专知会员服务

45+阅读 · 2020年10月31日

Linux导论，Introduction to Linux，96页ppt

Linux导论，Introduction to Linux，96页ppt

专知会员服务

81+阅读 · 2020年7月26日

50+篇《神经架构搜索NAS》2020论文合集

专知会员服务

61+阅读 · 2020年3月19日

【深度学习表格检测、信息提取和结构化】《Table Detection, Information Extraction and Structuring using Deep Learning》by Vihar Kurama

专知会员服务

38+阅读 · 2020年1月23日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Stabilizing Transformers for Reinforcement Learning

Stabilizing Transformers for Reinforcement Learning

专知会员服务

60+阅读 · 2019年10月17日

《DeepGCNs: Making GCNs Go as Deep as CNNs》

《DeepGCNs: Making GCNs Go as Deep as CNNs》

专知会员服务

31+阅读 · 2019年10月17日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

专知会员服务

65+阅读 · 2019年10月9日

VCIP 2022 Call for Special Session Proposals

VCIP 2022 Call for Special Session Proposals

CCF多媒体专委会

1+阅读 · 2022年4月1日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

【ICIG2021】Latest News & Announcements of the Workshop

【ICIG2021】Latest News & Announcements of the Workshop

中国图象图形学学会CSIG

0+阅读 · 2021年12月20日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium8

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium8

中国图象图形学学会CSIG

0+阅读 · 2021年11月16日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

disentangled-representation-papers

disentangled-representation-papers

CreateAMind

26+阅读 · 2018年9月12日

【论文】图上的表示学习综述

【论文】图上的表示学习综述

机器学习研究会

15+阅读 · 2017年9月24日

ARB抑制miR-193a表达促进早期糖尿病肾病壁层上皮细胞-足细胞转分化研究

国家自然科学基金

0+阅读 · 2015年12月31日

柔性金属氧化物薄膜晶体管研究

国家自然科学基金

0+阅读 · 2013年12月31日

雄激素受体在膀胱癌进展中对GATA3的调控机制研究

国家自然科学基金

0+阅读 · 2013年12月31日

高稳定性有序介孔尖晶石AFe2O4(A=Zn,Cu,Co,Ni)的可控制备及可见光催化分解水制氢研究

国家自然科学基金

0+阅读 · 2012年12月31日

基于SERS编码的Capase探针激活效应的研究

国家自然科学基金

0+阅读 · 2011年12月31日

ZNF644基因在高度近视发病机理中的作用研究

国家自然科学基金

0+阅读 · 2011年12月31日

活性氧稳态调节在ABA受体ABAR介导的信号通路中的作用

国家自然科学基金

0+阅读 · 2011年12月31日

SARI基因在肺癌侵袭转移中的作用及分子机制

国家自然科学基金

0+阅读 · 2009年12月31日

Wnt信号对关节软骨间充质祖细胞老化作用的研究

国家自然科学基金

0+阅读 · 2009年12月31日

柔性结构重复撞击的研究

国家自然科学基金

0+阅读 · 2008年12月31日

Human alignment of neural network representations

Arxiv

0+阅读 · 2022年11月2日

Multi-Agent Reinforcement Learning for Adaptive Mesh Refinement

Arxiv

0+阅读 · 2022年11月2日

Speech-text based multi-modal training with bidirectional attention for improved speech recognition

Arxiv

0+阅读 · 2022年11月1日

Improving Temporal Generalization of Pre-trained Language Models with Lexical Semantic Change

Arxiv

0+阅读 · 2022年10月31日

Combining Attention Module and Pixel Shuffle for License Plate Super-Resolution

Arxiv

0+阅读 · 2022年10月30日

DORE: Document Ordered Relation Extraction based on Generative Framework

Arxiv

0+阅读 · 2022年10月28日

Making Attention Mechanisms More Robust and Interpretable with Virtual Adversarial Training

Arxiv

0+阅读 · 2022年10月28日

Exploring Spatial-Temporal Features for Deepfake Detection and Localization

Arxiv

0+阅读 · 2022年10月28日

Hyper-Connected Transformer Network for Co-Learning Multi-Modality PET-CT Features

Arxiv

0+阅读 · 2022年10月28日

Exploring Visual Relationship for Image Captioning

Exploring Visual Relationship for Image Captioning

Arxiv

15+阅读 · 2018年9月19日

VIP会员

文章信息

相关主题

相关VIP内容

计算机科学课程与视频课件合集，Computer Science courses with video lectures

计算机科学课程与视频课件合集，Computer Science courses with video lectures

专知会员服务

37+阅读 · 2022年1月24日

【Google】深度学习对抗鲁棒性，43页ppt

专知会员服务

45+阅读 · 2020年10月31日

Linux导论，Introduction to Linux，96页ppt

Linux导论，Introduction to Linux，96页ppt

专知会员服务

81+阅读 · 2020年7月26日

50+篇《神经架构搜索NAS》2020论文合集

专知会员服务

61+阅读 · 2020年3月19日

【深度学习表格检测、信息提取和结构化】《Table Detection, Information Extraction and Structuring using Deep Learning》by Vihar Kurama

专知会员服务

38+阅读 · 2020年1月23日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Stabilizing Transformers for Reinforcement Learning

Stabilizing Transformers for Reinforcement Learning

专知会员服务

60+阅读 · 2019年10月17日

《DeepGCNs: Making GCNs Go as Deep as CNNs》

《DeepGCNs: Making GCNs Go as Deep as CNNs》

专知会员服务

31+阅读 · 2019年10月17日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

专知会员服务

65+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

面向具身智能的多模态数据存储与检索：综述

《算法战争研究计划全景评估》35页

【CMU博士论文】水下三维视觉感知与生成

智能体战争：自主人工智能军备竞赛全景透视

相关资讯

VCIP 2022 Call for Special Session Proposals

VCIP 2022 Call for Special Session Proposals

CCF多媒体专委会

1+阅读 · 2022年4月1日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

【ICIG2021】Latest News & Announcements of the Workshop

【ICIG2021】Latest News & Announcements of the Workshop

中国图象图形学学会CSIG

0+阅读 · 2021年12月20日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium8

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium8

中国图象图形学学会CSIG

0+阅读 · 2021年11月16日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

disentangled-representation-papers

disentangled-representation-papers

CreateAMind

26+阅读 · 2018年9月12日

【论文】图上的表示学习综述

【论文】图上的表示学习综述

机器学习研究会

15+阅读 · 2017年9月24日

相关论文

Human alignment of neural network representations

Arxiv

0+阅读 · 2022年11月2日

Multi-Agent Reinforcement Learning for Adaptive Mesh Refinement

Arxiv

0+阅读 · 2022年11月2日

Speech-text based multi-modal training with bidirectional attention for improved speech recognition

Arxiv

0+阅读 · 2022年11月1日

Improving Temporal Generalization of Pre-trained Language Models with Lexical Semantic Change

Arxiv

0+阅读 · 2022年10月31日

Combining Attention Module and Pixel Shuffle for License Plate Super-Resolution

Arxiv

0+阅读 · 2022年10月30日

DORE: Document Ordered Relation Extraction based on Generative Framework

Arxiv

0+阅读 · 2022年10月28日

Making Attention Mechanisms More Robust and Interpretable with Virtual Adversarial Training

Arxiv

0+阅读 · 2022年10月28日

Exploring Spatial-Temporal Features for Deepfake Detection and Localization

Arxiv

0+阅读 · 2022年10月28日

Hyper-Connected Transformer Network for Co-Learning Multi-Modality PET-CT Features

Arxiv

0+阅读 · 2022年10月28日

Exploring Visual Relationship for Image Captioning

Exploring Visual Relationship for Image Captioning

Arxiv

15+阅读 · 2018年9月19日

相关基金

ARB抑制miR-193a表达促进早期糖尿病肾病壁层上皮细胞-足细胞转分化研究

国家自然科学基金

0+阅读 · 2015年12月31日

柔性金属氧化物薄膜晶体管研究

国家自然科学基金

0+阅读 · 2013年12月31日

雄激素受体在膀胱癌进展中对GATA3的调控机制研究

国家自然科学基金

0+阅读 · 2013年12月31日

高稳定性有序介孔尖晶石AFe2O4(A=Zn,Cu,Co,Ni)的可控制备及可见光催化分解水制氢研究

国家自然科学基金

0+阅读 · 2012年12月31日

基于SERS编码的Capase探针激活效应的研究

国家自然科学基金

0+阅读 · 2011年12月31日

ZNF644基因在高度近视发病机理中的作用研究

国家自然科学基金

0+阅读 · 2011年12月31日

活性氧稳态调节在ABA受体ABAR介导的信号通路中的作用

国家自然科学基金

0+阅读 · 2011年12月31日

SARI基因在肺癌侵袭转移中的作用及分子机制

国家自然科学基金

0+阅读 · 2009年12月31日

Wnt信号对关节软骨间充质祖细胞老化作用的研究

国家自然科学基金

0+阅读 · 2009年12月31日

柔性结构重复撞击的研究

国家自然科学基金

0+阅读 · 2008年12月31日

微信扫码咨询专知VIP会员