部分相关视频获取 (Partially Relevant Video Retrieval) - 专知论文

会员服务 ·

0

矩 · Learning · Bagging · Extensibility · 相似度 ·

2022 年 8 月 26 日

Partially Relevant Video Retrieval

翻译：部分相关视频获取

Jianfeng Dong,Xianke Chen,Minsong Zhang,Xun Yang,Shujie Chen,Xirong Li,Xun Wang

from arxiv, Accepted by ACM MM 2022. The paper's homepage is http://danieljf24.github.io/prvr

Current methods for text-to-video retrieval (T2VR) are trained and tested on video-captioning oriented datasets such as MSVD, MSR-VTT and VATEX. A key property of these datasets is that videos are assumed to be temporally pre-trimmed with short duration, whilst the provided captions well describe the gist of the video content. Consequently, for a given paired video and caption, the video is supposed to be fully relevant to the caption. In reality, however, as queries are not known a priori, pre-trimmed video clips may not contain sufficient content to fully meet the query. This suggests a gap between the literature and the real world. To fill the gap, we propose in this paper a novel T2VR subtask termed Partially Relevant Video Retrieval (PRVR). An untrimmed video is considered to be partially relevant w.r.t. a given textual query if it contains a moment relevant to the query. PRVR aims to retrieve such partially relevant videos from a large collection of untrimmed videos. PRVR differs from single video moment retrieval and video corpus moment retrieval, as the latter two are to retrieve moments rather than untrimmed videos. We formulate PRVR as a multiple instance learning (MIL) problem, where a video is simultaneously viewed as a bag of video clips and a bag of video frames. Clips and frames represent video content at different time scales. We propose a Multi-Scale Similarity Learning (MS-SL) network that jointly learns clip-scale and frame-scale similarities for PRVR. Extensive experiments on three datasets (TVR, ActivityNet Captions, and Charades-STA) demonstrate the viability of the proposed method. We also show that our method can be used for improving video corpus moment retrieval.

翻译：文本到视频检索( T2VR) 的当前方法在MSVD、 MSSR- VTT 和 VATEX 等以视频为主的数据集中经过培训和测试。这些数据集的关键属性是假设视频在时间上是临时的预断,但所提供的字幕很好地描述了视频内容的格子。因此,对于配对视频和字幕来说,视频应该与标题完全相关。然而,在现实中,由于查询不为前置之人所知,预剪视频剪辑可能没有足够内容以完全满足查询。这表明文献与真实世界之间存在差距。为了填补这一差距,我们在此文件中建议一个名为“部分相关的视频Retrival(PRVR)”的新版本的T2VR 子片段。对于一个未剪辑的视频,如果它包含一个与查询相关的时刻,那么一个给的文本查询。 PRVR 旨在从一个大范围的图像库中取取出部分相关的视频。

0

相关内容

【NUS-Xavier教授】注意力神经网络，79页ppt

【NUS-Xavier教授】注意力神经网络，79页ppt

专知会员服务

66+阅读 · 2021年11月25日

NLP必读经典文献100篇

专知会员服务

124+阅读 · 2020年9月8日

史上最全！358篇机器学习&自然语言处理综述论文！都这儿了

专知会员服务

129+阅读 · 2020年7月18日

【跨语言BERT模型大集合】Transfer learning is increasingly going multilingual with language-specific BERT models

专知会员服务

54+阅读 · 2020年1月30日

【深度学习表格检测、信息提取和结构化】《Table Detection, Information Extraction and Structuring using Deep Learning》by Vihar Kurama

专知会员服务

38+阅读 · 2020年1月23日

Aspect-Oriented Syntax Network for Aspect-Based Sentiment Analysis，中山大学数据科学与计算机学院权小军教授，第八届全国社会媒体处理大会SMP2019

Aspect-Oriented Syntax Network for Aspect-Based Sentiment Analysis，中山大学数据科学与计算机学院权小军教授，第八届全国社会媒体处理大会SMP2019

专知会员服务

19+阅读 · 2019年10月22日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

专知会员服务

83+阅读 · 2019年10月9日

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

专知会员服务

65+阅读 · 2019年10月9日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

105+阅读 · 2019年10月9日

VCIP 2022 Call for Demos

VCIP 2022 Call for Demos

CCF多媒体专委会

1+阅读 · 2022年6月6日

征稿 | CFP：Special Issue of NLP and KG(JCR Q2，IF2.67)

征稿 | CFP：Special Issue of NLP and KG(JCR Q2，IF2.67)

开放知识图谱

1+阅读 · 2022年4月4日

IEEE ICKG 2022: Call for Papers

IEEE ICKG 2022: Call for Papers

机器学习与推荐算法

3+阅读 · 2022年3月30日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium2

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium2

中国图象图形学学会CSIG

0+阅读 · 2021年11月8日

会议交流 | IJCKG: International Joint Conference on Knowledge Graphs

会议交流 | IJCKG: International Joint Conference on Knowledge Graphs

开放知识图谱

0+阅读 · 2021年9月9日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

【论文】变分推断（Variational inference)的总结

【论文】变分推断（Variational inference)的总结

机器学习研究会

39+阅读 · 2017年11月16日

人类转录因子基因家族调控网络进化模式研究

国家自然科学基金

0+阅读 · 2015年12月31日

AG-WUS-PcG-lncRNA互作对梅多雌蕊发育的调控

国家自然科学基金

0+阅读 · 2015年12月31日

飞蝗型变可塑性的父性遗传及其表观遗传机制研究

国家自然科学基金

0+阅读 · 2014年12月31日

PRC1在胚胎干细胞生长和分化中识别和抑制靶基因的分子机制

国家自然科学基金

0+阅读 · 2014年12月31日

金属玻璃孔洞化现象的微观机制

国家自然科学基金

0+阅读 · 2013年12月31日

石墨烯材料的共价修饰和光谱电化学研究

国家自然科学基金

0+阅读 · 2012年12月31日

激光喷丸对非晶合金室温塑性的影响机制与室温塑性成形

国家自然科学基金

0+阅读 · 2011年12月31日

FMRP通过microRNA介导调控成体神经干细胞增殖和分化的研究

国家自然科学基金

0+阅读 · 2009年12月31日

信号转导通路和表观遗传模式在双酚A神经发育毒性中的作用

国家自然科学基金

0+阅读 · 2009年12月31日

磁性Pickering乳液界面流变学研究

国家自然科学基金

0+阅读 · 2008年12月31日

Generative Multi-hop Retrieval

Arxiv

0+阅读 · 2022年10月16日

Bearing-based Relative Localization for Robotic Swarm with Partially Mutual Observations

Arxiv

0+阅读 · 2022年10月15日

Learning to Locate Visual Answer in Video Corpus Using Question

Arxiv

0+阅读 · 2022年10月13日

Few-Shot Visual Question Generation: A Novel Task and Benchmark Datasets

Arxiv

0+阅读 · 2022年10月13日

RaP: Redundancy-aware Video-language Pre-training for Text-Video Retrieval

Arxiv

0+阅读 · 2022年10月13日

Language Agnostic Multilingual Information Retrieval with Contrastive Learning

Arxiv

0+阅读 · 2022年10月12日

Pre-training Methods in Information Retrieval

Arxiv

16+阅读 · 2021年11月27日

Less is More: ClipBERT for Video-and-Language Learning via Sparse Sampling

Arxiv

10+阅读 · 2021年2月11日

Multi-view Knowledge Graph Embedding for Entity Alignment

Arxiv

36+阅读 · 2019年6月6日

DeepSeek: Content Based Image Search & Retrieval

Arxiv

13+阅读 · 2018年1月11日

VIP会员

文章信息

相关主题

相关VIP内容

【NUS-Xavier教授】注意力神经网络，79页ppt

【NUS-Xavier教授】注意力神经网络，79页ppt

专知会员服务

66+阅读 · 2021年11月25日

NLP必读经典文献100篇

专知会员服务

124+阅读 · 2020年9月8日

史上最全！358篇机器学习&自然语言处理综述论文！都这儿了

专知会员服务

129+阅读 · 2020年7月18日

【跨语言BERT模型大集合】Transfer learning is increasingly going multilingual with language-specific BERT models

专知会员服务

54+阅读 · 2020年1月30日

【深度学习表格检测、信息提取和结构化】《Table Detection, Information Extraction and Structuring using Deep Learning》by Vihar Kurama

专知会员服务

38+阅读 · 2020年1月23日

Aspect-Oriented Syntax Network for Aspect-Based Sentiment Analysis，中山大学数据科学与计算机学院权小军教授，第八届全国社会媒体处理大会SMP2019

Aspect-Oriented Syntax Network for Aspect-Based Sentiment Analysis，中山大学数据科学与计算机学院权小军教授，第八届全国社会媒体处理大会SMP2019

专知会员服务

19+阅读 · 2019年10月22日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

专知会员服务

83+阅读 · 2019年10月9日

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

专知会员服务

65+阅读 · 2019年10月9日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

105+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

不确定环境下无人机三维路径规划研究 | 221页

远征作战军事后勤规划

大语言模型将如何改变军事指挥结构

美陆军能力集成与开发系统（ACIDS）流程指南 | 2025最新122页

相关资讯

VCIP 2022 Call for Demos

VCIP 2022 Call for Demos

CCF多媒体专委会

1+阅读 · 2022年6月6日

征稿 | CFP：Special Issue of NLP and KG(JCR Q2，IF2.67)

征稿 | CFP：Special Issue of NLP and KG(JCR Q2，IF2.67)

开放知识图谱

1+阅读 · 2022年4月4日

IEEE ICKG 2022: Call for Papers

IEEE ICKG 2022: Call for Papers

机器学习与推荐算法

3+阅读 · 2022年3月30日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium2

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium2

中国图象图形学学会CSIG

0+阅读 · 2021年11月8日

会议交流 | IJCKG: International Joint Conference on Knowledge Graphs

会议交流 | IJCKG: International Joint Conference on Knowledge Graphs

开放知识图谱

0+阅读 · 2021年9月9日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

【论文】变分推断（Variational inference)的总结

【论文】变分推断（Variational inference)的总结

机器学习研究会

39+阅读 · 2017年11月16日

相关论文

Generative Multi-hop Retrieval

Arxiv

0+阅读 · 2022年10月16日

Bearing-based Relative Localization for Robotic Swarm with Partially Mutual Observations

Arxiv

0+阅读 · 2022年10月15日

Learning to Locate Visual Answer in Video Corpus Using Question

Arxiv

0+阅读 · 2022年10月13日

Few-Shot Visual Question Generation: A Novel Task and Benchmark Datasets

Arxiv

0+阅读 · 2022年10月13日

RaP: Redundancy-aware Video-language Pre-training for Text-Video Retrieval

Arxiv

0+阅读 · 2022年10月13日

Language Agnostic Multilingual Information Retrieval with Contrastive Learning

Arxiv

0+阅读 · 2022年10月12日

Pre-training Methods in Information Retrieval

Arxiv

16+阅读 · 2021年11月27日

Less is More: ClipBERT for Video-and-Language Learning via Sparse Sampling

Arxiv

10+阅读 · 2021年2月11日

Multi-view Knowledge Graph Embedding for Entity Alignment

Arxiv

36+阅读 · 2019年6月6日

DeepSeek: Content Based Image Search & Retrieval

Arxiv

13+阅读 · 2018年1月11日

相关基金

人类转录因子基因家族调控网络进化模式研究

国家自然科学基金

0+阅读 · 2015年12月31日

AG-WUS-PcG-lncRNA互作对梅多雌蕊发育的调控

国家自然科学基金

0+阅读 · 2015年12月31日

飞蝗型变可塑性的父性遗传及其表观遗传机制研究

国家自然科学基金

0+阅读 · 2014年12月31日

PRC1在胚胎干细胞生长和分化中识别和抑制靶基因的分子机制

国家自然科学基金

0+阅读 · 2014年12月31日

金属玻璃孔洞化现象的微观机制

国家自然科学基金

0+阅读 · 2013年12月31日

石墨烯材料的共价修饰和光谱电化学研究

国家自然科学基金

0+阅读 · 2012年12月31日

激光喷丸对非晶合金室温塑性的影响机制与室温塑性成形

国家自然科学基金

0+阅读 · 2011年12月31日

FMRP通过microRNA介导调控成体神经干细胞增殖和分化的研究

国家自然科学基金

0+阅读 · 2009年12月31日

信号转导通路和表观遗传模式在双酚A神经发育毒性中的作用

国家自然科学基金

0+阅读 · 2009年12月31日

磁性Pickering乳液界面流变学研究

国家自然科学基金

0+阅读 · 2008年12月31日

微信扫码咨询专知VIP会员