ACLIP- Hitchhiker 长视频检索指南 (A CLIP-Hitchhiker's Guide to Long Video Retrieval) - 专知论文

会员服务 ·

0

Performer · state-of-the-art · SimPLe · 基准 · MoDELS ·

2022 年 5 月 17 日

A CLIP-Hitchhiker's Guide to Long Video Retrieval

翻译：ACLIP- Hitchhiker 长视频检索指南

Max Bain,Arsha Nagrani,Gül Varol,Andrew Zisserman

Our goal in this paper is the adaptation of image-text models for long video retrieval. Recent works have demonstrated state-of-the-art performance in video retrieval by adopting CLIP, effectively hitchhiking on the image-text representation for video tasks. However, there has been limited success in learning temporal aggregation that outperform mean-pooling the image-level representations extracted per frame by CLIP. We find that the simple yet effective baseline of weighted-mean of frame embeddings via query-scoring is a significant improvement above all prior temporal modelling attempts and mean-pooling. In doing so, we provide an improved baseline for others to compare to and demonstrate state-of-the-art performance of this simple baseline on a suite of long video retrieval benchmarks.

翻译：本文的目标是为长期视频检索修改图像文本模型。最近的工作通过采用CLIP,展示了视频检索方面的最先进性能,有效地搭乘了视频任务图像文本的演示。然而,在学习时间汇总方面,成功率有限,超过了CLIP为每个框架提取的图像水平演示的平均值。我们发现,通过查询校对嵌入框架的加权手段的简单而有效的基线比以往所有时间模拟尝试和平均集合都大有改进。我们这样做,为其他人提供了一个更好的基线,以便他们在一套长视频检索基准上比较和展示这一简单基线的最新表现。

0

相关内容

Performer

【干货书】深度学习数学：理解神经网络，347页pdf

【干货书】深度学习数学：理解神经网络，347页pdf

专知会员服务

267+阅读 · 2022年7月3日

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

专知会员服务

78+阅读 · 2022年3月15日

【Google】深度学习对抗鲁棒性，43页ppt

专知会员服务

45+阅读 · 2020年10月31日

50+篇《神经架构搜索NAS》2020论文合集

专知会员服务

61+阅读 · 2020年3月19日

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

专知会员服务

95+阅读 · 2020年3月12日

抢鲜看！13篇CVPR2020论文链接/开源代码/解读

抢鲜看！13篇CVPR2020论文链接/开源代码/解读

专知会员服务

50+阅读 · 2020年2月26日

Aspect-Oriented Syntax Network for Aspect-Based Sentiment Analysis，中山大学数据科学与计算机学院权小军教授，第八届全国社会媒体处理大会SMP2019

Aspect-Oriented Syntax Network for Aspect-Based Sentiment Analysis，中山大学数据科学与计算机学院权小军教授，第八届全国社会媒体处理大会SMP2019

专知会员服务

19+阅读 · 2019年10月22日

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

专知会员服务

160+阅读 · 2019年10月12日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

VCIP 2022 Call for Demos

VCIP 2022 Call for Demos

CCF多媒体专委会

1+阅读 · 2022年6月6日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium8

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium8

中国图象图形学学会CSIG

0+阅读 · 2021年11月16日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium4

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium4

中国图象图形学学会CSIG

0+阅读 · 2021年11月10日

【ICIG2021】Latest News & Announcements of the Plenary Talk2

【ICIG2021】Latest News & Announcements of the Plenary Talk2

中国图象图形学学会CSIG

0+阅读 · 2021年11月2日

会议交流 | IJCKG: International Joint Conference on Knowledge Graphs

会议交流 | IJCKG: International Joint Conference on Knowledge Graphs

开放知识图谱

0+阅读 · 2021年9月9日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

ICLR2019最佳论文出炉

ICLR2019最佳论文出炉

专知

12+阅读 · 2019年5月6日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

NAD+/CD38/cADPR信号通路介导脓毒症脑病的分子机制研究

国家自然科学基金

0+阅读 · 2015年12月31日

血清和糖皮质激素诱导的蛋白激酶1（SGK1)通过激活NLRP3炎症小体介导急性心肌梗死后心脏损伤的机制研究

国家自然科学基金

0+阅读 · 2014年12月31日

寒邪通过冷受体TRPM8介导UCP1依赖性脂解并促进动脉粥样硬化斑块增生和不稳定

国家自然科学基金

0+阅读 · 2014年12月31日

测试一阶逻辑可定义图性质

国家自然科学基金

1+阅读 · 2013年12月31日

Kronheimer-Nakajima quiver 模空间与有理曲面

国家自然科学基金

1+阅读 · 2013年12月31日

基于Nogo/NgR及其下游Rho/ROCK信号通路探讨电针治疗脊髓损伤的作用机制

国家自然科学基金

0+阅读 · 2012年12月31日

新型II-VI/III-V族多结叠层太阳电池材料与器件研究

国家自然科学基金

0+阅读 · 2012年12月31日

miRNAs和信号通路在艾滋病毒Vpr蛋白调控卡波氏肉瘤病毒潜伏感染中的作用和意义

国家自然科学基金

0+阅读 · 2011年12月31日

相关于算子的Orlicz-型函数空间的实变理论

国家自然科学基金

0+阅读 · 2011年12月31日

TR3相互作用新蛋白机理研究

国家自然科学基金

1+阅读 · 2008年12月31日

Chairs Can be Stood on: Overcoming Object Bias in Human-Object Interaction Detection

Arxiv

0+阅读 · 2022年7月6日

NAS-Bench-360: Benchmarking Neural Architecture Search on Diverse Tasks

Arxiv

0+阅读 · 2022年7月6日

Domain Adaptive Video Segmentation via Temporal Pseudo Supervision

Arxiv

0+阅读 · 2022年7月6日

Understanding and Improving Group Normalization

Arxiv

0+阅读 · 2022年7月5日

Efficient Representation Learning via Adaptive Context Pooling

Arxiv

1+阅读 · 2022年7月5日

GazBy: Gaze-Based BERT Model to Incorporate Human Attention in Neural Information Retrieval

Arxiv

0+阅读 · 2022年7月4日

Less is More: ClipBERT for Video-and-Language Learning via Sparse Sampling

Arxiv

10+阅读 · 2021年2月11日

MVFNet: Multi-View Fusion Network for Efficient Video Recognition

Arxiv

13+阅读 · 2021年1月5日

Detect-to-Retrieve: Efficient Regional Aggregation for Image Search

Arxiv

15+阅读 · 2018年12月4日

Mobile Video Object Detection with Temporally-Aware Feature Maps

Arxiv

11+阅读 · 2018年3月28日

VIP会员

文章信息

相关主题

state-of-the-art

相关VIP内容

【干货书】深度学习数学：理解神经网络，347页pdf

【干货书】深度学习数学：理解神经网络，347页pdf

专知会员服务

267+阅读 · 2022年7月3日

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

专知会员服务

78+阅读 · 2022年3月15日

【Google】深度学习对抗鲁棒性，43页ppt

专知会员服务

45+阅读 · 2020年10月31日

50+篇《神经架构搜索NAS》2020论文合集

专知会员服务

61+阅读 · 2020年3月19日

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

专知会员服务

95+阅读 · 2020年3月12日

抢鲜看！13篇CVPR2020论文链接/开源代码/解读

抢鲜看！13篇CVPR2020论文链接/开源代码/解读

专知会员服务

50+阅读 · 2020年2月26日

Aspect-Oriented Syntax Network for Aspect-Based Sentiment Analysis，中山大学数据科学与计算机学院权小军教授，第八届全国社会媒体处理大会SMP2019

Aspect-Oriented Syntax Network for Aspect-Based Sentiment Analysis，中山大学数据科学与计算机学院权小军教授，第八届全国社会媒体处理大会SMP2019

专知会员服务

19+阅读 · 2019年10月22日

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

专知会员服务

160+阅读 · 2019年10月12日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

《作战建模与仿真实证研究》

《俄罗斯核条令演变趋势》最新56页报告

《小型无人机系统侦测追踪技术：声学、计算机视觉与深度学习融合方案》最新98页

《"牧羊人网格"拦截策略：实现无人机集群可靠拦截的新范式》

相关资讯

VCIP 2022 Call for Demos

VCIP 2022 Call for Demos

CCF多媒体专委会

1+阅读 · 2022年6月6日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium8

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium8

中国图象图形学学会CSIG

0+阅读 · 2021年11月16日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium4

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium4

中国图象图形学学会CSIG

0+阅读 · 2021年11月10日

【ICIG2021】Latest News & Announcements of the Plenary Talk2

【ICIG2021】Latest News & Announcements of the Plenary Talk2

中国图象图形学学会CSIG

0+阅读 · 2021年11月2日

会议交流 | IJCKG: International Joint Conference on Knowledge Graphs

会议交流 | IJCKG: International Joint Conference on Knowledge Graphs

开放知识图谱

0+阅读 · 2021年9月9日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

ICLR2019最佳论文出炉

ICLR2019最佳论文出炉

专知

12+阅读 · 2019年5月6日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

相关论文

Chairs Can be Stood on: Overcoming Object Bias in Human-Object Interaction Detection

Arxiv

0+阅读 · 2022年7月6日

NAS-Bench-360: Benchmarking Neural Architecture Search on Diverse Tasks

Arxiv

0+阅读 · 2022年7月6日

Domain Adaptive Video Segmentation via Temporal Pseudo Supervision

Arxiv

0+阅读 · 2022年7月6日

Understanding and Improving Group Normalization

Arxiv

0+阅读 · 2022年7月5日

Efficient Representation Learning via Adaptive Context Pooling

Arxiv

1+阅读 · 2022年7月5日

GazBy: Gaze-Based BERT Model to Incorporate Human Attention in Neural Information Retrieval

Arxiv

0+阅读 · 2022年7月4日

Less is More: ClipBERT for Video-and-Language Learning via Sparse Sampling

Arxiv

10+阅读 · 2021年2月11日

MVFNet: Multi-View Fusion Network for Efficient Video Recognition

Arxiv

13+阅读 · 2021年1月5日

Detect-to-Retrieve: Efficient Regional Aggregation for Image Search

Arxiv

15+阅读 · 2018年12月4日

Mobile Video Object Detection with Temporally-Aware Feature Maps

Arxiv

11+阅读 · 2018年3月28日

相关基金

NAD+/CD38/cADPR信号通路介导脓毒症脑病的分子机制研究

国家自然科学基金

0+阅读 · 2015年12月31日

血清和糖皮质激素诱导的蛋白激酶1（SGK1)通过激活NLRP3炎症小体介导急性心肌梗死后心脏损伤的机制研究

国家自然科学基金

0+阅读 · 2014年12月31日

寒邪通过冷受体TRPM8介导UCP1依赖性脂解并促进动脉粥样硬化斑块增生和不稳定

国家自然科学基金

0+阅读 · 2014年12月31日

测试一阶逻辑可定义图性质

国家自然科学基金

1+阅读 · 2013年12月31日

Kronheimer-Nakajima quiver 模空间与有理曲面

国家自然科学基金

1+阅读 · 2013年12月31日

基于Nogo/NgR及其下游Rho/ROCK信号通路探讨电针治疗脊髓损伤的作用机制

国家自然科学基金

0+阅读 · 2012年12月31日

新型II-VI/III-V族多结叠层太阳电池材料与器件研究

国家自然科学基金

0+阅读 · 2012年12月31日

miRNAs和信号通路在艾滋病毒Vpr蛋白调控卡波氏肉瘤病毒潜伏感染中的作用和意义

国家自然科学基金

0+阅读 · 2011年12月31日

相关于算子的Orlicz-型函数空间的实变理论

国家自然科学基金

0+阅读 · 2011年12月31日

TR3相互作用新蛋白机理研究

国家自然科学基金

1+阅读 · 2008年12月31日

微信扫码咨询专知VIP会员