AXM-Net:个人再识别跨模式背景共享关注网络 (AXM-Net: Cross-Modal Context Sharing Attention Network for Person Re-ID) - 专知论文

会员服务 ·

0

Re-ID · Extensibility · 注意力机制 · Networking · 学成 ·

2021 年 3 月 19 日

AXM-Net: Cross-Modal Context Sharing Attention Network for Person Re-ID

翻译：AXM-Net:个人再识别跨模式背景共享关注网络

Ammarah Farooq,Muhammad Awais,Josef Kittler,Syed Safwan Khalid

Cross-modal person re-identification (Re-ID) is critical for modern video surveillance systems. The key challenge is to align inter-modality representations according to semantic information present for a person and ignore background information. In this work, we present AXM-Net, a novel CNN based architecture designed for learning semantically aligned visual and textual representations. The underlying building block consists of multiple streams of feature maps coming from visual and textual modalities and a novel learnable context sharing semantic alignment network. We also propose complementary intra modal attention learning mechanisms to focus on more fine-grained local details in the features along with a cross-modal affinity loss for robust feature matching. Our design is unique in its ability to implicitly learn feature alignments from data. The entire AXM-Net can be trained in an end-to-end manner. We report results on both person search and cross-modal Re-ID tasks. Extensive experimentation validates the proposed framework and demonstrates its superiority by outperforming the current state-of-the-art methods by a significant margin.

翻译：现代视频监视系统的关键是跨式个人再识别(Re-ID),关键的挑战是如何根据一个人的语义信息调整不同模式的表达方式,忽略背景资料。在这项工作中,我们介绍了AXM-Net,这是一个以CNN为基础的新颖结构,旨在学习在语义上与视觉和文字上一致的表达方式。基本构件包括来自视觉和文字模式的多重特征地图流,以及一个新的可学习背景共享语义匹配网络。我们还提议了模式内补充性关注学习机制,以侧重于特征中更精细的本地细节,同时关注功能匹配的跨式亲和性损失。我们的设计是独特的,因为它能够隐含地从数据中学习特征的匹配。整个AXM-Net可以接受端对端培训。我们报告关于人搜索和跨式再开发任务的结果。广泛实验验证了拟议的框架,并通过显著的比照现有先进方法展示其优越性。

0

相关内容

Re-ID

近期必读的五篇AAAI 2021【视频理解】相关论文和代码

专知会员服务

51+阅读 · 2021年1月19日

【CIKM2020】神经逻辑推理，Neural Logic Reasoning

【CIKM2020】神经逻辑推理，Neural Logic Reasoning

专知会员服务

51+阅读 · 2020年8月25日

【IJCAJ 2019】多视角知识图谱嵌入的实体对齐，Multi-view Knowledge Graph Embedding for Entity Alignment

【IJCAJ 2019】多视角知识图谱嵌入的实体对齐，Multi-view Knowledge Graph Embedding for Entity Alignment

专知会员服务

59+阅读 · 2020年6月30日

【AAAI 2019】双曲异构信息网络嵌入，Hyperbolic Heterogeneous Information Network Embedding

【AAAI 2019】双曲异构信息网络嵌入，Hyperbolic Heterogeneous Information Network Embedding

专知会员服务

60+阅读 · 2020年6月28日

【WWW 2019】异质图注意力网络，Heterogeneous Graph Attention Network

【WWW 2019】异质图注意力网络，Heterogeneous Graph Attention Network

专知会员服务

75+阅读 · 2020年6月14日

【CVPR2020-清华大学】渐进对抗网络的细粒度域适应，Progressive Adversarial Networks

专知会员服务

27+阅读 · 2020年4月4日

【WWW2020-北邮】结构深度聚类网络，Structural Deep Clustering Network

【WWW2020-北邮】结构深度聚类网络，Structural Deep Clustering Network

专知会员服务

94+阅读 · 2020年2月14日

近期必读的7篇【医学图像分割】相关论文和代码（CVPR、AAAI）

近期必读的7篇【医学图像分割】相关论文和代码（CVPR、AAAI）

专知会员服务

41+阅读 · 2020年1月10日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

50+阅读 · 2019年10月17日

【CVPR 2019|workshop】视觉问答和对话，Visual Question Answering and Dialog，斯坦福大学|Christopher Manning，Google DeepMind|Karl Moritz Hermann

【CVPR 2019|workshop】视觉问答和对话，Visual Question Answering and Dialog，斯坦福大学|Christopher Manning，Google DeepMind|Karl Moritz Hermann

专知会员服务

18+阅读 · 2019年6月17日

解决ReID中遮挡问题：Pose-Guided Feature Alignment for Occluded ReID

解决ReID中遮挡问题：Pose-Guided Feature Alignment for Occluded ReID

极市平台

9+阅读 · 2020年1月15日

内涵网络嵌入：Content-rich Network Embedding

内涵网络嵌入：Content-rich Network Embedding

我爱读PAMI

4+阅读 · 2019年11月5日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

已删除

架构文摘

3+阅读 · 2019年4月17日

Call for Participation: Shared Tasks in NLPCC 2019

Call for Participation: Shared Tasks in NLPCC 2019

中国计算机学会

5+阅读 · 2019年3月22日

动物脑的好奇心和强化学习的好奇心

动物脑的好奇心和强化学习的好奇心

CreateAMind

10+阅读 · 2019年1月26日

PTGAN for Person Re-Identification

PTGAN for Person Re-Identification

统计学习与视觉计算组

4+阅读 · 2018年9月10日

《pyramid Attention Network for Semantic Segmentation》

《pyramid Attention Network for Semantic Segmentation》

统计学习与视觉计算组

44+阅读 · 2018年8月30日

【论文推荐】最新六篇行人再识别（ReID）相关论文—和谐注意力网络、时序残差学习、评估和基准、图像生成、三元组、对抗属性-图像

【论文推荐】最新六篇行人再识别（ReID）相关论文—和谐注意力网络、时序残差学习、评估和基准、图像生成、三元组、对抗属性-图像

专知

9+阅读 · 2018年3月1日

【行人识别】Deep Transfer Learning for Person Re-identification

【行人识别】Deep Transfer Learning for Person Re-identification

极市平台

6+阅读 · 2017年7月5日

Attention Network Robustification for Person ReID

Attention Network Robustification for Person ReID

Arxiv

5+阅读 · 2019年10月15日

Cross-Modal Self-Attention Network for Referring Image Segmentation

Cross-Modal Self-Attention Network for Referring Image Segmentation

Arxiv

18+阅读 · 2019年4月9日

Relation-aware Graph Attention Network for Visual Question Answering

Arxiv

4+阅读 · 2019年3月29日

Omni-directional Feature Learning for Person Re-identification

Omni-directional Feature Learning for Person Re-identification

Arxiv

3+阅读 · 2018年12月13日

Attention-Aware Compositional Network for Person Re-identification

Arxiv

8+阅读 · 2018年5月16日

Deep Ordinal Hashing with Spatial Attention

Arxiv

9+阅读 · 2018年5月7日

Dual Attention Matching Network for Context-Aware Feature Sequence based Person Re-Identification

Arxiv

5+阅读 · 2018年3月27日

Multi-Channel Pyramid Person Matching Network for Person Re-Identification

Arxiv

7+阅读 · 2018年3月7日

Harmonious Attention Network for Person Re-Identification

Arxiv

7+阅读 · 2018年2月22日

Video Person Re-identification by Temporal Residual Learning

Arxiv

5+阅读 · 2018年2月22日

VIP会员

文章信息

相关主题

注意力机制

相关VIP内容

近期必读的五篇AAAI 2021【视频理解】相关论文和代码

专知会员服务

51+阅读 · 2021年1月19日

【CIKM2020】神经逻辑推理，Neural Logic Reasoning

【CIKM2020】神经逻辑推理，Neural Logic Reasoning

专知会员服务

51+阅读 · 2020年8月25日

【IJCAJ 2019】多视角知识图谱嵌入的实体对齐，Multi-view Knowledge Graph Embedding for Entity Alignment

【IJCAJ 2019】多视角知识图谱嵌入的实体对齐，Multi-view Knowledge Graph Embedding for Entity Alignment

专知会员服务

59+阅读 · 2020年6月30日

【AAAI 2019】双曲异构信息网络嵌入，Hyperbolic Heterogeneous Information Network Embedding

【AAAI 2019】双曲异构信息网络嵌入，Hyperbolic Heterogeneous Information Network Embedding

专知会员服务

60+阅读 · 2020年6月28日

【WWW 2019】异质图注意力网络，Heterogeneous Graph Attention Network

【WWW 2019】异质图注意力网络，Heterogeneous Graph Attention Network

专知会员服务

75+阅读 · 2020年6月14日

【CVPR2020-清华大学】渐进对抗网络的细粒度域适应，Progressive Adversarial Networks

专知会员服务

27+阅读 · 2020年4月4日

【WWW2020-北邮】结构深度聚类网络，Structural Deep Clustering Network

【WWW2020-北邮】结构深度聚类网络，Structural Deep Clustering Network

专知会员服务

94+阅读 · 2020年2月14日

近期必读的7篇【医学图像分割】相关论文和代码（CVPR、AAAI）

近期必读的7篇【医学图像分割】相关论文和代码（CVPR、AAAI）

专知会员服务

41+阅读 · 2020年1月10日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

50+阅读 · 2019年10月17日

【CVPR 2019|workshop】视觉问答和对话，Visual Question Answering and Dialog，斯坦福大学|Christopher Manning，Google DeepMind|Karl Moritz Hermann

【CVPR 2019|workshop】视觉问答和对话，Visual Question Answering and Dialog，斯坦福大学|Christopher Manning，Google DeepMind|Karl Moritz Hermann

专知会员服务

18+阅读 · 2019年6月17日

热门VIP内容

开通专知VIP会员享更多权益服务

AI智能体时代中的记忆：形式、功能与动态综述

《俄乌战争中的无人系统：新的战争方式与新兴趋势——来自前线的印象》报告

【斯坦福博士论文】面向地理空间数据的多模态与多尺度建模：时空生成式人工智能

多模态大语言模型下游调优中“保持自我”的重要性

相关资讯

解决ReID中遮挡问题：Pose-Guided Feature Alignment for Occluded ReID

解决ReID中遮挡问题：Pose-Guided Feature Alignment for Occluded ReID

极市平台

9+阅读 · 2020年1月15日

内涵网络嵌入：Content-rich Network Embedding

内涵网络嵌入：Content-rich Network Embedding

我爱读PAMI

4+阅读 · 2019年11月5日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

已删除

架构文摘

3+阅读 · 2019年4月17日

Call for Participation: Shared Tasks in NLPCC 2019

Call for Participation: Shared Tasks in NLPCC 2019

中国计算机学会

5+阅读 · 2019年3月22日

动物脑的好奇心和强化学习的好奇心

动物脑的好奇心和强化学习的好奇心

CreateAMind

10+阅读 · 2019年1月26日

PTGAN for Person Re-Identification

PTGAN for Person Re-Identification

统计学习与视觉计算组

4+阅读 · 2018年9月10日

《pyramid Attention Network for Semantic Segmentation》

《pyramid Attention Network for Semantic Segmentation》

统计学习与视觉计算组

44+阅读 · 2018年8月30日

【论文推荐】最新六篇行人再识别（ReID）相关论文—和谐注意力网络、时序残差学习、评估和基准、图像生成、三元组、对抗属性-图像

【论文推荐】最新六篇行人再识别（ReID）相关论文—和谐注意力网络、时序残差学习、评估和基准、图像生成、三元组、对抗属性-图像

专知

9+阅读 · 2018年3月1日

【行人识别】Deep Transfer Learning for Person Re-identification

【行人识别】Deep Transfer Learning for Person Re-identification

极市平台

6+阅读 · 2017年7月5日

相关论文

Attention Network Robustification for Person ReID

Attention Network Robustification for Person ReID

Arxiv

5+阅读 · 2019年10月15日

Cross-Modal Self-Attention Network for Referring Image Segmentation

Cross-Modal Self-Attention Network for Referring Image Segmentation

Arxiv

18+阅读 · 2019年4月9日

Relation-aware Graph Attention Network for Visual Question Answering

Arxiv

4+阅读 · 2019年3月29日

Omni-directional Feature Learning for Person Re-identification

Omni-directional Feature Learning for Person Re-identification

Arxiv

3+阅读 · 2018年12月13日

Attention-Aware Compositional Network for Person Re-identification

Arxiv

8+阅读 · 2018年5月16日

Deep Ordinal Hashing with Spatial Attention

Arxiv

9+阅读 · 2018年5月7日

Dual Attention Matching Network for Context-Aware Feature Sequence based Person Re-Identification

Arxiv

5+阅读 · 2018年3月27日

Multi-Channel Pyramid Person Matching Network for Person Re-Identification

Arxiv

7+阅读 · 2018年3月7日

Harmonious Attention Network for Person Re-Identification

Arxiv

7+阅读 · 2018年2月22日

Video Person Re-identification by Temporal Residual Learning

Arxiv

5+阅读 · 2018年2月22日

微信扫码咨询专知VIP会员