CAP-Net:用于3D形状分析的通信-警告点观点融合网络 (CAP-Net: Correspondence-Aware Point-view Fusion Network for 3D Shape Analysis) - 专知论文

会员服务 ·

0

3D · 可约的 · 塑造 · 模态 · Networking ·

2021 年 9 月 3 日

CAP-Net: Correspondence-Aware Point-view Fusion Network for 3D Shape Analysis

翻译：CAP-Net:用于3D形状分析的通信-警告点观点融合网络

Xinwei He,Silin Cheng,Song Bai,Xiang Bai

Learning 3D representations by fusing point cloud and multi-view data has been proven to be fairly effective. While prior works typically focus on exploiting global features of the two modalities, in this paper we argue that more discriminative features can be derived by modeling "where to fuse". To investigate this, we propose a novel Correspondence-Aware Point-view Fusion Net (CAPNet). The core element of CAP-Net is a module named Correspondence-Aware Fusion (CAF) which integrates the local features of the two modalities based on their correspondence scores. We further propose to filter out correspondence scores with low values to obtain salient local correspondences, which reduces redundancy for the fusion process. In our CAP-Net, we utilize the CAF modules to fuse the multi-scale features of the two modalities both bidirectionally and hierarchically in order to obtain more informative features. Comprehensive evaluations on popular 3D shape benchmarks covering 3D object classification and retrieval show the superiority of the proposed framework.

翻译：通过发泡点云和多视图数据进行学习的3D表达方式已证明相当有效。虽然先前的工作通常侧重于利用两种模式的全球特征,但我们在本文中认为,通过“连接点”的建模可以产生更多的歧视性特征。为了调查这一点,我们提议建立一个新型的通信-软件点视图融合网(CAPNet),CAP-Net的核心元素是一个名为“通信-软件融合”的模块,该模块根据通信分数整合两种模式的本地特征。我们进一步建议过滤低值的通信分数,以获得显著的本地通信,从而减少聚合过程的冗余。在我们CAP-Net中,我们利用CAF模块将两种模式的多尺度特征双向和分级连接起来,以获取更多信息。对涵盖3D对象分类和检索的通用3D形状基准的全面评价显示了拟议框架的优越性。

0

相关内容

3D是英文“Three Dimensions”的简称，中文是指三维、三个维度、三个坐标，即有长、有宽、有高，换句话说，就是立体的，是相对于只有长和宽的平面（2D）而言。

“CVPR 2021 接受论文列表 1663篇论文都在这了

专知会员服务

32+阅读 · 2021年6月12日

【WWW2021】神经协同推理

专知会员服务

58+阅读 · 2021年5月17日

【CVPR2021】LiDAR R-CNN：一种快速、通用的二阶段3D检测器

专知会员服务

16+阅读 · 2021年4月3日

最新《深度学习视频超分》综述论文，30页pdf，Video Super Resolution Based on Deep Learning: A comprehensive survey

最新《深度学习视频超分》综述论文，30页pdf，Video Super Resolution Based on Deep Learning: A comprehensive survey

专知会员服务

24+阅读 · 2020年7月28日

最新《3D医疗图像处理》综述论文，23页pdf，3D Deep Learning on Medical Images: A Review

最新《3D医疗图像处理》综述论文，23页pdf，3D Deep Learning on Medical Images: A Review

专知会员服务

60+阅读 · 2020年7月14日

3D目标检测进展综述

3D目标检测进展综述

专知会员服务

193+阅读 · 2020年4月24日

【ICLR-2020】网络反卷积，NETWORK DECONVOLUTION

【ICLR-2020】网络反卷积，NETWORK DECONVOLUTION

专知会员服务

39+阅读 · 2020年2月21日

八篇 ICCV 2019 【图神经网络（GNN）+CV】相关论文

八篇 ICCV 2019 【图神经网络（GNN）+CV】相关论文

专知会员服务

30+阅读 · 2020年1月10日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

revelation of MONet

revelation of MONet

CreateAMind

5+阅读 · 2019年6月8日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

CVPR2019 | Stereo R-CNN 3D 目标检测

CVPR2019 | Stereo R-CNN 3D 目标检测

极市平台

27+阅读 · 2019年3月10日

视频理解 S3D，I3D-GCN，SlowFastNet, LFB

视频理解 S3D，I3D-GCN，SlowFastNet, LFB

极市平台

7+阅读 · 2019年1月31日

《pyramid Attention Network for Semantic Segmentation》

《pyramid Attention Network for Semantic Segmentation》

统计学习与视觉计算组

44+阅读 · 2018年8月30日

【泡泡点云时空】PU-Net：点云上采样网络（CVPR2018-6）

【泡泡点云时空】PU-Net：点云上采样网络（CVPR2018-6）

泡泡机器人SLAM

6+阅读 · 2018年8月16日

Hierarchical Disentangled Representations

Hierarchical Disentangled Representations

CreateAMind

4+阅读 · 2018年4月15日

【论文推荐】最新七篇行人再识别相关论文—深度排序、风格自适应、对抗、重排序、多层次相似性、深度空间特征重构、图对应迁移

【论文推荐】最新七篇行人再识别相关论文—深度排序、风格自适应、对抗、重排序、多层次相似性、深度空间特征重构、图对应迁移

专知

6+阅读 · 2018年4月14日

视频超分辨 Detail-revealing Deep Video Super-resolution 论文笔记

视频超分辨 Detail-revealing Deep Video Super-resolution 论文笔记

统计学习与视觉计算组

17+阅读 · 2018年3月16日

Capsule Networks解析

Capsule Networks解析

机器学习研究会

11+阅读 · 2017年11月12日

3D Pose Transfer with Correspondence Learning and Mesh Refinement

Arxiv

0+阅读 · 2021年10月26日

$C$^{4}$Net: Contextual Compression and Complementary Combination Network for Salient Object Detection$

C$^{4}$Net: Contextual Compression and Complementary Combination Network for Salient Object Detection

Arxiv

0+阅读 · 2021年10月22日

MultiBodySync: Multi-Body Segmentation and Motion Estimation via 3D Scan Synchronization

Arxiv

4+阅读 · 2021年1月17日

DeepICP: An End-to-End Deep Neural Network for 3D Point Cloud Registration

DeepICP: An End-to-End Deep Neural Network for 3D Point Cloud Registration

Arxiv

9+阅读 · 2019年9月16日

Multimodal Semantic Attention Network for Video Captioning

Arxiv

4+阅读 · 2019年5月8日

3D Backbone Network for 3D Object Detection

Arxiv

12+阅读 · 2019年1月24日

Dual Memory Network Model for Biased Product Review Classification

Dual Memory Network Model for Biased Product Review Classification

Arxiv

3+阅读 · 2018年9月16日

Recurrent Fusion Network for Image Captioning

Recurrent Fusion Network for Image Captioning

Arxiv

3+阅读 · 2018年7月31日

Watch, Listen, and Describe: Globally and Locally Aligned Cross-Modal Attentions for Video Captioning

Arxiv

6+阅读 · 2018年4月15日

Deep Learning for Video Classification and Captioning

Arxiv

9+阅读 · 2018年2月22日

VIP会员

文章信息

相关主题

相关VIP内容

“CVPR 2021 接受论文列表 1663篇论文都在这了

专知会员服务

32+阅读 · 2021年6月12日

【WWW2021】神经协同推理

专知会员服务

58+阅读 · 2021年5月17日

【CVPR2021】LiDAR R-CNN：一种快速、通用的二阶段3D检测器

专知会员服务

16+阅读 · 2021年4月3日

最新《深度学习视频超分》综述论文，30页pdf，Video Super Resolution Based on Deep Learning: A comprehensive survey

最新《深度学习视频超分》综述论文，30页pdf，Video Super Resolution Based on Deep Learning: A comprehensive survey

专知会员服务

24+阅读 · 2020年7月28日

最新《3D医疗图像处理》综述论文，23页pdf，3D Deep Learning on Medical Images: A Review

最新《3D医疗图像处理》综述论文，23页pdf，3D Deep Learning on Medical Images: A Review

专知会员服务

60+阅读 · 2020年7月14日

3D目标检测进展综述

3D目标检测进展综述

专知会员服务

193+阅读 · 2020年4月24日

【ICLR-2020】网络反卷积，NETWORK DECONVOLUTION

【ICLR-2020】网络反卷积，NETWORK DECONVOLUTION

专知会员服务

39+阅读 · 2020年2月21日

八篇 ICCV 2019 【图神经网络（GNN）+CV】相关论文

八篇 ICCV 2019 【图神经网络（GNN）+CV】相关论文

专知会员服务

30+阅读 · 2020年1月10日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

热门VIP内容

开通专知VIP会员享更多权益服务

【博士论文】扩展可扩展会话推荐的边界

别想太多：高效 R1 风格大型推理模型综述

【ACMMM2025】EvoVLMA: 进化式视觉-语言模型自适应

智能体网络：用AI智能体编织下一代网络

相关资讯

revelation of MONet

revelation of MONet

CreateAMind

5+阅读 · 2019年6月8日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

CVPR2019 | Stereo R-CNN 3D 目标检测

CVPR2019 | Stereo R-CNN 3D 目标检测

极市平台

27+阅读 · 2019年3月10日

视频理解 S3D，I3D-GCN，SlowFastNet, LFB

视频理解 S3D，I3D-GCN，SlowFastNet, LFB

极市平台

7+阅读 · 2019年1月31日

《pyramid Attention Network for Semantic Segmentation》

《pyramid Attention Network for Semantic Segmentation》

统计学习与视觉计算组

44+阅读 · 2018年8月30日

【泡泡点云时空】PU-Net：点云上采样网络（CVPR2018-6）

【泡泡点云时空】PU-Net：点云上采样网络（CVPR2018-6）

泡泡机器人SLAM

6+阅读 · 2018年8月16日

Hierarchical Disentangled Representations

Hierarchical Disentangled Representations

CreateAMind

4+阅读 · 2018年4月15日

【论文推荐】最新七篇行人再识别相关论文—深度排序、风格自适应、对抗、重排序、多层次相似性、深度空间特征重构、图对应迁移

【论文推荐】最新七篇行人再识别相关论文—深度排序、风格自适应、对抗、重排序、多层次相似性、深度空间特征重构、图对应迁移

专知

6+阅读 · 2018年4月14日

视频超分辨 Detail-revealing Deep Video Super-resolution 论文笔记

视频超分辨 Detail-revealing Deep Video Super-resolution 论文笔记

统计学习与视觉计算组

17+阅读 · 2018年3月16日

Capsule Networks解析

Capsule Networks解析

机器学习研究会

11+阅读 · 2017年11月12日

相关论文

3D Pose Transfer with Correspondence Learning and Mesh Refinement

Arxiv

0+阅读 · 2021年10月26日

$C$^{4}$Net: Contextual Compression and Complementary Combination Network for Salient Object Detection$

C$^{4}$Net: Contextual Compression and Complementary Combination Network for Salient Object Detection

Arxiv

0+阅读 · 2021年10月22日

MultiBodySync: Multi-Body Segmentation and Motion Estimation via 3D Scan Synchronization

Arxiv

4+阅读 · 2021年1月17日

DeepICP: An End-to-End Deep Neural Network for 3D Point Cloud Registration

DeepICP: An End-to-End Deep Neural Network for 3D Point Cloud Registration

Arxiv

9+阅读 · 2019年9月16日

Multimodal Semantic Attention Network for Video Captioning

Arxiv

4+阅读 · 2019年5月8日

3D Backbone Network for 3D Object Detection

Arxiv

12+阅读 · 2019年1月24日

Dual Memory Network Model for Biased Product Review Classification

Dual Memory Network Model for Biased Product Review Classification

Arxiv

3+阅读 · 2018年9月16日

Recurrent Fusion Network for Image Captioning

Recurrent Fusion Network for Image Captioning

Arxiv

3+阅读 · 2018年7月31日

Watch, Listen, and Describe: Globally and Locally Aligned Cross-Modal Attentions for Video Captioning

Arxiv

6+阅读 · 2018年4月15日

Deep Learning for Video Classification and Captioning

Arxiv

9+阅读 · 2018年2月22日

微信扫码咨询专知VIP会员