深度神经网络中3D泛化特性的探究 (Investigating the Nature of 3D Generalization in Deep Neural Networks) - 专知论文

会员服务 ·

0

对象识别 · 泛化 · 深度学习架构 · 3D · 深度神经网络 ·

2023 年 4 月 19 日

Investigating the Nature of 3D Generalization in Deep Neural Networks

翻译：深度神经网络中3D泛化特性的探究

Shoaib Ahmed Siddiqui,David Krueger,Thomas Breuel

from arxiv, 15 pages, 15 figures, CVPR format

Visual object recognition systems need to generalize from a set of 2D training views to novel views. The question of how the human visual system can generalize to novel views has been studied and modeled in psychology, computer vision, and neuroscience. Modern deep learning architectures for object recognition generalize well to novel views, but the mechanisms are not well understood. In this paper, we characterize the ability of common deep learning architectures to generalize to novel views. We formulate this as a supervised classification task where labels correspond to unique 3D objects and examples correspond to 2D views of the objects at different 3D orientations. We consider three common models of generalization to novel views: (i) full 3D generalization, (ii) pure 2D matching, and (iii) matching based on a linear combination of views. We find that deep models generalize well to novel views, but they do so in a way that differs from all these existing models. Extrapolation to views beyond the range covered by views in the training set is limited, and extrapolation to novel rotation axes is even more limited, implying that the networks do not infer full 3D structure, nor use linear interpolation. Yet, generalization is far superior to pure 2D matching. These findings help with designing datasets with 2D views required to achieve 3D generalization. Code to reproduce our experiments is publicly available: https://github.com/shoaibahmed/investigating_3d_generalization.git

翻译：视觉对象识别系统需要从一组二维训练视图泛化到新视图。人类视觉系统如何能够推广到新视角已经在心理学、计算机视觉和神经科学中得到了研究和建模。现代深度学习架构的对象识别能力能够很好地推广到新视角，但机制尚未得到很好的理解。在本文中，我们对常见的深度学习架构的推广到新视图的能力进行了表征。我们将其制定为有监督的分类任务，其中标签对应于唯一的三维对象，示例对应于具有不同三维方向的对象的二维视图。我们考虑了三种常见的推广到新视图的模型：(i)完全的三维推广，(ii)纯2D匹配和(iii)基于视图的线性组合的匹配。我们发现，深度模型能够很好地推广到新视图，但是它们的推广方式与所有这些现有模型不同。到超出训练集中视图范围以外的视图的外推是有限的，到新旋转轴的外推则更加有限，这意味着网络没有推断出完整的三维结构，也没有使用线性插值。然而，推广远远优于纯2D匹配。这些发现有助于设计需要2D视图才能实现3D泛化的数据集。重现我们的实验的代码公开可用：https://github.com/shoaibahmed/investigating_3d_generalization.git

0

相关内容

对象识别

【2022新书】高效深度学习，Efficient Deep Learning Book

【2022新书】高效深度学习，Efficient Deep Learning Book

专知会员服务

125+阅读 · 2022年4月21日

【CVPR 2022】MixFormer：跨窗口与维度的特征融合，MixFormer: Mixing Features across Windows and Dimensions

【CVPR 2022】MixFormer：跨窗口与维度的特征融合，MixFormer: Mixing Features across Windows and Dimensions

专知会员服务

15+阅读 · 2022年3月19日

神经网络与形式语言综述，12页pdf，A Survey of Neural Networks and Formal Languages

神经网络与形式语言综述，12页pdf，A Survey of Neural Networks and Formal Languages

专知会员服务

21+阅读 · 2020年6月4日

【剑桥大学博士论文】深层神经网络结构的复兴，147页pdf，The resurgence of structure in deep neural networks

【剑桥大学博士论文】深层神经网络结构的复兴，147页pdf，The resurgence of structure in deep neural networks

专知会员服务

20+阅读 · 2020年5月14日

【Google】大迁移：通用视觉表示学习，General Visual Representation Learning

【Google】大迁移：通用视觉表示学习，General Visual Representation Learning

专知会员服务

37+阅读 · 2020年5月9日

50+篇《神经架构搜索NAS》2020论文合集

专知会员服务

61+阅读 · 2020年3月19日

【阿里巴巴-CVPR2020】频域学习，Learning in the Frequency Domain

【阿里巴巴-CVPR2020】频域学习，Learning in the Frequency Domain

专知会员服务

29+阅读 · 2020年3月14日

【MIT】图神经网络的泛化与表示极限，《Generalization and Representational Limits of Graph Neural Networks》

【MIT】图神经网络的泛化与表示极限，《Generalization and Representational Limits of Graph Neural Networks》

专知会员服务

46+阅读 · 2020年2月23日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

MIT博士论文 | 图指导的预测（含GNN的泛化能力和表示能力分析）

MIT博士论文 | 图指导的预测（含GNN的泛化能力和表示能力分析）

图与推荐

0+阅读 · 2022年11月14日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

无监督元学习表示学习

无监督元学习表示学习

CreateAMind

27+阅读 · 2019年1月4日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

disentangled-representation-papers

disentangled-representation-papers

CreateAMind

26+阅读 · 2018年9月12日

【论文推荐】最新五篇度量学习相关论文—无标签、三维姿态估计、主动度量学习、深度度量学习、层次度量学习与匹配

【论文推荐】最新五篇度量学习相关论文—无标签、三维姿态估计、主动度量学习、深度度量学习、层次度量学习与匹配

专知

20+阅读 · 2018年4月5日

【论文】深度学习的数学解释

【论文】深度学习的数学解释

机器学习研究会

10+阅读 · 2017年12月15日

基于深度学习的医疗影像论文汇总（Deep Learning Papers on Medical Image Analysis）

基于深度学习的医疗影像论文汇总（Deep Learning Papers on Medical Image Analysis）

AI研习社

17+阅读 · 2017年10月21日

离子注入合成In纳米颗粒在Al薄膜中超导性质的研究

国家自然科学基金

0+阅读 · 2015年12月31日

分子结自旋翻转非弹性电输运特性及计算方法研究

国家自然科学基金

0+阅读 · 2015年12月31日

量子Ising模型中Kibble-Zurek机制的量子模拟实验研究

国家自然科学基金

0+阅读 · 2014年12月31日

全蛋白质水凝胶机械性能的调控及在生物医学中的应用

国家自然科学基金

0+阅读 · 2014年12月31日

RBM10介导的选择性剪切调控机制及其与疾病之间分子关联的研究

国家自然科学基金

0+阅读 · 2012年12月31日

介孔氧化钛表面反应与传递调控机制的介尺度分子模拟

国家自然科学基金

0+阅读 · 2012年12月31日

复杂氧化物界面新奇电子态的起源及调控

国家自然科学基金

0+阅读 · 2012年12月31日

基于翻译学习和核方法的中文模糊限制信息检测研究

国家自然科学基金

2+阅读 · 2012年12月31日

半导体像素探测器读出ASIC的设计及性能研究

国家自然科学基金

0+阅读 · 2011年12月31日

用自旋极化扫描隧道显微镜研究3d族磁性超薄膜的自旋结构

国家自然科学基金

0+阅读 · 2009年12月31日

On the Split Closure of the Periodic Timetabling Polytope

Arxiv

0+阅读 · 2023年6月5日

Data Quality in Imitation Learning

Arxiv

0+阅读 · 2023年6月4日

Model-agnostic Measure of Generalization Difficulty

Arxiv

0+阅读 · 2023年6月2日

Investigating Navigation Strategies in the Morris Water Maze through Deep Reinforcement Learning

Arxiv

0+阅读 · 2023年6月1日

The feasibility of artificial consciousness through the lens of neuroscience

Arxiv

0+阅读 · 2023年6月1日

Renormalized Graph Neural Networks

Arxiv

0+阅读 · 2023年6月1日

Characterizing and overcoming the greedy nature of learning in multi-modal deep neural networks

Arxiv

10+阅读 · 2022年2月10日

Building Intelligent Autonomous Navigation Agents

Arxiv

24+阅读 · 2021年6月25日

A Comprehensive Survey on Transfer Learning

A Comprehensive Survey on Transfer Learning

Arxiv

121+阅读 · 2019年11月7日

Learning to Propagate Labels: Transductive Propagation Network for Few-shot Learning

Arxiv

21+阅读 · 2018年12月25日

VIP会员

文章信息

相关主题

深度学习架构

深度神经网络

相关VIP内容

【2022新书】高效深度学习，Efficient Deep Learning Book

【2022新书】高效深度学习，Efficient Deep Learning Book

专知会员服务

125+阅读 · 2022年4月21日

【CVPR 2022】MixFormer：跨窗口与维度的特征融合，MixFormer: Mixing Features across Windows and Dimensions

【CVPR 2022】MixFormer：跨窗口与维度的特征融合，MixFormer: Mixing Features across Windows and Dimensions

专知会员服务

15+阅读 · 2022年3月19日

神经网络与形式语言综述，12页pdf，A Survey of Neural Networks and Formal Languages

神经网络与形式语言综述，12页pdf，A Survey of Neural Networks and Formal Languages

专知会员服务

21+阅读 · 2020年6月4日

【剑桥大学博士论文】深层神经网络结构的复兴，147页pdf，The resurgence of structure in deep neural networks

【剑桥大学博士论文】深层神经网络结构的复兴，147页pdf，The resurgence of structure in deep neural networks

专知会员服务

20+阅读 · 2020年5月14日

【Google】大迁移：通用视觉表示学习，General Visual Representation Learning

【Google】大迁移：通用视觉表示学习，General Visual Representation Learning

专知会员服务

37+阅读 · 2020年5月9日

50+篇《神经架构搜索NAS》2020论文合集

专知会员服务

61+阅读 · 2020年3月19日

【阿里巴巴-CVPR2020】频域学习，Learning in the Frequency Domain

【阿里巴巴-CVPR2020】频域学习，Learning in the Frequency Domain

专知会员服务

29+阅读 · 2020年3月14日

【MIT】图神经网络的泛化与表示极限，《Generalization and Representational Limits of Graph Neural Networks》

【MIT】图神经网络的泛化与表示极限，《Generalization and Representational Limits of Graph Neural Networks》

专知会员服务

46+阅读 · 2020年2月23日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

三维高斯泼溅应用综述：分割、编辑与生成

《多智能体不确定环境追逃博弈研究》216页

【博士论文】基于不确定性的可靠性：现代机器学习中的选择性预测与可信部署

现代战争"杀伤区"理论：空间尺度与结构特征、控制手段与毁伤机制、生存策略与战线转移

相关资讯

MIT博士论文 | 图指导的预测（含GNN的泛化能力和表示能力分析）

MIT博士论文 | 图指导的预测（含GNN的泛化能力和表示能力分析）

图与推荐

0+阅读 · 2022年11月14日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

无监督元学习表示学习

无监督元学习表示学习

CreateAMind

27+阅读 · 2019年1月4日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

disentangled-representation-papers

disentangled-representation-papers

CreateAMind

26+阅读 · 2018年9月12日

【论文推荐】最新五篇度量学习相关论文—无标签、三维姿态估计、主动度量学习、深度度量学习、层次度量学习与匹配

【论文推荐】最新五篇度量学习相关论文—无标签、三维姿态估计、主动度量学习、深度度量学习、层次度量学习与匹配

专知

20+阅读 · 2018年4月5日

【论文】深度学习的数学解释

【论文】深度学习的数学解释

机器学习研究会

10+阅读 · 2017年12月15日

基于深度学习的医疗影像论文汇总（Deep Learning Papers on Medical Image Analysis）

基于深度学习的医疗影像论文汇总（Deep Learning Papers on Medical Image Analysis）

AI研习社

17+阅读 · 2017年10月21日

相关论文

On the Split Closure of the Periodic Timetabling Polytope

Arxiv

0+阅读 · 2023年6月5日

Data Quality in Imitation Learning

Arxiv

0+阅读 · 2023年6月4日

Model-agnostic Measure of Generalization Difficulty

Arxiv

0+阅读 · 2023年6月2日

Investigating Navigation Strategies in the Morris Water Maze through Deep Reinforcement Learning

Arxiv

0+阅读 · 2023年6月1日

The feasibility of artificial consciousness through the lens of neuroscience

Arxiv

0+阅读 · 2023年6月1日

Renormalized Graph Neural Networks

Arxiv

0+阅读 · 2023年6月1日

Characterizing and overcoming the greedy nature of learning in multi-modal deep neural networks

Arxiv

10+阅读 · 2022年2月10日

Building Intelligent Autonomous Navigation Agents

Arxiv

24+阅读 · 2021年6月25日

A Comprehensive Survey on Transfer Learning

A Comprehensive Survey on Transfer Learning

Arxiv

121+阅读 · 2019年11月7日

Learning to Propagate Labels: Transductive Propagation Network for Few-shot Learning

Arxiv

21+阅读 · 2018年12月25日

相关基金

离子注入合成In纳米颗粒在Al薄膜中超导性质的研究

国家自然科学基金

0+阅读 · 2015年12月31日

分子结自旋翻转非弹性电输运特性及计算方法研究

国家自然科学基金

0+阅读 · 2015年12月31日

量子Ising模型中Kibble-Zurek机制的量子模拟实验研究

国家自然科学基金

0+阅读 · 2014年12月31日

全蛋白质水凝胶机械性能的调控及在生物医学中的应用

国家自然科学基金

0+阅读 · 2014年12月31日

RBM10介导的选择性剪切调控机制及其与疾病之间分子关联的研究

国家自然科学基金

0+阅读 · 2012年12月31日

介孔氧化钛表面反应与传递调控机制的介尺度分子模拟

国家自然科学基金

0+阅读 · 2012年12月31日

复杂氧化物界面新奇电子态的起源及调控

国家自然科学基金

0+阅读 · 2012年12月31日

基于翻译学习和核方法的中文模糊限制信息检测研究

国家自然科学基金

2+阅读 · 2012年12月31日

半导体像素探测器读出ASIC的设计及性能研究

国家自然科学基金

0+阅读 · 2011年12月31日

用自旋极化扫描隧道显微镜研究3d族磁性超薄膜的自旋结构

国家自然科学基金

0+阅读 · 2009年12月31日

微信扫码咨询专知VIP会员