GitNet:鸟类与眼眼观察分离的几何前基变异 (GitNet: Geometric Prior-based Transformation for Birds-Eye-View Segmentation) - 专知论文

会员服务 ·

0

变换 · Projection · 知识 (knowledge) · Performer · 估计/估计量 ·

2022 年 7 月 21 日

GitNet: Geometric Prior-based Transformation for Birds-Eye-View Segmentation

翻译：GitNet:鸟类与眼眼观察分离的几何前基变异

Shi Gong,Xiaoqing Ye,Xiao Tan,Jingdong Wang,Errui Ding,Yu Zhou,Xiang Bai

from arxiv, ECCV 2022

Birds-eye-view (BEV) semantic segmentation is critical for autonomous driving for its powerful spatial representation ability. It is challenging to estimate the BEV semantic maps from monocular images due to the spatial gap, since it is implicitly required to realize both the perspective-to-BEV transformation and segmentation. We present a novel two-stage Geometry Prior-based Transformation framework named GitNet, consisting of (i) the geometry-guided pre-alignment and (ii) ray-based transformer. In the first stage, we decouple the BEV segmentation into the perspective image segmentation and geometric prior-based mapping, with explicit supervision by projecting the BEV semantic labels onto the image plane to learn visibility-aware features and learnable geometry to translate into BEV space. Second, the pre-aligned coarse BEV features are further deformed by ray-based transformers to take visibility knowledge into account. GitNet achieves the leading performance on the challenging nuScenes and Argoverse Datasets.

翻译：鸟类-眼视图(BEV)语义分解对于其强大的空间代表能力的自主驱动至关重要。由于空间差距,从单视图像中估算BEV语义图具有挑战性,因为要实现视觉-视觉-视觉-视觉-视觉-视觉-视觉(BEV)转换和分解,就隐含了实现视觉-视觉-视觉转换和分解的要求。我们提出了一个名为GitNet(GitNet)的新颖的两阶段先入为主的先导变形框架,由(一) 几何制导前对接和(二) 光基变形器组成。在第一阶段,我们将BEV语分解分解为视觉图像分解和前几何绘图,通过将BEV语义标签投射到图像平面以学习可见-觉特征和学习可转换为BEV空间的几何测量方法进行明确监督,因此具有挑战性。第二,以光谱为基础的变形变形变形变形变形器进一步变形,以将可见性知识纳入考虑。 GitNet在有挑战的核和Argovers数据集上取得领先的性表现。

0

相关内容

ICLR 2022杰出论文公布：7篇论文获得，清华朱军课题组摘得

ICLR 2022杰出论文公布：7篇论文获得，清华朱军课题组摘得

专知会员服务

60+阅读 · 2022年4月22日

【图与几何深度学习】Graph and geometric deep learning，49页ppt

【图与几何深度学习】Graph and geometric deep learning，49页ppt

专知会员服务

65+阅读 · 2021年4月24日

50+篇《神经架构搜索NAS》2020论文合集

专知会员服务

61+阅读 · 2020年3月19日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

165+阅读 · 2020年3月18日

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

专知会员服务

95+阅读 · 2020年3月12日

Aspect-Oriented Syntax Network for Aspect-Based Sentiment Analysis，中山大学数据科学与计算机学院权小军教授，第八届全国社会媒体处理大会SMP2019

Aspect-Oriented Syntax Network for Aspect-Based Sentiment Analysis，中山大学数据科学与计算机学院权小军教授，第八届全国社会媒体处理大会SMP2019

专知会员服务

19+阅读 · 2019年10月22日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

105+阅读 · 2019年10月9日

VCIP 2022 Call for Special Session Proposals

VCIP 2022 Call for Special Session Proposals

CCF多媒体专委会

1+阅读 · 2022年4月1日

ACM MM 2022 Call for Papers

ACM MM 2022 Call for Papers

CCF多媒体专委会

5+阅读 · 2022年3月29日

ACM TOMM Call for Papers

ACM TOMM Call for Papers

CCF多媒体专委会

2+阅读 · 2022年3月23日

【ICIG2021】Latest News & Announcements of the Tutorial

【ICIG2021】Latest News & Announcements of the Tutorial

中国图象图形学学会CSIG

3+阅读 · 2021年12月20日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium8

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium8

中国图象图形学学会CSIG

0+阅读 · 2021年11月16日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium3

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium3

中国图象图形学学会CSIG

0+阅读 · 2021年11月9日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

【推荐】用Python/OpenCV实现增强现实

【推荐】用Python/OpenCV实现增强现实

机器学习研究会

15+阅读 · 2017年11月16日

居民天然气阶梯定价政策设计及效果评价研究

国家自然科学基金

0+阅读 · 2015年12月31日

高血压血管重塑中血管保护分子CREG基因表达的上游调控机制研究

国家自然科学基金

0+阅读 · 2015年12月31日

近海环境斜拉索风致疲劳损伤的磁流变阻尼器控制

国家自然科学基金

0+阅读 · 2014年12月31日

空间插值的微分几何方法研究

国家自然科学基金

0+阅读 · 2014年12月31日

时间分辨共振拉曼光谱用于光感蛋白SRII/HtrII复合物的弱相互作用研究

国家自然科学基金

0+阅读 · 2014年12月31日

激发态在分子高次谐波辐射中的效应

国家自然科学基金

0+阅读 · 2012年12月31日

烟草抗马铃薯Y病毒相关miRNA的筛选与功能分析

国家自然科学基金

0+阅读 · 2012年12月31日

线粒体tRNA前体加工与细胞周期调控之间偶联机制的研究

国家自然科学基金

0+阅读 · 2011年12月31日

针刺抗氧化效应的TRx氧化还原调控机制研究

国家自然科学基金

0+阅读 · 2008年12月31日

基于主客体分子识别的超分子聚合与大分子自组装

国家自然科学基金

0+阅读 · 2008年12月31日

A Dual-Cycled Cross-View Transformer Network for Unified Road Layout Estimation and 3D Object Detection in the Bird's-Eye-View

Arxiv

0+阅读 · 2022年9月19日

Relative Transformation Estimation Based on Fusion of Odometry and UWB Ranging Data

Arxiv

0+阅读 · 2022年9月19日

Effective Image Tampering Localization via Semantic Segmentation Network

Arxiv

0+阅读 · 2022年9月18日

AFT-VO: Asynchronous Fusion Transformers for Multi-View Visual Odometry Estimation

AFT-VO: Asynchronous Fusion Transformers for Multi-View Visual Odometry Estimation

Arxiv

0+阅读 · 2022年9月16日

Image Understands Point Cloud: Weakly Supervised 3D Semantic Segmentation via Association Learning

Arxiv

0+阅读 · 2022年9月16日

A Spatiotemporal Model for Precise and Efficient Fully-automatic 3D Motion Correction in OCT

Arxiv

0+阅读 · 2022年9月15日

Bridging Implicit and Explicit Geometric Transformations for Single-Image View Synthesis

Arxiv

0+阅读 · 2022年9月15日

Learning from Future: A Novel Self-Training Framework for Semantic Segmentation

Arxiv

0+阅读 · 2022年9月15日

Self-supervised Geometric Perception

Arxiv

24+阅读 · 2021年3月4日

A 3D Coarse-to-Fine Framework for Volumetric Medical Image Segmentation

A 3D Coarse-to-Fine Framework for Volumetric Medical Image Segmentation

Arxiv

15+阅读 · 2018年8月2日

VIP会员

文章信息

相关主题

知识 (knowledge)

估计/估计量

相关VIP内容

ICLR 2022杰出论文公布：7篇论文获得，清华朱军课题组摘得

ICLR 2022杰出论文公布：7篇论文获得，清华朱军课题组摘得

专知会员服务

60+阅读 · 2022年4月22日

【图与几何深度学习】Graph and geometric deep learning，49页ppt

【图与几何深度学习】Graph and geometric deep learning，49页ppt

专知会员服务

65+阅读 · 2021年4月24日

50+篇《神经架构搜索NAS》2020论文合集

专知会员服务

61+阅读 · 2020年3月19日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

165+阅读 · 2020年3月18日

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

专知会员服务

95+阅读 · 2020年3月12日

Aspect-Oriented Syntax Network for Aspect-Based Sentiment Analysis，中山大学数据科学与计算机学院权小军教授，第八届全国社会媒体处理大会SMP2019

Aspect-Oriented Syntax Network for Aspect-Based Sentiment Analysis，中山大学数据科学与计算机学院权小军教授，第八届全国社会媒体处理大会SMP2019

专知会员服务

19+阅读 · 2019年10月22日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

105+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

《美空军条令出版物：战略打击》最新条令

《高能激光武器》22页slides

军事前沿模型

《面向小型无人机或无人飞行器的创新雷达探测与人工智能分类技术》263页

相关资讯

VCIP 2022 Call for Special Session Proposals

VCIP 2022 Call for Special Session Proposals

CCF多媒体专委会

1+阅读 · 2022年4月1日

ACM MM 2022 Call for Papers

ACM MM 2022 Call for Papers

CCF多媒体专委会

5+阅读 · 2022年3月29日

ACM TOMM Call for Papers

ACM TOMM Call for Papers

CCF多媒体专委会

2+阅读 · 2022年3月23日

【ICIG2021】Latest News & Announcements of the Tutorial

【ICIG2021】Latest News & Announcements of the Tutorial

中国图象图形学学会CSIG

3+阅读 · 2021年12月20日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium8

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium8

中国图象图形学学会CSIG

0+阅读 · 2021年11月16日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium3

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium3

中国图象图形学学会CSIG

0+阅读 · 2021年11月9日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

【推荐】用Python/OpenCV实现增强现实

【推荐】用Python/OpenCV实现增强现实

机器学习研究会

15+阅读 · 2017年11月16日

相关论文

A Dual-Cycled Cross-View Transformer Network for Unified Road Layout Estimation and 3D Object Detection in the Bird's-Eye-View

Arxiv

0+阅读 · 2022年9月19日

Relative Transformation Estimation Based on Fusion of Odometry and UWB Ranging Data

Arxiv

0+阅读 · 2022年9月19日

Effective Image Tampering Localization via Semantic Segmentation Network

Arxiv

0+阅读 · 2022年9月18日

AFT-VO: Asynchronous Fusion Transformers for Multi-View Visual Odometry Estimation

AFT-VO: Asynchronous Fusion Transformers for Multi-View Visual Odometry Estimation

Arxiv

0+阅读 · 2022年9月16日

Image Understands Point Cloud: Weakly Supervised 3D Semantic Segmentation via Association Learning

Arxiv

0+阅读 · 2022年9月16日

A Spatiotemporal Model for Precise and Efficient Fully-automatic 3D Motion Correction in OCT

Arxiv

0+阅读 · 2022年9月15日

Bridging Implicit and Explicit Geometric Transformations for Single-Image View Synthesis

Arxiv

0+阅读 · 2022年9月15日

Learning from Future: A Novel Self-Training Framework for Semantic Segmentation

Arxiv

0+阅读 · 2022年9月15日

Self-supervised Geometric Perception

Arxiv

24+阅读 · 2021年3月4日

A 3D Coarse-to-Fine Framework for Volumetric Medical Image Segmentation

A 3D Coarse-to-Fine Framework for Volumetric Medical Image Segmentation

Arxiv

15+阅读 · 2018年8月2日

相关基金

居民天然气阶梯定价政策设计及效果评价研究

国家自然科学基金

0+阅读 · 2015年12月31日

高血压血管重塑中血管保护分子CREG基因表达的上游调控机制研究

国家自然科学基金

0+阅读 · 2015年12月31日

近海环境斜拉索风致疲劳损伤的磁流变阻尼器控制

国家自然科学基金

0+阅读 · 2014年12月31日

空间插值的微分几何方法研究

国家自然科学基金

0+阅读 · 2014年12月31日

时间分辨共振拉曼光谱用于光感蛋白SRII/HtrII复合物的弱相互作用研究

国家自然科学基金

0+阅读 · 2014年12月31日

激发态在分子高次谐波辐射中的效应

国家自然科学基金

0+阅读 · 2012年12月31日

烟草抗马铃薯Y病毒相关miRNA的筛选与功能分析

国家自然科学基金

0+阅读 · 2012年12月31日

线粒体tRNA前体加工与细胞周期调控之间偶联机制的研究

国家自然科学基金

0+阅读 · 2011年12月31日

针刺抗氧化效应的TRx氧化还原调控机制研究

国家自然科学基金

0+阅读 · 2008年12月31日

基于主客体分子识别的超分子聚合与大分子自组装

国家自然科学基金

0+阅读 · 2008年12月31日

微信扫码咨询专知VIP会员