SVT-Net: 用于大规模地点识别的超光光重散射Voxel变异器 (SVT-Net: Super Light-Weight Sparse Voxel Transformer for Large Scale Place Recognition) - 专知论文

会员服务 ·

0

CSVT · 稀疏 · MoDELS · 缩放 · state-of-the-art ·

2021 年 12 月 13 日

SVT-Net: Super Light-Weight Sparse Voxel Transformer for Large Scale Place Recognition

翻译：SVT-Net: 用于大规模地点识别的超光光重散射Voxel变异器

Zhaoxin Fan,Zhenbo Song,Hongyan Liu,Zhiwu Lu,Jun He,Xiaoyong Du

from arxiv, accepted to AAAI 2022

Point cloud-based large scale place recognition is fundamental for many applications like Simultaneous Localization and Mapping (SLAM). Although many models have been proposed and have achieved good performance by learning short-range local features, long-range contextual properties have often been neglected. Moreover, the model size has also become a bottleneck for their wide applications. To overcome these challenges, we propose a super light-weight network model termed SVT-Net for large scale place recognition. Specifically, on top of the highly efficient 3D Sparse Convolution (SP-Conv), an Atom-based Sparse Voxel Transformer (ASVT) and a Cluster-based Sparse Voxel Transformer (CSVT) are proposed to learn both short-range local features and long-range contextual features in this model. Consisting of ASVT and CSVT, SVT-Net can achieve state-of-the-art on benchmark datasets in terms of both accuracy and speed with a super-light model size (0.9M). Meanwhile, two simplified versions of SVT-Net are introduced, which also achieve state-of-the-art and further reduce the model size to 0.8M and 0.4M respectively.

翻译：以云为主的大型云点定位对于许多应用来说至关重要,如同声相向的本地化和绘图(SLAM)等。虽然提出了许多模型,并且通过学习短距离本地特征取得了良好的绩效,但长距离背景属性往往被忽视。此外,模型大小也成为其广泛应用的瓶颈。为了克服这些挑战,我们提出了一个超轻量网络模型,称为SVT-Net,用于大规模位置识别。具体地说,除了高效的 3D Sparse Convolution(SP-Conv)、以Atom为主的Sparse Voxel变异器(ASVT)和以集群为基础的Sparse Voxel变异器(CSVT)之外,还提议在该模型中学习短距离本地特征和长距离背景特征。 ASVT和CSVT的结合,SVT-Net可以实现超光速精确度和速度基准数据集的状态(0.9M),同时,还引入了两个SVT-Net的简化版本,分别实现0.18M和0.8M的状态和进一步缩小模型。

0

相关内容

CSVT

阿里巴巴发布最新《时间序列Transformer建模》综述论文

阿里巴巴发布最新《时间序列Transformer建模》综述论文

专知会员服务

137+阅读 · 2022年2月16日

最新《Transformers模型》教程，64页ppt

最新《Transformers模型》教程，64页ppt

专知会员服务

321+阅读 · 2020年11月26日

【CVPR2020】时序分组注意力视频超分

【CVPR2020】时序分组注意力视频超分

专知会员服务

31+阅读 · 2020年7月1日

【CVPR2020-中科院计算所】弱监督语义分割的自监督等价注意力机制，Self-supervised Equivariant Attention Mechanism for Weakly Supervised Semantic Segmentation

【CVPR2020-中科院计算所】弱监督语义分割的自监督等价注意力机制，Self-supervised Equivariant Attention Mechanism for Weakly Supervised Semantic Segmentation

专知会员服务

76+阅读 · 2020年4月10日

【Amazon】使用预先训练的Transformer模型进行数据增强，Data Augmentation using Pre-trained Transformer Models

【Amazon】使用预先训练的Transformer模型进行数据增强，Data Augmentation using Pre-trained Transformer Models

专知会员服务

51+阅读 · 2020年3月7日

【Amazon】使用预先训练的Transformer模型进行数据增强

【Amazon】使用预先训练的Transformer模型进行数据增强

专知会员服务

58+阅读 · 2020年3月6日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Stabilizing Transformers for Reinforcement Learning

Stabilizing Transformers for Reinforcement Learning

专知会员服务

60+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

一文读懂Faster RCNN

一文读懂Faster RCNN

极市平台

5+阅读 · 2020年1月6日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

CVPR2019| 05-20更新17篇点云相关论文及代码合集

CVPR2019| 05-20更新17篇点云相关论文及代码合集

极市平台

23+阅读 · 2019年5月20日

视频理解 S3D，I3D-GCN，SlowFastNet, LFB

视频理解 S3D，I3D-GCN，SlowFastNet, LFB

极市平台

7+阅读 · 2019年1月31日

逆强化学习-学习人先验的动机

逆强化学习-学习人先验的动机

CreateAMind

16+阅读 · 2019年1月18日

无监督元学习表示学习

无监督元学习表示学习

CreateAMind

27+阅读 · 2019年1月4日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

【论文推荐】最新5篇图像分割相关论文—条件随机场和深度特征学习、移动端网络、长期视觉定位、主动学习、主动轮廓模型、生成对抗性网络

【论文推荐】最新5篇图像分割相关论文—条件随机场和深度特征学习、移动端网络、长期视觉定位、主动学习、主动轮廓模型、生成对抗性网络

专知

13+阅读 · 2018年1月23日

【推荐】用Python/OpenCV实现增强现实

【推荐】用Python/OpenCV实现增强现实

机器学习研究会

15+阅读 · 2017年11月16日

【推荐】深度学习目标检测概览

【推荐】深度学习目标检测概览

机器学习研究会

10+阅读 · 2017年9月1日

MFA: TDNN with Multi-scale Frequency-channel Attention for Text-independent Speaker Verification with Short Utterances

MFA: TDNN with Multi-scale Frequency-channel Attention for Text-independent Speaker Verification with Short Utterances

Arxiv

0+阅读 · 2022年2月15日

A Unified Framework for Masked and Mask-Free Face Recognition via Feature Rectification

Arxiv

0+阅读 · 2022年2月15日

Tightly Coupled Learning Strategy for Weakly Supervised Hierarchical Place Recognition

Arxiv

0+阅读 · 2022年2月14日

LighTN: Light-weight Transformer Network for Performance-overhead Tradeoff in Point Cloud Downsampling

Arxiv

0+阅读 · 2022年2月13日

Tiny Object Tracking: A Large-scale Dataset and A Baseline

Arxiv

0+阅读 · 2022年2月11日

Transformer in Transformer

Arxiv

11+阅读 · 2021年10月26日

ResT: An Efficient Transformer for Visual Recognition

Arxiv

3+阅读 · 2021年10月14日

AdaMML: Adaptive Multi-Modal Learning for Efficient Video Recognition

Arxiv

4+阅读 · 2021年5月12日

Transformer Tracking

Arxiv

17+阅读 · 2021年3月29日

3D Face Modeling from Diverse Raw Scan Data

3D Face Modeling from Diverse Raw Scan Data

Arxiv

5+阅读 · 2019年2月13日

VIP会员

文章信息

相关主题

state-of-the-art

相关VIP内容

阿里巴巴发布最新《时间序列Transformer建模》综述论文

阿里巴巴发布最新《时间序列Transformer建模》综述论文

专知会员服务

137+阅读 · 2022年2月16日

最新《Transformers模型》教程，64页ppt

最新《Transformers模型》教程，64页ppt

专知会员服务

321+阅读 · 2020年11月26日

【CVPR2020】时序分组注意力视频超分

【CVPR2020】时序分组注意力视频超分

专知会员服务

31+阅读 · 2020年7月1日

【CVPR2020-中科院计算所】弱监督语义分割的自监督等价注意力机制，Self-supervised Equivariant Attention Mechanism for Weakly Supervised Semantic Segmentation

【CVPR2020-中科院计算所】弱监督语义分割的自监督等价注意力机制，Self-supervised Equivariant Attention Mechanism for Weakly Supervised Semantic Segmentation

专知会员服务

76+阅读 · 2020年4月10日

【Amazon】使用预先训练的Transformer模型进行数据增强，Data Augmentation using Pre-trained Transformer Models

【Amazon】使用预先训练的Transformer模型进行数据增强，Data Augmentation using Pre-trained Transformer Models

专知会员服务

51+阅读 · 2020年3月7日

【Amazon】使用预先训练的Transformer模型进行数据增强

【Amazon】使用预先训练的Transformer模型进行数据增强

专知会员服务

58+阅读 · 2020年3月6日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Stabilizing Transformers for Reinforcement Learning

Stabilizing Transformers for Reinforcement Learning

专知会员服务

60+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

热门VIP内容

开通专知VIP会员享更多权益服务

面向具身智能的多模态数据存储与检索：综述

《算法战争研究计划全景评估》35页

【CMU博士论文】水下三维视觉感知与生成

智能体战争：自主人工智能军备竞赛全景透视

相关资讯

一文读懂Faster RCNN

一文读懂Faster RCNN

极市平台

5+阅读 · 2020年1月6日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

CVPR2019| 05-20更新17篇点云相关论文及代码合集

CVPR2019| 05-20更新17篇点云相关论文及代码合集

极市平台

23+阅读 · 2019年5月20日

视频理解 S3D，I3D-GCN，SlowFastNet, LFB

视频理解 S3D，I3D-GCN，SlowFastNet, LFB

极市平台

7+阅读 · 2019年1月31日

逆强化学习-学习人先验的动机

逆强化学习-学习人先验的动机

CreateAMind

16+阅读 · 2019年1月18日

无监督元学习表示学习

无监督元学习表示学习

CreateAMind

27+阅读 · 2019年1月4日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

【论文推荐】最新5篇图像分割相关论文—条件随机场和深度特征学习、移动端网络、长期视觉定位、主动学习、主动轮廓模型、生成对抗性网络

【论文推荐】最新5篇图像分割相关论文—条件随机场和深度特征学习、移动端网络、长期视觉定位、主动学习、主动轮廓模型、生成对抗性网络

专知

13+阅读 · 2018年1月23日

【推荐】用Python/OpenCV实现增强现实

【推荐】用Python/OpenCV实现增强现实

机器学习研究会

15+阅读 · 2017年11月16日

【推荐】深度学习目标检测概览

【推荐】深度学习目标检测概览

机器学习研究会

10+阅读 · 2017年9月1日

相关论文

MFA: TDNN with Multi-scale Frequency-channel Attention for Text-independent Speaker Verification with Short Utterances

MFA: TDNN with Multi-scale Frequency-channel Attention for Text-independent Speaker Verification with Short Utterances

Arxiv

0+阅读 · 2022年2月15日

A Unified Framework for Masked and Mask-Free Face Recognition via Feature Rectification

Arxiv

0+阅读 · 2022年2月15日

Tightly Coupled Learning Strategy for Weakly Supervised Hierarchical Place Recognition

Arxiv

0+阅读 · 2022年2月14日

LighTN: Light-weight Transformer Network for Performance-overhead Tradeoff in Point Cloud Downsampling

Arxiv

0+阅读 · 2022年2月13日

Tiny Object Tracking: A Large-scale Dataset and A Baseline

Arxiv

0+阅读 · 2022年2月11日

Transformer in Transformer

Arxiv

11+阅读 · 2021年10月26日

ResT: An Efficient Transformer for Visual Recognition

Arxiv

3+阅读 · 2021年10月14日

AdaMML: Adaptive Multi-Modal Learning for Efficient Video Recognition

Arxiv

4+阅读 · 2021年5月12日

Transformer Tracking

Arxiv

17+阅读 · 2021年3月29日

3D Face Modeling from Diverse Raw Scan Data

3D Face Modeling from Diverse Raw Scan Data

Arxiv

5+阅读 · 2019年2月13日

微信扫码咨询专知VIP会员