以摩达尔-韦斯回归和多模式IoU为基础进行大错配的多式多式探雷器 (Multi-Modal Pedestrian Detection with Large Misalignment Based on Modal-Wise Regression and Multi-Modal IoU) - 专知论文

会员服务 ·

0

Performer · 模态 · Better · state-of-the-art · Integration ·

2021 年 7 月 23 日

Multi-Modal Pedestrian Detection with Large Misalignment Based on Modal-Wise Regression and Multi-Modal IoU

翻译：以摩达尔-韦斯回归和多模式IoU为基础进行大错配的多式多式探雷器

Napat Wanchaitanawong,Masayuki Tanaka,Takashi Shibata,Masatoshi Okutomi

from arxiv, Accepted by MVA2021

The combined use of multiple modalities enables accurate pedestrian detection under poor lighting conditions by using the high visibility areas from these modalities together. The vital assumption for the combination use is that there is no or only a weak misalignment between the two modalities. In general, however, this assumption often breaks in actual situations. Due to this assumption's breakdown, the position of the bounding boxes does not match between the two modalities, resulting in a significant decrease in detection accuracy, especially in regions where the amount of misalignment is large. In this paper, we propose a multi-modal Faster-RCNN that is robust against large misalignment. The keys are 1) modal-wise regression and 2) multi-modal IoU for mini-batch sampling. To deal with large misalignment, we perform bounding box regression for both the RPN and detection-head with both modalities. We also propose a new sampling strategy called "multi-modal mini-batch sampling" that integrates the IoU for both modalities. We demonstrate that the proposed method's performance is much better than that of the state-of-the-art methods for data with large misalignment through actual image experiments.

翻译：结合使用多种模式,可以使用这些模式的高可见度区域,在照明条件差的情况下对行人进行准确的探测。混合使用的关键假设是两种模式之间没有或只是微弱的不匹配。但一般而言,这种假设往往在实际情况下打破。由于这一假设的崩溃,捆绑箱的位置与这两种模式不匹配,导致检测准确性显著下降,特别是在不匹配程度大的区域。在本文中,我们建议采用一种多式的快速加速-RCNNN,能够抵御大不匹配。关键是:(1) 模式性回归和(2) 用于微型批量取样的多式IOU。要处理大不匹配,我们用两种模式对RPN和检测头都进行捆绑式的回归。我们还提出一个新的取样战略,称为“多式微型批量取样”,将IOU结合到两种模式。我们证明,拟议方法的性能比用实际图像对大错位数据进行实验的状态方法要好得多。

0

相关内容

Performer

【ICCV2021】用于目标检测和实例分割的新损失函数

专知会员服务

22+阅读 · 2021年7月28日

【CVPR2021】双图层实例分割，大幅提升遮挡处理性能

专知会员服务

18+阅读 · 2021年5月23日

【CVPR2021】端到端的全卷积目标检测器

专知会员服务

30+阅读 · 2021年4月5日

【商汤科技】可变形Transformers端到端对象检测，Deformable DETR

【商汤科技】可变形Transformers端到端对象检测，Deformable DETR

专知会员服务

33+阅读 · 2020年10月11日

【微软雷德蒙研究院】小样本自然语言生成，Few-shot Natural Language Generation for Task-Oriented Dialog

【微软雷德蒙研究院】小样本自然语言生成，Few-shot Natural Language Generation for Task-Oriented Dialog

专知会员服务

33+阅读 · 2020年2月29日

【2020新书】Python大数据处理，Mastering Large Datasets with Python，311页pdf

【2020新书】Python大数据处理，Mastering Large Datasets with Python，311页pdf

专知会员服务

196+阅读 · 2020年2月1日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

【深度学习视频分析/多模态学习资源大列表】

【深度学习视频分析/多模态学习资源大列表】

专知会员服务

92+阅读 · 2019年10月16日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

什么是anchor-based 和anchor free？

什么是anchor-based 和anchor free？

计算机视觉life

6+阅读 · 2020年1月4日

视频目标检测：Flow-based

视频目标检测：Flow-based

极市平台

22+阅读 · 2019年5月27日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

【干货】实例分割的进阶三级跳：从 Mask R-CNN 到 Hybrid Task Cascade

【干货】实例分割的进阶三级跳：从 Mask R-CNN 到 Hybrid Task Cascade

GAN生成式对抗网络

8+阅读 · 2019年3月14日

Disentangled的假设的探讨

Disentangled的假设的探讨

CreateAMind

9+阅读 · 2018年12月10日

CornerNet: Detecting Objects as Paired Keypoints 论文笔记

CornerNet: Detecting Objects as Paired Keypoints 论文笔记

统计学习与视觉计算组

7+阅读 · 2018年9月27日

STRCF for Visual Object Tracking

STRCF for Visual Object Tracking

统计学习与视觉计算组

15+阅读 · 2018年5月29日

条件GAN重大改进！cGANs with Projection Discriminator

条件GAN重大改进！cGANs with Projection Discriminator

CreateAMind

8+阅读 · 2018年2月7日

【推荐】YOLO实时目标检测(6fps)

【推荐】YOLO实时目标检测(6fps)

机器学习研究会

20+阅读 · 2017年11月5日

【推荐】深度学习目标检测概览

【推荐】深度学习目标检测概览

机器学习研究会

10+阅读 · 2017年9月1日

Object detection on aerial imagery using CenterNet

Object detection on aerial imagery using CenterNet

Arxiv

6+阅读 · 2019年8月22日

Deep Learning based Pedestrian Detection at Distance in Smart Cities

Deep Learning based Pedestrian Detection at Distance in Smart Cities

Arxiv

4+阅读 · 2019年3月28日

DPOD: Dense 6D Pose Object Detector in RGB images

DPOD: Dense 6D Pose Object Detector in RGB images

Arxiv

5+阅读 · 2019年2月28日

Stereo R-CNN based 3D Object Detection for Autonomous Driving

Stereo R-CNN based 3D Object Detection for Autonomous Driving

Arxiv

5+阅读 · 2019年2月26日

Car Detection using Unmanned Aerial Vehicles: Comparison between Faster R-CNN and YOLOv3

Car Detection using Unmanned Aerial Vehicles: Comparison between Faster R-CNN and YOLOv3

Arxiv

3+阅读 · 2018年12月28日

Triply Supervised Decoder Networks for Joint Detection and Segmentation

Arxiv

3+阅读 · 2018年9月25日

Acquisition of Localization Confidence for Accurate Object Detection

Acquisition of Localization Confidence for Accurate Object Detection

Arxiv

4+阅读 · 2018年7月30日

Zero-Shot Object Detection by Hybrid Region Embedding

Arxiv

19+阅读 · 2018年5月17日

Few-Example Object Detection with Model Communication

Arxiv

7+阅读 · 2018年2月14日

Improving Object Localization with Fitness NMS and Bounded IoU Loss

Arxiv

4+阅读 · 2017年11月8日

VIP会员

文章信息

相关主题

state-of-the-art

相关VIP内容

【ICCV2021】用于目标检测和实例分割的新损失函数

专知会员服务

22+阅读 · 2021年7月28日

【CVPR2021】双图层实例分割，大幅提升遮挡处理性能

专知会员服务

18+阅读 · 2021年5月23日

【CVPR2021】端到端的全卷积目标检测器

专知会员服务

30+阅读 · 2021年4月5日

【商汤科技】可变形Transformers端到端对象检测，Deformable DETR

【商汤科技】可变形Transformers端到端对象检测，Deformable DETR

专知会员服务

33+阅读 · 2020年10月11日

【微软雷德蒙研究院】小样本自然语言生成，Few-shot Natural Language Generation for Task-Oriented Dialog

【微软雷德蒙研究院】小样本自然语言生成，Few-shot Natural Language Generation for Task-Oriented Dialog

专知会员服务

33+阅读 · 2020年2月29日

【2020新书】Python大数据处理，Mastering Large Datasets with Python，311页pdf

【2020新书】Python大数据处理，Mastering Large Datasets with Python，311页pdf

专知会员服务

196+阅读 · 2020年2月1日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

【深度学习视频分析/多模态学习资源大列表】

【深度学习视频分析/多模态学习资源大列表】

专知会员服务

92+阅读 · 2019年10月16日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

热门VIP内容

开通专知VIP会员享更多权益服务

《美空军条令出版物：战略打击》最新条令

《高能激光武器》22页slides

军事前沿模型

《面向小型无人机或无人飞行器的创新雷达探测与人工智能分类技术》263页

相关资讯

什么是anchor-based 和anchor free？

什么是anchor-based 和anchor free？

计算机视觉life

6+阅读 · 2020年1月4日

视频目标检测：Flow-based

视频目标检测：Flow-based

极市平台

22+阅读 · 2019年5月27日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

【干货】实例分割的进阶三级跳：从 Mask R-CNN 到 Hybrid Task Cascade

【干货】实例分割的进阶三级跳：从 Mask R-CNN 到 Hybrid Task Cascade

GAN生成式对抗网络

8+阅读 · 2019年3月14日

Disentangled的假设的探讨

Disentangled的假设的探讨

CreateAMind

9+阅读 · 2018年12月10日

CornerNet: Detecting Objects as Paired Keypoints 论文笔记

CornerNet: Detecting Objects as Paired Keypoints 论文笔记

统计学习与视觉计算组

7+阅读 · 2018年9月27日

STRCF for Visual Object Tracking

STRCF for Visual Object Tracking

统计学习与视觉计算组

15+阅读 · 2018年5月29日

条件GAN重大改进！cGANs with Projection Discriminator

条件GAN重大改进！cGANs with Projection Discriminator

CreateAMind

8+阅读 · 2018年2月7日

【推荐】YOLO实时目标检测(6fps)

【推荐】YOLO实时目标检测(6fps)

机器学习研究会

20+阅读 · 2017年11月5日

【推荐】深度学习目标检测概览

【推荐】深度学习目标检测概览

机器学习研究会

10+阅读 · 2017年9月1日

相关论文

Object detection on aerial imagery using CenterNet

Object detection on aerial imagery using CenterNet

Arxiv

6+阅读 · 2019年8月22日

Deep Learning based Pedestrian Detection at Distance in Smart Cities

Deep Learning based Pedestrian Detection at Distance in Smart Cities

Arxiv

4+阅读 · 2019年3月28日

DPOD: Dense 6D Pose Object Detector in RGB images

DPOD: Dense 6D Pose Object Detector in RGB images

Arxiv

5+阅读 · 2019年2月28日

Stereo R-CNN based 3D Object Detection for Autonomous Driving

Stereo R-CNN based 3D Object Detection for Autonomous Driving

Arxiv

5+阅读 · 2019年2月26日

Car Detection using Unmanned Aerial Vehicles: Comparison between Faster R-CNN and YOLOv3

Car Detection using Unmanned Aerial Vehicles: Comparison between Faster R-CNN and YOLOv3

Arxiv

3+阅读 · 2018年12月28日

Triply Supervised Decoder Networks for Joint Detection and Segmentation

Arxiv

3+阅读 · 2018年9月25日

Acquisition of Localization Confidence for Accurate Object Detection

Acquisition of Localization Confidence for Accurate Object Detection

Arxiv

4+阅读 · 2018年7月30日

Zero-Shot Object Detection by Hybrid Region Embedding

Arxiv

19+阅读 · 2018年5月17日

Few-Example Object Detection with Model Communication

Arxiv

7+阅读 · 2018年2月14日

Improving Object Localization with Fitness NMS and Bounded IoU Loss

Arxiv

4+阅读 · 2017年11月8日

微信扫码咨询专知VIP会员