交并比平滑在边界框回归中的应用 (Intersection over Union with smoothing for bounding box regression) - 专知论文

会员服务 ·

0

平滑 · 边界框 · 损失函数 · 损失 · 搜索空间 ·

2023 年 3 月 27 日

Intersection over Union with smoothing for bounding box regression

翻译：交并比平滑在边界框回归中的应用

Petra Števuliáková,Petr Hurtik

from arxiv, 11 pages, 4 figures, 4 tables, IWANN2023 conference

We focus on the construction of a loss function for the bounding box regression. The Intersection over Union (IoU) metric is improved to converge faster, to make the surface of the loss function smooth and continuous over the whole searched space, and to reach a more precise approximation of the labels. The main principle is adding a smoothing part to the original IoU, where the smoothing part is given by a linear space with values that increases from the ground truth bounding box to the border of the input image, and thus covers the whole spatial search space. We show the motivation and formalism behind this loss function and experimentally prove that it outperforms IoU, DIoU, CIoU, and SIoU by a large margin. We experimentally show that the proposed loss function is robust with respect to the noise in the dimension of ground truth bounding boxes. The reference implementation is available at gitlab.com/irafm-ai/smoothing-iou.

翻译：我们关注于构建边界框回归的损失函数。我们改进了交并比（IoU）度量标准，使其更快地收敛，使损失函数在整个搜索空间上变得平滑和连续，并使其更准确地逼近标签。主要原则是向原始IoU添加一个平滑部分，其中平滑部分由一个线性空间给出，其值从真实边界框到输入图像的边缘逐渐增加，因此覆盖了整个空间搜索空间。我们展示了这个损失函数背后的动机和形式化，并通过实验证明，它比IoU、DIoU、CIoU和SIoU的效果要好得多。我们通过实验证明，所提出的损失函数对于标注的噪声具有鲁棒性。参考实现可在 gitlab.com/irafm-ai/smoothing-iou 上找到。

0

相关内容

从30+场秋招面试中总结出的超强面经——目标检测篇（含答案）

从30+场秋招面试中总结出的超强面经——目标检测篇（含答案）

专知会员服务

16+阅读 · 2023年1月10日

【干货书】计算优化:实践中的成功，415页pdf

【干货书】计算优化:实践中的成功，415页pdf

专知会员服务

70+阅读 · 2022年12月29日

【硬核书】矩阵代数基础，248页pdf

【硬核书】矩阵代数基础，248页pdf

专知会员服务

88+阅读 · 2021年12月9日

【干货书】机器学习速查手册，135页pdf

【干货书】机器学习速查手册，135页pdf

专知会员服务

127+阅读 · 2020年11月20日

【Google】平滑对抗训练，Smooth Adversarial Training

【Google】平滑对抗训练，Smooth Adversarial Training

专知会员服务

49+阅读 · 2020年7月4日

【开放新书】可验证深度学习，91页pdf阐述Deep Learning的鲁棒性，提升安全可靠性

【开放新书】可验证深度学习，91页pdf阐述Deep Learning的鲁棒性，提升安全可靠性

专知会员服务

59+阅读 · 2020年4月11日

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

专知会员服务

95+阅读 · 2020年3月12日

【机器学习基础最新版】（Mathematics for Machine Learning），417页pdf

【机器学习基础最新版】（Mathematics for Machine Learning），417页pdf

专知会员服务

244+阅读 · 2019年10月21日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

CVPR 2019：精确目标检测的不确定边界框回归

CVPR 2019：精确目标检测的不确定边界框回归

AI科技评论

13+阅读 · 2019年9月16日

初学者系列：基于Keras的Faster-RCNN的代码学习

初学者系列：基于Keras的Faster-RCNN的代码学习

专知

17+阅读 · 2019年8月9日

Deep Compression/Acceleration：模型压缩加速论文汇总

Deep Compression/Acceleration：模型压缩加速论文汇总

极市平台

14+阅读 · 2019年5月15日

CVPR2019 | Stereo R-CNN 3D 目标检测

CVPR2019 | Stereo R-CNN 3D 目标检测

极市平台

27+阅读 · 2019年3月10日

Single-Shot Object Detection with Enriched Semantics

Single-Shot Object Detection with Enriched Semantics

统计学习与视觉计算组

14+阅读 · 2018年8月29日

一文读懂目标检测：R-CNN、Fast R-CNN、Faster R-CNN、YOLO、SSD

一文读懂目标检测：R-CNN、Fast R-CNN、Faster R-CNN、YOLO、SSD

七月在线实验室

11+阅读 · 2018年7月18日

「目标检测算法」连连看：从 Faster R-CNN 、 R-FCN 到 FPN

「目标检测算法」连连看：从 Faster R-CNN 、 R-FCN 到 FPN

AI研习社

10+阅读 · 2018年5月12日

讲透RCNN, Fast-RCNN, Faster-RCNN，将CNN用于目标检测

讲透RCNN, Fast-RCNN, Faster-RCNN，将CNN用于目标检测

数据挖掘入门与实战

18+阅读 · 2018年4月20日

Relation Networks for Object Detection 论文笔记

Relation Networks for Object Detection 论文笔记

统计学习与视觉计算组

16+阅读 · 2018年4月18日

【CNN】一文读懂卷积神经网络CNN

【CNN】一文读懂卷积神经网络CNN

产业智能官

18+阅读 · 2018年1月2日

改进型网络模型中若干组合优化问题的复杂性理论与算法设计研究

国家自然科学基金

0+阅读 · 2014年12月31日

PARP-1调控急性肺损伤中中性粒细胞浸润和活化的作用及其分子机制研究

国家自然科学基金

0+阅读 · 2013年12月31日

陶瓷产业集群与区域经济发展实证研究——以江西景德镇为例

国家自然科学基金

0+阅读 · 2012年12月31日

脂肪因子chemerin通过ChemR23依赖性途径对动脉粥样硬化发生、发展和斑块稳定性影响及其作用机制的研究

国家自然科学基金

0+阅读 · 2012年12月31日

面向安全等级的安全需求工程方法与环境

国家自然科学基金

0+阅读 · 2012年12月31日

多维高次有限元超收敛后处理研究

国家自然科学基金

0+阅读 · 2011年12月31日

PPARγ调控PI3K/Akt在胰岛素抵抗中的作用

国家自然科学基金

0+阅读 · 2011年12月31日

线性积分方程的Galerkin快速谱方法

国家自然科学基金

0+阅读 · 2009年12月31日

Erbin在细胞分裂周期中的作用

国家自然科学基金

0+阅读 · 2009年12月31日

Intermedin调节低氧性肺血管改建的作用及分子机制

国家自然科学基金

0+阅读 · 2008年12月31日

Deep Fourier Residual method for solving time-harmonic Maxwell's equations

Arxiv

0+阅读 · 2023年5月16日

Weighted Intersection over Union (wIoU): A New Evaluation Metric for Image Segmentation

Arxiv

0+阅读 · 2023年5月16日

Wavelet-Based Density Estimation for Persistent Homology

Arxiv

0+阅读 · 2023年5月15日

Topological Interpretability for Deep-Learning

Arxiv

1+阅读 · 2023年5月15日

CLRerNet: Improving Confidence of Lane Detection with LaneIoU

Arxiv

0+阅读 · 2023年5月15日

Subspace Culling for Ray-Box Intersection

Arxiv

0+阅读 · 2023年5月15日

Power Allocation for the Base Matrix of Spatially Coupled Sparse Regression Codes

Arxiv

0+阅读 · 2023年5月13日

GAN Dissection: Visualizing and Understanding Generative Adversarial Networks

GAN Dissection: Visualizing and Understanding Generative Adversarial Networks

Arxiv

11+阅读 · 2018年12月8日

Rotation-Sensitive Regression for Oriented Scene Text Detection

Arxiv

13+阅读 · 2018年3月14日

Additive Margin Softmax for Face Verification

Arxiv

11+阅读 · 2018年1月18日

VIP会员

文章信息

相关主题

相关VIP内容

从30+场秋招面试中总结出的超强面经——目标检测篇（含答案）

从30+场秋招面试中总结出的超强面经——目标检测篇（含答案）

专知会员服务

16+阅读 · 2023年1月10日

【干货书】计算优化:实践中的成功，415页pdf

【干货书】计算优化:实践中的成功，415页pdf

专知会员服务

70+阅读 · 2022年12月29日

【硬核书】矩阵代数基础，248页pdf

【硬核书】矩阵代数基础，248页pdf

专知会员服务

88+阅读 · 2021年12月9日

【干货书】机器学习速查手册，135页pdf

【干货书】机器学习速查手册，135页pdf

专知会员服务

127+阅读 · 2020年11月20日

【Google】平滑对抗训练，Smooth Adversarial Training

【Google】平滑对抗训练，Smooth Adversarial Training

专知会员服务

49+阅读 · 2020年7月4日

【开放新书】可验证深度学习，91页pdf阐述Deep Learning的鲁棒性，提升安全可靠性

【开放新书】可验证深度学习，91页pdf阐述Deep Learning的鲁棒性，提升安全可靠性

专知会员服务

59+阅读 · 2020年4月11日

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

专知会员服务

95+阅读 · 2020年3月12日

【机器学习基础最新版】（Mathematics for Machine Learning），417页pdf

【机器学习基础最新版】（Mathematics for Machine Learning），417页pdf

专知会员服务

244+阅读 · 2019年10月21日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

《社交媒体信息作战》最新48页技术报告

《美空军条令出版物：战略打击》最新条令

《使用量化测量将传感器节点关联到融合中心的算法设计》171页

军事前沿模型

相关资讯

CVPR 2019：精确目标检测的不确定边界框回归

CVPR 2019：精确目标检测的不确定边界框回归

AI科技评论

13+阅读 · 2019年9月16日

初学者系列：基于Keras的Faster-RCNN的代码学习

初学者系列：基于Keras的Faster-RCNN的代码学习

专知

17+阅读 · 2019年8月9日

Deep Compression/Acceleration：模型压缩加速论文汇总

Deep Compression/Acceleration：模型压缩加速论文汇总

极市平台

14+阅读 · 2019年5月15日

CVPR2019 | Stereo R-CNN 3D 目标检测

CVPR2019 | Stereo R-CNN 3D 目标检测

极市平台

27+阅读 · 2019年3月10日

Single-Shot Object Detection with Enriched Semantics

Single-Shot Object Detection with Enriched Semantics

统计学习与视觉计算组

14+阅读 · 2018年8月29日

一文读懂目标检测：R-CNN、Fast R-CNN、Faster R-CNN、YOLO、SSD

一文读懂目标检测：R-CNN、Fast R-CNN、Faster R-CNN、YOLO、SSD

七月在线实验室

11+阅读 · 2018年7月18日

「目标检测算法」连连看：从 Faster R-CNN 、 R-FCN 到 FPN

「目标检测算法」连连看：从 Faster R-CNN 、 R-FCN 到 FPN

AI研习社

10+阅读 · 2018年5月12日

讲透RCNN, Fast-RCNN, Faster-RCNN，将CNN用于目标检测

讲透RCNN, Fast-RCNN, Faster-RCNN，将CNN用于目标检测

数据挖掘入门与实战

18+阅读 · 2018年4月20日

Relation Networks for Object Detection 论文笔记

Relation Networks for Object Detection 论文笔记

统计学习与视觉计算组

16+阅读 · 2018年4月18日

【CNN】一文读懂卷积神经网络CNN

【CNN】一文读懂卷积神经网络CNN

产业智能官

18+阅读 · 2018年1月2日

相关论文

Deep Fourier Residual method for solving time-harmonic Maxwell's equations

Arxiv

0+阅读 · 2023年5月16日

Weighted Intersection over Union (wIoU): A New Evaluation Metric for Image Segmentation

Arxiv

0+阅读 · 2023年5月16日

Wavelet-Based Density Estimation for Persistent Homology

Arxiv

0+阅读 · 2023年5月15日

Topological Interpretability for Deep-Learning

Arxiv

1+阅读 · 2023年5月15日

CLRerNet: Improving Confidence of Lane Detection with LaneIoU

Arxiv

0+阅读 · 2023年5月15日

Subspace Culling for Ray-Box Intersection

Arxiv

0+阅读 · 2023年5月15日

Power Allocation for the Base Matrix of Spatially Coupled Sparse Regression Codes

Arxiv

0+阅读 · 2023年5月13日

GAN Dissection: Visualizing and Understanding Generative Adversarial Networks

GAN Dissection: Visualizing and Understanding Generative Adversarial Networks

Arxiv

11+阅读 · 2018年12月8日

Rotation-Sensitive Regression for Oriented Scene Text Detection

Arxiv

13+阅读 · 2018年3月14日

Additive Margin Softmax for Face Verification

Arxiv

11+阅读 · 2018年1月18日

相关基金

改进型网络模型中若干组合优化问题的复杂性理论与算法设计研究

国家自然科学基金

0+阅读 · 2014年12月31日

PARP-1调控急性肺损伤中中性粒细胞浸润和活化的作用及其分子机制研究

国家自然科学基金

0+阅读 · 2013年12月31日

陶瓷产业集群与区域经济发展实证研究——以江西景德镇为例

国家自然科学基金

0+阅读 · 2012年12月31日

脂肪因子chemerin通过ChemR23依赖性途径对动脉粥样硬化发生、发展和斑块稳定性影响及其作用机制的研究

国家自然科学基金

0+阅读 · 2012年12月31日

面向安全等级的安全需求工程方法与环境

国家自然科学基金

0+阅读 · 2012年12月31日

多维高次有限元超收敛后处理研究

国家自然科学基金

0+阅读 · 2011年12月31日

PPARγ调控PI3K/Akt在胰岛素抵抗中的作用

国家自然科学基金

0+阅读 · 2011年12月31日

线性积分方程的Galerkin快速谱方法

国家自然科学基金

0+阅读 · 2009年12月31日

Erbin在细胞分裂周期中的作用

国家自然科学基金

0+阅读 · 2009年12月31日

Intermedin调节低氧性肺血管改建的作用及分子机制

国家自然科学基金

0+阅读 · 2008年12月31日

微信扫码咨询专知VIP会员