Jaccard度量损失：利用软标签优化Jaccard指数 (Jaccard Metric Losses: Optimizing the Jaccard Index with Soft Labels) - 专知论文

会员服务 ·

0

损失 · 软标签 · 度量 · 交叉熵 · SOFT ·

2023 年 3 月 28 日

Jaccard Metric Losses: Optimizing the Jaccard Index with Soft Labels

翻译：Jaccard度量损失：利用软标签优化Jaccard指数

Zifu Wang,Matthew B. Blaschko

from arxiv, Submitted to ICML2023. Code is available at https://github.com/zifuwanggg/JDTLosses

IoU losses are surrogates that directly optimize the Jaccard index. In semantic segmentation, leveraging IoU losses as part of the loss function is shown to perform better with respect to the Jaccard index measure than optimizing pixel-wise losses such as the cross-entropy loss alone. The most notable IoU losses are the soft Jaccard loss and the Lovasz-Softmax loss. However, these losses are incompatible with soft labels which are ubiquitous in machine learning. In this paper, we propose Jaccard metric losses (JMLs), which are identical to the soft Jaccard loss in a standard setting with hard labels, but are compatible with soft labels. With JMLs, we study two of the most popular use cases of soft labels: label smoothing and knowledge distillation. With a variety of architectures, our experiments show significant improvements over the cross-entropy loss on three semantic segmentation datasets (Cityscapes, PASCAL VOC and DeepGlobe Land), and our simple approach outperforms state-of-the-art knowledge distillation methods by a large margin. Code is available at: \href{https://github.com/zifuwanggg/JDTLosses}{https://github.com/zifuwanggg/JDTLosses}.

翻译：IoU（交并比）损失是直接优化Jaccard指数的替代品。在语义分割中，将IoU损失作为损失函数的一部分比仅优化像素值的损失（如交叉熵损失）在Jaccard指数测量方面表现更好。最显著的IoU损失是soft Jaccard损失和Lovasz-Softmax损失。然而，这些损失函数不兼容在机器学习中普遍使用的软标签。在本文中，我们提出Jaccard度量损失（JML），它们在硬标签的标准情况下与soft Jaccard损失相同，但对软标签兼容。通过JML，我们研究了软标签的两个最流行的用例：标签平滑和知识蒸馏。通过各种体系结构，我们的实验表明，在三个语义分割数据集（Cityscapes、PASCAL VOC和DeepGlobe Land）上，与交叉熵损失相比，我们的简单方法显著提高了性能，且超过了最先进的知识蒸馏方法很大的范围。代码可在以下网址中获得：\href{https://github.com/zifuwanggg/JDTLosses}{https://github.com/zifuwanggg/JDTLosses}。

0

相关内容

【干货书】工程和科学中的概率和统计，

【干货书】工程和科学中的概率和统计，

专知会员服务

58+阅读 · 2022年12月24日

《JADC2 Update—— The What to the How》美国国防信息系统局（DISA）10页slides

《JADC2 Update—— The What to the How》美国国防信息系统局（DISA）10页slides

专知会员服务

49+阅读 · 2022年6月8日

【CVPR 2022】一个完全无监督的框架，从噪声和部分测量中学习图像，Robust Equivariant Imaging: a fully unsupervised framework for learning to image

【CVPR 2022】一个完全无监督的框架，从噪声和部分测量中学习图像，Robust Equivariant Imaging: a fully unsupervised framework for learning to image

专知会员服务

25+阅读 · 2022年3月3日

【NeurIPS2021】基于自适应均衡学习的半监督语义分割

专知会员服务

14+阅读 · 2021年10月13日

【Google-CMU】元伪标签的元学习，Meta Pseudo Labels

【Google-CMU】元伪标签的元学习，Meta Pseudo Labels

专知会员服务

32+阅读 · 2020年3月30日

【CVPR2020】视觉跟踪的概率回归，Probabilistic Regression for Visual Tracking

【CVPR2020】视觉跟踪的概率回归，Probabilistic Regression for Visual Tracking

专知会员服务

37+阅读 · 2020年3月27日

最大均方差正则化贝叶斯神经网络，Bayesian Neural Networks With Maximum Mean Discrepancy Regularization

最大均方差正则化贝叶斯神经网络，Bayesian Neural Networks With Maximum Mean Discrepancy Regularization

专知会员服务

54+阅读 · 2020年3月5日

【MIT】时间序列GAN，Subadditivity of Probability Divergences

专知会员服务

63+阅读 · 2020年3月4日

【WWW2020-MAGNN】异质图嵌入的集合图神经网络 MAGNN: Metapath Aggregated Graph Neural Network for Heterogeneous Graph Embedding

【WWW2020-MAGNN】异质图嵌入的集合图神经网络 MAGNN: Metapath Aggregated Graph Neural Network for Heterogeneous Graph Embedding

专知会员服务

116+阅读 · 2020年2月10日

【Google论文】ALBERT:自我监督学习语言表达的精简BERT

【Google论文】ALBERT:自我监督学习语言表达的精简BERT

专知会员服务

24+阅读 · 2019年11月4日

GNN 新基准！Long Range Graph Benchmark

GNN 新基准！Long Range Graph Benchmark

图与推荐

0+阅读 · 2022年10月18日

ECCV 2022 | 修正FPN带来的大目标性能损害：You Should Look at All Objects

ECCV 2022 | 修正FPN带来的大目标性能损害：You Should Look at All Objects

PaperWeekly

0+阅读 · 2022年7月21日

灾难性遗忘问题新视角：迁移-干扰平衡

灾难性遗忘问题新视角：迁移-干扰平衡

CreateAMind

17+阅读 · 2019年7月6日

深度卷积神经网络中的降采样

深度卷积神经网络中的降采样

极市平台

12+阅读 · 2019年5月24日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

分享神经网络中设计loss function的一些技巧

分享神经网络中设计loss function的一些技巧

极市平台

35+阅读 · 2019年1月22日

无监督元学习表示学习

无监督元学习表示学习

CreateAMind

27+阅读 · 2019年1月4日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

换个角度看GAN：另一种损失函数

换个角度看GAN：另一种损失函数

机器之心

16+阅读 · 2019年1月1日

【论文】变分推断（Variational inference)的总结

【论文】变分推断（Variational inference)的总结

机器学习研究会

39+阅读 · 2017年11月16日

Jacobi行列式和Hilbert变换中的若干问题及应用

国家自然科学基金

0+阅读 · 2014年12月31日

有限域上多项式的p-进与T-进指数和

国家自然科学基金

0+阅读 · 2013年12月31日

考虑观测值时空相关性的InSAR三维形变估计方法

国家自然科学基金

0+阅读 · 2013年12月31日

椿皮中苦木内酯类成分抑制HER2的作用机制和构效关系研究

国家自然科学基金

0+阅读 · 2012年12月31日

11C-PD153035 PET/CT筛选非小细胞肺癌EGFR突变和监测EGFR-TKIs疗效的研究

国家自然科学基金

0+阅读 · 2012年12月31日

函数域中的Vinogradov中值定理

国家自然科学基金

0+阅读 · 2012年12月31日

小模数弧齿锥齿轮粉末冶金近净成形齿面偏差控制机理研究

国家自然科学基金

0+阅读 · 2012年12月31日

非参数CFAR检测理论及应用

国家自然科学基金

0+阅读 · 2011年12月31日

I型Ge基clathrate晶体生长及热电性能研究

国家自然科学基金

0+阅读 · 2009年12月31日

电荷分布对聚电解质溶液性质的影响

国家自然科学基金

0+阅读 · 2008年12月31日

Double Robust Semi-Supervised Inference for the Mean: Selection Bias under MAR Labeling with Decaying Overlap

Arxiv

0+阅读 · 2023年5月18日

Robust inference of causality in high-dimensional dynamical processes from the Information Imbalance of distance ranks

Arxiv

0+阅读 · 2023年5月18日

Online List Labeling with Predictions

Arxiv

0+阅读 · 2023年5月17日

Logit-Based Ensemble Distribution Distillation for Robust Autoregressive Sequence Uncertainties

Arxiv

0+阅读 · 2023年5月17日

SuSana Distancia is all you need: Enforcing class separability in metric learning via two novel distance-based loss functions for few-shot image classification

SuSana Distancia is all you need: Enforcing class separability in metric learning via two novel distance-based loss functions for few-shot image classification

Arxiv

0+阅读 · 2023年5月17日

Selecting the Number of Clusters $K$ with a Stability Trade-off: an Internal Validation Criterion

Arxiv

0+阅读 · 2023年5月16日

Heat diffusion distance processes: a statistically founded method to analyze graph data sets

Arxiv

0+阅读 · 2023年5月16日

Online Continual Learning Without the Storage Constraint

Arxiv

0+阅读 · 2023年5月16日

Contrastive Clustering

Arxiv

31+阅读 · 2020年9月21日

Class-Balanced Loss Based on Effective Number of Samples

Arxiv

12+阅读 · 2019年1月16日

VIP会员

文章信息

相关主题

相关VIP内容

【干货书】工程和科学中的概率和统计，

【干货书】工程和科学中的概率和统计，

专知会员服务

58+阅读 · 2022年12月24日

《JADC2 Update—— The What to the How》美国国防信息系统局（DISA）10页slides

《JADC2 Update—— The What to the How》美国国防信息系统局（DISA）10页slides

专知会员服务

49+阅读 · 2022年6月8日

【CVPR 2022】一个完全无监督的框架，从噪声和部分测量中学习图像，Robust Equivariant Imaging: a fully unsupervised framework for learning to image

【CVPR 2022】一个完全无监督的框架，从噪声和部分测量中学习图像，Robust Equivariant Imaging: a fully unsupervised framework for learning to image

专知会员服务

25+阅读 · 2022年3月3日

【NeurIPS2021】基于自适应均衡学习的半监督语义分割

专知会员服务

14+阅读 · 2021年10月13日

【Google-CMU】元伪标签的元学习，Meta Pseudo Labels

【Google-CMU】元伪标签的元学习，Meta Pseudo Labels

专知会员服务

32+阅读 · 2020年3月30日

【CVPR2020】视觉跟踪的概率回归，Probabilistic Regression for Visual Tracking

【CVPR2020】视觉跟踪的概率回归，Probabilistic Regression for Visual Tracking

专知会员服务

37+阅读 · 2020年3月27日

最大均方差正则化贝叶斯神经网络，Bayesian Neural Networks With Maximum Mean Discrepancy Regularization

最大均方差正则化贝叶斯神经网络，Bayesian Neural Networks With Maximum Mean Discrepancy Regularization

专知会员服务

54+阅读 · 2020年3月5日

【MIT】时间序列GAN，Subadditivity of Probability Divergences

专知会员服务

63+阅读 · 2020年3月4日

【WWW2020-MAGNN】异质图嵌入的集合图神经网络 MAGNN: Metapath Aggregated Graph Neural Network for Heterogeneous Graph Embedding

【WWW2020-MAGNN】异质图嵌入的集合图神经网络 MAGNN: Metapath Aggregated Graph Neural Network for Heterogeneous Graph Embedding

专知会员服务

116+阅读 · 2020年2月10日

【Google论文】ALBERT:自我监督学习语言表达的精简BERT

【Google论文】ALBERT:自我监督学习语言表达的精简BERT

专知会员服务

24+阅读 · 2019年11月4日

热门VIP内容

开通专知VIP会员享更多权益服务

【博士论文】扩展可扩展会话推荐的边界

别想太多：高效 R1 风格大型推理模型综述

【ACMMM2025】EvoVLMA: 进化式视觉-语言模型自适应

智能体网络：用AI智能体编织下一代网络

相关资讯

GNN 新基准！Long Range Graph Benchmark

GNN 新基准！Long Range Graph Benchmark

图与推荐

0+阅读 · 2022年10月18日

ECCV 2022 | 修正FPN带来的大目标性能损害：You Should Look at All Objects

ECCV 2022 | 修正FPN带来的大目标性能损害：You Should Look at All Objects

PaperWeekly

0+阅读 · 2022年7月21日

灾难性遗忘问题新视角：迁移-干扰平衡

灾难性遗忘问题新视角：迁移-干扰平衡

CreateAMind

17+阅读 · 2019年7月6日

深度卷积神经网络中的降采样

深度卷积神经网络中的降采样

极市平台

12+阅读 · 2019年5月24日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

分享神经网络中设计loss function的一些技巧

分享神经网络中设计loss function的一些技巧

极市平台

35+阅读 · 2019年1月22日

无监督元学习表示学习

无监督元学习表示学习

CreateAMind

27+阅读 · 2019年1月4日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

换个角度看GAN：另一种损失函数

换个角度看GAN：另一种损失函数

机器之心

16+阅读 · 2019年1月1日

【论文】变分推断（Variational inference)的总结

【论文】变分推断（Variational inference)的总结

机器学习研究会

39+阅读 · 2017年11月16日

相关论文

Double Robust Semi-Supervised Inference for the Mean: Selection Bias under MAR Labeling with Decaying Overlap

Arxiv

0+阅读 · 2023年5月18日

Robust inference of causality in high-dimensional dynamical processes from the Information Imbalance of distance ranks

Arxiv

0+阅读 · 2023年5月18日

Online List Labeling with Predictions

Arxiv

0+阅读 · 2023年5月17日

Logit-Based Ensemble Distribution Distillation for Robust Autoregressive Sequence Uncertainties

Arxiv

0+阅读 · 2023年5月17日

SuSana Distancia is all you need: Enforcing class separability in metric learning via two novel distance-based loss functions for few-shot image classification

SuSana Distancia is all you need: Enforcing class separability in metric learning via two novel distance-based loss functions for few-shot image classification

Arxiv

0+阅读 · 2023年5月17日

Selecting the Number of Clusters $K$ with a Stability Trade-off: an Internal Validation Criterion

Arxiv

0+阅读 · 2023年5月16日

Heat diffusion distance processes: a statistically founded method to analyze graph data sets

Arxiv

0+阅读 · 2023年5月16日

Online Continual Learning Without the Storage Constraint

Arxiv

0+阅读 · 2023年5月16日

Contrastive Clustering

Arxiv

31+阅读 · 2020年9月21日

Class-Balanced Loss Based on Effective Number of Samples

Arxiv

12+阅读 · 2019年1月16日

相关基金

Jacobi行列式和Hilbert变换中的若干问题及应用

国家自然科学基金

0+阅读 · 2014年12月31日

有限域上多项式的p-进与T-进指数和

国家自然科学基金

0+阅读 · 2013年12月31日

考虑观测值时空相关性的InSAR三维形变估计方法

国家自然科学基金

0+阅读 · 2013年12月31日

椿皮中苦木内酯类成分抑制HER2的作用机制和构效关系研究

国家自然科学基金

0+阅读 · 2012年12月31日

11C-PD153035 PET/CT筛选非小细胞肺癌EGFR突变和监测EGFR-TKIs疗效的研究

国家自然科学基金

0+阅读 · 2012年12月31日

函数域中的Vinogradov中值定理

国家自然科学基金

0+阅读 · 2012年12月31日

小模数弧齿锥齿轮粉末冶金近净成形齿面偏差控制机理研究

国家自然科学基金

0+阅读 · 2012年12月31日

非参数CFAR检测理论及应用

国家自然科学基金

0+阅读 · 2011年12月31日

I型Ge基clathrate晶体生长及热电性能研究

国家自然科学基金

0+阅读 · 2009年12月31日

电荷分布对聚电解质溶液性质的影响

国家自然科学基金

0+阅读 · 2008年12月31日

微信扫码咨询专知VIP会员