标题：探究Dice损失函数梯度并模仿其方式摘要：在过去几年里，针对全监督语义分割，在监督神经网络方面已经出现了几种局部标准，例如交叉熵损失函数和Dice损失函数等。Dice损失函数是一个有趣的例子，它来自于常见的Dice系数的松弛形式；Dice系数是医学影像应用中的主要评价指标之一。在本文中，我们首先理论研究了Dice损失函数的梯度，表明具体的结果是权重化负的的ground truth，且其动态范围非常小。这使得在本文的第二部分，我们可以通过简单的逐元素乘法将网络输出与梯度负权重ground truth相结合，以模仿Dice损失函数的监督。这个相当令人惊讶的结果揭示了Dice损失函数在梯度下降期间实际执行的监督方式。这可以帮助实践者理解和解释结果，并在研究人员设计新的损失函数时提供指引。 (On the dice loss gradient and the ways to mimic it)

2023 年 4 月 9 日

On the dice loss gradient and the ways to mimic it

翻译：标题：探究Dice损失函数梯度并模仿其方式摘要：在过去几年里，针对全监督语义分割，在监督神经网络方面已经出现了几种局部标准，例如交叉熵损失函数和Dice损失函数等。Dice损失函数是一个有趣的例子，它来自于常见的Dice系数的松弛形式；Dice系数是医学影像应用中的主要评价指标之一。在本文中，我们首先理论研究了Dice损失函数的梯度，表明具体的结果是权重化负的的ground truth，且其动态范围非常小。这使得在本文的第二部分，我们可以通过简单的逐元素乘法将网络输出与梯度负权重ground truth相结合，以模仿Dice损失函数的监督。这个相当令人惊讶的结果揭示了Dice损失函数在梯度下降期间实际执行的监督方式。这可以帮助实践者理解和解释结果，并在研究人员设计新的损失函数时提供指引。

Hoel Kervadec,Marleen de Bruijne

from arxiv, Currently under review

In the past few years, in the context of fully-supervised semantic segmentation, several losses -- such as cross-entropy and dice -- have emerged as de facto standards to supervise neural networks. The Dice loss is an interesting case, as it comes from the relaxation of the popular Dice coefficient; one of the main evaluation metric in medical imaging applications. In this paper, we first study theoretically the gradient of the dice loss, showing that concretely it is a weighted negative of the ground truth, with a very small dynamic range. This enables us, in the second part of this paper, to mimic the supervision of the dice loss, through a simple element-wise multiplication of the network output with a negative of the ground truth. This rather surprising result sheds light on the practical supervision performed by the dice loss during gradient descent. This can help the practitioner to understand and interpret results while guiding researchers when designing new losses.

翻译：注：在翻译中，不能采用中文的专业术语，需要使用英文进行标注（即：Dice，ground truth）。