Although ubiquitous in the sciences, histogram data have not received much attention by the Deep Learning community. Whilst regression and classification tasks for scalar and vector data are routinely solved by neural networks, a principled approach for estimating histogram labels as a function of an input vector or image is lacking in the literature. We present a dedicated method for Deep Learning-based histogram regression, which incorporates cross-bin information and yields distributions over possible histograms, expressed by $\tau$-quantiles of the cumulative histogram in each bin. The crux of our approach is a new loss function obtained by applying the pinball loss to the cumulative histogram, which for 1D histograms reduces to the Earth Mover's distance (EMD) in the special case of the median ($\tau = 0.5$), and generalizes it to arbitrary quantiles. We validate our method with an illustrative toy example, a football-related task, and an astrophysical computer vision problem. We show that with our loss function, the accuracy of the predicted median histograms is very similar to the standard EMD case (and higher than for per-bin loss functions such as cross-entropy), while the predictions become much more informative at almost no additional computational cost.
翻译:虽然在科学领域普遍存在,但深层学习界对直方图数据没有多少注意。 虽然神经网络经常解决卡路里和矢量数据的回归和分类任务,但文献中缺乏估算直方图标签作为输入矢量或图像函数的原则性方法。 我们提出了一个基于深层学习的直方图回归专门方法,该方法包含跨文献资料和可能直方图的产值分布,以每进书中累积直方图的等值表示。 我们的方法的重心是通过将弹丸损失应用到累积直方图获得的新损失函数,1D直方图在中位($\tau=0.5美元)的特殊情况下将直方图标签降低到地球移动器距离(EMD),并笼统地将其分为任意立方图。 我们用一个示例、一个与足球有关的任务和一个天体物理计算机视觉问题来验证我们的方法。 我们显示,随着我们的损失功能的提高,将预测的中位直方图的准确性值降低到每平方图的计算成本,而每平方图的计算法则几乎不比标准的计算成本。