The study of a machine learning problem is in many ways is difficult to separate from the study of the loss function being used. One avenue of inquiry has been to look at these loss functions in terms of their properties as scoring rules via the proper-composite representation, in which predictions are mapped to probability distributions which are then scored via a scoring rule. However, recent research so far has primarily been concerned with analysing the (typically) finite-dimensional conditional risk problem on the output space, leaving aside the larger total risk minimisation. We generalise a number of these results to an infinite dimensional setting and in doing so we are able to exploit the familial resemblance of density and conditional density estimation to provide a simple characterisation of the canonical link.
翻译:对机器学习问题的研究在许多方面都难以与正在使用的损失函数研究分开,调查的一个途径是通过适当的综合说明,将这些损失功能作为评分规则,通过适当的综合说明,将预测绘制成概率分布图,然后通过评分规则评分。然而,迄今为止最近的研究主要涉及分析产出空间的(典型的)有限尺寸有条件风险问题,而忽略了更大的全部风险最小化。我们将这些结果概括为无限的维度设置,从而我们能够利用密度和有条件密度估计的家庭相似性,以提供一个简单化的卡通联系特征。