统计差异的动态经济估计 (Neural Estimation of Statistical Divergences)

Statistical divergences (SDs), which quantify the dissimilarity between probability distributions, are a basic constituent of statistical inference and machine learning. A modern method for estimating those divergences relies on parametrizing an empirical variational form by a neural network (NN) and optimizing over parameter space. Such neural estimators are abundantly used in practice, but corresponding performance guarantees are partial and call for further exploration. In particular, there is a fundamental tradeoff between the two sources of error involved: approximation and empirical estimation. While the former needs the NN class to be rich and expressive, the latter relies on controlling complexity. We explore this tradeoff for an estimator based on a shallow NN by means of non-asymptotic error bounds, focusing on four popular $\mathsf{f}$-divergences -- Kullback-Leibler, chi-squared, squared Hellinger, and total variation. Our analysis relies on non-asymptotic function approximation theorems and tools from empirical process theory. The bounds reveal the tension between the NN size and the number of samples, and enable to characterize scaling rates thereof that ensure consistency. For compactly supported distributions, we further show that neural estimators of the first three divergences above with appropriate NN growth-rate are near minimax rate-optimal, achieving the parametric rate up to logarithmic factors.

翻译：量化概率分布差异的统计差异(SDs)是统计推断和机器学习的基本组成部分。一种现代方法是估算这些差异的现代方法,其依据是神经网络(NN)对实验变异形式进行假称,并优化参数空间。这种神经估计器在实践中使用得非常多,但相应的性能保障是局部的,需要进一步探索。特别是,两种错误来源之间有一个基本的权衡:近似值和实证估计。前者需要NNN类丰富和表达性,而后者则依赖复杂性控制。我们探索以浅 NNN值为基础的估测器的这一交易,其依据是非无线误差界限,重点是四个流行的 $\mathsf{f}$-diverence -- -- Kullback-Leiper、 chi- quald、 sqummicrialinger, 和总体变异。我们的分析依赖于非测试功能将参数和工具与实证过程理论的比对等。我们通过非默认的参数对准性能显示 Nemaryal 比例之间的紧张度,首先揭示了Negraphal 和显示Negraphal a ex latical ex amination lax amination lax the the slaxal aminational aminational aminations thes the supal lax the supal as the the supaltime axlations to the ex as to the the supal axildal axil ex the silds