It can be difficult to assess the quality of a fitted model when facing unsupervised learning problems. Latent variable models, such as variation autoencoders and Gaussian mixture models, are often trained with likelihood-based approaches. In scope of Goodhart's law, when a metric becomes a target it ceases to be a good metric and therefore we should not use likelihood to assess the quality of the fit of these models. The solution we propose is a new metric for model comparison or regularization that relies on moments. The concept is to study the difference between the data moments and the model moments using a matrix norm, such as the Frobenius norm. We show how to use this new metric for model comparison and then for regularization. It is common to draw samples from the fitted distribution when evaluating latent variable models and we show that our proposed metric is faster to compute and has a smaller variance that this alternative. We conclude this article with a proof of concept of both applications and we discuss future work.
翻译:在Goodhart的法律范围内,当一个指标成为目标时,它就不再是一个良好的衡量标准,因此我们不应该利用评估这些模型是否适合质量的可能性来评估这些模型的质量。我们提出的解决办法是模型比较或规范化的新衡量标准,它依赖于时空。概念是利用Frobenius规范等矩阵规范研究数据时刻与模型时刻之间的差异。我们展示如何使用这一新的指标进行模型比较,然后进行正规化。在评估潜在变量模型时,从合适的分布中抽取样本是常见的。我们表明,我们提议的衡量标准比较得更快,而且差异较小。我们通过证明应用概念和我们讨论未来工作来完成这一文章。