The ratio of two probability densities, called a density-ratio, is a vital quantity in machine learning. In particular, a relative density-ratio, which is a bounded extension of the density-ratio, has received much attention due to its stability and has been used in various applications such as outlier detection and dataset comparison. Existing methods for (relative) density-ratio estimation (DRE) require many instances from both densities. However, sufficient instances are often unavailable in practice. In this paper, we propose a meta-learning method for relative DRE, which estimates the relative density-ratio from a few instances by using knowledge in related datasets. Specifically, given two datasets that consist of a few instances, our model extracts the datasets' information by using neural networks and uses it to obtain instance embeddings appropriate for the relative DRE. We model the relative density-ratio by a linear model on the embedded space, whose global optimum solution can be obtained as a closed-form solution. The closed-form solution enables fast and effective adaptation to a few instances, and its differentiability enables us to train our model such that the expected test error for relative DRE can be explicitly minimized after adapting to a few instances. We empirically demonstrate the effectiveness of the proposed method by using three problems: relative DRE, dataset comparison, and outlier detection.
翻译:两种概率密度的比例(称为密度- 纬度)是机器学习的一个重要数量。 特别是,相对密度- 纬度(即密度- 纬度的结合延伸) 因其稳定性而引起人们的极大关注,并被用于外部探测和数据集比较等各种应用中。 现有的(相对的)密度- 纬度估计(DRE) 方法需要两种密度(DRE) 的多种情况。 但是,在实践中往往没有足够的例子。 在本文中,我们为相对的DRE提出一个元学习方法,该方法利用相关数据集的知识从几个例子估算相对密度- 。 具体地说,考虑到由少数实例组成的两个数据集,我们的模型通过神经网络和数据集比较等各种应用来提取数据集的信息。 我们用嵌入空间的线性模型来模拟相对密度- 相对密度- 鼠标(DE), 其全球最佳解决方案可以作为一种封闭式解决方案获得。 封闭式解决方案能够快速和有效地适应少数实例, 并且其相对的兼容性使得我们能够通过最小化的测试模型来演示三个模型, 。 我们用最差的实验性的方法来明确地展示我们的模型, 。 我们用最差的实验性测试 。