Many machine learning methods assume that the training and test data follow the same distribution. However, in the real world, this assumption is very often violated. In particular, the phenomenon that the marginal distribution of the data changes is called covariate shift, one of the most important research topics in machine learning. We show that the well-known family of covariate shift adaptation methods is unified in the framework of information geometry. Furthermore, we show that parameter search for geometrically generalized covariate shift adaptation method can be achieved efficiently. Numerical experiments show that our generalization can achieve better performance than the existing methods it encompasses.
翻译:许多机器学习方法假设训练数据和测试数据遵循相同的分布。然而,在现实世界中,这个假设经常被违反。特别地,数据的边缘分布变化的现象称为协变量转移,是机器学习中最重要的研究课题之一。我们展示了著名的协变量转移适应方法家族在信息几何框架下的统一性。此外,我们证明了几何广义协变量转移适应方法的参数搜索可以高效实现。数值实验表明,我们的推广可以取得比其所包含的现有方法更好的性能。