Researchers have been facing a difficult problem that data generation mechanisms could be influenced by internal or external factors leading to the training and test data with quite different distributions, consequently traditional classification or regression from the training set is unable to achieve satisfying results on test data. In this paper, we address this nontrivial domain generalization problem by finding a central subspace in which domain-based covariance is minimized while the functional relationship is simultaneously maximally preserved. We propose a novel variance measurement for multiple domains so as to minimize the difference between conditional distributions across domains with solid theoretical demonstration and supports, meanwhile, the algorithm preserves the functional relationship via maximizing the variance of conditional expectations given output. Furthermore, we also provide a fast implementation that requires much less computation and smaller memory for large-scale matrix operations, suitable for not only domain generalization but also other kernel-based eigenvalue decompositions. To show the practicality of the proposed method, we compare our methods against some well-known dimension reduction and domain generalization techniques on both synthetic data and real-world applications. We show that for small-scale datasets, we are able to achieve better quantitative results indicating better generalization performance over unseen test datasets. For large-scale problems, the proposed fast implementation maintains the quantitative performance but at a substantially lower computational cost.
翻译:研究人员一直面临一个难题,即数据生成机制可能受到内部或外部因素的影响,导致培训和测试数据分布差异很大,因此,培训组的传统分类或回归无法在测试数据方面取得令人满意的结果。在本文件中,我们通过找到一个中心子空间,将基于域的共变最小化最小化,同时最大限度地保持功能关系,从而解决这一非三边域一般化问题。我们提议对多个领域进行新的差异计量,以尽量减少有条件分布在有可靠理论演示和支持的各领域之间的差别,同时,算法通过尽可能扩大有条件期望产出的差异来保持功能关系。此外,我们还提供快速实施,这要求大规模矩阵操作的计算少得多,记忆小得多,不仅适合领域一般化,而且适合其他基于内核值的内核分解。为了显示拟议方法的实用性,我们将我们的方法与一些众所周知的减少尺寸和在合成数据和现实世界应用方面的域化技术进行比较。我们显示,对于小规模数据集,我们能够实现功能性关系保持最大程度的差异。此外,我们还提供快速实施,这要求减少大规模矩阵操作的计算,不仅适合领域,而且适合领域一般化,而且还适合其他以内核值为基础的计算。