To measure the difference between two probability distributions, we propose conditional transport (CT) as a new divergence and further approximate it with the amortized CT (ACT) cost to make it amenable to implicit distributions and stochastic gradient descent based optimization. ACT amortizes the computation of its conditional transport plans and comes with unbiased sample gradients that are straightforward to compute. When applied to train a generative model, ACT is shown to strike a good balance between mode covering and seeking behaviors and strongly resist mode collapse. On a wide variety of benchmark datasets for generative modeling, substituting the default statistical distance of an existing generative adversarial network with ACT is shown to consistently improve the performance.
翻译:为了衡量两种概率分布之间的差异,我们提议将有条件运输(CT)作为一种新的差异,并进一步将其与摊销的CT(CTCT)成本相近,使之适合隐含分布和基于梯度的梯度下降优化;ACT摊销其有条件运输计划的计算,并配有可直接计算的不带偏见的样本梯度;在用于培训基因化模型时,ACT显示在覆盖和寻求行为模式之间取得了良好的平衡,并强烈抵制模式崩溃;在用于基因化模型的多种基准数据集方面,用ACT取代现有的与ACT的基因化对立网络的默认统计距离,从而不断改善业绩。