Langevin Monte Carlo (LMC) is a popular Bayesian sampling method. For the log-concave distribution function, the method converges exponentially fast, up to a controllable discretization error. However, the method requires the evaluation of a full gradient in each iteration, and for a problem on $\mathbb{R}^d$, this amounts to $d$ times partial derivative evaluations per iteration. The cost is high when $d\gg1$. In this paper, we investigate how to enhance computational efficiency through the application of RCD (random coordinate descent) on LMC. There are two sides of the theory: 1 By blindly applying RCD to LMC, one surrogates the full gradient by a randomly selected directional derivative per iteration. Although the cost is reduced per iteration, the total number of iteration is increased to achieve a preset error tolerance. Ultimately there is no computational gain; 2 We then incorporate variance reduction techniques, such as SAGA (stochastic average gradient) and SVRG (stochastic variance reduced gradient), into RCD-LMC. It will be proved that the cost is reduced compared with the classical LMC, and in the underdamped case, convergence is achieved with the same number of iterations, while each iteration requires merely one-directional derivative. This means we obtain the best possible computational cost in the underdamped-LMC framework.
翻译:LAMC是流行的Bayesian Bayesian 抽样方法(LMC) Langevin Langevin Monte Carlo (LMC) 。 对于对日对流分配功能,该方法会迅速成倍地聚集,达到可控制的离散错误。然而,该方法要求对每迭代中完全梯度进行评估,对美元(mathbbb{R ⁇ ⁇ d$)的问题则需要评估完全梯度,而对于美元($mathbb{R ⁇ d$)的问题,每迭代中部分衍生物评价的总额是部分衍生物评价的两倍。当美元为美元($d\gg1美元)时,成本是很高的。在本文中,我们研究如何通过在LMC上应用刚果民盟(RCD-M)的(随机平均梯度协调下降梯度)来提高计算效率。 理论的两面有两面: 一是盲目地将刚果民盟(LMC)应用LDRD, 一种随机选择的衍生物来取代整个梯度。 虽然每迭代代代计算成本,但根据RCD-LMCL的公式,成本的计算方法需要降低成本。