We propose a solution strategy for linear systems arising in interior method optimization, which is suitable for implementation on hardware accelerators such as graphical processing units (GPUs). The current gold standard for solving these systems is the LDL^T factorization. However, LDL^T requires pivoting during factorization, which substantially increases communication cost and degrades performance on GPUs. Our novel approach solves a large indefinite system by solving multiple smaller positive definite systems, using an iterative solve for the Schur complement and an inner direct solve (via Cholesky factorization) within each iteration. Cholesky is stable without pivoting, thereby reducing communication and allowing reuse of the symbolic factorization. We demonstrate the practicality of our approach and show that on large systems it can efficiently utilize GPUs and outperform LDL^T factorization of the full system.
翻译:我们为内部方法优化产生的线性系统提出了一个解决方案战略,适合在图形处理器等硬件加速器上实施。目前的解决这些系统的黄金标准是LDL ⁇ T因子化。然而,LDL ⁇ T需要在因子化过程中进行分化,这大大增加了通信成本,降低了GPU的性能。我们的新办法解决了一个大型的不定期系统,方法是利用Schur补充的迭代式溶液和每个迭代内的内直接溶液(通过Cholesky因子化)解决多小的正性定型系统。Choolesky是稳定的,没有通过配对,从而减少通信,允许重复使用符号因子化。我们展示了我们的方法的实用性,并表明在大型系统中,它能够有效利用GUP,超越整个系统的LDL ⁇ T因子化。