In this paper, we study the application of quasi-Newton methods for solving empirical risk minimization (ERM) problems defined over a large dataset. Traditional deterministic and stochastic quasi-Newton methods can be executed to solve such problems; however, it is known that their global convergence rate may not be better than first-order methods, and their local superlinear convergence only appears towards the end of the learning process. In this paper, we use an adaptive sample size scheme that exploits the superlinear convergence of quasi-Newton methods globally and throughout the entire learning process. The main idea of the proposed adaptive sample size algorithms is to start with a small subset of data points and solve their corresponding ERM problem within its statistical accuracy, and then enlarge the sample size geometrically and use the optimal solution of the problem corresponding to the smaller set as an initial point for solving the subsequent ERM problem with more samples. We show that if the initial sample size is sufficiently large and we use quasi-Newton methods to solve each subproblem, the subproblems can be solved superlinearly fast (after at most three iterations), as we guarantee that the iterates always stay within a neighborhood that quasi-Newton methods converge superlinearly. Numerical experiments on various datasets confirm our theoretical results and demonstrate the computational advantages of our method.
翻译:在本文中,我们研究使用准纽顿方法解决大型数据集界定的实验风险最小化(ERM)问题。传统的确定性和随机准纽顿方法可以用来解决这些问题;然而,众所周知,其全球趋同率可能不会比一级方法好,其本地超级线性趋同率只是接近学习过程的终点。在本文件中,我们使用一个适应性样本规模方案,利用准纽顿方法在全球和整个学习过程中的超级线性趋同。拟议的适应性样本规模算法的主要想法是从一小部分数据点开始,在统计精确度范围内解决相应的机构风险管理问题,然后从几何角度扩大样本规模,并使用与较小规模方法相对的最佳解决办法,作为用更多样本解决随后的机构风险管理问题的初步点。我们表明,如果初始样本规模足够大,我们使用准纽顿方法解决每个子问题,则次质性标可被解决超线性解(在最多三个次等时),然后在统计精确度范围内解决其相应的机构风险管理问题,我们保证在各种理论化方法中总是能证实我们的最佳方法。