Developing high-performing predictive models for large tabular data sets is a challenging task. The state-of-the-art methods are based on expert-developed model ensembles from different supervised learning methods. Recently, automated machine learning (AutoML) is emerging as a promising approach to automate predictive model development. Neural architecture search (NAS) is an AutoML approach that generates and evaluates multiple neural network architectures concurrently and improves the accuracy of the generated models iteratively. A key issue in NAS, particularly for large data sets, is the large computation time required to evaluate each generated architecture. While data-parallel training is a promising approach that can address this issue, its use within NAS is difficult. For different data sets, the data-parallel training settings such as the number of parallel processes, learning rate, and batch size need to be adapted to achieve high accuracy and reduction in training time. To that end, we have developed AgEBO-Tabular, an approach to combine aging evolution (AgE), a parallel NAS method that searches over neural architecture space, and an asynchronous Bayesian optimization method for tuning the hyperparameters of the data-parallel training simultaneously. We demonstrate the efficacy of the proposed method to generate high-performing neural network models for large tabular benchmark data sets. Furthermore, we demonstrate that the automatically discovered neural network models using our method outperform the state-of-the-art AutoML ensemble models in inference speed by two orders of magnitude while reaching similar accuracy values.
翻译:开发大型表层数据集高性能预测模型是一项艰巨的任务。 最先进的方法基于专家开发的模型,来自不同监督的学习方法。 最近,自动化机器学习(Automil)正在成为自动预测模型开发的一个很有希望的方法。 神经结构搜索(NAS)是一种自动ML方法,它同时生成和评估多种神经网络结构,提高生成模型的迭接性准确性。NAS的一个关键问题是,特别是大型数据集,评估每个生成的架构所需的大量计算时间。 虽然数据单流培训是一种很有希望的方法,可以解决这一问题,但在NAS内部却很难使用。 对于不同的数据集,数据单数培训设置,如平行进程的数量、学习率和批量规模,需要调整,以达到高准确性和减少培训时间。 为此,我们开发了AGEBO-Tabulal, 一种将不断演变的进化(AgE)方法, 一种平行的NAS方法,可以搜索神经结构空间, 而在NAS培训中, 也很难使用一个类似的方法, 模拟模型, 用来演示我们高性系统模型, 以同步的系统化的方法, 模拟, 以演示我们高压式的系统模拟的系统, 以演示高压方法, 演示我们高压式的系统 。