在表格数据集上进行资源培训的神经结构搜索 (Resource-Constrained Neural Architecture Search on Tabular Datasets)

The best neural architecture for a given machine learning problem depends on many factors: not only the complexity and structure of the dataset, but also on resource constraints including latency, compute, energy consumption, etc. Neural architecture search (NAS) for tabular datasets is an important but under-explored problem. Previous NAS algorithms designed for image search spaces incorporate resource constraints directly into the reinforcement learning rewards. In this paper, we argue that search spaces for tabular NAS pose considerable challenges for these existing reward-shaping methods, and propose a new reinforcement learning (RL) controller to address these challenges. Motivated by rejection sampling, when we sample candidate architectures during a search, we immediately discard any architecture that violates our resource constraints. We use a Monte-Carlo-based correction to our RL policy gradient update to account for this extra filtering step. Results on several tabular datasets show TabNAS, the proposed approach, efficiently finds high-quality models that satisfy the given resource constraints.

翻译：特定机器学习问题的最佳神经结构取决于许多因素:不仅数据集的复杂性和结构,而且还取决于资源限制,包括隐蔽性、计算、能源消耗等。神经结构搜索(NAS)对于表格数据集来说是一个重要但探索不足的问题。先前为图像搜索空间设计的NAS算法将资源限制直接纳入了强化学习奖励。在本文中,我们认为,表格NAS的搜索空间对这些现有的奖赏划分方法提出了相当大的挑战,并提出了新的强化学习控制器来应对这些挑战。受拒绝抽样的激励,当我们在搜索中抽取候选架构时,我们立即抛弃任何违反我们资源限制的架构。我们用蒙特-卡洛对我们的RL政策梯度更新来计算这一额外的过滤步骤。几个表格数据集的结果显示 TabNAS, 即拟议方法, 有效地找到满足了资源限制的高质量模型。

相关内容

数据集

关注 88

数据集，又称为资料集、数据集合或资料集合，是一种由数据所组成的集合。
Data set（或dataset）是一个数据的集合，通常以表格形式出现。每一列代表一个特定变量。每一行都对应于某一成员的数据集的问题。它列出的价值观为每一个变量，如身高和体重的一个物体或价值的随机数。每个数值被称为数据资料。对应于行数，该数据集的数据可能包括一个或多个成员。

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

专知会员服务

78+阅读 · 2022年3月15日

UCM《机器学习导论笔记》，80页pdf CSE176 Introduction to Machine Learning

专知会员服务

31+阅读 · 2021年9月29日

【深度学习架构、模型和技巧集合(TensorFlow/PyTorch)】’Deep Learning Models - A collection of various deep learning architectures, models, and tips'

专知会员服务

59+阅读 · 2020年1月25日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日