自动机器学习框架的实验评估：AutoMLBench的全面评估 (AutoMLBench: A Comprehensive Experimental Evaluation of Automated Machine Learning Frameworks) - 专知论文

会员服务 ·

0

机器学习框架 · AutoML · 设计决策 · 自动化机器学习 · 机器学习 ·

2023 年 4 月 12 日

AutoMLBench: A Comprehensive Experimental Evaluation of Automated Machine Learning Frameworks

翻译：自动机器学习框架的实验评估：AutoMLBench的全面评估

Hassan Eldeeb,Mohamed Maher,Radwa Elshawi,Sherif Sakr

With the booming demand for machine learning applications, it has been recognized that the number of knowledgeable data scientists can not scale with the growing data volumes and application needs in our digital world. In response to this demand, several automated machine learning (AutoML) frameworks have been developed to fill the gap of human expertise by automating the process of building machine learning pipelines. Each framework comes with different heuristics-based design decisions. In this study, we present a comprehensive evaluation and comparison of the performance characteristics of six popular AutoML frameworks, namely, AutoWeka, AutoSKlearn, TPOT, Recipe, ATM, and SmartML, across 100 data sets from established AutoML benchmark suites. Our experimental evaluation considers different aspects for its comparison, including the performance impact of several design decisions, including time budget, size of search space, meta-learning, and ensemble construction. The results of our study reveal various interesting insights that can significantly guide and impact the design of AutoML frameworks.

翻译：随着机器学习应用的需求增长，人们认识到知识丰富的数据科学家无法跟上数字世界中不断增长的数据和应用需求。为了满足这个需求，出现了几种自动化机器学习 (AutoML) 框架，通过自动化构建机器学习管道来填补人力专业知识的空缺。每个框架都有不同的基于启发式的设计决策。在本研究中，我们介绍了六种流行的自动化机器学习框架，分别是 AutoWeka、AutoSKlearn、TPOT、Recipe、ATM 和 SmartML，在已建立的 AutoML 基准测试套件中对 100 个数据集的性能特征进行了全面评估和对比。我们的实验评估考虑了不同方面的比较，包括时间预算、搜索空间大小、元学习和集合构建等设计决策的性能影响。我们研究的结果揭示了一些有趣的见解，这些见解可以对 AutoML 框架的设计产生重大的指导和影响。

0

相关内容

机器学习框架

机器学习框架

【Manning新书】自动机器学习实战，Automated Machine Learning in Action

【Manning新书】自动机器学习实战，Automated Machine Learning in Action

专知会员服务

95+阅读 · 2022年4月8日

【新书】【Metalearning】自动机器学习和数据挖掘的应用，Applications to Automated Machine Learning and Data Mining

【新书】【Metalearning】自动机器学习和数据挖掘的应用，Applications to Automated Machine Learning and Data Mining

专知会员服务

76+阅读 · 2022年3月24日

33页PPT【AI+天气预测】，AI and Machine learning for weather predictions

33页PPT【AI+天气预测】，AI and Machine learning for weather predictions

专知会员服务

35+阅读 · 2022年3月5日

Into the Metaverse，93页ppt介绍元宇宙概念、应用、趋势

Into the Metaverse，93页ppt介绍元宇宙概念、应用、趋势

专知会员服务

49+阅读 · 2022年2月19日

[SIGIR2021]可复现推荐系统评估的全面和严谨的框架

[SIGIR2021]可复现推荐系统评估的全面和严谨的框架

专知会员服务

22+阅读 · 2021年4月30日

【干货书】真实机器学习，264页pdf，Real-World Machine Learning

【干货书】真实机器学习，264页pdf，Real-World Machine Learning

专知会员服务

115+阅读 · 2020年4月5日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

166+阅读 · 2020年3月18日

【KDD2019|讲座推荐】公平意识机器学习：现实挑战与经验教训：Fairness-Aware Machine Learning: Practical Challenges and Lessons Learned

专知会员服务

20+阅读 · 2019年12月9日

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

专知会员服务

79+阅读 · 2019年10月10日

【强化学习研讨会|Microsoft Research】安全公平的机器学习（Safe and Fair Machine Learning）

【强化学习研讨会|Microsoft Research】安全公平的机器学习（Safe and Fair Machine Learning）

专知会员服务

16+阅读 · 2019年10月3日

【干货】AutoML自动机器学习：最新进展综述

【干货】AutoML自动机器学习：最新进展综述

专知

27+阅读 · 2019年8月9日

【AutoML干货】自动机器学习: 最新进展综述与开放挑战

【AutoML干货】自动机器学习: 最新进展综述与开放挑战

专知

25+阅读 · 2019年6月16日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

时序数据异常检测工具/数据集大列表

时序数据异常检测工具/数据集大列表

极市平台

65+阅读 · 2019年2月23日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

利用动态深度学习预测金融时间序列基于Python

利用动态深度学习预测金融时间序列基于Python

量化投资与机器学习

18+阅读 · 2018年10月30日

disentangled-representation-papers

disentangled-representation-papers

CreateAMind

26+阅读 · 2018年9月12日

LibRec 精选：基于LSTM的序列推荐实现（PyTorch）

LibRec 精选：基于LSTM的序列推荐实现（PyTorch）

LibRec智能推荐

50+阅读 · 2018年8月27日

新型Plectin-1荧光、MRI靶向分子探针对胰腺癌早期诊断的实验研究

国家自然科学基金

0+阅读 · 2014年12月31日

小分子化合物组合诱导成纤维细胞转分化为神经干细胞

国家自然科学基金

0+阅读 · 2013年12月31日

基于融合智能算法斜拉桥振动控制Benchmark问题的混合控制策略研究

国家自然科学基金

0+阅读 · 2013年12月31日

基于Realized GARCH框架的波动率和相关性模型理论和应用研究

国家自然科学基金

0+阅读 · 2012年12月31日

配位场调控的稀土基单分子磁体的设计、合成及磁性研究

国家自然科学基金

0+阅读 · 2012年12月31日

MR凋亡分子成像评估曲妥珠单抗靶向治疗HER2阳性乳腺癌疗效的实验研究

国家自然科学基金

0+阅读 · 2012年12月31日

BEC的保几何结构数值模拟与研究

国家自然科学基金

0+阅读 · 2011年12月31日

基于list-mode数据的快速SART真3D PET断层重建算法的研究

国家自然科学基金

0+阅读 · 2011年12月31日

TfR报告基因表达活体MRI显像监测骨髓源性神经干细胞治疗缺氧缺血性脑损伤的实验研究

国家自然科学基金

0+阅读 · 2009年12月31日

新型手性N-Oxide金属化合物的合成与催化研究

国家自然科学基金

0+阅读 · 2008年12月31日

Real-Time Scheduling for Time-Sensitive Networking: A Systematic Review and Experimental Study

Arxiv

0+阅读 · 2023年5月26日

A Comprehensive Survey on Multimodal Recommender Systems: Taxonomy, Evaluation, and Future Directions

Arxiv

16+阅读 · 2023年2月9日

Controllable Data Generation by Deep Learning: A Review

Arxiv

15+阅读 · 2022年7月19日

An Overview on Machine Translation Evaluation

An Overview on Machine Translation Evaluation

Arxiv

14+阅读 · 2022年2月22日

Recent Advances in Deep Learning-based Dialogue Systems

Arxiv

18+阅读 · 2021年5月10日

A Survey of Knowledge-Enhanced Text Generation

Arxiv

18+阅读 · 2020年10月9日

A Survey on Distributed Machine Learning

Arxiv

45+阅读 · 2019年12月20日

Meta-World: A Benchmark and Evaluation for Multi-Task and Meta Reinforcement Learning

Meta-World: A Benchmark and Evaluation for Multi-Task and Meta Reinforcement Learning

Arxiv

34+阅读 · 2019年10月24日

AutoML: A Survey of the State-of-the-Art

AutoML: A Survey of the State-of-the-Art

Arxiv

74+阅读 · 2019年8月14日

Taking Human out of Learning Applications: A Survey on Automated Machine Learning

Taking Human out of Learning Applications: A Survey on Automated Machine Learning

Arxiv

14+阅读 · 2019年1月17日

VIP会员

文章信息

相关主题

机器学习框架

自动化机器学习

相关VIP内容

【Manning新书】自动机器学习实战，Automated Machine Learning in Action

【Manning新书】自动机器学习实战，Automated Machine Learning in Action

专知会员服务

95+阅读 · 2022年4月8日

【新书】【Metalearning】自动机器学习和数据挖掘的应用，Applications to Automated Machine Learning and Data Mining

【新书】【Metalearning】自动机器学习和数据挖掘的应用，Applications to Automated Machine Learning and Data Mining

专知会员服务

76+阅读 · 2022年3月24日

33页PPT【AI+天气预测】，AI and Machine learning for weather predictions

33页PPT【AI+天气预测】，AI and Machine learning for weather predictions

专知会员服务

35+阅读 · 2022年3月5日

Into the Metaverse，93页ppt介绍元宇宙概念、应用、趋势

Into the Metaverse，93页ppt介绍元宇宙概念、应用、趋势

专知会员服务

49+阅读 · 2022年2月19日

[SIGIR2021]可复现推荐系统评估的全面和严谨的框架

[SIGIR2021]可复现推荐系统评估的全面和严谨的框架

专知会员服务

22+阅读 · 2021年4月30日

【干货书】真实机器学习，264页pdf，Real-World Machine Learning

【干货书】真实机器学习，264页pdf，Real-World Machine Learning

专知会员服务

115+阅读 · 2020年4月5日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

166+阅读 · 2020年3月18日

【KDD2019|讲座推荐】公平意识机器学习：现实挑战与经验教训：Fairness-Aware Machine Learning: Practical Challenges and Lessons Learned

专知会员服务

20+阅读 · 2019年12月9日

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

专知会员服务

79+阅读 · 2019年10月10日

【强化学习研讨会|Microsoft Research】安全公平的机器学习（Safe and Fair Machine Learning）

【强化学习研讨会|Microsoft Research】安全公平的机器学习（Safe and Fair Machine Learning）

专知会员服务

16+阅读 · 2019年10月3日

热门VIP内容

开通专知VIP会员享更多权益服务

小规模训练指南：打造世界级大语言模型的关键方法

无人机编队飞行：复杂环境中作战的策略、挑战与应用

大模型APP，AI时代第一个爆款

从数据中心视角出发的高效大语言模型训练综述

相关资讯

【干货】AutoML自动机器学习：最新进展综述

【干货】AutoML自动机器学习：最新进展综述

专知

27+阅读 · 2019年8月9日

【AutoML干货】自动机器学习: 最新进展综述与开放挑战

【AutoML干货】自动机器学习: 最新进展综述与开放挑战

专知

25+阅读 · 2019年6月16日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

时序数据异常检测工具/数据集大列表

时序数据异常检测工具/数据集大列表

极市平台

65+阅读 · 2019年2月23日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

利用动态深度学习预测金融时间序列基于Python

利用动态深度学习预测金融时间序列基于Python

量化投资与机器学习

18+阅读 · 2018年10月30日

disentangled-representation-papers

disentangled-representation-papers

CreateAMind

26+阅读 · 2018年9月12日

LibRec 精选：基于LSTM的序列推荐实现（PyTorch）

LibRec 精选：基于LSTM的序列推荐实现（PyTorch）

LibRec智能推荐

50+阅读 · 2018年8月27日

相关论文

Real-Time Scheduling for Time-Sensitive Networking: A Systematic Review and Experimental Study

Arxiv

0+阅读 · 2023年5月26日

A Comprehensive Survey on Multimodal Recommender Systems: Taxonomy, Evaluation, and Future Directions

Arxiv

16+阅读 · 2023年2月9日

Controllable Data Generation by Deep Learning: A Review

Arxiv

15+阅读 · 2022年7月19日

An Overview on Machine Translation Evaluation

An Overview on Machine Translation Evaluation

Arxiv

14+阅读 · 2022年2月22日

Recent Advances in Deep Learning-based Dialogue Systems

Arxiv

18+阅读 · 2021年5月10日

A Survey of Knowledge-Enhanced Text Generation

Arxiv

18+阅读 · 2020年10月9日

A Survey on Distributed Machine Learning

Arxiv

45+阅读 · 2019年12月20日

Meta-World: A Benchmark and Evaluation for Multi-Task and Meta Reinforcement Learning

Meta-World: A Benchmark and Evaluation for Multi-Task and Meta Reinforcement Learning

Arxiv

34+阅读 · 2019年10月24日

AutoML: A Survey of the State-of-the-Art

AutoML: A Survey of the State-of-the-Art

Arxiv

74+阅读 · 2019年8月14日

Taking Human out of Learning Applications: A Survey on Automated Machine Learning

Taking Human out of Learning Applications: A Survey on Automated Machine Learning

Arxiv

14+阅读 · 2019年1月17日

相关基金

新型Plectin-1荧光、MRI靶向分子探针对胰腺癌早期诊断的实验研究

国家自然科学基金

0+阅读 · 2014年12月31日

小分子化合物组合诱导成纤维细胞转分化为神经干细胞

国家自然科学基金

0+阅读 · 2013年12月31日

基于融合智能算法斜拉桥振动控制Benchmark问题的混合控制策略研究

国家自然科学基金

0+阅读 · 2013年12月31日

基于Realized GARCH框架的波动率和相关性模型理论和应用研究

国家自然科学基金

0+阅读 · 2012年12月31日

配位场调控的稀土基单分子磁体的设计、合成及磁性研究

国家自然科学基金

0+阅读 · 2012年12月31日

MR凋亡分子成像评估曲妥珠单抗靶向治疗HER2阳性乳腺癌疗效的实验研究

国家自然科学基金

0+阅读 · 2012年12月31日

BEC的保几何结构数值模拟与研究

国家自然科学基金

0+阅读 · 2011年12月31日

基于list-mode数据的快速SART真3D PET断层重建算法的研究

国家自然科学基金

0+阅读 · 2011年12月31日

TfR报告基因表达活体MRI显像监测骨髓源性神经干细胞治疗缺氧缺血性脑损伤的实验研究

国家自然科学基金

0+阅读 · 2009年12月31日

新型手性N-Oxide金属化合物的合成与催化研究

国家自然科学基金

0+阅读 · 2008年12月31日

微信扫码咨询专知VIP会员