DiffML: 端到端可区分的 ML 管道 (DiffML: End-to-end Differentiable ML Pipelines) - 专知论文

会员服务 ·

0

机器学习建模 · ML · 端到端 · Automator · 粤港澳大湾区数字经济研究院 ·

2022 年 7 月 4 日

DiffML: End-to-end Differentiable ML Pipelines

翻译：DiffML: 端到端可区分的 ML 管道

Benjamin Hilprecht,Christian Hammacher,Eduardo Reis,Mohamed Abdelaal,Carsten Binnig

In this paper, we present our vision of differentiable ML pipelines called DiffML to automate the construction of ML pipelines in an end-to-end fashion. The idea is that DiffML allows to jointly train not just the ML model itself but also the entire pipeline including data preprocessing steps, e.g., data cleaning, feature selection, etc. Our core idea is to formulate all pipeline steps in a differentiable way such that the entire pipeline can be trained using backpropagation. However, this is a non-trivial problem and opens up many new research questions. To show the feasibility of this direction, we demonstrate initial ideas and a general principle of how typical preprocessing steps such as data cleaning, feature selection and dataset selection can be formulated as differentiable programs and jointly learned with the ML model. Moreover, we discuss a research roadmap and core challenges that have to be systematically tackled to enable fully differentiable ML pipelines.

翻译：在本文中,我们展示了我们关于不同的、称为DiffML的、称为DiffML的管道的设想,目的是以端到端的方式使ML管道的建造自动化。想法是,DiffML不仅允许联合培训ML模型本身,而且允许联合培训整个管道,包括数据处理前步骤,例如数据清理、特征选择等。我们的核心想法是以不同的方式制定所有管道步骤,以便整个管道能够利用反向分析来培训。然而,这是一个非三重问题,并开启了许多新的研究问题。为了展示这一方向的可行性,我们展示了初步的想法和一般原则,说明如何将数据清理、特征选择和数据集选择等典型的预处理步骤作为不同的方案,并与ML模型共同学习。此外,我们讨论了研究路线图和核心挑战,必须系统地加以解决,以便能够完全不同的 ML管道。

0

相关内容

机器学习建模

机器学习建模

不可错过！《机器学习100讲》课程，UBC Mark Schmidt讲授

不可错过！《机器学习100讲》课程，UBC Mark Schmidt讲授

专知会员服务

75+阅读 · 2022年6月28日

Meta最新WWW2022《联邦计算导论》教程，附77页ppt

Meta最新WWW2022《联邦计算导论》教程，附77页ppt

专知会员服务

60+阅读 · 2022年5月5日

【Google】深度学习对抗鲁棒性，43页ppt

专知会员服务

45+阅读 · 2020年10月31日

UC.Berkeley CS189讲义教材:《机器学习全面指南》，185页pdf

专知会员服务

162+阅读 · 2020年1月16日

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

专知会员服务

160+阅读 · 2019年10月12日

开源书：PyTorch深度学习起步

开源书：PyTorch深度学习起步

专知会员服务

51+阅读 · 2019年10月11日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

专知会员服务

65+阅读 · 2019年10月9日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

105+阅读 · 2019年10月9日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

VCIP 2022 Call for Special Session Proposals

VCIP 2022 Call for Special Session Proposals

CCF多媒体专委会

1+阅读 · 2022年4月1日

ACM MM 2022 Call for Papers

ACM MM 2022 Call for Papers

CCF多媒体专委会

5+阅读 · 2022年3月29日

ACM TOMM Call for Papers

ACM TOMM Call for Papers

CCF多媒体专委会

2+阅读 · 2022年3月23日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium8

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium8

中国图象图形学学会CSIG

0+阅读 · 2021年11月16日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium3

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium3

中国图象图形学学会CSIG

0+阅读 · 2021年11月9日

【ICIG2021】Latest News & Announcements of the Plenary Talk1

【ICIG2021】Latest News & Announcements of the Plenary Talk1

中国图象图形学学会CSIG

0+阅读 · 2021年11月1日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

同步辐射光电离质谱技术研究C3 Criegee中间体宏观反应动力学

国家自然科学基金

0+阅读 · 2015年12月31日

S3AGA样本（Spitzer-SDSS Spectral Atlas of Galaxies and AGNs)及其AGN研究

国家自然科学基金

0+阅读 · 2014年12月31日

混合物定量结构性质关系的化学计量学方法研究

国家自然科学基金

0+阅读 · 2013年12月31日

Schrodinger-Poisson方程的若干问题研究

国家自然科学基金

1+阅读 · 2012年12月31日

amiRNA干扰NMHC II-A对PRRSV感染细胞凋亡信号传导的影响及机制

国家自然科学基金

0+阅读 · 2012年12月31日

人血管内皮细胞登革病毒候选受体- - 55 kDa蛋白的鉴定

国家自然科学基金

0+阅读 · 2012年12月31日

Bcl2家族促凋亡蛋白在牙龈蛋白酶诱导成骨细胞调亡中的作用及其机制

国家自然科学基金

0+阅读 · 2011年12月31日

干湿交替过程中土壤氧化铁形态转化对As和Sb环境化学行为的影响机制

国家自然科学基金

0+阅读 · 2011年12月31日

中红外新波段强场物理前沿开拓

国家自然科学基金

0+阅读 · 2011年12月31日

图像匹配的新方法研究：李群约束的优化算法

国家自然科学基金

0+阅读 · 2009年12月31日

PEER: A Collaborative Language Model

Arxiv

0+阅读 · 2022年8月24日

Deep Symbolic Learning: Discovering Symbols and Rules from Perceptions

Arxiv

0+阅读 · 2022年8月24日

Survey on Evolutionary Deep Learning: Principles, Algorithms, Applications and Open Issues

Arxiv

20+阅读 · 2022年8月23日

[Re] Differentiable Spatial Planning using Transformers

Arxiv

0+阅读 · 2022年8月19日

Domain Adversarial Spatial-Temporal Network: A Transferable Framework for Short-term Traffic Forecasting across Cities

Arxiv

0+阅读 · 2022年8月19日

ExSum: From Local Explanations to Model Understanding

Arxiv

13+阅读 · 2022年4月30日

AI for Next Generation Computing: Emerging Trends and Future Directions

Arxiv

19+阅读 · 2022年3月5日

On Neural Differential Equations

Arxiv

23+阅读 · 2022年2月4日

The Principles of Deep Learning Theory

Arxiv

65+阅读 · 2021年6月18日

Differentiable Dynamic Programming for Structured Prediction and Attention

Arxiv

56+阅读 · 2018年2月20日

VIP会员

文章信息

相关主题

机器学习建模

粤港澳大湾区数字经济研究院

相关VIP内容

不可错过！《机器学习100讲》课程，UBC Mark Schmidt讲授

不可错过！《机器学习100讲》课程，UBC Mark Schmidt讲授

专知会员服务

75+阅读 · 2022年6月28日

Meta最新WWW2022《联邦计算导论》教程，附77页ppt

Meta最新WWW2022《联邦计算导论》教程，附77页ppt

专知会员服务

60+阅读 · 2022年5月5日

【Google】深度学习对抗鲁棒性，43页ppt

专知会员服务

45+阅读 · 2020年10月31日

UC.Berkeley CS189讲义教材:《机器学习全面指南》，185页pdf

专知会员服务

162+阅读 · 2020年1月16日

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

专知会员服务

160+阅读 · 2019年10月12日

开源书：PyTorch深度学习起步

开源书：PyTorch深度学习起步

专知会员服务

51+阅读 · 2019年10月11日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

专知会员服务

65+阅读 · 2019年10月9日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

105+阅读 · 2019年10月9日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

从社会学实验到行为仿真：理解基于Agent的观点动力学建模思维

中英文版《GPT-5 System Card速览》报告

ACL 2025 | 大模型结构化知识提示的泛化能力研究

【普林斯顿博士论文】大型模型的高效推理

相关资讯

VCIP 2022 Call for Special Session Proposals

VCIP 2022 Call for Special Session Proposals

CCF多媒体专委会

1+阅读 · 2022年4月1日

ACM MM 2022 Call for Papers

ACM MM 2022 Call for Papers

CCF多媒体专委会

5+阅读 · 2022年3月29日

ACM TOMM Call for Papers

ACM TOMM Call for Papers

CCF多媒体专委会

2+阅读 · 2022年3月23日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium8

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium8

中国图象图形学学会CSIG

0+阅读 · 2021年11月16日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium3

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium3

中国图象图形学学会CSIG

0+阅读 · 2021年11月9日

【ICIG2021】Latest News & Announcements of the Plenary Talk1

【ICIG2021】Latest News & Announcements of the Plenary Talk1

中国图象图形学学会CSIG

0+阅读 · 2021年11月1日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

相关论文

PEER: A Collaborative Language Model

Arxiv

0+阅读 · 2022年8月24日

Deep Symbolic Learning: Discovering Symbols and Rules from Perceptions

Arxiv

0+阅读 · 2022年8月24日

Survey on Evolutionary Deep Learning: Principles, Algorithms, Applications and Open Issues

Arxiv

20+阅读 · 2022年8月23日

[Re] Differentiable Spatial Planning using Transformers

Arxiv

0+阅读 · 2022年8月19日

Domain Adversarial Spatial-Temporal Network: A Transferable Framework for Short-term Traffic Forecasting across Cities

Arxiv

0+阅读 · 2022年8月19日

ExSum: From Local Explanations to Model Understanding

Arxiv

13+阅读 · 2022年4月30日

AI for Next Generation Computing: Emerging Trends and Future Directions

Arxiv

19+阅读 · 2022年3月5日

On Neural Differential Equations

Arxiv

23+阅读 · 2022年2月4日

The Principles of Deep Learning Theory

Arxiv

65+阅读 · 2021年6月18日

Differentiable Dynamic Programming for Structured Prediction and Attention

Arxiv

56+阅读 · 2018年2月20日

相关基金

同步辐射光电离质谱技术研究C3 Criegee中间体宏观反应动力学

国家自然科学基金

0+阅读 · 2015年12月31日

S3AGA样本（Spitzer-SDSS Spectral Atlas of Galaxies and AGNs)及其AGN研究

国家自然科学基金

0+阅读 · 2014年12月31日

混合物定量结构性质关系的化学计量学方法研究

国家自然科学基金

0+阅读 · 2013年12月31日

Schrodinger-Poisson方程的若干问题研究

国家自然科学基金

1+阅读 · 2012年12月31日

amiRNA干扰NMHC II-A对PRRSV感染细胞凋亡信号传导的影响及机制

国家自然科学基金

0+阅读 · 2012年12月31日

人血管内皮细胞登革病毒候选受体- - 55 kDa蛋白的鉴定

国家自然科学基金

0+阅读 · 2012年12月31日

Bcl2家族促凋亡蛋白在牙龈蛋白酶诱导成骨细胞调亡中的作用及其机制

国家自然科学基金

0+阅读 · 2011年12月31日

干湿交替过程中土壤氧化铁形态转化对As和Sb环境化学行为的影响机制

国家自然科学基金

0+阅读 · 2011年12月31日

中红外新波段强场物理前沿开拓

国家自然科学基金

0+阅读 · 2011年12月31日

图像匹配的新方法研究：李群约束的优化算法

国家自然科学基金

0+阅读 · 2009年12月31日

微信扫码咨询专知VIP会员