为机器学习中归纳偏差的作用解释“无免费午餐定理”和 Kolmogorov 复杂性 (The No Free Lunch Theorem, Kolmogorov Complexity, and the Role of Inductive Biases in Machine Learning) - 专知论文

会员服务 ·

0

归纳偏差 · 低复杂度 · 偏差 · 均匀采样 · 均匀分布 ·

2023 年 4 月 11 日

The No Free Lunch Theorem, Kolmogorov Complexity, and the Role of Inductive Biases in Machine Learning

翻译：为机器学习中归纳偏差的作用解释“无免费午餐定理”和 Kolmogorov 复杂性

Micah Goldblum,Marc Finzi,Keefer Rowan,Andrew Gordon Wilson

No free lunch theorems for supervised learning state that no learner can solve all problems or that all learners achieve exactly the same accuracy on average over a uniform distribution on learning problems. Accordingly, these theorems are often referenced in support of the notion that individual problems require specially tailored inductive biases. While virtually all uniformly sampled datasets have high complexity, real-world problems disproportionately generate low-complexity data, and we argue that neural network models share this same preference, formalized using Kolmogorov complexity. Notably, we show that architectures designed for a particular domain, such as computer vision, can compress datasets on a variety of seemingly unrelated domains. Our experiments show that pre-trained and even randomly initialized language models prefer to generate low-complexity sequences. Whereas no free lunch theorems seemingly indicate that individual problems require specialized learners, we explain how tasks that often require human intervention such as picking an appropriately sized model when labeled data is scarce or plentiful can be automated into a single learning algorithm. These observations justify the trend in deep learning of unifying seemingly disparate problems with an increasingly small set of machine learning models.

翻译：无免费午餐定理指出，没有一个学习者能够解决所有问题，或者所有学习者在学习问题的均匀分布上的精度都是一样的。因此，这些定理经常被引用来支持特定的归纳偏差可以帮助学习。虽然几乎所有均匀采样的数据集的复杂度很高，但现实世界中的问题却不按比例地产生低复杂度的数据，我们认为神经网络模型也有这种偏好，用 Kolmogorov 复杂度来描述。值得注意的是，我们展示了为特定领域设计的架构，比如计算机视觉，可以对多种看似不相关的领域的数据集进行压缩。我们的实验表明，预训练甚至是随机初始化的语言模型更喜欢生成低复杂度的序列。虽然无免费午餐定理似乎表明每个问题需要特定的学习者，但我们解释了在标记数据稀缺或丰富时，经常需要人工干预的任务如何被自动化为一个单一的学习算法。这些观察结果证明了深度学习的趋势，即使用越来越少的机器学习模型统一看似不相关的问题。

0

相关内容

归纳偏差

【干货书】机器学习练习册，211页pdf，Exercises in Machine Learning

【干货书】机器学习练习册，211页pdf，Exercises in Machine Learning

专知会员服务

111+阅读 · 2022年10月5日

【干货书】机器学习设计模式，408页pdf，Machine Learning Design Patterns

【干货书】机器学习设计模式，408页pdf，Machine Learning Design Patterns

专知会员服务

138+阅读 · 2022年2月6日

【经典书】线性代数，436页pdf

专知会员服务

78+阅读 · 2021年3月16日

INRIA 最新《机器学习理论》课程笔记，176页pdf

专知会员服务

51+阅读 · 2020年12月14日

【干货书】机器学习速查手册，135页pdf

【干货书】机器学习速查手册，135页pdf

专知会员服务

127+阅读 · 2020年11月20日

【新书：机器学习简介】《A Concise Introduction to Machine Learning》by A.C. Faul (CRC 2019)

【新书：机器学习简介】《A Concise Introduction to Machine Learning》by A.C. Faul (CRC 2019)

专知会员服务

77+阅读 · 2020年2月8日

【机器学习基础最新版】（Mathematics for Machine Learning），417页pdf

【机器学习基础最新版】（Mathematics for Machine Learning），417页pdf

专知会员服务

244+阅读 · 2019年10月21日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

从NeurIPS 2022看域泛化：大规模实验分析和模型平均

从NeurIPS 2022看域泛化：大规模实验分析和模型平均

PaperWeekly

0+阅读 · 2022年10月23日

局部学习的特征选择：Local-Learning-Based Feature Selection

局部学习的特征选择：Local-Learning-Based Feature Selection

我爱读PAMI

14+阅读 · 2019年9月20日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

逆强化学习-学习人先验的动机

逆强化学习-学习人先验的动机

CreateAMind

16+阅读 · 2019年1月18日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

无监督元学习表示学习

无监督元学习表示学习

CreateAMind

27+阅读 · 2019年1月4日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

【推荐】免费书(草稿)：数据科学的数学基础

【推荐】免费书(草稿)：数据科学的数学基础

机器学习研究会

20+阅读 · 2017年10月1日

基于Amalgam空间的Hardy空间实变理论及其应用

国家自然科学基金

1+阅读 · 2017年12月31日

分布式有监督学习的学习理论

国家自然科学基金

17+阅读 · 2015年12月31日

有限元先验与后验误差估计中常数的精细估计及其应用

国家自然科学基金

1+阅读 · 2015年12月31日

有限域上多项式的p-进与T-进指数和

国家自然科学基金

0+阅读 · 2013年12月31日

随机矩阵理论中Beta系综的特征多项式

国家自然科学基金

0+阅读 · 2013年12月31日

基于有限环的量子纠错码理论研究

国家自然科学基金

0+阅读 · 2012年12月31日

双曲空间上几类偏微分方程的研究

国家自然科学基金

0+阅读 · 2012年12月31日

相关于算子的变指标函数空间实变理论及其应用

国家自然科学基金

0+阅读 · 2012年12月31日

基本群表示，调和度量的构造及其到上同调的应用

国家自然科学基金

1+阅读 · 2011年12月31日

相关于算子的Orlicz-型函数空间的实变理论

国家自然科学基金

0+阅读 · 2011年12月31日

Synthesizing a Progression of Subtasks for Block-Based Visual Programming Tasks

Arxiv

0+阅读 · 2023年5月27日

Neural incomplete factorization: learning preconditioners for the conjugate gradient method

Arxiv

0+阅读 · 2023年5月25日

Learning Assumption-based Argumentation Frameworks

Arxiv

0+阅读 · 2023年5月25日

Evaluating and reducing the distance between synthetic and real speech distributions

Arxiv

0+阅读 · 2023年5月25日

Bayesian Analysis for Over-parameterized Linear Model without Sparsity

Arxiv

0+阅读 · 2023年5月25日

A survey and taxonomy of loss functions in machine learning

Arxiv

26+阅读 · 2023年1月13日

Machine Learning Methods for Management UAV Flocks -- a Survey

Arxiv

40+阅读 · 2021年8月30日

The Principles of Deep Learning Theory

Arxiv

65+阅读 · 2021年6月18日

A Survey of Adversarial Learning on Graphs

Arxiv

38+阅读 · 2020年3月10日

Causality for Machine Learning

Arxiv

25+阅读 · 2019年11月24日

VIP会员

文章信息

相关主题

相关VIP内容

【干货书】机器学习练习册，211页pdf，Exercises in Machine Learning

【干货书】机器学习练习册，211页pdf，Exercises in Machine Learning

专知会员服务

111+阅读 · 2022年10月5日

【干货书】机器学习设计模式，408页pdf，Machine Learning Design Patterns

【干货书】机器学习设计模式，408页pdf，Machine Learning Design Patterns

专知会员服务

138+阅读 · 2022年2月6日

【经典书】线性代数，436页pdf

专知会员服务

78+阅读 · 2021年3月16日

INRIA 最新《机器学习理论》课程笔记，176页pdf

专知会员服务

51+阅读 · 2020年12月14日

【干货书】机器学习速查手册，135页pdf

【干货书】机器学习速查手册，135页pdf

专知会员服务

127+阅读 · 2020年11月20日

【新书：机器学习简介】《A Concise Introduction to Machine Learning》by A.C. Faul (CRC 2019)

【新书：机器学习简介】《A Concise Introduction to Machine Learning》by A.C. Faul (CRC 2019)

专知会员服务

77+阅读 · 2020年2月8日

【机器学习基础最新版】（Mathematics for Machine Learning），417页pdf

【机器学习基础最新版】（Mathematics for Machine Learning），417页pdf

专知会员服务

244+阅读 · 2019年10月21日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

热门VIP内容

开通专知VIP会员享更多权益服务

【CMU博士论文】数据驱动决策中的激励、信息与不确定性

DGP双粒度提示框架：图增强大模型助力欺诈检测

【ICCV2025】ESSENTIAL：用于视频类增量学习的情景记忆与语义记忆整合

唯快不破：大型语言模型高效架构综述

相关资讯

从NeurIPS 2022看域泛化：大规模实验分析和模型平均

从NeurIPS 2022看域泛化：大规模实验分析和模型平均

PaperWeekly

0+阅读 · 2022年10月23日

局部学习的特征选择：Local-Learning-Based Feature Selection

局部学习的特征选择：Local-Learning-Based Feature Selection

我爱读PAMI

14+阅读 · 2019年9月20日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

逆强化学习-学习人先验的动机

逆强化学习-学习人先验的动机

CreateAMind

16+阅读 · 2019年1月18日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

无监督元学习表示学习

无监督元学习表示学习

CreateAMind

27+阅读 · 2019年1月4日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

【推荐】免费书(草稿)：数据科学的数学基础

【推荐】免费书(草稿)：数据科学的数学基础

机器学习研究会

20+阅读 · 2017年10月1日

相关论文

Synthesizing a Progression of Subtasks for Block-Based Visual Programming Tasks

Arxiv

0+阅读 · 2023年5月27日

Neural incomplete factorization: learning preconditioners for the conjugate gradient method

Arxiv

0+阅读 · 2023年5月25日

Learning Assumption-based Argumentation Frameworks

Arxiv

0+阅读 · 2023年5月25日

Evaluating and reducing the distance between synthetic and real speech distributions

Arxiv

0+阅读 · 2023年5月25日

Bayesian Analysis for Over-parameterized Linear Model without Sparsity

Arxiv

0+阅读 · 2023年5月25日

A survey and taxonomy of loss functions in machine learning

Arxiv

26+阅读 · 2023年1月13日

Machine Learning Methods for Management UAV Flocks -- a Survey

Arxiv

40+阅读 · 2021年8月30日

The Principles of Deep Learning Theory

Arxiv

65+阅读 · 2021年6月18日

A Survey of Adversarial Learning on Graphs

Arxiv

38+阅读 · 2020年3月10日

Causality for Machine Learning

Arxiv

25+阅读 · 2019年11月24日

相关基金

基于Amalgam空间的Hardy空间实变理论及其应用

国家自然科学基金

1+阅读 · 2017年12月31日

分布式有监督学习的学习理论

国家自然科学基金

17+阅读 · 2015年12月31日

有限元先验与后验误差估计中常数的精细估计及其应用

国家自然科学基金

1+阅读 · 2015年12月31日

有限域上多项式的p-进与T-进指数和

国家自然科学基金

0+阅读 · 2013年12月31日

随机矩阵理论中Beta系综的特征多项式

国家自然科学基金

0+阅读 · 2013年12月31日

基于有限环的量子纠错码理论研究

国家自然科学基金

0+阅读 · 2012年12月31日

双曲空间上几类偏微分方程的研究

国家自然科学基金

0+阅读 · 2012年12月31日

相关于算子的变指标函数空间实变理论及其应用

国家自然科学基金

0+阅读 · 2012年12月31日

基本群表示，调和度量的构造及其到上同调的应用

国家自然科学基金

1+阅读 · 2011年12月31日

相关于算子的Orlicz-型函数空间的实变理论

国家自然科学基金

0+阅读 · 2011年12月31日

微信扫码咨询专知VIP会员