基于主要渐变预期值的可转移性估计值 (Transferability Estimation Based On Principal Gradient Expectation) - 专知论文

会员服务 ·

0

估计/估计量 · Extensibility · ONCE · 规范化的 · SimPLe ·

2022 年 11 月 30 日

Transferability Estimation Based On Principal Gradient Expectation

翻译：基于主要渐变预期值的可转移性估计值

Huiyan Qi,Lechao Cheng,Jingjing Chen,Yue Yu,Zunlei Feng,Yu-Gang Jiang

from arxiv, 13 pages, 3 figures, 9 tables

Deep transfer learning has been widely used for knowledge transmission in recent years. The standard approach of pre-training and subsequently fine-tuning, or linear probing, has shown itself to be effective in many down-stream tasks. Therefore, a challenging and ongoing question arises: how to quantify cross-task transferability that is compatible with transferred results while keeping self-consistency? Existing transferability metrics are estimated on the particular model by conversing source and target tasks. They must be recalculated with all existing source tasks whenever a novel unknown target task is encountered, which is extremely computationally expensive. In this work, we highlight what properties should be satisfied and evaluate existing metrics in light of these characteristics. Building upon this, we propose Principal Gradient Expectation (PGE), a simple yet effective method for assessing transferability across tasks. Specifically, we use a restart scheme to calculate every batch gradient over each weight unit more than once, and then we take the average of all the gradients to get the expectation. Thus, the transferability between the source and target task is estimated by computing the distance of normalized principal gradients. Extensive experiments show that the proposed transferability metric is more stable, reliable and efficient than SOTA methods.

翻译：近年来,在知识传输方面广泛使用了深层转移学习。培训前和随后微调的标准方法,或线性测试的标准方法,已经表明在许多下游任务中行之有效。因此,一个具有挑战性和持续的问题产生:如何量化与转移的结果相兼容的跨任务转移性,同时保持自一致性?现有的可转让性指标是通过对源和目标任务进行调和来对特定模型进行估计的。当遇到新的未知目标任务时,必须对所有现有的源任务进行重新计算,这种任务在计算上极其昂贵。在这项工作中,我们强调哪些属性应该得到满足,并根据这些特点评估现有的指标。在此基础上,我们提议了 " 首席渐进期望 " (PGE),这是评估各项任务之间可转让性的一个简单而有效的方法。具体地说,我们使用重新启动的办法计算每个重量单位的每批次梯度,一次以上,然后用所有梯度的平均值来获得预期。因此,源和目标任务之间的可转移性是通过计算归正化本位梯度的距离来估计的。广泛的实验表明,拟议的可转让性比标准更稳定。

0

相关内容

估计/估计量

估计/估计量

【ICDM 2022教程】图挖掘中的公平性:度量、算法和应用

【ICDM 2022教程】图挖掘中的公平性:度量、算法和应用

专知会员服务

28+阅读 · 2022年12月26日

不可错过！《机器学习100讲》课程，UBC Mark Schmidt讲授

不可错过！《机器学习100讲》课程，UBC Mark Schmidt讲授

专知会员服务

75+阅读 · 2022年6月28日

ICLR 2022杰出论文公布：7篇论文获得，清华朱军课题组摘得

ICLR 2022杰出论文公布：7篇论文获得，清华朱军课题组摘得

专知会员服务

60+阅读 · 2022年4月22日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

《DeepGCNs: Making GCNs Go as Deep as CNNs》

《DeepGCNs: Making GCNs Go as Deep as CNNs》

专知会员服务

31+阅读 · 2019年10月17日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

专知会员服务

83+阅读 · 2019年10月9日

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

专知会员服务

65+阅读 · 2019年10月9日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

105+阅读 · 2019年10月9日

征稿 | International Joint Conference on Knowledge Graphs (IJCKG)

征稿 | International Joint Conference on Knowledge Graphs (IJCKG)

开放知识图谱

2+阅读 · 2022年5月20日

ACM MM 2022 Call for Papers

ACM MM 2022 Call for Papers

CCF多媒体专委会

5+阅读 · 2022年3月29日

ACM TOMM Call for Papers

ACM TOMM Call for Papers

CCF多媒体专委会

2+阅读 · 2022年3月23日

会议交流 | IJCKG: International Joint Conference on Knowledge Graphs

会议交流 | IJCKG: International Joint Conference on Knowledge Graphs

开放知识图谱

0+阅读 · 2021年9月9日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

【论文】变分推断（Variational inference)的总结

【论文】变分推断（Variational inference)的总结

机器学习研究会

39+阅读 · 2017年11月16日

基于Ag纳米结构阵列对N2H4的痕量检测及其SERS增强机制研究

国家自然科学基金

0+阅读 · 2017年12月31日

界面调配提高纳米晶复合永磁材料矫顽力的研究

国家自然科学基金

0+阅读 · 2014年12月31日

非晶复合材料设计和调控制备的科学基础

国家自然科学基金

0+阅读 · 2014年12月31日

DRM1/2、CMT3调节植物PTI信号通路的分子机制研究

国家自然科学基金

0+阅读 · 2013年12月31日

BER通路基因miRNA结合位点基因多态性与结直肠癌易感性的关联及功能研究

国家自然科学基金

0+阅读 · 2013年12月31日

调控家蚕发育非编码RNA（non-coding RNA, ncRNA）的功能解析

国家自然科学基金

0+阅读 · 2011年12月31日

Cystatin B缺失与Prion疾病自噬作用机制的研究

国家自然科学基金

0+阅读 · 2011年12月31日

microRNA参与调控新生儿支气管肺发育不良的分子机制研究

国家自然科学基金

0+阅读 · 2010年12月31日

TiO2(ZnO)/Cu2O薄膜阵列异质结的结构调控及光电性能研究

国家自然科学基金

0+阅读 · 2008年12月31日

磁性Pickering乳液界面流变学研究

国家自然科学基金

0+阅读 · 2008年12月31日

Gaussian Noise is Nearly Instance Optimal for Private Unbiased Mean Estimation

Arxiv

0+阅读 · 2023年1月31日

On the Stability of General Bayesian Inference

Arxiv

0+阅读 · 2023年1月31日

Fast Optimal Estimation with Intractable Models using Permutation-Invariant Neural Networks

Arxiv

0+阅读 · 2023年1月31日

Adversarial Training of Self-supervised Monocular Depth Estimation against Physical-World Attacks

Arxiv

0+阅读 · 2023年1月31日

Structure Learning and Parameter Estimation for Graphical Models via Penalized Maximum Likelihood Methods

Arxiv

0+阅读 · 2023年1月30日

A Novel Framework for Policy Mirror Descent with General Parametrization and Linear Convergence

Arxiv

0+阅读 · 2023年1月30日

Using modified intention-to-treat as a principal stratum estimator for failure to initiate treatment

Arxiv

0+阅读 · 2023年1月30日

Optimal Decision Trees For Interpretable Clustering with Constraints

Arxiv

0+阅读 · 2023年1月30日

Zero-Shot Transfer of Haptics-Based Object Insertion Policies

Arxiv

0+阅读 · 2023年1月29日

Optimal Rate for Parameter Estimation in Matrix-variate Deviated Models

Arxiv

0+阅读 · 2023年1月27日

VIP会员

文章信息

相关主题

估计/估计量

相关VIP内容

【ICDM 2022教程】图挖掘中的公平性:度量、算法和应用

【ICDM 2022教程】图挖掘中的公平性:度量、算法和应用

专知会员服务

28+阅读 · 2022年12月26日

不可错过！《机器学习100讲》课程，UBC Mark Schmidt讲授

不可错过！《机器学习100讲》课程，UBC Mark Schmidt讲授

专知会员服务

75+阅读 · 2022年6月28日

ICLR 2022杰出论文公布：7篇论文获得，清华朱军课题组摘得

ICLR 2022杰出论文公布：7篇论文获得，清华朱军课题组摘得

专知会员服务

60+阅读 · 2022年4月22日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

《DeepGCNs: Making GCNs Go as Deep as CNNs》

《DeepGCNs: Making GCNs Go as Deep as CNNs》

专知会员服务

31+阅读 · 2019年10月17日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

专知会员服务

83+阅读 · 2019年10月9日

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

专知会员服务

65+阅读 · 2019年10月9日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

105+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

《战区安全决策课程体系》最新244页

《"无人机航母"原型平台》

任务规划与地形分析：现代复杂环境作战导航体系

《攻击场景描述形式化模型研究》

相关资讯

征稿 | International Joint Conference on Knowledge Graphs (IJCKG)

征稿 | International Joint Conference on Knowledge Graphs (IJCKG)

开放知识图谱

2+阅读 · 2022年5月20日

ACM MM 2022 Call for Papers

ACM MM 2022 Call for Papers

CCF多媒体专委会

5+阅读 · 2022年3月29日

ACM TOMM Call for Papers

ACM TOMM Call for Papers

CCF多媒体专委会

2+阅读 · 2022年3月23日

会议交流 | IJCKG: International Joint Conference on Knowledge Graphs

会议交流 | IJCKG: International Joint Conference on Knowledge Graphs

开放知识图谱

0+阅读 · 2021年9月9日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

【论文】变分推断（Variational inference)的总结

【论文】变分推断（Variational inference)的总结

机器学习研究会

39+阅读 · 2017年11月16日

相关论文

Gaussian Noise is Nearly Instance Optimal for Private Unbiased Mean Estimation

Arxiv

0+阅读 · 2023年1月31日

On the Stability of General Bayesian Inference

Arxiv

0+阅读 · 2023年1月31日

Fast Optimal Estimation with Intractable Models using Permutation-Invariant Neural Networks

Arxiv

0+阅读 · 2023年1月31日

Adversarial Training of Self-supervised Monocular Depth Estimation against Physical-World Attacks

Arxiv

0+阅读 · 2023年1月31日

Structure Learning and Parameter Estimation for Graphical Models via Penalized Maximum Likelihood Methods

Arxiv

0+阅读 · 2023年1月30日

A Novel Framework for Policy Mirror Descent with General Parametrization and Linear Convergence

Arxiv

0+阅读 · 2023年1月30日

Using modified intention-to-treat as a principal stratum estimator for failure to initiate treatment

Arxiv

0+阅读 · 2023年1月30日

Optimal Decision Trees For Interpretable Clustering with Constraints

Arxiv

0+阅读 · 2023年1月30日

Zero-Shot Transfer of Haptics-Based Object Insertion Policies

Arxiv

0+阅读 · 2023年1月29日

Optimal Rate for Parameter Estimation in Matrix-variate Deviated Models

Arxiv

0+阅读 · 2023年1月27日

相关基金

基于Ag纳米结构阵列对N2H4的痕量检测及其SERS增强机制研究

国家自然科学基金

0+阅读 · 2017年12月31日

界面调配提高纳米晶复合永磁材料矫顽力的研究

国家自然科学基金

0+阅读 · 2014年12月31日

非晶复合材料设计和调控制备的科学基础

国家自然科学基金

0+阅读 · 2014年12月31日

DRM1/2、CMT3调节植物PTI信号通路的分子机制研究

国家自然科学基金

0+阅读 · 2013年12月31日

BER通路基因miRNA结合位点基因多态性与结直肠癌易感性的关联及功能研究

国家自然科学基金

0+阅读 · 2013年12月31日

调控家蚕发育非编码RNA（non-coding RNA, ncRNA）的功能解析

国家自然科学基金

0+阅读 · 2011年12月31日

Cystatin B缺失与Prion疾病自噬作用机制的研究

国家自然科学基金

0+阅读 · 2011年12月31日

microRNA参与调控新生儿支气管肺发育不良的分子机制研究

国家自然科学基金

0+阅读 · 2010年12月31日

TiO2(ZnO)/Cu2O薄膜阵列异质结的结构调控及光电性能研究

国家自然科学基金

0+阅读 · 2008年12月31日

磁性Pickering乳液界面流变学研究

国家自然科学基金

0+阅读 · 2008年12月31日

微信扫码咨询专知VIP会员