通才和专家预测交叉学习 (Cross-study learning for generalist and specialist predictions) - 专知论文

会员服务 ·

0

预测器/决策函数 · Performer · 预测准确率 · 估计/估计量 · Weight ·

2021 年 6 月 17 日

Cross-study learning for generalist and specialist predictions

翻译：通才和专家预测交叉学习

Boyu Ren,Prasad Patil,Francesca Dominici,Giovanni Parmigiani,Lorenzo Trippa

The integration and use of data from multiple studies, for the development of prediction models is an important task in several scientific fields. We propose a framework for generalist and specialist predictions that leverages multiple datasets, with potential differences in the relationships between predictors and outcomes. Our framework uses stacking, and it includes three major components: 1) an ensemble of prediction models trained on one or more datasets, 2) task-specific utility functions and 3) a no-data-reuse technique for estimating stacking weights. We illustrate that under mild regularity conditions the framework produces stacked PFs with oracle properties. In particular we show that the the stacking weights are nearly optimal. We also characterize the scenario where the proposed no-data-reuse technique increases prediction accuracy compared to stacking with data reuse in a special case.We perform a simulation study to illustrate these results. We apply our framework to predict mortality using a collection of datasets on long-term exposure to air pollutants.

翻译：将多种研究的数据综合起来并加以使用,以开发预测模型,是若干科学领域的一项重要任务。我们提出了一个通用和专家预测框架,利用多种数据集,在预测数据和结果之间的关系上可能存在差异。我们的框架使用堆叠,包括三个主要组成部分:1)一组经过一个或多个数据集培训的预测模型,2)任务特有的实用功能,3)用于估计堆叠重量的无数据重复使用技术。我们说明,在温和的常规条件下,框架产生堆叠的有骨骼特性的PF。我们特别表明,堆叠的权重几乎是最佳的。我们还说明了拟议中的不使用技术与特殊情况下的数据再利用相比,提高了预测的准确性。我们进行了模拟研究,以说明这些结果。我们运用我们的框架,利用收集的关于空气污染物长期暴露的数据集来预测死亡率。

0

相关内容

预测器/决策函数

预测器/决策函数

哥伦比亚大学最新《机器学习》课程，Fall-B 2020 (Machine Learning)

专知会员服务

39+阅读 · 2020年11月3日

2020数据工程师成长路线图

专知会员服务

41+阅读 · 2020年9月6日

最新《机器学习最优化》课程笔记，36页pdf，Optimization for Machine Learning

专知会员服务

171+阅读 · 2020年5月10日

【干货书】真实机器学习，264页pdf，Real-World Machine Learning

【干货书】真实机器学习，264页pdf，Real-World Machine Learning

专知会员服务

115+阅读 · 2020年4月5日

【机器学习最优化课程笔记】Optimization for Machine Learning，36页pdf

【机器学习最优化课程笔记】Optimization for Machine Learning，36页pdf

专知会员服务

117+阅读 · 2020年3月25日

【金融机器学习课程资料】Financial Machine Learning

专知会员服务

119+阅读 · 2019年12月24日

【电子书推荐】机器学习课程，A Course in Machine Learning，Hal Daumé III

【电子书推荐】机器学习课程，A Course in Machine Learning，Hal Daumé III

专知会员服务

28+阅读 · 2019年11月19日

FlowQA: Grasping Flow in History for Conversational Machine Comprehension

FlowQA: Grasping Flow in History for Conversational Machine Comprehension

专知会员服务

34+阅读 · 2019年10月18日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

【论文笔记】通俗理解少样本文本分类 (Few-Shot Text Classification) (1)

【论文笔记】通俗理解少样本文本分类 (Few-Shot Text Classification) (1)

深度学习自然语言处理

7+阅读 · 2020年4月8日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

meta learning 17年：MAML SNAIL

meta learning 17年：MAML SNAIL

CreateAMind

11+阅读 · 2019年1月2日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

春节充电系列：李宏毅2017机器学习课程学习笔记05之Logistic 回归

春节充电系列：李宏毅2017机器学习课程学习笔记05之Logistic 回归

专知

5+阅读 · 2018年2月18日

carla 学习笔记

carla 学习笔记

CreateAMind

9+阅读 · 2018年2月7日

【推荐】Python机器学习生态圈(Scikit-Learn相关项目)

【推荐】Python机器学习生态圈(Scikit-Learn相关项目)

机器学习研究会

6+阅读 · 2017年8月23日

【学习】Hierarchical Softmax

【学习】Hierarchical Softmax

机器学习研究会

4+阅读 · 2017年8月6日

Generalized and Incremental Few-Shot Learning by Explicit Learning and Calibration without Forgetting

Arxiv

0+阅读 · 2021年8月18日

spectrai: A deep learning framework for spectral data

Arxiv

0+阅读 · 2021年8月17日

Retrieval & Interaction Machine for Tabular Data Prediction

Arxiv

5+阅读 · 2021年8月11日

Multi-Interactive Attention Network for Fine-grained Feature Learning in CTR Prediction

Arxiv

9+阅读 · 2020年12月13日

Generative Adversarial Zero-Shot Relational Learning for Knowledge Graphs

Generative Adversarial Zero-Shot Relational Learning for Knowledge Graphs

Arxiv

4+阅读 · 2020年1月8日

Generative Adversarial Networks and Conditional Random Fields for Hyperspectral Image Classification

Arxiv

3+阅读 · 2019年5月12日

Learning to Predict the Cosmological Structure Formation

Arxiv

3+阅读 · 2018年11月15日

Deep Learning for Generic Object Detection: A Survey

Deep Learning for Generic Object Detection: A Survey

Arxiv

14+阅读 · 2018年9月6日

A Study on Overfitting in Deep Reinforcement Learning

Arxiv

7+阅读 · 2018年4月20日

Generative Adversarial Networks and Probabilistic Graph Models for Hyperspectral Image Classification

Arxiv

11+阅读 · 2018年2月10日

VIP会员

文章信息

相关主题

预测器/决策函数

预测准确率

估计/估计量

相关VIP内容

哥伦比亚大学最新《机器学习》课程，Fall-B 2020 (Machine Learning)

专知会员服务

39+阅读 · 2020年11月3日

2020数据工程师成长路线图

专知会员服务

41+阅读 · 2020年9月6日

最新《机器学习最优化》课程笔记，36页pdf，Optimization for Machine Learning

专知会员服务

171+阅读 · 2020年5月10日

【干货书】真实机器学习，264页pdf，Real-World Machine Learning

【干货书】真实机器学习，264页pdf，Real-World Machine Learning

专知会员服务

115+阅读 · 2020年4月5日

【机器学习最优化课程笔记】Optimization for Machine Learning，36页pdf

【机器学习最优化课程笔记】Optimization for Machine Learning，36页pdf

专知会员服务

117+阅读 · 2020年3月25日

【金融机器学习课程资料】Financial Machine Learning

专知会员服务

119+阅读 · 2019年12月24日

【电子书推荐】机器学习课程，A Course in Machine Learning，Hal Daumé III

【电子书推荐】机器学习课程，A Course in Machine Learning，Hal Daumé III

专知会员服务

28+阅读 · 2019年11月19日

FlowQA: Grasping Flow in History for Conversational Machine Comprehension

FlowQA: Grasping Flow in History for Conversational Machine Comprehension

专知会员服务

34+阅读 · 2019年10月18日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

热门VIP内容

开通专知VIP会员享更多权益服务

智能体化人工智能：架构、应用及未来发展方向的综合综述

《自主武器》365页书籍

联邦学习综述：多层次聚合技术的系统分类、实验洞察与未来前沿

人工智能在空战中的局限及其真正适用领域

相关资讯

【论文笔记】通俗理解少样本文本分类 (Few-Shot Text Classification) (1)

【论文笔记】通俗理解少样本文本分类 (Few-Shot Text Classification) (1)

深度学习自然语言处理

7+阅读 · 2020年4月8日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

meta learning 17年：MAML SNAIL

meta learning 17年：MAML SNAIL

CreateAMind

11+阅读 · 2019年1月2日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

春节充电系列：李宏毅2017机器学习课程学习笔记05之Logistic 回归

春节充电系列：李宏毅2017机器学习课程学习笔记05之Logistic 回归

专知

5+阅读 · 2018年2月18日

carla 学习笔记

carla 学习笔记

CreateAMind

9+阅读 · 2018年2月7日

【推荐】Python机器学习生态圈(Scikit-Learn相关项目)

【推荐】Python机器学习生态圈(Scikit-Learn相关项目)

机器学习研究会

6+阅读 · 2017年8月23日

【学习】Hierarchical Softmax

【学习】Hierarchical Softmax

机器学习研究会

4+阅读 · 2017年8月6日

相关论文

Generalized and Incremental Few-Shot Learning by Explicit Learning and Calibration without Forgetting

Arxiv

0+阅读 · 2021年8月18日

spectrai: A deep learning framework for spectral data

Arxiv

0+阅读 · 2021年8月17日

Retrieval & Interaction Machine for Tabular Data Prediction

Arxiv

5+阅读 · 2021年8月11日

Multi-Interactive Attention Network for Fine-grained Feature Learning in CTR Prediction

Arxiv

9+阅读 · 2020年12月13日

Generative Adversarial Zero-Shot Relational Learning for Knowledge Graphs

Generative Adversarial Zero-Shot Relational Learning for Knowledge Graphs

Arxiv

4+阅读 · 2020年1月8日

Generative Adversarial Networks and Conditional Random Fields for Hyperspectral Image Classification

Arxiv

3+阅读 · 2019年5月12日

Learning to Predict the Cosmological Structure Formation

Arxiv

3+阅读 · 2018年11月15日

Deep Learning for Generic Object Detection: A Survey

Deep Learning for Generic Object Detection: A Survey

Arxiv

14+阅读 · 2018年9月6日

A Study on Overfitting in Deep Reinforcement Learning

Arxiv

7+阅读 · 2018年4月20日

Generative Adversarial Networks and Probabilistic Graph Models for Hyperspectral Image Classification

Arxiv

11+阅读 · 2018年2月10日

微信扫码咨询专知VIP会员