以预测方式选择 (Selection by Prediction with Conformal p-values) - 专知论文

会员服务 ·

0

Conformer · 短列表 · 阈值 · MoDELS · CASES ·

2022 年 10 月 4 日

Selection by Prediction with Conformal p-values

翻译：以预测方式选择

Ying Jin,Emmanuel J. Candès

Decision making or scientific discovery pipelines such as job hiring and drug discovery often involve multiple stages: before any resource-intensive step, there is often an initial screening that uses predictions from a machine learning model to shortlist a few candidates from a large pool. We study screening procedures that aim to select candidates whose unobserved outcomes exceed user-specified values. We develop a method that wraps around any prediction model to produce a subset of candidates while controlling the proportion of falsely selected units. Building upon the conformal inference framework, our method first constructs p-values that quantify the statistical evidence for large outcomes; it then determines the shortlist by comparing the p-values to a threshold introduced in the multiple testing literature. In many cases, the procedure selects candidates whose predictions are above a data-dependent threshold. We demonstrate the empirical performance of our method via simulations, and apply it to job hiring and drug discovery datasets.

翻译：决策或科学发现管道,如招工和毒品发现,往往涉及多个阶段:在任何资源密集型步骤之前,往往先进行初步筛选,利用机器学习模型的预测,从大型人才库中将少数候选人排入最后名单;我们研究筛选程序,目的是挑选未观察到的结果超过用户指定值的候选人;我们开发一种环绕任何预测模型的方法,以产生一组候选人,同时控制不实选择单位的比例;根据一致的推断框架,我们的方法首先构建了将统计证据量化为大结果的p值;然后通过将P值与多个测试文献中引入的阈值进行比较来确定短名单;在许多情况下,该程序挑选的人选的预测高于数据依赖阈值;我们通过模拟来展示我们方法的经验性表现,并将其应用于招聘和药物发现数据集。

0

相关内容

Conformer

不可错过！700+ppt《因果推理》课程！杜克大学Fan Li教程

不可错过！700+ppt《因果推理》课程！杜克大学Fan Li教程

专知会员服务

72+阅读 · 2022年7月11日

不可错过！《机器学习100讲》课程，UBC Mark Schmidt讲授

不可错过！《机器学习100讲》课程，UBC Mark Schmidt讲授

专知会员服务

76+阅读 · 2022年6月28日

Linux导论，Introduction to Linux，96页ppt

Linux导论，Introduction to Linux，96页ppt

专知会员服务

82+阅读 · 2020年7月26日

【跨语言BERT模型大集合】Transfer learning is increasingly going multilingual with language-specific BERT models

专知会员服务

54+阅读 · 2020年1月30日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

50+阅读 · 2019年10月17日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

专知会员服务

65+阅读 · 2019年10月9日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

105+阅读 · 2019年10月9日

VCIP 2022 Call for Demos

VCIP 2022 Call for Demos

CCF多媒体专委会

1+阅读 · 2022年6月6日

VCIP 2022 Call for Special Session Proposals

VCIP 2022 Call for Special Session Proposals

CCF多媒体专委会

1+阅读 · 2022年4月1日

【ICIG2021】Latest News & Announcements of the Tutorial

【ICIG2021】Latest News & Announcements of the Tutorial

中国图象图形学学会CSIG

3+阅读 · 2021年12月20日

【ICIG2021】Latest News & Announcements of the Workshop

【ICIG2021】Latest News & Announcements of the Workshop

中国图象图形学学会CSIG

0+阅读 · 2021年12月20日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium6

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium6

中国图象图形学学会CSIG

2+阅读 · 2021年11月12日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium4

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium4

中国图象图形学学会CSIG

0+阅读 · 2021年11月10日

【ICIG2021】Latest News & Announcements of the Industry Talk2

【ICIG2021】Latest News & Announcements of the Industry Talk2

中国图象图形学学会CSIG

0+阅读 · 2021年7月29日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

Fe掺杂CuGaS2中间带薄膜材料的制备及光电特性

国家自然科学基金

0+阅读 · 2014年12月31日

基于水资源系统演变不确定性的水资源短缺风险评估

国家自然科学基金

0+阅读 · 2013年12月31日

热-机械疲劳载荷下抗高温材料表面冷却孔的变形研究

国家自然科学基金

0+阅读 · 2013年12月31日

含氮杂噻蒽环高折射率聚芳硫醚的分子设计与性能研究

国家自然科学基金

0+阅读 · 2013年12月31日

Trx1/FOXO1信号通路调控肝癌耐药的机制研究

国家自然科学基金

0+阅读 · 2012年12月31日

丹参联合化疗和VEGF靶向药物对结肠癌的协同作用及机制研究

国家自然科学基金

0+阅读 · 2012年12月31日

MRP1/ABCC1基因3＇UTR单核苷酸多态性介导miRNA对原发性肝癌多药耐药性的影响

国家自然科学基金

0+阅读 · 2012年12月31日

CD45及剪接变构体在骨髓造血干细胞层面始动银屑病发病的机制研究

国家自然科学基金

0+阅读 · 2011年12月31日

强非线性椭圆问题

国家自然科学基金

0+阅读 · 2009年12月31日

一类necroptosis诱导剂抗肿瘤干细胞的研究

国家自然科学基金

0+阅读 · 2009年12月31日

Zero-Label Prompt Selection

Arxiv

0+阅读 · 2022年11月9日

Cold Start Streaming Learning for Deep Networks

Arxiv

0+阅读 · 2022年11月9日

Learning to Follow Instructions in Text-Based Games

Arxiv

0+阅读 · 2022年11月8日

Flexible variable selection in the presence of missing data

Arxiv

0+阅读 · 2022年11月8日

Eliciting Knowledge from Large Pre-Trained Models for Unsupervised Knowledge-Grounded Conversation

Arxiv

0+阅读 · 2022年11月8日

Prompter: Utilizing Large Language Model Prompting for a Data Efficient Embodied Instruction Following

Arxiv

0+阅读 · 2022年11月7日

A Dynamic Spatiotemporal Stochastic Volatility Model with an Application to Environmental Risks

Arxiv

0+阅读 · 2022年11月6日

SJ-HD^2R: Selective Joint High Dynamic Range and Denoising Imaging for Dynamic Scenes

Arxiv

0+阅读 · 2022年11月3日

PROP: Pre-training with Representative Words Prediction for Ad-hoc Retrieval

Arxiv

11+阅读 · 2020年10月20日

IEOPF: An Active Contour Model for Image Segmentation with Inhomogeneities Estimated by Orthogonal Primary Functions

Arxiv

10+阅读 · 2018年1月20日

VIP会员

文章信息

相关主题

相关VIP内容

不可错过！700+ppt《因果推理》课程！杜克大学Fan Li教程

不可错过！700+ppt《因果推理》课程！杜克大学Fan Li教程

专知会员服务

72+阅读 · 2022年7月11日

不可错过！《机器学习100讲》课程，UBC Mark Schmidt讲授

不可错过！《机器学习100讲》课程，UBC Mark Schmidt讲授

专知会员服务

76+阅读 · 2022年6月28日

Linux导论，Introduction to Linux，96页ppt

Linux导论，Introduction to Linux，96页ppt

专知会员服务

82+阅读 · 2020年7月26日

【跨语言BERT模型大集合】Transfer learning is increasingly going multilingual with language-specific BERT models

专知会员服务

54+阅读 · 2020年1月30日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

50+阅读 · 2019年10月17日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

专知会员服务

65+阅读 · 2019年10月9日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

105+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

《俄乌战争中的无人系统：新的战争方式与新兴趋势——来自前线的印象》报告

《海上自主水面船舶远程操作中心：安全可持续运行的多维度分析》

多模态大语言模型下游调优中“保持自我”的重要性

隐身自主无人水下航行器技术如何变革水下作战并重塑海军竞争

相关资讯

VCIP 2022 Call for Demos

VCIP 2022 Call for Demos

CCF多媒体专委会

1+阅读 · 2022年6月6日

VCIP 2022 Call for Special Session Proposals

VCIP 2022 Call for Special Session Proposals

CCF多媒体专委会

1+阅读 · 2022年4月1日

【ICIG2021】Latest News & Announcements of the Tutorial

【ICIG2021】Latest News & Announcements of the Tutorial

中国图象图形学学会CSIG

3+阅读 · 2021年12月20日

【ICIG2021】Latest News & Announcements of the Workshop

【ICIG2021】Latest News & Announcements of the Workshop

中国图象图形学学会CSIG

0+阅读 · 2021年12月20日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium6

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium6

中国图象图形学学会CSIG

2+阅读 · 2021年11月12日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium4

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium4

中国图象图形学学会CSIG

0+阅读 · 2021年11月10日

【ICIG2021】Latest News & Announcements of the Industry Talk2

【ICIG2021】Latest News & Announcements of the Industry Talk2

中国图象图形学学会CSIG

0+阅读 · 2021年7月29日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

相关论文

Zero-Label Prompt Selection

Arxiv

0+阅读 · 2022年11月9日

Cold Start Streaming Learning for Deep Networks

Arxiv

0+阅读 · 2022年11月9日

Learning to Follow Instructions in Text-Based Games

Arxiv

0+阅读 · 2022年11月8日

Flexible variable selection in the presence of missing data

Arxiv

0+阅读 · 2022年11月8日

Eliciting Knowledge from Large Pre-Trained Models for Unsupervised Knowledge-Grounded Conversation

Arxiv

0+阅读 · 2022年11月8日

Prompter: Utilizing Large Language Model Prompting for a Data Efficient Embodied Instruction Following

Arxiv

0+阅读 · 2022年11月7日

A Dynamic Spatiotemporal Stochastic Volatility Model with an Application to Environmental Risks

Arxiv

0+阅读 · 2022年11月6日

SJ-HD^2R: Selective Joint High Dynamic Range and Denoising Imaging for Dynamic Scenes

Arxiv

0+阅读 · 2022年11月3日

PROP: Pre-training with Representative Words Prediction for Ad-hoc Retrieval

Arxiv

11+阅读 · 2020年10月20日

IEOPF: An Active Contour Model for Image Segmentation with Inhomogeneities Estimated by Orthogonal Primary Functions

Arxiv

10+阅读 · 2018年1月20日

相关基金

Fe掺杂CuGaS2中间带薄膜材料的制备及光电特性

国家自然科学基金

0+阅读 · 2014年12月31日

基于水资源系统演变不确定性的水资源短缺风险评估

国家自然科学基金

0+阅读 · 2013年12月31日

热-机械疲劳载荷下抗高温材料表面冷却孔的变形研究

国家自然科学基金

0+阅读 · 2013年12月31日

含氮杂噻蒽环高折射率聚芳硫醚的分子设计与性能研究

国家自然科学基金

0+阅读 · 2013年12月31日

Trx1/FOXO1信号通路调控肝癌耐药的机制研究

国家自然科学基金

0+阅读 · 2012年12月31日

丹参联合化疗和VEGF靶向药物对结肠癌的协同作用及机制研究

国家自然科学基金

0+阅读 · 2012年12月31日

MRP1/ABCC1基因3＇UTR单核苷酸多态性介导miRNA对原发性肝癌多药耐药性的影响

国家自然科学基金

0+阅读 · 2012年12月31日

CD45及剪接变构体在骨髓造血干细胞层面始动银屑病发病的机制研究

国家自然科学基金

0+阅读 · 2011年12月31日

强非线性椭圆问题

国家自然科学基金

0+阅读 · 2009年12月31日

一类necroptosis诱导剂抗肿瘤干细胞的研究

国家自然科学基金

0+阅读 · 2009年12月31日

微信扫码咨询专知VIP会员