终止- 兰多实验实验选择器: 具有假发现率控制的快速高多功能变量选择 (The Terminating-Random Experiments Selector: Fast High-Dimensional Variable Selection with False Discovery Rate Control) - 专知论文

会员服务 ·

0

控制器 · FAST · state-of-the-art · 预测器/决策函数 · CC ·

2022 年 10 月 23 日

The Terminating-Random Experiments Selector: Fast High-Dimensional Variable Selection with False Discovery Rate Control

翻译：终止- 兰多实验实验选择器: 具有假发现率控制的快速高多功能变量选择

Jasin Machkour,Michael Muma,Daniel P. Palomar

from arxiv, 32 pages, 24 figures, 2 tables, R packages 'TRexSelector' and 'tlars' on CRAN

We propose the Terminating-Random Experiments (T-Rex) selector, a fast variable selection method for high-dimensional data. The T-Rex selector controls a user-defined target false discovery rate (FDR) while maximizing the number of selected variables. This is achieved by fusing the solutions of multiple early terminated random experiments. The experiments are conducted on a combination of the original predictors and multiple sets of randomly generated dummy predictors. A finite sample proof based on martingale theory for the FDR control property is provided. Numerical simulations confirm that the FDR is controlled at the target level while allowing for a high power. We prove under mild conditions that the dummies can be sampled from any univariate probability distribution with finite expectation and variance. The computational complexity of the proposed method is linear in the number of variables. The T-Rex selector outperforms state-of-the-art methods for FDR control on a simulated genome-wide association study (GWAS), while its sequential computation time is more than two orders of magnitude lower than that of the strongest benchmark methods. The open source R package TRexSelector containing the implementation of the T-Rex selector is available on CRAN.

翻译：我们提出终止- 兰多姆实验( T- Rex) 选择器, 这是一种用于高维数据的快速变量选择方法。 T- Rex 选择器控制了一个用户定义的目标错误发现率( FDR), 并同时最大限度地增加选定变量的数量。这是通过使用多个早期终止随机实验的解决方案实现的。实验是在原始预测器和多组随机生成的模拟模拟预测器的组合下进行的。提供了基于FDR控制属性的martingale理论的有限样本证据。数字模拟证实FDR控制在目标水平上,同时允许高功率。我们证明, 在温和的条件下, 能够从任何有一定期望和差异的单向概率分布中样本。提议的方法的计算复杂性在变量数中是线性。 T- Rex 选择了用于模拟基因组全域联系( GWASS) 的FDR 控制的最新方法, 而其连续计算时间比最强的基准方法低两个数量级。包含 RRECT 的开放源包。

0

相关内容

控制器

【干货书】机器学习速查手册，135页pdf

【干货书】机器学习速查手册，135页pdf

专知会员服务

127+阅读 · 2020年11月20日

【2020Manning新书】微前端实战，Micro Frontends in Action，296页pdf

【2020Manning新书】微前端实战，Micro Frontends in Action，296页pdf

专知会员服务

58+阅读 · 2020年8月28日

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

专知会员服务

96+阅读 · 2020年3月12日

UC.Berkeley CS189讲义教材:《机器学习全面指南》，185页pdf

专知会员服务

162+阅读 · 2020年1月16日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

专知会员服务

160+阅读 · 2019年10月12日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

ACM TOMM Call for Papers

ACM TOMM Call for Papers

CCF多媒体专委会

2+阅读 · 2022年3月23日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium9

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium9

中国图象图形学学会CSIG

0+阅读 · 2021年12月17日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium6

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium6

中国图象图形学学会CSIG

2+阅读 · 2021年11月12日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium4

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium4

中国图象图形学学会CSIG

0+阅读 · 2021年11月10日

【ICIG2021】Latest News & Announcements of the Industry Talk1

【ICIG2021】Latest News & Announcements of the Industry Talk1

中国图象图形学学会CSIG

0+阅读 · 2021年7月28日

局部学习的特征选择：Local-Learning-Based Feature Selection

局部学习的特征选择：Local-Learning-Based Feature Selection

我爱读PAMI

14+阅读 · 2019年9月20日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

Progerin/PrelaminA诱发早老症的蛋白质组学研究

国家自然科学基金

1+阅读 · 2015年12月31日

状态空间搜索的anytime模式及其高效算法研究

国家自然科学基金

0+阅读 · 2015年12月31日

非均质量子器件Schr？dinger-Poisson系统多尺度分析与算法研究

国家自然科学基金

0+阅读 · 2014年12月31日

表观遗传因子对大鼠生命周期中多器官基因表达的影响

国家自然科学基金

0+阅读 · 2014年12月31日

靶向前列腺癌介孔纳米硅球MR显像

国家自然科学基金

0+阅读 · 2013年12月31日

基于不确定性理论和贝叶斯网络的地铁隧道施工环境变形安全实时预警控制研究

国家自然科学基金

0+阅读 · 2012年12月31日

等离子体层顶位形统计研究

国家自然科学基金

0+阅读 · 2012年12月31日

沿海港口水域船舶交通流行为特征及演化机理研究

国家自然科学基金

0+阅读 · 2011年12月31日

城市大规模群体疏散模拟仿真与管理策略研究

国家自然科学基金

0+阅读 · 2009年12月31日

群体消费者服务失败和补救机制研究

国家自然科学基金

0+阅读 · 2008年12月31日

Robust tests for equality of regression curves based on characteristic functions

Arxiv

0+阅读 · 2022年12月7日

Bivariate log-symmetric models: distributional properties, parameter estimation and an application to fatigue data analysis

Arxiv

0+阅读 · 2022年12月7日

The Best Path Algorithm automatic variables selection via High Dimensional Graphical Models

The Best Path Algorithm automatic variables selection via High Dimensional Graphical Models

Arxiv

0+阅读 · 2022年12月6日

Variable Selection using Inverse Survival Probability Weighting

Arxiv

0+阅读 · 2022年12月5日

Nonparametric Group Variable Selection with Multivariate Response for Connectome-Based Modeling of Cognitive Scores

Arxiv

0+阅读 · 2022年12月4日

Predictive Quantile Regression with Mixed Roots and Increasing Dimensions: The ALQR Approach

Arxiv

0+阅读 · 2022年12月4日

Parametric Modal Regression with Error in Covariates

Arxiv

0+阅读 · 2022年12月3日

Improving Training and Inference of Face Recognition Models via Random Temperature Scaling

Arxiv

0+阅读 · 2022年12月2日

Welfare and Fairness in Multi-objective Reinforcement Learning

Arxiv

0+阅读 · 2022年11月30日

Class-Balanced Loss Based on Effective Number of Samples

Arxiv

12+阅读 · 2019年1月16日

VIP会员

文章信息

相关主题

state-of-the-art

预测器/决策函数

相关VIP内容

【干货书】机器学习速查手册，135页pdf

【干货书】机器学习速查手册，135页pdf

专知会员服务

127+阅读 · 2020年11月20日

【2020Manning新书】微前端实战，Micro Frontends in Action，296页pdf

【2020Manning新书】微前端实战，Micro Frontends in Action，296页pdf

专知会员服务

58+阅读 · 2020年8月28日

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

专知会员服务

96+阅读 · 2020年3月12日

UC.Berkeley CS189讲义教材:《机器学习全面指南》，185页pdf

专知会员服务

162+阅读 · 2020年1月16日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

专知会员服务

160+阅读 · 2019年10月12日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

检索增强生成（RAG）技术，261页slides

美联参会指南-联合规划与执行概述及政策框架 | 32页

从DeepSeek-R1学到的三个核心经验

大规模视觉模型中的提示式适配：综述

相关资讯

ACM TOMM Call for Papers

ACM TOMM Call for Papers

CCF多媒体专委会

2+阅读 · 2022年3月23日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium9

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium9

中国图象图形学学会CSIG

0+阅读 · 2021年12月17日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium6

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium6

中国图象图形学学会CSIG

2+阅读 · 2021年11月12日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium4

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium4

中国图象图形学学会CSIG

0+阅读 · 2021年11月10日

【ICIG2021】Latest News & Announcements of the Industry Talk1

【ICIG2021】Latest News & Announcements of the Industry Talk1

中国图象图形学学会CSIG

0+阅读 · 2021年7月28日

局部学习的特征选择：Local-Learning-Based Feature Selection

局部学习的特征选择：Local-Learning-Based Feature Selection

我爱读PAMI

14+阅读 · 2019年9月20日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

相关论文

Robust tests for equality of regression curves based on characteristic functions

Arxiv

0+阅读 · 2022年12月7日

Bivariate log-symmetric models: distributional properties, parameter estimation and an application to fatigue data analysis

Arxiv

0+阅读 · 2022年12月7日

The Best Path Algorithm automatic variables selection via High Dimensional Graphical Models

The Best Path Algorithm automatic variables selection via High Dimensional Graphical Models

Arxiv

0+阅读 · 2022年12月6日

Variable Selection using Inverse Survival Probability Weighting

Arxiv

0+阅读 · 2022年12月5日

Nonparametric Group Variable Selection with Multivariate Response for Connectome-Based Modeling of Cognitive Scores

Arxiv

0+阅读 · 2022年12月4日

Predictive Quantile Regression with Mixed Roots and Increasing Dimensions: The ALQR Approach

Arxiv

0+阅读 · 2022年12月4日

Parametric Modal Regression with Error in Covariates

Arxiv

0+阅读 · 2022年12月3日

Improving Training and Inference of Face Recognition Models via Random Temperature Scaling

Arxiv

0+阅读 · 2022年12月2日

Welfare and Fairness in Multi-objective Reinforcement Learning

Arxiv

0+阅读 · 2022年11月30日

Class-Balanced Loss Based on Effective Number of Samples

Arxiv

12+阅读 · 2019年1月16日

相关基金

Progerin/PrelaminA诱发早老症的蛋白质组学研究

国家自然科学基金

1+阅读 · 2015年12月31日

状态空间搜索的anytime模式及其高效算法研究

国家自然科学基金

0+阅读 · 2015年12月31日

非均质量子器件Schr？dinger-Poisson系统多尺度分析与算法研究

国家自然科学基金

0+阅读 · 2014年12月31日

表观遗传因子对大鼠生命周期中多器官基因表达的影响

国家自然科学基金

0+阅读 · 2014年12月31日

靶向前列腺癌介孔纳米硅球MR显像

国家自然科学基金

0+阅读 · 2013年12月31日

基于不确定性理论和贝叶斯网络的地铁隧道施工环境变形安全实时预警控制研究

国家自然科学基金

0+阅读 · 2012年12月31日

等离子体层顶位形统计研究

国家自然科学基金

0+阅读 · 2012年12月31日

沿海港口水域船舶交通流行为特征及演化机理研究

国家自然科学基金

0+阅读 · 2011年12月31日

城市大规模群体疏散模拟仿真与管理策略研究

国家自然科学基金

0+阅读 · 2009年12月31日

群体消费者服务失败和补救机制研究

国家自然科学基金

0+阅读 · 2008年12月31日

微信扫码咨询专知VIP会员