FDR 联合Null假假设的受控多重测试:一种以淘汰为基础的方法 (FDR Controlled Multiple Testing for Union Null Hypotheses: A Knockoff-based Approach) - 专知论文

会员服务 ·

0

可辨认的 · 控制器 · 相互独立的 · 统计量 · 情景 ·

2022 年 10 月 3 日

FDR Controlled Multiple Testing for Union Null Hypotheses: A Knockoff-based Approach

翻译：FDR 联合Null假假设的受控多重测试:一种以淘汰为基础的方法

Ran Dai,Cheng Zheng

False discovery rate (FDR) controlling procedures provide important statistical guarantees for the replicability in signal identification based on multiple hypotheses testing. In many fields of study, FDR controlling procedures are used in high-dimensional (HD) analyses to discover features that are truly associated with the outcome. In some recent applications, data on the same set of candidate features are independently collected in multiple different studies. For example, gene expression data are collected at different facilities and with different cohorts, to identify the genetic biomarkers of multiple types of cancers. These studies provide us opportunities to identify signals by considering information from different sources (with potential heterogeneity) jointly. This paper is about how to provide FDR control guarantees for the tests of union null hypotheses of conditional independence. We present a knockoff-based variable selection method (\textit{Simultaneous knockoffs}) to identify mutual signals from multiple independent data sets, providing exact FDR control guarantees under finite sample settings. This method can work with very general model settings and test statistics. We demonstrate the performance of this method with extensive numerical studies and two real data examples.

翻译：假发现率(FDR)控制程序为基于多种假设测试的信号识别可复制性提供了重要的统计保障。在许多研究领域,FDR控制程序用于高维(HD)分析,以发现与结果真正相关的特征。在最近的一些应用中,同一套候选特征的数据通过多种不同的研究独立收集。例如,在不同设施内收集基因表达数据,并与不同的组群收集这些数据,以查明多种类型癌症的遗传生物标志。这些研究为我们提供了机会,通过共同考虑不同来源(潜在异质)的信息来识别信号。本文涉及如何为测试有条件独立的无关联假设提供FDR控制保障。我们提出了一个基于敲击的变量选择方法(\ textit{Simultaney knoff}),以识别来自多个独立数据集的相互信号,在有限的抽样环境中提供准确的FDR控制保证。这种方法可以与非常普遍的模型设置和测试统计数据合作。我们用大量的数字研究和两个真实数据实例来展示这一方法的绩效。

0

相关内容

可辨认的

加速图神经网络推理，121页ppt，普林斯顿大学JAVIER DUARTE主讲

加速图神经网络推理，121页ppt，普林斯顿大学JAVIER DUARTE主讲

专知会员服务

33+阅读 · 2022年6月13日

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

专知会员服务

96+阅读 · 2020年3月12日

Aspect-Oriented Syntax Network for Aspect-Based Sentiment Analysis，中山大学数据科学与计算机学院权小军教授，第八届全国社会媒体处理大会SMP2019

Aspect-Oriented Syntax Network for Aspect-Based Sentiment Analysis，中山大学数据科学与计算机学院权小军教授，第八届全国社会媒体处理大会SMP2019

专知会员服务

19+阅读 · 2019年10月22日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

专知会员服务

83+阅读 · 2019年10月9日

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

专知会员服务

65+阅读 · 2019年10月9日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

105+阅读 · 2019年10月9日

VCIP 2022 Call for Special Session Proposals

VCIP 2022 Call for Special Session Proposals

CCF多媒体专委会

1+阅读 · 2022年4月1日

ACM MM 2022 Call for Papers

ACM MM 2022 Call for Papers

CCF多媒体专委会

5+阅读 · 2022年3月29日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

【ICIG2021】Latest News & Announcements of the Tutorial

【ICIG2021】Latest News & Announcements of the Tutorial

中国图象图形学学会CSIG

3+阅读 · 2021年12月20日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium6

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium6

中国图象图形学学会CSIG

2+阅读 · 2021年11月12日

【ICIG2021】Latest News & Announcements of the Industry Talk2

【ICIG2021】Latest News & Announcements of the Industry Talk2

中国图象图形学学会CSIG

0+阅读 · 2021年7月29日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

disentangled-representation-papers

disentangled-representation-papers

CreateAMind

26+阅读 · 2018年9月12日

弱辛Banach空间上的Maslov指标的研究

国家自然科学基金

0+阅读 · 2014年12月31日

脂肪细胞来源Microparticles介导新生血管生成在2型糖尿病易损斑块形成中的作用

国家自然科学基金

0+阅读 · 2014年12月31日

TMS1基因响应高温胁迫和ER Stress的分子机制

国家自然科学基金

0+阅读 · 2014年12月31日

转录因子Ikaros在抑制肝癌干细胞增殖中的功能和分子机制研究

国家自然科学基金

0+阅读 · 2014年12月31日

WC-CoCr热喷涂过程界面润湿特性及微观机理研究

国家自然科学基金

0+阅读 · 2013年12月31日

基于子空间分析的LiDAR数据处理及表面重建关键技术研究

国家自然科学基金

0+阅读 · 2013年12月31日

AtWRKY28转录因子在锯齿状缺刻叶发育及形态建成中的功能分析

国家自然科学基金

0+阅读 · 2012年12月31日

机载InSAR区域网平差方法研究

国家自然科学基金

0+阅读 · 2012年12月31日

几个非线性Schrodinger方程组模型及相关问题研究

国家自然科学基金

0+阅读 · 2012年12月31日

乙烯反应转录因子OsERF2调控水稻根发育的分子基础

国家自然科学基金

0+阅读 · 2011年12月31日

A Local Search-Based Approach for Set Covering

Arxiv

0+阅读 · 2022年11月8日

Exponential Euler and backward Euler methods for nonlinear heat conduction problems

Arxiv

0+阅读 · 2022年11月8日

Abstraction-Based Verification of Approximate Pre-Opacity for Control Systems

Arxiv

0+阅读 · 2022年11月8日

Te Test: A New Non-asymptotic T-test for Behrens-Fisher Problems

Arxiv

0+阅读 · 2022年11月8日

On the amortized complexity of approximate counting

Arxiv

0+阅读 · 2022年11月8日

A Continual Development Methodology for Large-scale Multitask Dynamic ML Systems

Arxiv

0+阅读 · 2022年11月6日

A Data-Driven Evolutionary Transfer Optimization for Expensive Problems in Dynamic Environments

Arxiv

0+阅读 · 2022年11月5日

Near-optimal multiple testing in Bayesian linear models with finite-sample FDR control

Arxiv

0+阅读 · 2022年11月4日

Verification of the busy-forbidden protocol (using an extension of the cones and foci framework)

Arxiv

0+阅读 · 2022年11月4日

Improved Image Segmentation via Cost Minimization of Multiple Hypotheses

Arxiv

14+阅读 · 2018年1月31日

VIP会员

文章信息

相关主题

相互独立的

相关VIP内容

加速图神经网络推理，121页ppt，普林斯顿大学JAVIER DUARTE主讲

加速图神经网络推理，121页ppt，普林斯顿大学JAVIER DUARTE主讲

专知会员服务

33+阅读 · 2022年6月13日

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

专知会员服务

96+阅读 · 2020年3月12日

Aspect-Oriented Syntax Network for Aspect-Based Sentiment Analysis，中山大学数据科学与计算机学院权小军教授，第八届全国社会媒体处理大会SMP2019

Aspect-Oriented Syntax Network for Aspect-Based Sentiment Analysis，中山大学数据科学与计算机学院权小军教授，第八届全国社会媒体处理大会SMP2019

专知会员服务

19+阅读 · 2019年10月22日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

专知会员服务

83+阅读 · 2019年10月9日

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

专知会员服务

65+阅读 · 2019年10月9日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

105+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

大语言模型智能体强化学习：全景综述

《城市滨海地区：理解复杂多变环境下的指挥控制框架》50页报告

【伯克利博士论文】从推理服务到训练：面向大规模 LLM 智能体的高效系统

美空军“顶点2025”实验：推进AI在C2、动态目标锁定与联盟集成中的应用

相关资讯

VCIP 2022 Call for Special Session Proposals

VCIP 2022 Call for Special Session Proposals

CCF多媒体专委会

1+阅读 · 2022年4月1日

ACM MM 2022 Call for Papers

ACM MM 2022 Call for Papers

CCF多媒体专委会

5+阅读 · 2022年3月29日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

【ICIG2021】Latest News & Announcements of the Tutorial

【ICIG2021】Latest News & Announcements of the Tutorial

中国图象图形学学会CSIG

3+阅读 · 2021年12月20日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium6

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium6

中国图象图形学学会CSIG

2+阅读 · 2021年11月12日

【ICIG2021】Latest News & Announcements of the Industry Talk2

【ICIG2021】Latest News & Announcements of the Industry Talk2

中国图象图形学学会CSIG

0+阅读 · 2021年7月29日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

disentangled-representation-papers

disentangled-representation-papers

CreateAMind

26+阅读 · 2018年9月12日

相关论文

A Local Search-Based Approach for Set Covering

Arxiv

0+阅读 · 2022年11月8日

Exponential Euler and backward Euler methods for nonlinear heat conduction problems

Arxiv

0+阅读 · 2022年11月8日

Abstraction-Based Verification of Approximate Pre-Opacity for Control Systems

Arxiv

0+阅读 · 2022年11月8日

Te Test: A New Non-asymptotic T-test for Behrens-Fisher Problems

Arxiv

0+阅读 · 2022年11月8日

On the amortized complexity of approximate counting

Arxiv

0+阅读 · 2022年11月8日

A Continual Development Methodology for Large-scale Multitask Dynamic ML Systems

Arxiv

0+阅读 · 2022年11月6日

A Data-Driven Evolutionary Transfer Optimization for Expensive Problems in Dynamic Environments

Arxiv

0+阅读 · 2022年11月5日

Near-optimal multiple testing in Bayesian linear models with finite-sample FDR control

Arxiv

0+阅读 · 2022年11月4日

Verification of the busy-forbidden protocol (using an extension of the cones and foci framework)

Arxiv

0+阅读 · 2022年11月4日

Improved Image Segmentation via Cost Minimization of Multiple Hypotheses

Arxiv

14+阅读 · 2018年1月31日

相关基金

弱辛Banach空间上的Maslov指标的研究

国家自然科学基金

0+阅读 · 2014年12月31日

脂肪细胞来源Microparticles介导新生血管生成在2型糖尿病易损斑块形成中的作用

国家自然科学基金

0+阅读 · 2014年12月31日

TMS1基因响应高温胁迫和ER Stress的分子机制

国家自然科学基金

0+阅读 · 2014年12月31日

转录因子Ikaros在抑制肝癌干细胞增殖中的功能和分子机制研究

国家自然科学基金

0+阅读 · 2014年12月31日

WC-CoCr热喷涂过程界面润湿特性及微观机理研究

国家自然科学基金

0+阅读 · 2013年12月31日

基于子空间分析的LiDAR数据处理及表面重建关键技术研究

国家自然科学基金

0+阅读 · 2013年12月31日

AtWRKY28转录因子在锯齿状缺刻叶发育及形态建成中的功能分析

国家自然科学基金

0+阅读 · 2012年12月31日

机载InSAR区域网平差方法研究

国家自然科学基金

0+阅读 · 2012年12月31日

几个非线性Schrodinger方程组模型及相关问题研究

国家自然科学基金

0+阅读 · 2012年12月31日

乙烯反应转录因子OsERF2调控水稻根发育的分子基础

国家自然科学基金

0+阅读 · 2011年12月31日

微信扫码咨询专知VIP会员