高多层随机森林回归性六相置热性测试 (Testing for Regression Heteroskedasticity with High-Dimensional Random Forests) - 专知论文

会员服务 ·

0

随机森林 · 方差 · 统计量 · 均值 · INTERACT ·

2022 年 12 月 5 日

Testing for Regression Heteroskedasticity with High-Dimensional Random Forests

翻译：高多层随机森林回归性六相置热性测试

from arxiv, 113 pages, 1 figure, 9 tables

Statistical inference for high-dimensional regression heteroskedasticity is an important but under-explored problem. The current paper aims at filling this gap by proposing two tests, namely the variance difference test and the variance difference Breusch-Pagan test, for assessing high-dimensional regression heteroskedasticity. The former tests whether an explanatory feature of interest is associated with the conditional variance of a response variable, while the latter tests heteroskedasticity in the regression, which is known to be the Breusch-Pagan test problem. To formally establish the tests, we have derived rigorous P-values and test sizes, and analyzed the test power under a nonparametric heteroskedastic data generating model with high-dimensional input features. Such a model setting takes into account high-dimensional applications with flexible structures of heteroskedasticity and features having interaction effects on the mean of the response; these are common applications in many fields such as biology. Our methods leverage machine learning mean prediction methods such as random forests and use knockoff variables as negative controls. Particularly, the definition of knockoffs for our test statistics is more flexible than the original definition of knockoffs, and we give a detailed comparison of these two definitions and discuss the advantages of our knockoffs. The satisfactory empirical performance of the proposed tests is illustrated with simulation results and an HIV (Human Immunodeficiency Virus) case study.

翻译：高维回归层的统计推断值是一个重要但尚未得到充分探讨的问题。本文的目的是通过提出两个测试来填补这一差距, 即差异差异测试和差异差异Breusch- Pagan测试, 用于评估高维回归层的三重风险测试。以前的测试是, 关注的解释性特征是否与反应变量的有条件差异相关联, 而后一种测试则是, 回归层的三重风险测试, 即众所周知的布雷什- 帕根测试问题。为了正式建立测试, 我们得出了严格的P值和测试尺寸, 分析了在非对等异差异差异异异异异异异异异异异异异异异异异异异的测试中测试能力。这种模型设置考虑到高维异异异异的高度应用, 对反应的平均值具有互动效应; 这些都是许多领域( 如生物学等) 常见的应用。我们的方法利用随机森林等机器的预测方法, 以及将击倒变量作为负面控制。特别是,, 在非对高维度数据生成模型的模型数据进行测试模型进行分析时, 我们提出的模拟测试定义比我们最初的模型测试, 的模拟测试的模型分析, 的逻辑分析是更灵活地讨论。

0

相关内容

随机森林

随机森林指的是利用多棵树对样本进行训练并预测的一种分类器。

知识荟萃

精品入门和进阶教程、论文和代码整理等

更多

查看相关VIP内容、论文、资讯等

宾夕法尼亚大学最新《不确定性估计》课程笔记，134页pdf，附Slides

宾夕法尼亚大学最新《不确定性估计》课程笔记，134页pdf，附Slides

专知会员服务

49+阅读 · 2022年11月13日

ICLR 2022杰出论文公布：7篇论文获得，清华朱军课题组摘得

ICLR 2022杰出论文公布：7篇论文获得，清华朱军课题组摘得

专知会员服务

60+阅读 · 2022年4月22日

【干货书】机器学习速查手册，135页pdf

【干货书】机器学习速查手册，135页pdf

专知会员服务

127+阅读 · 2020年11月20日

【干货书】真实机器学习，264页pdf，Real-World Machine Learning

【干货书】真实机器学习，264页pdf，Real-World Machine Learning

专知会员服务

115+阅读 · 2020年4月5日

【新书】数字图像(影像)处理手第二版，2176pdf，Mathematical Methods in Imaging

【新书】数字图像(影像)处理手第二版，2176pdf，Mathematical Methods in Imaging

专知会员服务

93+阅读 · 2020年2月12日

UC.Berkeley CS189讲义教材:《机器学习全面指南》，185页pdf

专知会员服务

162+阅读 · 2020年1月16日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

专知会员服务

79+阅读 · 2019年10月10日

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

专知会员服务

83+阅读 · 2019年10月9日

VCIP 2022 Call for Demos

VCIP 2022 Call for Demos

CCF多媒体专委会

1+阅读 · 2022年6月6日

VCIP 2022 Call for Special Session Proposals

VCIP 2022 Call for Special Session Proposals

CCF多媒体专委会

1+阅读 · 2022年4月1日

ACM MM 2022 Call for Papers

ACM MM 2022 Call for Papers

CCF多媒体专委会

5+阅读 · 2022年3月29日

IEEE TII Call For Papers

IEEE TII Call For Papers

CCF多媒体专委会

3+阅读 · 2022年3月24日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium4

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium4

中国图象图形学学会CSIG

0+阅读 · 2021年11月10日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium1

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium1

中国图象图形学学会CSIG

0+阅读 · 2021年11月3日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

Calmodulin的N环和C环与心肌CaV1.2钙通道的多个结合位点交互作用介导其Ca2+依赖性失活的机制研究

国家自然科学基金

0+阅读 · 2014年12月31日

光周期调节基因GmFT4在大豆中的功能研究

国家自然科学基金

0+阅读 · 2013年12月31日

构建快速检测降钙素原的高灵敏量子点免疫层析技术的研究

国家自然科学基金

0+阅读 · 2013年12月31日

MicRNA107调控BACE1mRNA基因与阿尔茨海默病内质网应激病理机制研究

国家自然科学基金

0+阅读 · 2012年12月31日

银屑病中皮肤DC的免疫调节机制

国家自然科学基金

0+阅读 · 2012年12月31日

棉花中一个成花素同源基因GhFTL1调节开花的功能研究

国家自然科学基金

0+阅读 · 2012年12月31日

III-V族半导体异质结构二维电子气的自旋输运特性

国家自然科学基金

0+阅读 · 2012年12月31日

量子散射中的异常现象、Levinson 定理及其它

国家自然科学基金

0+阅读 · 2011年12月31日

半Heusler合金型拓扑绝缘体材料的制备和物性研究

国家自然科学基金

0+阅读 · 2011年12月31日

探寻与高功能孤独症和Asperger综合征相关的拷贝数变异

国家自然科学基金

0+阅读 · 2009年12月31日

Maximum likelihood estimation and prediction error for a Mat{é}rn model on the circle

Arxiv

0+阅读 · 2023年2月6日

A Log-Linear Non-Parametric Online Changepoint Detection Algorithm based on Functional Pruning

Arxiv

0+阅读 · 2023年2月6日

Estimating Time-Varying Networks for High-Dimensional Time Series

Arxiv

0+阅读 · 2023年2月5日

Scalable inference in functional linear regression with streaming data

Arxiv

0+阅读 · 2023年2月5日

$\ell_1$-penalized Multinomial Regression: Estimation, inference, and prediction, with an application to risk factor identification for different dementia subtypes

Arxiv

0+阅读 · 2023年2月5日

A Simple Approach for Local and Global Variable Importance in Nonlinear Regression Models

Arxiv

0+阅读 · 2023年2月3日

Characterization and estimation of high dimensional sparse regression parameters under linear inequality constraints

Arxiv

0+阅读 · 2023年2月3日

Sparse High-Dimensional Vector Autoregressive Bootstrap

Arxiv

0+阅读 · 2023年2月2日

Robust multi-item auction design using statistical learning: Overcoming uncertainty in bidders' types distributions

Arxiv

0+阅读 · 2023年2月2日

High-dimensional variable clustering based on sub-asymptotic maxima of a weakly dependent random process

Arxiv

0+阅读 · 2023年2月2日

VIP会员

文章信息

相关主题

相关VIP内容

宾夕法尼亚大学最新《不确定性估计》课程笔记，134页pdf，附Slides

宾夕法尼亚大学最新《不确定性估计》课程笔记，134页pdf，附Slides

专知会员服务

49+阅读 · 2022年11月13日

ICLR 2022杰出论文公布：7篇论文获得，清华朱军课题组摘得

ICLR 2022杰出论文公布：7篇论文获得，清华朱军课题组摘得

专知会员服务

60+阅读 · 2022年4月22日

【干货书】机器学习速查手册，135页pdf

【干货书】机器学习速查手册，135页pdf

专知会员服务

127+阅读 · 2020年11月20日

【干货书】真实机器学习，264页pdf，Real-World Machine Learning

【干货书】真实机器学习，264页pdf，Real-World Machine Learning

专知会员服务

115+阅读 · 2020年4月5日

【新书】数字图像(影像)处理手第二版，2176pdf，Mathematical Methods in Imaging

【新书】数字图像(影像)处理手第二版，2176pdf，Mathematical Methods in Imaging

专知会员服务

93+阅读 · 2020年2月12日

UC.Berkeley CS189讲义教材:《机器学习全面指南》，185页pdf

专知会员服务

162+阅读 · 2020年1月16日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

专知会员服务

79+阅读 · 2019年10月10日

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

专知会员服务

83+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

【CMU博士论文】基础模型训练中网络规模数据的负责任与高效使用

《俄乌战争背景下俄罗斯的战略性海军分析（2022-2025年）》最新100页报告

人工智能时代背景下的未来海战

相关资讯

VCIP 2022 Call for Demos

VCIP 2022 Call for Demos

CCF多媒体专委会

1+阅读 · 2022年6月6日

VCIP 2022 Call for Special Session Proposals

VCIP 2022 Call for Special Session Proposals

CCF多媒体专委会

1+阅读 · 2022年4月1日

ACM MM 2022 Call for Papers

ACM MM 2022 Call for Papers

CCF多媒体专委会

5+阅读 · 2022年3月29日

IEEE TII Call For Papers

IEEE TII Call For Papers

CCF多媒体专委会

3+阅读 · 2022年3月24日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium4

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium4

中国图象图形学学会CSIG

0+阅读 · 2021年11月10日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium1

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium1

中国图象图形学学会CSIG

0+阅读 · 2021年11月3日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

相关论文

Maximum likelihood estimation and prediction error for a Mat{é}rn model on the circle

Arxiv

0+阅读 · 2023年2月6日

A Log-Linear Non-Parametric Online Changepoint Detection Algorithm based on Functional Pruning

Arxiv

0+阅读 · 2023年2月6日

Estimating Time-Varying Networks for High-Dimensional Time Series

Arxiv

0+阅读 · 2023年2月5日

Scalable inference in functional linear regression with streaming data

Arxiv

0+阅读 · 2023年2月5日

$\ell_1$-penalized Multinomial Regression: Estimation, inference, and prediction, with an application to risk factor identification for different dementia subtypes

Arxiv

0+阅读 · 2023年2月5日

A Simple Approach for Local and Global Variable Importance in Nonlinear Regression Models

Arxiv

0+阅读 · 2023年2月3日

Characterization and estimation of high dimensional sparse regression parameters under linear inequality constraints

Arxiv

0+阅读 · 2023年2月3日

Sparse High-Dimensional Vector Autoregressive Bootstrap

Arxiv

0+阅读 · 2023年2月2日

Robust multi-item auction design using statistical learning: Overcoming uncertainty in bidders' types distributions

Arxiv

0+阅读 · 2023年2月2日

High-dimensional variable clustering based on sub-asymptotic maxima of a weakly dependent random process

Arxiv

0+阅读 · 2023年2月2日

相关基金

Calmodulin的N环和C环与心肌CaV1.2钙通道的多个结合位点交互作用介导其Ca2+依赖性失活的机制研究

国家自然科学基金

0+阅读 · 2014年12月31日

光周期调节基因GmFT4在大豆中的功能研究

国家自然科学基金

0+阅读 · 2013年12月31日

构建快速检测降钙素原的高灵敏量子点免疫层析技术的研究

国家自然科学基金

0+阅读 · 2013年12月31日

MicRNA107调控BACE1mRNA基因与阿尔茨海默病内质网应激病理机制研究

国家自然科学基金

0+阅读 · 2012年12月31日

银屑病中皮肤DC的免疫调节机制

国家自然科学基金

0+阅读 · 2012年12月31日

棉花中一个成花素同源基因GhFTL1调节开花的功能研究

国家自然科学基金

0+阅读 · 2012年12月31日

III-V族半导体异质结构二维电子气的自旋输运特性

国家自然科学基金

0+阅读 · 2012年12月31日

量子散射中的异常现象、Levinson 定理及其它

国家自然科学基金

0+阅读 · 2011年12月31日

半Heusler合金型拓扑绝缘体材料的制备和物性研究

国家自然科学基金

0+阅读 · 2011年12月31日

探寻与高功能孤独症和Asperger综合征相关的拷贝数变异

国家自然科学基金

0+阅读 · 2009年12月31日

微信扫码咨询专知VIP会员