高差异后勤倒退中统计推断的快速方法 (SLOE: A Faster Method for Statistical Inference in High-Dimensional Logistic Regression) - 专知论文

会员服务 ·

0

估计/估计量 · 对数几率回归 · 可约的 · 统计量 · 极大似然估计 ·

2021 年 3 月 23 日

SLOE: A Faster Method for Statistical Inference in High-Dimensional Logistic Regression

翻译：高差异后勤倒退中统计推断的快速方法

Steve Yadlowsky,Taedong Yun,Cory McLean,Alexander D'Amour

Logistic regression remains one of the most widely used tools in applied statistics, machine learning and data science. Practical datasets often have a substantial number of features $d$ relative to the sample size $n$. In these cases, the logistic regression maximum likelihood estimator (MLE) is biased, and its standard large-sample approximation is poor. In this paper, we develop an improved method for debiasing predictions and estimating frequentist uncertainty for such datasets. We build on recent work characterizing the asymptotic statistical behavior of the MLE in the regime where the aspect ratio $d / n$, instead of the number of features $d$, remains fixed as $n$ grows. In principle, this approximation facilitates bias and uncertainty corrections, but in practice, these corrections require an estimate of the signal strength of the predictors. Our main contribution is SLOE, an estimator of the signal strength with convergence guarantees that reduces the computation time of estimation and inference by orders of magnitude. The bias correction that this facilitates also reduces the variance of the predictions, yielding narrower confidence intervals with higher (valid) coverage of the true underlying probabilities and parameters. We provide an open source package for this method, available at https://github.com/google-research/sloe-logistic.

翻译：在应用统计、机器学习和数据科学方面,物流回归仍然是最广泛使用的工具之一。实用数据集通常具有与抽样规模相对相当的大量特征。在这些案例中,后勤回归最大可能性估计仪(MLE)存在偏差,其标准大范围抽样近似值很低。在本文中,我们开发了一种更好的方法,用以减少预测的偏差,并估计这类数据集的常态不确定性。我们以最近的工作为基础,将MLE在制度下的无约束统计行为定性为标准,因为制度内方位比率为$/n美元,而不是特征数目为美元,但随着美元的增长而固定不变。原则上,这种近似可促进偏差和不确定性的纠正,但在实践中,这些更正需要估计预测仪的信号强度。我们的主要贡献是SLOE,这是信号强度的衡量标准,保证会减少估算的计算时间和数量级的推断。纠正偏差还有助于减少预测的差异,产生较窄的互信度间隔期,产生较窄的间隔期,在可获取的精确度/精确的参数中,我们提供这种精确的精确的源。

0

相关内容

估计/估计量

估计/估计量

【斯坦福大学博士论文】大规模和高维统计学习方法和算法，147页pdf， Large-scale and high-dimensional statistical learning methods and algorithms

专知会员服务

26+阅读 · 2020年6月13日

(普林斯顿讲义)：高维概率论，326页pdf《Probability in High Dimension》

(普林斯顿讲义)：高维概率论，326页pdf《Probability in High Dimension》

专知会员服务

122+阅读 · 2020年5月30日

【论文】用于推理的概率逻辑神经网络（Probabilistic Logic Neural Networks for Reasoning）

【论文】用于推理的概率逻辑神经网络（Probabilistic Logic Neural Networks for Reasoning）

专知会员服务

104+阅读 · 2019年12月30日

在线变分推断，76页ppt，A Regret Bound for Online Variational Inference

在线变分推断，76页ppt，A Regret Bound for Online Variational Inference

专知会员服务

21+阅读 · 2019年12月2日

【变分推断课件】Lectures on Variational Inference：Statistical Analysis of Variational Approximations（附带pdf）

【变分推断课件】Lectures on Variational Inference：Statistical Analysis of Variational Approximations（附带pdf）

专知会员服务

16+阅读 · 2019年11月30日

【变分推断课件】Lectures on Variational Inference： Approximate Bayesian Inference in Machine Learning（附带pdf）

【变分推断课件】Lectures on Variational Inference： Approximate Bayesian Inference in Machine Learning（附带pdf）

专知会员服务

35+阅读 · 2019年11月30日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

Yoshua Bengio，使算法知道“为什么”

Yoshua Bengio，使算法知道“为什么”

专知会员服务

8+阅读 · 2019年10月10日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

逆强化学习-学习人先验的动机

逆强化学习-学习人先验的动机

CreateAMind

16+阅读 · 2019年1月18日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

Disentangled的假设的探讨

Disentangled的假设的探讨

CreateAMind

9+阅读 · 2018年12月10日

disentangled-representation-papers

disentangled-representation-papers

CreateAMind

26+阅读 · 2018年9月12日

Hierarchical Disentangled Representations

Hierarchical Disentangled Representations

CreateAMind

4+阅读 · 2018年4月15日

【论文】变分推断（Variational inference)的总结

【论文】变分推断（Variational inference)的总结

机器学习研究会

39+阅读 · 2017年11月16日

逻辑回归（Logistic Regression）模型简介

逻辑回归（Logistic Regression）模型简介

全球人工智能

5+阅读 · 2017年11月1日

Adversarial Variational Bayes: Unifying VAE and GAN 代码

Adversarial Variational Bayes: Unifying VAE and GAN 代码

CreateAMind

7+阅读 · 2017年10月4日

【推荐】RNN/LSTM时序预测

【推荐】RNN/LSTM时序预测

机器学习研究会

25+阅读 · 2017年9月8日

Robust approach for variable selection with high dimensional Logitudinal data analysis

Arxiv

0+阅读 · 2021年5月18日

High-Dimensional Sparse Single-Index Regression Via Hilbert-Schmidt Independence Criterion

Arxiv

0+阅读 · 2021年5月18日

Smoothed Quantile Regression with Large-Scale Inference

Arxiv

0+阅读 · 2021年5月18日

Uniform-in-Submodel Bounds for Linear Regression in a Model Free Framework

Arxiv

0+阅读 · 2021年5月17日

Eigenvalue distribution of a high-dimensional distance covariance matrix with application

Arxiv

0+阅读 · 2021年5月17日

Covariate-Adjusted Inference for Differential Analysis of High-Dimensional Networks

Arxiv

0+阅读 · 2021年5月17日

Multi-Agent Low-Dimensional Linear Bandits

Arxiv

0+阅读 · 2021年5月16日

Statistical inference for stationary linear models with tapered data

Arxiv

0+阅读 · 2021年5月14日

A multilevel Monte Carlo method for asymptotic-preserving particle schemes in the diffusive limit

Arxiv

0+阅读 · 2021年5月14日

Deep learning: a statistical viewpoint

Arxiv

18+阅读 · 2021年3月16日

VIP会员

文章信息

相关主题

估计/估计量

对数几率回归

极大似然估计

相关VIP内容

【斯坦福大学博士论文】大规模和高维统计学习方法和算法，147页pdf， Large-scale and high-dimensional statistical learning methods and algorithms

专知会员服务

26+阅读 · 2020年6月13日

(普林斯顿讲义)：高维概率论，326页pdf《Probability in High Dimension》

(普林斯顿讲义)：高维概率论，326页pdf《Probability in High Dimension》

专知会员服务

122+阅读 · 2020年5月30日

【论文】用于推理的概率逻辑神经网络（Probabilistic Logic Neural Networks for Reasoning）

【论文】用于推理的概率逻辑神经网络（Probabilistic Logic Neural Networks for Reasoning）

专知会员服务

104+阅读 · 2019年12月30日

在线变分推断，76页ppt，A Regret Bound for Online Variational Inference

在线变分推断，76页ppt，A Regret Bound for Online Variational Inference

专知会员服务

21+阅读 · 2019年12月2日

【变分推断课件】Lectures on Variational Inference：Statistical Analysis of Variational Approximations（附带pdf）

【变分推断课件】Lectures on Variational Inference：Statistical Analysis of Variational Approximations（附带pdf）

专知会员服务

16+阅读 · 2019年11月30日

【变分推断课件】Lectures on Variational Inference： Approximate Bayesian Inference in Machine Learning（附带pdf）

【变分推断课件】Lectures on Variational Inference： Approximate Bayesian Inference in Machine Learning（附带pdf）

专知会员服务

35+阅读 · 2019年11月30日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

Yoshua Bengio，使算法知道“为什么”

Yoshua Bengio，使算法知道“为什么”

专知会员服务

8+阅读 · 2019年10月10日

热门VIP内容

开通专知VIP会员享更多权益服务

《乌克兰无人机产业：志愿者与政策在构建新兴无人机产业中的协同作用》最新报告

《人工智能辅助决策中的数据可视化：系统性综述》

人工智能驱动弹药制造现代化：美国陆军转型之路

《敏捷作战部署中枢纽-辐条基地选址优化研究》80页

相关资讯

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

逆强化学习-学习人先验的动机

逆强化学习-学习人先验的动机

CreateAMind

16+阅读 · 2019年1月18日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

Disentangled的假设的探讨

Disentangled的假设的探讨

CreateAMind

9+阅读 · 2018年12月10日

disentangled-representation-papers

disentangled-representation-papers

CreateAMind

26+阅读 · 2018年9月12日

Hierarchical Disentangled Representations

Hierarchical Disentangled Representations

CreateAMind

4+阅读 · 2018年4月15日

【论文】变分推断（Variational inference)的总结

【论文】变分推断（Variational inference)的总结

机器学习研究会

39+阅读 · 2017年11月16日

逻辑回归（Logistic Regression）模型简介

逻辑回归（Logistic Regression）模型简介

全球人工智能

5+阅读 · 2017年11月1日

Adversarial Variational Bayes: Unifying VAE and GAN 代码

Adversarial Variational Bayes: Unifying VAE and GAN 代码

CreateAMind

7+阅读 · 2017年10月4日

【推荐】RNN/LSTM时序预测

【推荐】RNN/LSTM时序预测

机器学习研究会

25+阅读 · 2017年9月8日

相关论文

Robust approach for variable selection with high dimensional Logitudinal data analysis

Arxiv

0+阅读 · 2021年5月18日

High-Dimensional Sparse Single-Index Regression Via Hilbert-Schmidt Independence Criterion

Arxiv

0+阅读 · 2021年5月18日

Smoothed Quantile Regression with Large-Scale Inference

Arxiv

0+阅读 · 2021年5月18日

Uniform-in-Submodel Bounds for Linear Regression in a Model Free Framework

Arxiv

0+阅读 · 2021年5月17日

Eigenvalue distribution of a high-dimensional distance covariance matrix with application

Arxiv

0+阅读 · 2021年5月17日

Covariate-Adjusted Inference for Differential Analysis of High-Dimensional Networks

Arxiv

0+阅读 · 2021年5月17日

Multi-Agent Low-Dimensional Linear Bandits

Arxiv

0+阅读 · 2021年5月16日

Statistical inference for stationary linear models with tapered data

Arxiv

0+阅读 · 2021年5月14日

A multilevel Monte Carlo method for asymptotic-preserving particle schemes in the diffusive limit

Arxiv

0+阅读 · 2021年5月14日

Deep learning: a statistical viewpoint

Arxiv

18+阅读 · 2021年3月16日

微信扫码咨询专知VIP会员