具有越境随机效应的可缩缩后勤回归 (Scalable logistic regression with crossed random effects) - 专知论文

会员服务 ·

0

对数几率回归 · 可约的 · 欠估计 · 代价 · 样本方差 ·

2021 年 5 月 28 日

Scalable logistic regression with crossed random effects

翻译：具有越境随机效应的可缩缩后勤回归

Swarnadip Ghosh,Trevor Hastie,Art B. Owen

The cost of both generalized least squares (GLS) and Gibbs sampling in a crossed random effects model can easily grow faster than $N^{3/2}$ for $N$ observations. Ghosh et al. (2020) develop a backfitting algorithm that reduces the cost to $O(N)$. Here we extend that method to a generalized linear mixed model for logistic regression. We use backfitting within an iteratively reweighted penalized least square algorithm. The specific approach is a version of penalized quasi-likelihood due to Schall (1991). A straightforward version of Schall's algorithm would also cost more than $N^{3/2}$ because it requires the trace of the inverse of a large matrix. We approximate that quantity at cost $O(N)$ and prove that this substitution makes an asymptotically negligible difference. Our backfitting algorithm also collapses the fixed effect with one random effect at a time in a way that is analogous to the collapsed Gibbs sampler of Papaspiliopoulos et al. (2020). We use a symmetric operator that facilitates efficient covariance computation. We illustrate our method on a real dataset from Stitch Fix. By properly accounting for crossed random effects we show that a naive logistic regression could underestimate sampling variances by several hundred fold.

翻译：在宽度随机效应模型中,一般最低方(GLS)和Gibbs抽样的成本都很容易比美元3/2美元(美元)的观察成本增长更快。Ghosh等人(202020年)开发了一种将成本降低至美元(美元)的回调算法。在这里,我们将这种方法推广到普遍线性混合模式,以利后勤回归。我们用的是迭代再加权的、受处罚的最低方算法。具体的方法是Schall(1991年)的受罚准相似性(准相似性)的版本。一个直接版本的Schall的算法成本也高于美元(N3/2美元),因为它需要大矩阵的反差迹。我们以美元估算该数量,并证明这种替代可以产生无微小的差别。我们的后补算法也使固定效果崩溃,一次随机效果与Scapspililiopouls等人(202020年)的崩溃的Gibs采样器类似。我们使用一个配方操作器来便利高效的逆差计算。我们用一种方法,我们用100次的精确度计算方法来说明我们从Stellimregregregregrequest进行精确的精确分析。我们可以正确分析。

0

相关内容

对数几率回归

对数几率回归

【CVPR2021】用于目标检测的通用实例蒸馏

【CVPR2021】用于目标检测的通用实例蒸馏

专知会员服务

24+阅读 · 2021年3月22日

INRIA 最新《机器学习理论》课程笔记，176页pdf

专知会员服务

51+阅读 · 2020年12月14日

【干货书】机器学习速查手册，135页pdf

【干货书】机器学习速查手册，135页pdf

专知会员服务

127+阅读 · 2020年11月20日

一份简单《图神经网络》教程，28页ppt

一份简单《图神经网络》教程，28页ppt

专知会员服务

126+阅读 · 2020年8月2日

【MLSS2020】流数据贝叶斯预测，米兰Sonia Petrone教授，80页ppt

【MLSS2020】流数据贝叶斯预测，米兰Sonia Petrone教授，80页ppt

专知会员服务

48+阅读 · 2020年7月5日

【跨语言BERT模型大集合】Transfer learning is increasingly going multilingual with language-specific BERT models

专知会员服务

54+阅读 · 2020年1月30日

UC.Berkeley CS189讲义教材:《机器学习全面指南》，185页pdf

专知会员服务

162+阅读 · 2020年1月16日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

已删除

将门创投

3+阅读 · 2020年8月3日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

Ray RLlib: Scalable 降龙十八掌

Ray RLlib: Scalable 降龙十八掌

CreateAMind

9+阅读 · 2018年12月28日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

数据分析师应该知道的16种回归技术：Lasso回归

数据分析师应该知道的16种回归技术：Lasso回归

数萃大数据

16+阅读 · 2018年8月13日

【论文推荐】最新七篇强化学习相关论文—逻辑约束、综述、多任务深度强化学习、参数服务器、事件抽取、分层强化学习、过拟合研究

【论文推荐】最新七篇强化学习相关论文—逻辑约束、综述、多任务深度强化学习、参数服务器、事件抽取、分层强化学习、过拟合研究

专知

25+阅读 · 2018年4月29日

Hierarchical Disentangled Representations

Hierarchical Disentangled Representations

CreateAMind

4+阅读 · 2018年4月15日

【论文】变分推断（Variational inference)的总结

【论文】变分推断（Variational inference)的总结

机器学习研究会

39+阅读 · 2017年11月16日

Auto-Encoding GAN

Auto-Encoding GAN

CreateAMind

7+阅读 · 2017年8月4日

Logistic回归第一弹——二项Logistic Regression

Logistic回归第一弹——二项Logistic Regression

机器学习深度学习实战原创交流

3+阅读 · 2015年10月22日

On Estimating Rank-One Spiked Tensors in the Presence of Heavy Tailed Errors

Arxiv

0+阅读 · 2021年7月20日

Using Shape Constraints for Improving Symbolic Regression Models

Arxiv

0+阅读 · 2021年7月20日

Generalized maximum likelihood estimation of the mean of parameters of mixtures, with applications to sampling

Arxiv

0+阅读 · 2021年7月20日

Fast compression of MCMC output

Arxiv

0+阅读 · 2021年7月19日

Multivariate Convex Regression at Scale

Arxiv

0+阅读 · 2021年7月19日

Distributional Anchor Regression

Arxiv

0+阅读 · 2021年7月19日

Estimating Linear Mixed Effects Models with Truncated Normally Distributed Random Effects

Arxiv

0+阅读 · 2021年7月17日

Subset-of-Data Variational Inference for Deep Gaussian-Processes Regression

Arxiv

0+阅读 · 2021年7月17日

Nonlinear Invariant Risk Minimization: A Causal Approach

Arxiv

0+阅读 · 2021年7月16日

Subspace Shrinkage in Conjugate Bayesian Vector Autoregressions

Arxiv

0+阅读 · 2021年7月16日

VIP会员

文章信息

相关主题

对数几率回归

相关VIP内容

【CVPR2021】用于目标检测的通用实例蒸馏

【CVPR2021】用于目标检测的通用实例蒸馏

专知会员服务

24+阅读 · 2021年3月22日

INRIA 最新《机器学习理论》课程笔记，176页pdf

专知会员服务

51+阅读 · 2020年12月14日

【干货书】机器学习速查手册，135页pdf

【干货书】机器学习速查手册，135页pdf

专知会员服务

127+阅读 · 2020年11月20日

一份简单《图神经网络》教程，28页ppt

一份简单《图神经网络》教程，28页ppt

专知会员服务

126+阅读 · 2020年8月2日

【MLSS2020】流数据贝叶斯预测，米兰Sonia Petrone教授，80页ppt

【MLSS2020】流数据贝叶斯预测，米兰Sonia Petrone教授，80页ppt

专知会员服务

48+阅读 · 2020年7月5日

【跨语言BERT模型大集合】Transfer learning is increasingly going multilingual with language-specific BERT models

专知会员服务

54+阅读 · 2020年1月30日

UC.Berkeley CS189讲义教材:《机器学习全面指南》，185页pdf

专知会员服务

162+阅读 · 2020年1月16日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

热门VIP内容

开通专知VIP会员享更多权益服务

【新书】面向企业的图学习扩展：生产级图学习与推理，485页pdf

AI智能体编程：技术、挑战与机遇综述

【国家标准】数据安全技术数据安全风险评估方法

【CMU博士论文】交互式学习的进展：替代性反馈机制与自适应因果推理

相关资讯

已删除

将门创投

3+阅读 · 2020年8月3日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

Ray RLlib: Scalable 降龙十八掌

Ray RLlib: Scalable 降龙十八掌

CreateAMind

9+阅读 · 2018年12月28日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

数据分析师应该知道的16种回归技术：Lasso回归

数据分析师应该知道的16种回归技术：Lasso回归

数萃大数据

16+阅读 · 2018年8月13日

【论文推荐】最新七篇强化学习相关论文—逻辑约束、综述、多任务深度强化学习、参数服务器、事件抽取、分层强化学习、过拟合研究

【论文推荐】最新七篇强化学习相关论文—逻辑约束、综述、多任务深度强化学习、参数服务器、事件抽取、分层强化学习、过拟合研究

专知

25+阅读 · 2018年4月29日

Hierarchical Disentangled Representations

Hierarchical Disentangled Representations

CreateAMind

4+阅读 · 2018年4月15日

【论文】变分推断（Variational inference)的总结

【论文】变分推断（Variational inference)的总结

机器学习研究会

39+阅读 · 2017年11月16日

Auto-Encoding GAN

Auto-Encoding GAN

CreateAMind

7+阅读 · 2017年8月4日

Logistic回归第一弹——二项Logistic Regression

Logistic回归第一弹——二项Logistic Regression

机器学习深度学习实战原创交流

3+阅读 · 2015年10月22日

相关论文

On Estimating Rank-One Spiked Tensors in the Presence of Heavy Tailed Errors

Arxiv

0+阅读 · 2021年7月20日

Using Shape Constraints for Improving Symbolic Regression Models

Arxiv

0+阅读 · 2021年7月20日

Generalized maximum likelihood estimation of the mean of parameters of mixtures, with applications to sampling

Arxiv

0+阅读 · 2021年7月20日

Fast compression of MCMC output

Arxiv

0+阅读 · 2021年7月19日

Multivariate Convex Regression at Scale

Arxiv

0+阅读 · 2021年7月19日

Distributional Anchor Regression

Arxiv

0+阅读 · 2021年7月19日

Estimating Linear Mixed Effects Models with Truncated Normally Distributed Random Effects

Arxiv

0+阅读 · 2021年7月17日

Subset-of-Data Variational Inference for Deep Gaussian-Processes Regression

Arxiv

0+阅读 · 2021年7月17日

Nonlinear Invariant Risk Minimization: A Causal Approach

Arxiv

0+阅读 · 2021年7月16日

Subspace Shrinkage in Conjugate Bayesian Vector Autoregressions

Arxiv

0+阅读 · 2021年7月16日

微信扫码咨询专知VIP会员