改进半监督高维回归模型的预测器 (Improved Estimators for Semi-supervised High-dimensional Regression Model) - 专知论文

会员服务 ·

0

估计/估计量 · 可约的 · 向量化 · 方差 · MoDELS ·

2021 年 8 月 30 日

Improved Estimators for Semi-supervised High-dimensional Regression Model

翻译：改进半监督高维回归模型的预测器

Ilan Livne,David Azriel,Yair Goldberg

We study a linear high-dimensional regression model in a semi-supervised setting, where for many observations only the vector of covariates $X$ is given with no response $Y$. We do not make any sparsity assumptions on the vector of coefficients, and aim at estimating Var$(Y|X)$. We propose an estimator, which is unbiased, consistent, and asymptotically normal. This estimator can be improved by adding zero-estimators arising from the unlabelled data. Adding zero-estimators does not affect the bias and potentially can reduce variance. In order to achieve optimal improvement, many zero-estimators should be used, but this raises the problem of estimating many parameters. Therefore, we introduce covariate selection algorithms that identify which zero-estimators should be used in order to improve the above estimator. We further illustrate our approach for other estimators, and present an algorithm that improves estimation for any given variance estimator. Our theoretical results are demonstrated in a simulation study.

翻译：我们在一个半监督的环境中研究一个线性高维回归模型,对于许多观测来说,只有共变的矢量X$得到没有响应的美元。我们没有对系数的矢量做任何夸度假设,我们的目标是估计Var$(Y ⁇ X),我们建议一个不偏袒、一致和无症状的测算器。这个测算器可以通过增加无标签数据产生的零估计值来改进。添加零估计值不会影响偏差,而且有可能减少差异。为了实现最佳的改进,应当使用许多零估计值,但这提出了估计许多参数的问题。因此,我们采用共变选择算法,确定应使用哪些零估计值来改进上述估计值。我们进一步为其他估计者说明我们的方法,并提出一种改进任何差异估计值的算法。我们的理论结果在模拟研究中得到了证明。

0

相关内容

估计/估计量

估计/估计量

INRIA最新「机器学习理论」新书，229页pdf原理性阐述机器学习

INRIA最新「机器学习理论」新书，229页pdf原理性阐述机器学习

专知会员服务

69+阅读 · 2021年3月27日

INRIA 最新《机器学习理论》课程笔记，176页pdf

专知会员服务

51+阅读 · 2020年12月14日

【神经自然语言处理进展：建模，学习，推理】Progress in Neural NLP: Modeling, Learning, and Reasoning

【神经自然语言处理进展：建模，学习，推理】Progress in Neural NLP: Modeling, Learning, and Reasoning

专知会员服务

78+阅读 · 2020年8月13日

【伯克利】自回归模型的局部掩卷积，Locally Masked Convolution for Autoregressive Models

【伯克利】自回归模型的局部掩卷积，Locally Masked Convolution for Autoregressive Models

专知会员服务

20+阅读 · 2020年6月23日

商业数据分析，39页ppt

商业数据分析，39页ppt

专知会员服务

165+阅读 · 2020年6月2日

随机特征核近似综述: 算法与理论，Random Features for Kernel Approximation: A Survey in Algorithms, Theory, and Beyond

随机特征核近似综述: 算法与理论，Random Features for Kernel Approximation: A Survey in Algorithms, Theory, and Beyond

专知会员服务

33+阅读 · 2020年4月26日

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

专知会员服务

95+阅读 · 2020年3月12日

【机器学习基础最新版】（Mathematics for Machine Learning），417页pdf

【机器学习基础最新版】（Mathematics for Machine Learning），417页pdf

专知会员服务

244+阅读 · 2019年10月21日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

局部学习的特征选择：Local-Learning-Based Feature Selection

局部学习的特征选择：Local-Learning-Based Feature Selection

我爱读PAMI

14+阅读 · 2019年9月20日

已删除

将门创投

5+阅读 · 2019年4月4日

逆强化学习-学习人先验的动机

逆强化学习-学习人先验的动机

CreateAMind

16+阅读 · 2019年1月18日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

Disentangled的假设的探讨

Disentangled的假设的探讨

CreateAMind

9+阅读 · 2018年12月10日

Hierarchical Disentangled Representations

Hierarchical Disentangled Representations

CreateAMind

4+阅读 · 2018年4月15日

【学习】(Python)SVM数据分类

【学习】(Python)SVM数据分类

机器学习研究会

6+阅读 · 2017年10月15日

【推荐】决策树/随机森林深入解析

【推荐】决策树/随机森林深入解析

机器学习研究会

5+阅读 · 2017年9月21日

On Optimal Interpolation In Linear Regression

Arxiv

0+阅读 · 2021年10月21日

Distribution-Free Robust Linear Regression

Arxiv

0+阅读 · 2021年10月21日

A Parsimonious Personalized Dose Finding Model via Dimension Reduction

Arxiv

0+阅读 · 2021年10月20日

Testing for long-range dependence in non-stationary time series time-varying regression

Arxiv

0+阅读 · 2021年10月20日

Two-Stage TMLE to Reduce Bias and Improve Efficiency in Cluster Randomized Trials

Arxiv

0+阅读 · 2021年10月20日

Heterogeneous Treatment Effects in Regression Discontinuity Designs

Arxiv

0+阅读 · 2021年10月19日

A General Modeling Framework for Network Autoregressive Processes

Arxiv

0+阅读 · 2021年10月18日

Quantile Regression by Dyadic CART

Arxiv

0+阅读 · 2021年10月16日

Generalized regression operator estimation for continuous time functional data processes with missing at random response

Arxiv

0+阅读 · 2021年10月15日

Strong $L^p$-error analysis of nonlinear Monte Carlo approximations for high-dimensional semilinear partial differential equations

Arxiv

0+阅读 · 2021年10月15日

VIP会员

文章信息

相关主题

估计/估计量

相关VIP内容

INRIA最新「机器学习理论」新书，229页pdf原理性阐述机器学习

INRIA最新「机器学习理论」新书，229页pdf原理性阐述机器学习

专知会员服务

69+阅读 · 2021年3月27日

INRIA 最新《机器学习理论》课程笔记，176页pdf

专知会员服务

51+阅读 · 2020年12月14日

【神经自然语言处理进展：建模，学习，推理】Progress in Neural NLP: Modeling, Learning, and Reasoning

【神经自然语言处理进展：建模，学习，推理】Progress in Neural NLP: Modeling, Learning, and Reasoning

专知会员服务

78+阅读 · 2020年8月13日

【伯克利】自回归模型的局部掩卷积，Locally Masked Convolution for Autoregressive Models

【伯克利】自回归模型的局部掩卷积，Locally Masked Convolution for Autoregressive Models

专知会员服务

20+阅读 · 2020年6月23日

商业数据分析，39页ppt

商业数据分析，39页ppt

专知会员服务

165+阅读 · 2020年6月2日

随机特征核近似综述: 算法与理论，Random Features for Kernel Approximation: A Survey in Algorithms, Theory, and Beyond

随机特征核近似综述: 算法与理论，Random Features for Kernel Approximation: A Survey in Algorithms, Theory, and Beyond

专知会员服务

33+阅读 · 2020年4月26日

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

专知会员服务

95+阅读 · 2020年3月12日

【机器学习基础最新版】（Mathematics for Machine Learning），417页pdf

【机器学习基础最新版】（Mathematics for Machine Learning），417页pdf

专知会员服务

244+阅读 · 2019年10月21日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

热门VIP内容

开通专知VIP会员享更多权益服务

新书册《几何深度学习的数学基础》

中程单向攻击无人机的战略意义：俄乌战争启示

在无标注条件下适配视觉—语言模型：全面综述

面向视觉语言模型的持续学习：遗忘之外的综述与分类体系

相关资讯

局部学习的特征选择：Local-Learning-Based Feature Selection

局部学习的特征选择：Local-Learning-Based Feature Selection

我爱读PAMI

14+阅读 · 2019年9月20日

已删除

将门创投

5+阅读 · 2019年4月4日

逆强化学习-学习人先验的动机

逆强化学习-学习人先验的动机

CreateAMind

16+阅读 · 2019年1月18日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

Disentangled的假设的探讨

Disentangled的假设的探讨

CreateAMind

9+阅读 · 2018年12月10日

Hierarchical Disentangled Representations

Hierarchical Disentangled Representations

CreateAMind

4+阅读 · 2018年4月15日

【学习】(Python)SVM数据分类

【学习】(Python)SVM数据分类

机器学习研究会

6+阅读 · 2017年10月15日

【推荐】决策树/随机森林深入解析

【推荐】决策树/随机森林深入解析

机器学习研究会

5+阅读 · 2017年9月21日

相关论文

On Optimal Interpolation In Linear Regression

Arxiv

0+阅读 · 2021年10月21日

Distribution-Free Robust Linear Regression

Arxiv

0+阅读 · 2021年10月21日

A Parsimonious Personalized Dose Finding Model via Dimension Reduction

Arxiv

0+阅读 · 2021年10月20日

Testing for long-range dependence in non-stationary time series time-varying regression

Arxiv

0+阅读 · 2021年10月20日

Two-Stage TMLE to Reduce Bias and Improve Efficiency in Cluster Randomized Trials

Arxiv

0+阅读 · 2021年10月20日

Heterogeneous Treatment Effects in Regression Discontinuity Designs

Arxiv

0+阅读 · 2021年10月19日

A General Modeling Framework for Network Autoregressive Processes

Arxiv

0+阅读 · 2021年10月18日

Quantile Regression by Dyadic CART

Arxiv

0+阅读 · 2021年10月16日

Generalized regression operator estimation for continuous time functional data processes with missing at random response

Arxiv

0+阅读 · 2021年10月15日

Strong $L^p$-error analysis of nonlinear Monte Carlo approximations for high-dimensional semilinear partial differential equations

Arxiv

0+阅读 · 2021年10月15日

微信扫码咨询专知VIP会员