通过成果适应性、阿盟SO和协作性倾向分分数方法整合数据 (Data Integration through outcome adaptive LASSO and a collaborative propensity score approach) - 专知论文

会员服务 ·

0

估计/估计量 · INFORMS · Performer · Integration · 得分 ·

2021 年 3 月 28 日

Data Integration through outcome adaptive LASSO and a collaborative propensity score approach

翻译：通过成果适应性、阿盟SO和协作性倾向分分数方法整合数据

Asma Bahamyirou,Mireille E. Schnitzer

Administrative data, or non-probability sample data, are increasingly being used to obtain official statistics due to their many benefits over survey methods. In particular, they are less costly, provide a larger sample size, and are not reliant on the response rate. However, it is difficult to obtain an unbiased estimate of the population mean from such data due to the absence of design weights. Several estimation approaches have been proposed recently using an auxiliary probability sample which provides representative covariate information of the target population. However, when this covariate information is high-dimensional, variable selection is not a straight-forward task even for a subject matter expert. In the context of efficient and doubly robust estimation approaches for estimating a population mean, we develop two data adaptive methods for variable selection using the outcome adaptive LASSO and a collaborative propensity score, respectively. Simulation studies are performed in order to verify the performance of the proposed methods versus competing methods. Finally, we presented an anayisis of the impact of Covid-19 on Canadians.

翻译：行政数据,或非概率抽样数据,由于在调查方法上有许多好处,正越来越多地被用于获取官方统计,特别是,这些数据费用较低,抽样规模较大,不依赖答复率;然而,由于缺乏设计权重,很难从这些数据中获得对人口平均数的公正估计;最近提出了几种估算方法,采用辅助概率抽样,为目标人口提供具有代表性的共变信息;然而,当这种共变信息为高维度时,即使对主题专家来说,选择变量也不是直截了当的任务。在对人口值进行高效和加倍有力的估计时,我们分别利用适应LASSO的结果和协作性适应性分数,为变量选择制定了两种数据适应性方法。进行了模拟研究,以核实拟议方法的绩效和相互竞争的方法。最后,我们介绍了Covid-19对加拿大人的影响。

0

相关内容

估计/估计量

估计/估计量

【ICML2021】异质风险最小化，Heterogeneous Risk Minimization

专知会员服务

16+阅读 · 2021年5月21日

【CVPR2021】现实世界域泛化的自适应方法

【CVPR2021】现实世界域泛化的自适应方法

专知会员服务

58+阅读 · 2021年3月31日

【干货书】机器学习速查手册，135页pdf

【干货书】机器学习速查手册，135页pdf

专知会员服务

127+阅读 · 2020年11月20日

不可错过！UIUC最新《统计强化学习》课程！

专知会员服务

54+阅读 · 2020年9月7日

数据科学导论，54页ppt，Introduction to Data Science

数据科学导论，54页ppt，Introduction to Data Science

专知会员服务

42+阅读 · 2020年7月27日

【SIGIR2020】多检索系统的贝叶斯推理风险评估，Bayesian Inferential Risk Evaluation On Multiple IR Systems

【SIGIR2020】多检索系统的贝叶斯推理风险评估，Bayesian Inferential Risk Evaluation On Multiple IR Systems

专知会员服务

9+阅读 · 2020年6月10日

【SIGIR2020-中科院计算所】L2R2: 利用排名进行外展推理，L2R2: Leveraging Ranking for Abductive Reasoning

【SIGIR2020-中科院计算所】L2R2: 利用排名进行外展推理，L2R2: Leveraging Ranking for Abductive Reasoning

专知会员服务

11+阅读 · 2020年5月25日

知识图谱嵌入模型的概率标定,Probability Calibration for Knowledge Graph Embedding Models

专知会员服务

36+阅读 · 2020年5月11日

【Amazon】使用预先训练的Transformer模型进行数据增强，Data Augmentation using Pre-trained Transformer Models

【Amazon】使用预先训练的Transformer模型进行数据增强，Data Augmentation using Pre-trained Transformer Models

专知会员服务

51+阅读 · 2020年3月7日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

Disentangled的假设的探讨

Disentangled的假设的探讨

CreateAMind

9+阅读 · 2018年12月10日

disentangled-representation-papers

disentangled-representation-papers

CreateAMind

26+阅读 · 2018年9月12日

【论文推荐】最新六篇主题模型相关论文—领域特定知识库、神经变分推断、动态和静态主题模型

【论文推荐】最新六篇主题模型相关论文—领域特定知识库、神经变分推断、动态和静态主题模型

专知

19+阅读 · 2018年6月26日

Hierarchical Disentangled Representations

Hierarchical Disentangled Representations

CreateAMind

4+阅读 · 2018年4月15日

人工智能 | 国际会议截稿信息9条

人工智能 | 国际会议截稿信息9条

Call4Papers

4+阅读 · 2018年3月13日

【学习】(Python)SVM数据分类

【学习】(Python)SVM数据分类

机器学习研究会

6+阅读 · 2017年10月15日

【学习】Hierarchical Softmax

【学习】Hierarchical Softmax

机器学习研究会

4+阅读 · 2017年8月6日

【今日新增】IEEE Trans.专刊截稿信息8条

【今日新增】IEEE Trans.专刊截稿信息8条

Call4Papers

7+阅读 · 2017年6月29日

Model-Assisted Inference for Covariate-Specific Treatment Effects with High-dimensional Data

Arxiv

0+阅读 · 2021年5月24日

Computational Efficient Approximations of the Concordance Probability in a Big Data Setting

Arxiv

0+阅读 · 2021年5月21日

Estimating Unknown Time-Varying Parameters in Uncertain Differential Equation

Arxiv

0+阅读 · 2021年5月21日

Profile Matching for the Generalization and Personalization of Causal Inferences

Arxiv

0+阅读 · 2021年5月20日

Measuring Model Fairness under Noisy Covariates: A Theoretical Perspective

Arxiv

0+阅读 · 2021年5月20日

Distributed Adaptive Nearest Neighbor Classifier: Algorithm and Theory

Arxiv

0+阅读 · 2021年5月20日

A Robust Score-Driven Filter for Multivariate Time Series

Arxiv

0+阅读 · 2021年5月20日

A data-driven approach to the forecasting of ground-level ozone concentration

Arxiv

0+阅读 · 2021年5月20日

Proximal Learning for Individualized Treatment Regimes Under Unmeasured Confounding

Arxiv

0+阅读 · 2021年5月20日

Performance of Empirical Risk Minimization for Linear Regression with Dependent Data

Arxiv

0+阅读 · 2021年5月19日

VIP会员

文章信息

相关主题

估计/估计量

相关VIP内容

【ICML2021】异质风险最小化，Heterogeneous Risk Minimization

专知会员服务

16+阅读 · 2021年5月21日

【CVPR2021】现实世界域泛化的自适应方法

【CVPR2021】现实世界域泛化的自适应方法

专知会员服务

58+阅读 · 2021年3月31日

【干货书】机器学习速查手册，135页pdf

【干货书】机器学习速查手册，135页pdf

专知会员服务

127+阅读 · 2020年11月20日

不可错过！UIUC最新《统计强化学习》课程！

专知会员服务

54+阅读 · 2020年9月7日

数据科学导论，54页ppt，Introduction to Data Science

数据科学导论，54页ppt，Introduction to Data Science

专知会员服务

42+阅读 · 2020年7月27日

【SIGIR2020】多检索系统的贝叶斯推理风险评估，Bayesian Inferential Risk Evaluation On Multiple IR Systems

【SIGIR2020】多检索系统的贝叶斯推理风险评估，Bayesian Inferential Risk Evaluation On Multiple IR Systems

专知会员服务

9+阅读 · 2020年6月10日

【SIGIR2020-中科院计算所】L2R2: 利用排名进行外展推理，L2R2: Leveraging Ranking for Abductive Reasoning

【SIGIR2020-中科院计算所】L2R2: 利用排名进行外展推理，L2R2: Leveraging Ranking for Abductive Reasoning

专知会员服务

11+阅读 · 2020年5月25日

知识图谱嵌入模型的概率标定,Probability Calibration for Knowledge Graph Embedding Models

专知会员服务

36+阅读 · 2020年5月11日

【Amazon】使用预先训练的Transformer模型进行数据增强，Data Augmentation using Pre-trained Transformer Models

【Amazon】使用预先训练的Transformer模型进行数据增强，Data Augmentation using Pre-trained Transformer Models

专知会员服务

51+阅读 · 2020年3月7日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

热门VIP内容

开通专知VIP会员享更多权益服务

NeurIPS 2025 | 自动化所新作速览（一）

大型语言模型（LLM）赋能的知识图谱构建：综述

NeurIPS 2025 | 自动化所新作速览（二）

领域特定文本分类中的预训练语言模型新进展：系统综述

相关资讯

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

Disentangled的假设的探讨

Disentangled的假设的探讨

CreateAMind

9+阅读 · 2018年12月10日

disentangled-representation-papers

disentangled-representation-papers

CreateAMind

26+阅读 · 2018年9月12日

【论文推荐】最新六篇主题模型相关论文—领域特定知识库、神经变分推断、动态和静态主题模型

【论文推荐】最新六篇主题模型相关论文—领域特定知识库、神经变分推断、动态和静态主题模型

专知

19+阅读 · 2018年6月26日

Hierarchical Disentangled Representations

Hierarchical Disentangled Representations

CreateAMind

4+阅读 · 2018年4月15日

人工智能 | 国际会议截稿信息9条

人工智能 | 国际会议截稿信息9条

Call4Papers

4+阅读 · 2018年3月13日

【学习】(Python)SVM数据分类

【学习】(Python)SVM数据分类

机器学习研究会

6+阅读 · 2017年10月15日

【学习】Hierarchical Softmax

【学习】Hierarchical Softmax

机器学习研究会

4+阅读 · 2017年8月6日

【今日新增】IEEE Trans.专刊截稿信息8条

【今日新增】IEEE Trans.专刊截稿信息8条

Call4Papers

7+阅读 · 2017年6月29日

相关论文

Model-Assisted Inference for Covariate-Specific Treatment Effects with High-dimensional Data

Arxiv

0+阅读 · 2021年5月24日

Computational Efficient Approximations of the Concordance Probability in a Big Data Setting

Arxiv

0+阅读 · 2021年5月21日

Estimating Unknown Time-Varying Parameters in Uncertain Differential Equation

Arxiv

0+阅读 · 2021年5月21日

Profile Matching for the Generalization and Personalization of Causal Inferences

Arxiv

0+阅读 · 2021年5月20日

Measuring Model Fairness under Noisy Covariates: A Theoretical Perspective

Arxiv

0+阅读 · 2021年5月20日

Distributed Adaptive Nearest Neighbor Classifier: Algorithm and Theory

Arxiv

0+阅读 · 2021年5月20日

A Robust Score-Driven Filter for Multivariate Time Series

Arxiv

0+阅读 · 2021年5月20日

A data-driven approach to the forecasting of ground-level ozone concentration

Arxiv

0+阅读 · 2021年5月20日

Proximal Learning for Individualized Treatment Regimes Under Unmeasured Confounding

Arxiv

0+阅读 · 2021年5月20日

Performance of Empirical Risk Minimization for Linear Regression with Dependent Data

Arxiv

0+阅读 · 2021年5月19日

微信扫码咨询专知VIP会员