最优化的 Cox 递减子取样程序 (Optimal Cox Regression Subsampling Procedure with Rare Events) - 专知论文

会员服务 ·

0

子采样 · 估计/估计量 · 优化器 · 数据集 · Performer ·

2020 年 12 月 3 日

Optimal Cox Regression Subsampling Procedure with Rare Events

翻译：最优化的 Cox 递减子取样程序

Nir Keret,Malka Gorfine

Massive sized survival datasets are becoming increasingly prevalent with the development of the healthcare industry. Such datasets pose computational challenges unprecedented in traditional survival analysis use-cases. A popular way for coping with massive datasets is downsampling them to a more manageable size, such that the computational resources can be afforded by the researcher. Cox proportional hazards regression has remained one of the most popular statistical models for the analysis of survival data to-date. This work addresses the settings of right censored and possibly left truncated data with rare events, such that the observed failure times constitute only a small portion of the overall sample. We propose Cox regression subsampling-based estimators that approximate their full-data partial-likelihood-based counterparts, by assigning optimal sampling probabilities to censored observations, and including all observed failures in the analysis. Asymptotic properties of the proposed estimators are established under suitable regularity conditions, and simulation studies are carried out to evaluate the finite sample performance of the estimators. We further apply our procedure on UK-biobank colorectal cancer genetic and environmental risk factors.

翻译：随着保健行业的发展,大规模生存数据集日益普遍。这类数据集在传统生存分析使用案例中构成前所未有的计算挑战。处理大规模数据集的流行方式是将它们降格到更易于管理的规模,这样研究人员就可以提供计算资源。考克斯比例危害回归仍然是迄今分析生存数据最受欢迎的统计模型之一。这项工作处理的是受右侧审查的、可能左侧截断的数据,并有罕见事件,因此观察到的失败时间只占总样本的一小部分。我们提议使用基于考克斯回归的子抽样估计器,以近似于其全数据半类似对应数据,方法是为经过审查的观察确定最佳采样概率,并将所有观察到的失败情况纳入分析中。拟议估算器的随机特性是在适当的正常条件下建立的,并进行模拟研究,以评价估算器的有限样品性能。我们进一步对英国生物银行的红外癌和环境风险因素适用了我们的程序。

0

相关内容

子采样

不可错过！UIUC最新《统计强化学习》课程！

专知会员服务

53+阅读 · 2020年9月7日

Linux导论，Introduction to Linux，96页ppt

Linux导论，Introduction to Linux，96页ppt

专知会员服务

79+阅读 · 2020年7月26日

【电子书】人工智能编程范式（Paradigms of Artificial Intelligence Programming）1048页PDF免费下载

【电子书】人工智能编程范式（Paradigms of Artificial Intelligence Programming）1048页PDF免费下载

专知会员服务

50+阅读 · 2019年10月30日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

48+阅读 · 2019年10月17日

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

专知会员服务

154+阅读 · 2019年10月12日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

177+阅读 · 2019年10月11日

2019年机器学习框架回顾

2019年机器学习框架回顾

专知会员服务

36+阅读 · 2019年10月11日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

专知会员服务

79+阅读 · 2019年10月10日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

26+阅读 · 2019年5月22日

已删除

将门创投

11+阅读 · 2019年4月26日

动物脑的好奇心和强化学习的好奇心

动物脑的好奇心和强化学习的好奇心

CreateAMind

10+阅读 · 2019年1月26日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

17+阅读 · 2019年1月7日

meta learning 17年：MAML SNAIL

meta learning 17年：MAML SNAIL

CreateAMind

11+阅读 · 2019年1月2日

RL 真经

CreateAMind

5+阅读 · 2018年12月28日

Disentangled的假设的探讨

Disentangled的假设的探讨

CreateAMind

9+阅读 · 2018年12月10日

Hierarchical Disentangled Representations

Hierarchical Disentangled Representations

CreateAMind

4+阅读 · 2018年4月15日

强化学习族谱

强化学习族谱

CreateAMind

26+阅读 · 2017年8月2日

强化学习 cartpole_a3c

强化学习 cartpole_a3c

CreateAMind

9+阅读 · 2017年7月21日

Partially Observable Mean Field Reinforcement Learning

Arxiv

0+阅读 · 2021年1月24日

A heavy-tailed and overdispersed collective risk model

Arxiv

0+阅读 · 2021年1月22日

Forecasting blood sugar levels in Diabetes with univariate algorithms

Arxiv

0+阅读 · 2021年1月21日

Bayesian GARCH Modeling of Functional Sports Data

Arxiv

0+阅读 · 2021年1月20日

Colombian Women's Life Patterns: A Multivariate Density Regression Approach

Colombian Women's Life Patterns: A Multivariate Density Regression Approach

Arxiv

0+阅读 · 2021年1月20日

Bayesian Meta-analysis of Rare Events with Non-ignorable Missing Data

Arxiv

0+阅读 · 2021年1月20日

Optimizing Optimizers: Regret-optimal gradient descent algorithms

Arxiv

0+阅读 · 2021年1月19日

Generalization and Regularization in DQN

Generalization and Regularization in DQN

Arxiv

6+阅读 · 2019年1月30日

Prediction of the FIFA World Cup 2018 - A random forest approach with an emphasis on estimated team ability parameters

Arxiv

3+阅读 · 2018年6月13日

Variance-based regularization with convex objectives

Arxiv

5+阅读 · 2017年12月14日

VIP会员

文章信息

相关主题

估计/估计量

相关VIP内容

不可错过！UIUC最新《统计强化学习》课程！

专知会员服务

53+阅读 · 2020年9月7日

Linux导论，Introduction to Linux，96页ppt

Linux导论，Introduction to Linux，96页ppt

专知会员服务

79+阅读 · 2020年7月26日

【电子书】人工智能编程范式（Paradigms of Artificial Intelligence Programming）1048页PDF免费下载

【电子书】人工智能编程范式（Paradigms of Artificial Intelligence Programming）1048页PDF免费下载

专知会员服务

50+阅读 · 2019年10月30日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

48+阅读 · 2019年10月17日

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

专知会员服务

154+阅读 · 2019年10月12日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

177+阅读 · 2019年10月11日

2019年机器学习框架回顾

2019年机器学习框架回顾

专知会员服务

36+阅读 · 2019年10月11日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

专知会员服务

79+阅读 · 2019年10月10日

热门VIP内容

开通专知VIP会员享更多权益服务

《海军陆战队远征军信息组行动》美军条令

《文化：第六个领域和C6ISRT框架的引入》译文版

算法时代的战争艺术：认知战与人工智能驱动战略

《雷达任务调度与策略梯度强化学习：为连续观察和行动空间创建环境和智能体》最新报告

相关资讯

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

26+阅读 · 2019年5月22日

已删除

将门创投

11+阅读 · 2019年4月26日

动物脑的好奇心和强化学习的好奇心

动物脑的好奇心和强化学习的好奇心

CreateAMind

10+阅读 · 2019年1月26日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

17+阅读 · 2019年1月7日

meta learning 17年：MAML SNAIL

meta learning 17年：MAML SNAIL

CreateAMind

11+阅读 · 2019年1月2日

RL 真经

CreateAMind

5+阅读 · 2018年12月28日

Disentangled的假设的探讨

Disentangled的假设的探讨

CreateAMind

9+阅读 · 2018年12月10日

Hierarchical Disentangled Representations

Hierarchical Disentangled Representations

CreateAMind

4+阅读 · 2018年4月15日

强化学习族谱

强化学习族谱

CreateAMind

26+阅读 · 2017年8月2日

强化学习 cartpole_a3c

强化学习 cartpole_a3c

CreateAMind

9+阅读 · 2017年7月21日

相关论文

Partially Observable Mean Field Reinforcement Learning

Arxiv

0+阅读 · 2021年1月24日

A heavy-tailed and overdispersed collective risk model

Arxiv

0+阅读 · 2021年1月22日

Forecasting blood sugar levels in Diabetes with univariate algorithms

Arxiv

0+阅读 · 2021年1月21日

Bayesian GARCH Modeling of Functional Sports Data

Arxiv

0+阅读 · 2021年1月20日

Colombian Women's Life Patterns: A Multivariate Density Regression Approach

Colombian Women's Life Patterns: A Multivariate Density Regression Approach

Arxiv

0+阅读 · 2021年1月20日

Bayesian Meta-analysis of Rare Events with Non-ignorable Missing Data

Arxiv

0+阅读 · 2021年1月20日

Optimizing Optimizers: Regret-optimal gradient descent algorithms

Arxiv

0+阅读 · 2021年1月19日

Generalization and Regularization in DQN

Generalization and Regularization in DQN

Arxiv

6+阅读 · 2019年1月30日

Prediction of the FIFA World Cup 2018 - A random forest approach with an emphasis on estimated team ability parameters

Arxiv

3+阅读 · 2018年6月13日

Variance-based regularization with convex objectives

Arxiv

5+阅读 · 2017年12月14日

微信扫码咨询专知VIP会员