纵向数据分布情况:可视化、递减和预测 (Distributional Representation of Longitudinal Data: Visualization, Regression and Prediction) - 专知论文

会员服务 ·

0

稀疏 · 分布式表示 · 得分 · 泛函 · 样本 ·

2021 年 9 月 6 日

Distributional Representation of Longitudinal Data: Visualization, Regression and Prediction

翻译：纵向数据分布情况:可视化、递减和预测

Álvaro Gajardo,Xiongtao Dai,Hans-Georg Müller

We develop a representation of Gaussian distributed sparsely sampled longitudinal data whereby the data for each subject are mapped to a multivariate Gaussian distribution; this map is entirely data-driven. The proposed method utilizes functional principal component analysis and is nonparametric, assuming no prior knowledge of the covariance or mean structure of the longitudinal data. This approach naturally connects with a deeper investigation of the behavior of the functional principal component scores obtained for longitudinal data, as the number of observations per subject increases from sparse to dense. We show how this is reflected in the shrinkage of the distribution of the conditional scores given noisy longitudinal observations towards a point mass located at the true but unobservable FPCs. Mapping each subject's sparse observations to the corresponding conditional score distribution leads to useful visualizations and representations of sparse longitudinal data. Asymptotic rates of convergence as sample size increases are obtained for the 2-Wasserstein metric between the true and estimated conditional score distributions, both for a $K$-truncated functional principal component representation as well as for the case when $K=K(n)$ diverges with sample size $n\to\infty$. We apply these ideas to construct predictive distributions aimed at predicting outcomes given sparse longitudinal data.

翻译：我们开发了高森分散的分散抽样纵向数据代表, 将每个主题的数据映射成多变量高斯分布; 这张地图完全是数据驱动的。拟议的方法使用功能性主要成分分析, 并且是非参数性, 假设事先对纵向数据的共差或中值结构没有了解, 假设事先对纵向数据的共差或中值结构没有了解。这个方法自然与更深入地调查从纵向数据中获得的功能性主要组成部分分数的行为联系起来, 因为每个主题的观测数从稀疏到密度增加。我们展示了这一点如何反映在条件性分数分布的缩缩缩中, 原因是对位于真实但不可观测的FPCs的点质量进行了激烈的纵向观察。绘制每个对象对相应条件性分数分布的稀少观察, 导致对微长的长度数据的可视化和表达。当样本大小增加时, 在真实和估计的分数分布之间, 实际和估计性主要功能分数的比值增加, 两者的比重均表示为$- K= K= 以恒度预测的数值为预测结果。

0

相关内容

【开放书】数据可视化基础，《Fundamentals of Data Visualization》

专知会员服务

65+阅读 · 2021年6月13日

【ICML2021】异质风险最小化，Heterogeneous Risk Minimization

专知会员服务

16+阅读 · 2021年5月21日

【ACL2020】Span-ConveRT：预训练对话表示小样本跨度提取，Span-ConveRT: Few-shot Span Extraction for Dialog with Pretrained Conversational Representations

【ACL2020】Span-ConveRT：预训练对话表示小样本跨度提取，Span-ConveRT: Few-shot Span Extraction for Dialog with Pretrained Conversational Representations

专知会员服务

17+阅读 · 2020年5月19日

【视频描述综述论文】Video Description: A Survey of Methods, Datasets, and Evaluation Metrics

【视频描述综述论文】Video Description: A Survey of Methods, Datasets, and Evaluation Metrics

专知会员服务

65+阅读 · 2020年5月12日

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

专知会员服务

96+阅读 · 2020年3月12日

【新开放书】医学影像原理与应用，Medical Imaging Principles and Applications

【新开放书】医学影像原理与应用，Medical Imaging Principles and Applications

专知会员服务

90+阅读 · 2019年12月15日

【IC2S2 2019教程】社会信息网络分析的计算模型（Computational Models for Social and Information Network Analysis），清华大学计算机系教授唐杰

【IC2S2 2019教程】社会信息网络分析的计算模型（Computational Models for Social and Information Network Analysis），清华大学计算机系教授唐杰

专知会员服务

40+阅读 · 2019年11月16日

TensorFlow官方开源的神经结构学习（Neural Structured Learning）库

TensorFlow官方开源的神经结构学习（Neural Structured Learning）库

专知会员服务

18+阅读 · 2019年10月18日

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

专知会员服务

79+阅读 · 2019年10月10日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

disentangled-representation-papers

disentangled-representation-papers

CreateAMind

26+阅读 · 2018年9月12日

Hierarchical Disentangled Representations

Hierarchical Disentangled Representations

CreateAMind

4+阅读 · 2018年4月15日

已删除

将门创投

9+阅读 · 2017年10月17日

Auto-Encoding GAN

Auto-Encoding GAN

CreateAMind

7+阅读 · 2017年8月4日

Stateful Strategic Regression

Arxiv

0+阅读 · 2021年10月26日

Random matrices based schemes for stable and robust nonparametric and functional regression estimators

Arxiv

0+阅读 · 2021年10月26日

Optimal Bayesian Estimation of a Regression Curve, a Conditional Density and a Conditional Distribution

Arxiv

0+阅读 · 2021年10月26日

Communication-Efficient Distributed Quantile Regression with Optimal Statistical Guarantees

Communication-Efficient Distributed Quantile Regression with Optimal Statistical Guarantees

Arxiv

0+阅读 · 2021年10月25日

Sufficient reductions in regression with mixed predictors

Arxiv

0+阅读 · 2021年10月25日

Applying Regression Conformal Prediction with Nearest Neighbors to time series data

Arxiv

0+阅读 · 2021年10月25日

Poisson-modification of the Quasi Lindley distribution and its zero modification for over-dispersed count data

Arxiv

0+阅读 · 2021年10月25日

Conjugate priors for count and rounded data regression

Arxiv

0+阅读 · 2021年10月23日

Bayesian Shrinkage for Functional Network Models, with Applications to Longitudinal Item Response Data

Arxiv

0+阅读 · 2021年10月22日

Premise selection with neural networks and distributed representation of features

Arxiv

3+阅读 · 2018年7月26日

VIP会员

文章信息

相关主题

分布式表示

相关VIP内容

【开放书】数据可视化基础，《Fundamentals of Data Visualization》

专知会员服务

65+阅读 · 2021年6月13日

【ICML2021】异质风险最小化，Heterogeneous Risk Minimization

专知会员服务

16+阅读 · 2021年5月21日

【ACL2020】Span-ConveRT：预训练对话表示小样本跨度提取，Span-ConveRT: Few-shot Span Extraction for Dialog with Pretrained Conversational Representations

【ACL2020】Span-ConveRT：预训练对话表示小样本跨度提取，Span-ConveRT: Few-shot Span Extraction for Dialog with Pretrained Conversational Representations

专知会员服务

17+阅读 · 2020年5月19日

【视频描述综述论文】Video Description: A Survey of Methods, Datasets, and Evaluation Metrics

【视频描述综述论文】Video Description: A Survey of Methods, Datasets, and Evaluation Metrics

专知会员服务

65+阅读 · 2020年5月12日

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

专知会员服务

96+阅读 · 2020年3月12日

【新开放书】医学影像原理与应用，Medical Imaging Principles and Applications

【新开放书】医学影像原理与应用，Medical Imaging Principles and Applications

专知会员服务

90+阅读 · 2019年12月15日

【IC2S2 2019教程】社会信息网络分析的计算模型（Computational Models for Social and Information Network Analysis），清华大学计算机系教授唐杰

【IC2S2 2019教程】社会信息网络分析的计算模型（Computational Models for Social and Information Network Analysis），清华大学计算机系教授唐杰

专知会员服务

40+阅读 · 2019年11月16日

TensorFlow官方开源的神经结构学习（Neural Structured Learning）库

TensorFlow官方开源的神经结构学习（Neural Structured Learning）库

专知会员服务

18+阅读 · 2019年10月18日

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

专知会员服务

79+阅读 · 2019年10月10日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

《代码、指挥与冲突：描绘军事人工智能的未来》报告

【斯坦福博士论文】面向地理空间数据的多模态与多尺度建模：时空生成式人工智能

美国启动“自有军事人工智能计划”：采用谷歌Gemini以推动全军人工智能应用

《创新与适应性作为军事成功的关键因素：来自俄乌战争的战略洞见》报告

相关资讯

disentangled-representation-papers

disentangled-representation-papers

CreateAMind

26+阅读 · 2018年9月12日

Hierarchical Disentangled Representations

Hierarchical Disentangled Representations

CreateAMind

4+阅读 · 2018年4月15日

已删除

将门创投

9+阅读 · 2017年10月17日

Auto-Encoding GAN

Auto-Encoding GAN

CreateAMind

7+阅读 · 2017年8月4日

相关论文

Stateful Strategic Regression

Arxiv

0+阅读 · 2021年10月26日

Random matrices based schemes for stable and robust nonparametric and functional regression estimators

Arxiv

0+阅读 · 2021年10月26日

Optimal Bayesian Estimation of a Regression Curve, a Conditional Density and a Conditional Distribution

Arxiv

0+阅读 · 2021年10月26日

Communication-Efficient Distributed Quantile Regression with Optimal Statistical Guarantees

Communication-Efficient Distributed Quantile Regression with Optimal Statistical Guarantees

Arxiv

0+阅读 · 2021年10月25日

Sufficient reductions in regression with mixed predictors

Arxiv

0+阅读 · 2021年10月25日

Applying Regression Conformal Prediction with Nearest Neighbors to time series data

Arxiv

0+阅读 · 2021年10月25日

Poisson-modification of the Quasi Lindley distribution and its zero modification for over-dispersed count data

Arxiv

0+阅读 · 2021年10月25日

Conjugate priors for count and rounded data regression

Arxiv

0+阅读 · 2021年10月23日

Bayesian Shrinkage for Functional Network Models, with Applications to Longitudinal Item Response Data

Arxiv

0+阅读 · 2021年10月22日

Premise selection with neural networks and distributed representation of features

Arxiv

3+阅读 · 2018年7月26日

微信扫码咨询专知VIP会员