Hamiltonian Monte Carlo保健数据计算和增加模型 (A Hamiltonian Monte Carlo Model for Imputation and Augmentation of Healthcare Data) - 专知论文

会员服务 ·

0

缺失值 · 蒙特卡罗 · MoDELS · Processing（编程语言） · 有偏 ·

2021 年 3 月 3 日

A Hamiltonian Monte Carlo Model for Imputation and Augmentation of Healthcare Data

翻译：Hamiltonian Monte Carlo保健数据计算和增加模型

Narges Pourshahrokhi,Samaneh Kouchaki,Kord M. Kober,Christine Miaskowski,Payam Barnaghi

Missing values exist in nearly all clinical studies because data for a variable or question are not collected or not available. Inadequate handling of missing values can lead to biased results and loss of statistical power in analysis. Existing models usually do not consider privacy concerns or do not utilise the inherent correlations across multiple features to impute the missing values. In healthcare applications, we are usually confronted with high dimensional and sometimes small sample size datasets that need more effective augmentation or imputation techniques. Besides, imputation and augmentation processes are traditionally conducted individually. However, imputing missing values and augmenting data can significantly improve generalisation and avoid bias in machine learning models. A Bayesian approach to impute missing values and creating augmented samples in high dimensional healthcare data is proposed in this work. We propose folded Hamiltonian Monte Carlo (F-HMC) with Bayesian inference as a more practical approach to process the cross-dimensional relations by applying a random walk and Hamiltonian dynamics to adapt posterior distribution and generate large-scale samples. The proposed method is applied to a cancer symptom assessment dataset and confirmed to enrich the quality of data in precision, accuracy, recall, F1 score, and propensity metric.

翻译：几乎所有临床研究都存在缺失的数值,因为没有收集或无法获得变量或问题的数据。对缺失值的处理不当可能导致偏差结果和在分析中丧失统计力量。现有模型通常不考虑隐私问题,或没有利用多种特征的内在关联来估算缺失值。在医疗保健应用中,我们通常面临需要更有效增强或估算技术的高维,有时是小型样本数据集。此外,估算和增强过程传统上是单独进行的。但是,估算缺失值和增强数据可以大大改善机器学习模型中的概括性并避免偏差。在这项工作中建议采用巴伊西亚人的方法,在高维度保健数据中估算缺失值和创建增强样本。我们建议用贝伊人推理推理的折叠汉密尔顿·蒙特卡洛(F-HMC)作为处理跨维度关系的更实用的方法,采用随机行走和汉密尔顿动力来调整远地点分布并生成大型样本。拟议的方法适用于癌症症状评估数据集,并被确认用于丰富精确度、准确度、回顾、F1分数和测量度的数据质量。

0

相关内容

缺失值

经济学中的数据科学，Data Science in Economics，附22页pdf

经济学中的数据科学，Data Science in Economics，附22页pdf

专知会员服务

36+阅读 · 2020年4月1日

【医学图像处理中的因果性】52页ppt，Causality Matters in Medical Imaging

【医学图像处理中的因果性】52页ppt，Causality Matters in Medical Imaging

专知会员服务

60+阅读 · 2020年3月14日

【新书：机器学习简介】《A Concise Introduction to Machine Learning》by A.C. Faul (CRC 2019)

【新书：机器学习简介】《A Concise Introduction to Machine Learning》by A.C. Faul (CRC 2019)

专知会员服务

77+阅读 · 2020年2月8日

【综述】安全和健壮的医疗机器学习综述，Secure and Robust Machine Learning for Healthcare: A Survey，附22页pdf

【综述】安全和健壮的医疗机器学习综述，Secure and Robust Machine Learning for Healthcare: A Survey，附22页pdf

专知会员服务

46+阅读 · 2020年1月25日

【新开放书】医学影像原理与应用，Medical Imaging Principles and Applications

【新开放书】医学影像原理与应用，Medical Imaging Principles and Applications

专知会员服务

90+阅读 · 2019年12月15日

【ICCV 2019 Workshop】UGLLI Face Alignment: Estimating Uncertainty with Gaussian Log-Likelihood Loss（UGLLI人脸对齐：估计不确定性与高斯对数似然损失），犹他大学 Abhinav Kumar

【ICCV 2019 Workshop】UGLLI Face Alignment: Estimating Uncertainty with Gaussian Log-Likelihood Loss（UGLLI人脸对齐：估计不确定性与高斯对数似然损失），犹他大学 Abhinav Kumar

专知会员服务

15+阅读 · 2019年10月31日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

105+阅读 · 2019年10月9日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

Disentangled的假设的探讨

Disentangled的假设的探讨

CreateAMind

9+阅读 · 2018年12月10日

Hierarchical Imitation - Reinforcement Learning

Hierarchical Imitation - Reinforcement Learning

CreateAMind

19+阅读 · 2018年5月25日

Hierarchical Disentangled Representations

Hierarchical Disentangled Representations

CreateAMind

4+阅读 · 2018年4月15日

【论文】变分推断（Variational inference)的总结

【论文】变分推断（Variational inference)的总结

机器学习研究会

39+阅读 · 2017年11月16日

【学习】Hierarchical Softmax

【学习】Hierarchical Softmax

机器学习研究会

4+阅读 · 2017年8月6日

Auto-Encoding GAN

Auto-Encoding GAN

CreateAMind

7+阅读 · 2017年8月4日

GAN猫的脸

机械鸡

11+阅读 · 2017年7月8日

A unified framework for Hamiltonian deep neural networks

Arxiv

0+阅读 · 2021年4月27日

A data-driven and model-based accelerated Hamiltonian Monte Carlo method for Bayesian elliptic inverse problems

Arxiv

0+阅读 · 2021年4月27日

A linear noise approximation for stochastic epidemic models fit to partially observed incidence counts

Arxiv

0+阅读 · 2021年4月27日

Transporting treatment effects with incomplete attributes

Arxiv

0+阅读 · 2021年4月26日

Impacts of Dirty Data: and Experimental Evaluation

Arxiv

0+阅读 · 2021年4月26日

GuideBP: Guiding Backpropagation Through Weaker Pathways of Parallel Logits

GuideBP: Guiding Backpropagation Through Weaker Pathways of Parallel Logits

Arxiv

0+阅读 · 2021年4月23日

Data Augmentation for Graph Neural Networks

Arxiv

38+阅读 · 2020年12月2日

Test-time augmentation with uncertainty estimation for deep learning-based medical image segmentation

Test-time augmentation with uncertainty estimation for deep learning-based medical image segmentation

Arxiv

4+阅读 · 2018年7月19日

Generative Model for Heterogeneous Inference

Arxiv

4+阅读 · 2018年4月26日

A Generative Model For Zero Shot Learning Using Conditional Variational Autoencoders

Arxiv

9+阅读 · 2018年1月27日

VIP会员

文章信息

相关主题

Processing（编程语言）

相关VIP内容

经济学中的数据科学，Data Science in Economics，附22页pdf

经济学中的数据科学，Data Science in Economics，附22页pdf

专知会员服务

36+阅读 · 2020年4月1日

【医学图像处理中的因果性】52页ppt，Causality Matters in Medical Imaging

【医学图像处理中的因果性】52页ppt，Causality Matters in Medical Imaging

专知会员服务

60+阅读 · 2020年3月14日

【新书：机器学习简介】《A Concise Introduction to Machine Learning》by A.C. Faul (CRC 2019)

【新书：机器学习简介】《A Concise Introduction to Machine Learning》by A.C. Faul (CRC 2019)

专知会员服务

77+阅读 · 2020年2月8日

【综述】安全和健壮的医疗机器学习综述，Secure and Robust Machine Learning for Healthcare: A Survey，附22页pdf

【综述】安全和健壮的医疗机器学习综述，Secure and Robust Machine Learning for Healthcare: A Survey，附22页pdf

专知会员服务

46+阅读 · 2020年1月25日

【新开放书】医学影像原理与应用，Medical Imaging Principles and Applications

【新开放书】医学影像原理与应用，Medical Imaging Principles and Applications

专知会员服务

90+阅读 · 2019年12月15日

【ICCV 2019 Workshop】UGLLI Face Alignment: Estimating Uncertainty with Gaussian Log-Likelihood Loss（UGLLI人脸对齐：估计不确定性与高斯对数似然损失），犹他大学 Abhinav Kumar

【ICCV 2019 Workshop】UGLLI Face Alignment: Estimating Uncertainty with Gaussian Log-Likelihood Loss（UGLLI人脸对齐：估计不确定性与高斯对数似然损失），犹他大学 Abhinav Kumar

专知会员服务

15+阅读 · 2019年10月31日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

105+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

自动驾驶轨迹规划中的基础模型：进展综述与开放挑战

《用于提升多域战备的大型语言模型辅助场景生成器》报告

【斯坦福博士论文】为人类使用优化 AI 模型

国防领域人工智能规模化应用的理论与实践

相关资讯

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

Disentangled的假设的探讨

Disentangled的假设的探讨

CreateAMind

9+阅读 · 2018年12月10日

Hierarchical Imitation - Reinforcement Learning

Hierarchical Imitation - Reinforcement Learning

CreateAMind

19+阅读 · 2018年5月25日

Hierarchical Disentangled Representations

Hierarchical Disentangled Representations

CreateAMind

4+阅读 · 2018年4月15日

【论文】变分推断（Variational inference)的总结

【论文】变分推断（Variational inference)的总结

机器学习研究会

39+阅读 · 2017年11月16日

【学习】Hierarchical Softmax

【学习】Hierarchical Softmax

机器学习研究会

4+阅读 · 2017年8月6日

Auto-Encoding GAN

Auto-Encoding GAN

CreateAMind

7+阅读 · 2017年8月4日

GAN猫的脸

机械鸡

11+阅读 · 2017年7月8日

相关论文

A unified framework for Hamiltonian deep neural networks

Arxiv

0+阅读 · 2021年4月27日

A data-driven and model-based accelerated Hamiltonian Monte Carlo method for Bayesian elliptic inverse problems

Arxiv

0+阅读 · 2021年4月27日

A linear noise approximation for stochastic epidemic models fit to partially observed incidence counts

Arxiv

0+阅读 · 2021年4月27日

Transporting treatment effects with incomplete attributes

Arxiv

0+阅读 · 2021年4月26日

Impacts of Dirty Data: and Experimental Evaluation

Arxiv

0+阅读 · 2021年4月26日

GuideBP: Guiding Backpropagation Through Weaker Pathways of Parallel Logits

GuideBP: Guiding Backpropagation Through Weaker Pathways of Parallel Logits

Arxiv

0+阅读 · 2021年4月23日

Data Augmentation for Graph Neural Networks

Arxiv

38+阅读 · 2020年12月2日

Test-time augmentation with uncertainty estimation for deep learning-based medical image segmentation

Test-time augmentation with uncertainty estimation for deep learning-based medical image segmentation

Arxiv

4+阅读 · 2018年7月19日

Generative Model for Heterogeneous Inference

Arxiv

4+阅读 · 2018年4月26日

A Generative Model For Zero Shot Learning Using Conditional Variational Autoencoders

Arxiv

9+阅读 · 2018年1月27日

微信扫码咨询专知VIP会员