跨组织推论的内在风险最小化:将老鼠数据替代人类数据用于人类风险因素的发现 (Invariant Risk Minimisation for Cross-Organism Inference: Substituting Mouse Data for Human Data in Human Risk Factor Discovery) - 专知论文

会员服务 ·

0

模型可辨识性 · 分解的 · 不变 · MoDELS · 可辨认的 ·

2021 年 11 月 14 日

Invariant Risk Minimisation for Cross-Organism Inference: Substituting Mouse Data for Human Data in Human Risk Factor Discovery

翻译：跨组织推论的内在风险最小化:将老鼠数据替代人类数据用于人类风险因素的发现

Odhran O'Donoghue,Paul Duckworth,Giuseppe Ughi,Linus Scheibenreif,Kia Khezeli,Adrienne Hoarfrost,Samuel Budd,Patrick Foley,Nicholas Chia,John Kalantari,Graham Mackintosh,Frank Soboczenski,Lauren Sanders

from arxiv, Machine Learning for Health (ML4H) - Extended Abstract

Human medical data can be challenging to obtain due to data privacy concerns, difficulties conducting certain types of experiments, or prohibitive associated costs. In many settings, data from animal models or in-vitro cell lines are available to help augment our understanding of human data. However, this data is known for having low etiological validity in comparison to human data. In this work, we augment small human medical datasets with in-vitro data and animal models. We use Invariant Risk Minimisation (IRM) to elucidate invariant features by considering cross-organism data as belonging to different data-generating environments. Our models identify genes of relevance to human cancer development. We observe a degree of consistency between varying the amounts of human and mouse data used, however, further work is required to obtain conclusive insights. As a secondary contribution, we enhance existing open source datasets and provide two uniformly processed, cross-organism, homologue gene-matched datasets to the community.

翻译：由于对数据隐私的关切、进行某些类型的实验的困难或令人望而却步的相关费用,人类医疗数据可能难以获得。在许多环境中,动物模型或体外细胞线的数据有助于增进我们对人类数据的理解。然而,这一数据据知与人类数据相比,其病理学有效性较低。在这项工作中,我们利用体外数据和动物模型来增加小型人类医疗数据集。我们利用不易风险最小化(IRM)来说明不同特性,将跨机体数据视为属于不同数据产生环境。我们的模型确定了与人类癌症发展相关的基因。我们观察到了人类和鼠标数据的不同数量之间的一致性,然而,我们还需要做进一步的工作才能获得结论性的洞察力。作为辅助贡献,我们加强现有的开放源数据集,并向社区提供两种统一处理的、跨机组、同源基因匹配数据集。

0

相关内容

模型可辨识性

模型可辨识性

因果推断，Causal Inference：The Mixtape

因果推断，Causal Inference：The Mixtape

专知会员服务

107+阅读 · 2021年8月27日

【ICML2021】异质风险最小化，Heterogeneous Risk Minimization

专知会员服务

16+阅读 · 2021年5月21日

生成对抗网络GAN在各领域应用研究进展(中文版)，37页pdf

生成对抗网络GAN在各领域应用研究进展(中文版)，37页pdf

专知会员服务

151+阅读 · 2020年12月30日

【经典书】数据挖掘：理论、算法与示例，347页pdf，Nong Ye，Arizona State University

【经典书】数据挖掘：理论、算法与示例，347页pdf，Nong Ye，Arizona State University

专知会员服务

82+阅读 · 2020年2月27日

【KDD2019|讲座推荐】从生产规模神经网络中发现知识的统计学习方法：Statistical Mechanics Methods for Discovering Knowledge from Production-Scale Neural Networks

【KDD2019|讲座推荐】从生产规模神经网络中发现知识的统计学习方法：Statistical Mechanics Methods for Discovering Knowledge from Production-Scale Neural Networks

专知会员服务

18+阅读 · 2019年12月4日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

最新BERT相关论文清单，BERT-related Papers

最新BERT相关论文清单，BERT-related Papers

专知会员服务

53+阅读 · 2019年9月29日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

人工智能 | SCI期刊专刊信息3条

人工智能 | SCI期刊专刊信息3条

Call4Papers

5+阅读 · 2019年1月10日

Disentangled的假设的探讨

Disentangled的假设的探讨

CreateAMind

9+阅读 · 2018年12月10日

最新5篇生成对抗网络相关论文推荐—FusedGAN、DeblurGAN、AdvGAN、CipherGAN、MMD GANS

最新5篇生成对抗网络相关论文推荐—FusedGAN、DeblurGAN、AdvGAN、CipherGAN、MMD GANS

专知

23+阅读 · 2018年1月18日

论文浅尝 | Hike: A Hybrid Human-Machine Method for Entity Alignment

论文浅尝 | Hike: A Hybrid Human-Machine Method for Entity Alignment

开放知识图谱

4+阅读 · 2017年12月30日

计算机视觉近一年进展综述

计算机视觉近一年进展综述

机器学习研究会

9+阅读 · 2017年11月25日

【论文】变分推断（Variational inference)的总结

【论文】变分推断（Variational inference)的总结

机器学习研究会

39+阅读 · 2017年11月16日

Auto-Encoding GAN

Auto-Encoding GAN

CreateAMind

7+阅读 · 2017年8月4日

【今日新增】IEEE Trans.专刊截稿信息8条

【今日新增】IEEE Trans.专刊截稿信息8条

Call4Papers

7+阅读 · 2017年6月29日

A Search Engine for Discovery of Scientific Challenges and Directions

A Search Engine for Discovery of Scientific Challenges and Directions

Arxiv

0+阅读 · 2022年1月19日

Estimators for covariate-adjusted ROC curves with missing biomarkers values

Arxiv

0+阅读 · 2022年1月17日

Targeted Optimal Treatment Regime Learning Using Summary Statistics

Arxiv

0+阅读 · 2022年1月17日

Bayesian information criteria for clustering normally distributed data

Arxiv

0+阅读 · 2022年1月16日

Characterizing Big Data Management

Arxiv

0+阅读 · 2022年1月15日

The Effect of Sample Size and Missingness on Inference with Missing Data

Arxiv

0+阅读 · 2022年1月13日

Supporting Domain Data Selection in Data-Enhanced Process Models

Arxiv

0+阅读 · 2022年1月13日

Domain Adaptation as a Problem of Inference on Graphical Models

Arxiv

5+阅读 · 2020年10月23日

Discovery and recognition of motion primitives in human activities

Discovery and recognition of motion primitives in human activities

Arxiv

4+阅读 · 2019年2月4日

A three domain covariance framework for EEG/MEG data

Arxiv

3+阅读 · 2014年10月9日

VIP会员

文章信息

相关主题

模型可辨识性

相关VIP内容

因果推断，Causal Inference：The Mixtape

因果推断，Causal Inference：The Mixtape

专知会员服务

107+阅读 · 2021年8月27日

【ICML2021】异质风险最小化，Heterogeneous Risk Minimization

专知会员服务

16+阅读 · 2021年5月21日

生成对抗网络GAN在各领域应用研究进展(中文版)，37页pdf

生成对抗网络GAN在各领域应用研究进展(中文版)，37页pdf

专知会员服务

151+阅读 · 2020年12月30日

【经典书】数据挖掘：理论、算法与示例，347页pdf，Nong Ye，Arizona State University

【经典书】数据挖掘：理论、算法与示例，347页pdf，Nong Ye，Arizona State University

专知会员服务

82+阅读 · 2020年2月27日

【KDD2019|讲座推荐】从生产规模神经网络中发现知识的统计学习方法：Statistical Mechanics Methods for Discovering Knowledge from Production-Scale Neural Networks

【KDD2019|讲座推荐】从生产规模神经网络中发现知识的统计学习方法：Statistical Mechanics Methods for Discovering Knowledge from Production-Scale Neural Networks

专知会员服务

18+阅读 · 2019年12月4日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

最新BERT相关论文清单，BERT-related Papers

最新BERT相关论文清单，BERT-related Papers

专知会员服务

53+阅读 · 2019年9月29日

热门VIP内容

开通专知VIP会员享更多权益服务

【CMU博士论文】数据驱动决策中的激励、信息与不确定性

DGP双粒度提示框架：图增强大模型助力欺诈检测

【ICCV2025】ESSENTIAL：用于视频类增量学习的情景记忆与语义记忆整合

唯快不破：大型语言模型高效架构综述

相关资讯

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

人工智能 | SCI期刊专刊信息3条

人工智能 | SCI期刊专刊信息3条

Call4Papers

5+阅读 · 2019年1月10日

Disentangled的假设的探讨

Disentangled的假设的探讨

CreateAMind

9+阅读 · 2018年12月10日

最新5篇生成对抗网络相关论文推荐—FusedGAN、DeblurGAN、AdvGAN、CipherGAN、MMD GANS

最新5篇生成对抗网络相关论文推荐—FusedGAN、DeblurGAN、AdvGAN、CipherGAN、MMD GANS

专知

23+阅读 · 2018年1月18日

论文浅尝 | Hike: A Hybrid Human-Machine Method for Entity Alignment

论文浅尝 | Hike: A Hybrid Human-Machine Method for Entity Alignment

开放知识图谱

4+阅读 · 2017年12月30日

计算机视觉近一年进展综述

计算机视觉近一年进展综述

机器学习研究会

9+阅读 · 2017年11月25日

【论文】变分推断（Variational inference)的总结

【论文】变分推断（Variational inference)的总结

机器学习研究会

39+阅读 · 2017年11月16日

Auto-Encoding GAN

Auto-Encoding GAN

CreateAMind

7+阅读 · 2017年8月4日

【今日新增】IEEE Trans.专刊截稿信息8条

【今日新增】IEEE Trans.专刊截稿信息8条

Call4Papers

7+阅读 · 2017年6月29日

相关论文

A Search Engine for Discovery of Scientific Challenges and Directions

A Search Engine for Discovery of Scientific Challenges and Directions

Arxiv

0+阅读 · 2022年1月19日

Estimators for covariate-adjusted ROC curves with missing biomarkers values

Arxiv

0+阅读 · 2022年1月17日

Targeted Optimal Treatment Regime Learning Using Summary Statistics

Arxiv

0+阅读 · 2022年1月17日

Bayesian information criteria for clustering normally distributed data

Arxiv

0+阅读 · 2022年1月16日

Characterizing Big Data Management

Arxiv

0+阅读 · 2022年1月15日

The Effect of Sample Size and Missingness on Inference with Missing Data

Arxiv

0+阅读 · 2022年1月13日

Supporting Domain Data Selection in Data-Enhanced Process Models

Arxiv

0+阅读 · 2022年1月13日

Domain Adaptation as a Problem of Inference on Graphical Models

Arxiv

5+阅读 · 2020年10月23日

Discovery and recognition of motion primitives in human activities

Discovery and recognition of motion primitives in human activities

Arxiv

4+阅读 · 2019年2月4日

A three domain covariance framework for EEG/MEG data

Arxiv

3+阅读 · 2014年10月9日

微信扫码咨询专知VIP会员