改善通过因果建模的机器学习鉴别生物标志物的泛化能力：免疫受体诊断研究 (Improving generalization of machine learning-identified biomarkers with causal modeling: an investigation into immune receptor diagnostics) - 专知论文

会员服务 ·

0

因果建模 · 受体 · 生物 · 泛化能力 · 泛化 ·

2023 年 4 月 3 日

Improving generalization of machine learning-identified biomarkers with causal modeling: an investigation into immune receptor diagnostics

翻译：改善通过因果建模的机器学习鉴别生物标志物的泛化能力：免疫受体诊断研究

Milena Pavlović,Ghadi S. Al Hajj,Chakravarthi Kanduri,Johan Pensar,Mollie Wood,Ludvig M. Sollid,Victor Greiff,Geir Kjetil Sandve

Machine learning is increasingly used to discover diagnostic and prognostic biomarkers from high-dimensional molecular data. However, a variety of factors related to experimental design may affect the ability to learn generalizable and clinically applicable diagnostics. Here, we argue that a causal perspective improves the identification of these challenges and formalizes their relation to the robustness and generalization of machine learning-based diagnostics. To make for a concrete discussion, we focus on a specific, recently established high-dimensional biomarker - adaptive immune receptor repertoires (AIRRs). Through simulations, we illustrate how major biological and experimental factors of the AIRR domain may influence the learned biomarkers. In conclusion, we argue that causal modeling improves machine learning-based biomarker robustness by identifying stable relations between variables and by guiding the adjustment of the relations and variables that vary between populations.

翻译：机器学习越来越多地用于从高维分子数据中发现诊断和预后生物标志物。然而，与实验设计相关的各种因素可能会影响学习可推广和临床适用的诊断。在这里，我们认为以因果透视改善了识别这些挑战并形式化了它们与基于机器学习的诊断的强度和泛化能力的关系。为了进行具体讨论，我们重点关注了一个特定的、最近建立的高维生物标记物——适应性免疫受体库（AIRRs）。通过模拟，我们说明了AIRR领域的主要生物学和实验因素如何影响学习到的生物标志物。总之，我们认为因果建模通过识别变量之间的稳定关系以及指导那些因人口而异的关系和变量的调整，提高了基于机器学习的生物标志物的鲁棒性。

0

相关内容

因果建模

终身学习如何构建？NeurIPS2022《终身学习机》教程，70页ppt

终身学习如何构建？NeurIPS2022《终身学习机》教程，70页ppt

专知会员服务

46+阅读 · 2023年1月26日

【NAACL2022】自然语言处理的对比数据与学习

【NAACL2022】自然语言处理的对比数据与学习

专知会员服务

46+阅读 · 2022年7月10日

机器学习损失函数概述，Loss Functions in Machine Learning

机器学习损失函数概述，Loss Functions in Machine Learning

专知会员服务

84+阅读 · 2022年3月19日

最新《自监督表示学习》报告，70页ppt

最新《自监督表示学习》报告，70页ppt

专知会员服务

86+阅读 · 2020年12月22日

【ETH】最新《几何数据分析》2020课程，附PPT下载

专知会员服务

45+阅读 · 2020年12月18日

经济学中的数据科学，Data Science in Economics，附22页pdf

经济学中的数据科学，Data Science in Economics，附22页pdf

专知会员服务

36+阅读 · 2020年4月1日

【机器学习基础最新版】（Mathematics for Machine Learning），417页pdf

【机器学习基础最新版】（Mathematics for Machine Learning），417页pdf

专知会员服务

245+阅读 · 2019年10月21日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

直播 | Interpretable and Trustworthy Graph Geometric Deep Learning

直播 | Interpretable and Trustworthy Graph Geometric Deep Learning

图与推荐

2+阅读 · 2022年11月2日

【干货书】基于统计和机器学习的实用时间序列分析预测，Time Series Analysis Prediction

【干货书】基于统计和机器学习的实用时间序列分析预测，Time Series Analysis Prediction

专知

18+阅读 · 2022年4月9日

Multi-Task Learning的几篇综述文章

Multi-Task Learning的几篇综述文章

深度学习自然语言处理

15+阅读 · 2020年6月15日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

利用动态深度学习预测金融时间序列基于Python

利用动态深度学习预测金融时间序列基于Python

量化投资与机器学习

18+阅读 · 2018年10月30日

disentangled-representation-papers

disentangled-representation-papers

CreateAMind

26+阅读 · 2018年9月12日

【SIGIR2018】五篇对抗训练文章

【SIGIR2018】五篇对抗训练文章

专知

12+阅读 · 2018年7月9日

【推荐】RNN/LSTM时序预测

【推荐】RNN/LSTM时序预测

机器学习研究会

25+阅读 · 2017年9月8日

高血压血管重塑中血管保护分子CREG基因表达的上游调控机制研究

国家自然科学基金

0+阅读 · 2015年12月31日

长链非编码RNA PCAT-1在前列腺癌细胞中的功能机制研究

国家自然科学基金

0+阅读 · 2015年12月31日

基于PERK/elF2α通路研究针刺调控MCAO/R大鼠内质网应激-自噬稳态重构的分子机制

国家自然科学基金

0+阅读 · 2015年12月31日

Nrf2基因修饰MSC增强脂肪肝供肝肝移植损伤保护及相关microRNA调控机制

国家自然科学基金

0+阅读 · 2014年12月31日

β-catenin/Ets1复合体在胶质母细胞瘤中对hTERT表达调控机制的研究

国家自然科学基金

0+阅读 · 2013年12月31日

个体化医学中生物标记物预测能力的估计和推断

国家自然科学基金

2+阅读 · 2013年12月31日

Ndrg2基因在胰岛素抵抗与恶性肿瘤发生相关性中的研究

国家自然科学基金

0+阅读 · 2011年12月31日

寻找与确认肺癌诊断及疗效评价生物标志物的代谢组学分析方法研究

国家自然科学基金

1+阅读 · 2011年12月31日

BsMAb预定位技术提高MR分子成像敏感性的可行性研究

国家自然科学基金

0+阅读 · 2009年12月31日

免疫抑制性受体CD305在类风湿性关节炎发病机制中的作用研究

国家自然科学基金

0+阅读 · 2009年12月31日

Active Learning Principles for In-Context Learning with Large Language Models

Arxiv

0+阅读 · 2023年5月23日

Arxiv

0+阅读 · 2023年5月23日

Covariate balancing using the integral probability metric for causal inference

Arxiv

0+阅读 · 2023年5月23日

Improving Robustness in Knowledge Distillation Using Domain-Targeted Data Augmentation

Arxiv

0+阅读 · 2023年5月22日

Mitigating ML Model Decay in Continuous Integration with Data Drift Detection: An Empirical Study

Arxiv

0+阅读 · 2023年5月22日

Discovering Causal Relations and Equations from Data

Arxiv

0+阅读 · 2023年5月21日

Random Relabeling for Efficient Machine Unlearning

Arxiv

0+阅读 · 2023年5月21日

Comparing Software Developers with ChatGPT: An Empirical Investigation

Arxiv

0+阅读 · 2023年5月19日

Generalised likelihood profiles for models with intractable likelihoods

Arxiv

0+阅读 · 2023年5月19日

Federated Causal Inference in Heterogeneous Observational Data

Arxiv

24+阅读 · 2021年8月10日

VIP会员

文章信息

相关主题

相关VIP内容

终身学习如何构建？NeurIPS2022《终身学习机》教程，70页ppt

终身学习如何构建？NeurIPS2022《终身学习机》教程，70页ppt

专知会员服务

46+阅读 · 2023年1月26日

【NAACL2022】自然语言处理的对比数据与学习

【NAACL2022】自然语言处理的对比数据与学习

专知会员服务

46+阅读 · 2022年7月10日

机器学习损失函数概述，Loss Functions in Machine Learning

机器学习损失函数概述，Loss Functions in Machine Learning

专知会员服务

84+阅读 · 2022年3月19日

最新《自监督表示学习》报告，70页ppt

最新《自监督表示学习》报告，70页ppt

专知会员服务

86+阅读 · 2020年12月22日

【ETH】最新《几何数据分析》2020课程，附PPT下载

专知会员服务

45+阅读 · 2020年12月18日

经济学中的数据科学，Data Science in Economics，附22页pdf

经济学中的数据科学，Data Science in Economics，附22页pdf

专知会员服务

36+阅读 · 2020年4月1日

【机器学习基础最新版】（Mathematics for Machine Learning），417页pdf

【机器学习基础最新版】（Mathematics for Machine Learning），417页pdf

专知会员服务

245+阅读 · 2019年10月21日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

【新书】《知识图谱与大语言模型的协同应用》，544页pdf

军事通信系统：安全行动的支柱

《缓解大语言模型（LLMs）幻觉：面向应用的检索增强生成（RAG）、推理与智能体系统综述》

【新书】机器学习系统，2620页pdf

相关资讯

直播 | Interpretable and Trustworthy Graph Geometric Deep Learning

直播 | Interpretable and Trustworthy Graph Geometric Deep Learning

图与推荐

2+阅读 · 2022年11月2日

【干货书】基于统计和机器学习的实用时间序列分析预测，Time Series Analysis Prediction

【干货书】基于统计和机器学习的实用时间序列分析预测，Time Series Analysis Prediction

专知

18+阅读 · 2022年4月9日

Multi-Task Learning的几篇综述文章

Multi-Task Learning的几篇综述文章

深度学习自然语言处理

15+阅读 · 2020年6月15日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

利用动态深度学习预测金融时间序列基于Python

利用动态深度学习预测金融时间序列基于Python

量化投资与机器学习

18+阅读 · 2018年10月30日

disentangled-representation-papers

disentangled-representation-papers

CreateAMind

26+阅读 · 2018年9月12日

【SIGIR2018】五篇对抗训练文章

【SIGIR2018】五篇对抗训练文章

专知

12+阅读 · 2018年7月9日

【推荐】RNN/LSTM时序预测

【推荐】RNN/LSTM时序预测

机器学习研究会

25+阅读 · 2017年9月8日

相关论文

Active Learning Principles for In-Context Learning with Large Language Models

Arxiv

0+阅读 · 2023年5月23日

Arxiv

0+阅读 · 2023年5月23日

Covariate balancing using the integral probability metric for causal inference

Arxiv

0+阅读 · 2023年5月23日

Improving Robustness in Knowledge Distillation Using Domain-Targeted Data Augmentation

Arxiv

0+阅读 · 2023年5月22日

Mitigating ML Model Decay in Continuous Integration with Data Drift Detection: An Empirical Study

Arxiv

0+阅读 · 2023年5月22日

Discovering Causal Relations and Equations from Data

Arxiv

0+阅读 · 2023年5月21日

Random Relabeling for Efficient Machine Unlearning

Arxiv

0+阅读 · 2023年5月21日

Comparing Software Developers with ChatGPT: An Empirical Investigation

Arxiv

0+阅读 · 2023年5月19日

Generalised likelihood profiles for models with intractable likelihoods

Arxiv

0+阅读 · 2023年5月19日

Federated Causal Inference in Heterogeneous Observational Data

Arxiv

24+阅读 · 2021年8月10日

相关基金

高血压血管重塑中血管保护分子CREG基因表达的上游调控机制研究

国家自然科学基金

0+阅读 · 2015年12月31日

长链非编码RNA PCAT-1在前列腺癌细胞中的功能机制研究

国家自然科学基金

0+阅读 · 2015年12月31日

基于PERK/elF2α通路研究针刺调控MCAO/R大鼠内质网应激-自噬稳态重构的分子机制

国家自然科学基金

0+阅读 · 2015年12月31日

Nrf2基因修饰MSC增强脂肪肝供肝肝移植损伤保护及相关microRNA调控机制

国家自然科学基金

0+阅读 · 2014年12月31日

β-catenin/Ets1复合体在胶质母细胞瘤中对hTERT表达调控机制的研究

国家自然科学基金

0+阅读 · 2013年12月31日

个体化医学中生物标记物预测能力的估计和推断

国家自然科学基金

2+阅读 · 2013年12月31日

Ndrg2基因在胰岛素抵抗与恶性肿瘤发生相关性中的研究

国家自然科学基金

0+阅读 · 2011年12月31日

寻找与确认肺癌诊断及疗效评价生物标志物的代谢组学分析方法研究

国家自然科学基金

1+阅读 · 2011年12月31日

BsMAb预定位技术提高MR分子成像敏感性的可行性研究

国家自然科学基金

0+阅读 · 2009年12月31日

免疫抑制性受体CD305在类风湿性关节炎发病机制中的作用研究

国家自然科学基金

0+阅读 · 2009年12月31日

微信扫码咨询专知VIP会员