模拟儿童癌症幸存者生育潜力模型:介绍现代统计和计算方法 (Modelling fertility potential in survivors of childhood cancer: An introduction to modern statistical and computational methods)

Statistical and computational methods are widely used in today's scientific studies. Using a female fertility potential in childhood cancer survivors as an example, we illustrate how these methods can be used to extract insight regarding biological processes from noisy observational data in order to inform decision making. We start by contextualizing the computational methods with the working example: the modelling of acute ovarian failure risk in female childhood cancer survivors to quantify the risk of permanent ovarian failure due to exposure to lifesaving but nonetheless toxic cancer treatments. This is followed by a description of the general framework of classification problems. We provide an overview of the modelling algorithms employed in our example, including one classic model (logistic regression) and two popular modern learning methods (random forest and support vector machines). Using the working example, we show the general steps of data preparation for modelling, variable selection steps for the classic model, and how model performance might be improved utilizing visualization tools. We end with a note on the importance of model evaluation.

翻译：统计和计算方法在今天的科学研究中广泛使用。我们以儿童癌症幸存者中的女性生育潜力为例,说明这些方法如何能够用来从吵闹的观察数据中提取生物过程的洞察力,以便为决策提供信息。我们首先从计算方法的背景化开始,以工作实例为例:对女性儿童癌症幸存者急性卵巢衰竭风险进行建模,以量化因接触拯救生命但有毒癌症治疗而导致卵巢永久衰竭的风险。随后将描述分类问题的一般框架。我们概述了我们的例子中使用的模型算法,包括一种经典模型(逻辑回归)和两种流行的现代学习方法(随机森林和支持矢量机)。我们以工作实例为例,展示了模型数据准备的一般步骤、典型模型的可变选择步骤,以及如何利用可视化工具改进模型性能。我们最后要指出模型评估的重要性。

相关内容

MoDELS

关注 44

ACM/IEEE第23届模型驱动工程语言和系统国际会议，是模型驱动软件和系统工程的首要会议系列，由ACM-SIGSOFT和IEEE-TCSE支持组织。自1998年以来，模型涵盖了建模的各个方面，从语言和方法到工具和应用程序。模特的参加者来自不同的背景，包括研究人员、学者、工程师和工业专业人士。MODELS 2019是一个论坛，参与者可以围绕建模和模型驱动的软件和系统交流前沿研究成果和创新实践经验。今年的版本将为建模社区提供进一步推进建模基础的机会，并在网络物理系统、嵌入式系统、社会技术系统、云计算、大数据、机器学习、安全、开源等新兴领域提出建模的创新应用以及可持续性。官网链接：http://www.modelsconference.org/

【干货书】机器学习速查手册，135页pdf

专知会员服务

127+阅读 · 2020年11月20日

UC.Berkeley CS189讲义教材:《机器学习全面指南》，185页pdf

专知会员服务

162+阅读 · 2020年1月16日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

专知会员服务

163+阅读 · 2019年10月12日