深心神经网络,在识别推定因果遗传变异方面有受控制的变量选择 (Deep neural networks with controlled variable selection for the identification of putative causal genetic variants)

Deep neural networks (DNN) have been used successfully in many scientific problems for their high prediction accuracy, but their application to genetic studies remains challenging due to their poor interpretability. In this paper, we consider the problem of scalable, robust variable selection in DNN for the identification of putative causal genetic variants in genome sequencing studies. We identified a pronounced randomness in feature selection in DNN due to its stochastic nature, which may hinder interpretability and give rise to misleading results. We propose an interpretable neural network model, stabilized using ensembling, with controlled variable selection for genetic studies. The merit of the proposed method includes: (1) flexible modelling of the non-linear effect of genetic variants to improve statistical power; (2) multiple knockoffs in the input layer to rigorously control false discovery rate; (3) hierarchical layers to substantially reduce the number of weight parameters and activations to improve computational efficiency; (4) de-randomized feature selection to stabilize identified signals. We evaluated the proposed method in extensive simulation studies and applied it to the analysis of Alzheimer disease genetics. We showed that the proposed method, when compared to conventional linear and nonlinear methods, can lead to substantially more discoveries.

翻译：深神经网络(DNN)在许多科学问题上被成功地用于许多高预测精确度的科学问题,但是,由于遗传研究的可解释性差,这些网络在遗传研究中的应用仍然具有挑战性。在本文件中,我们考虑了DNN为确定基因组测序研究中的推定因果遗传变异物而在DNN中进行可扩缩、稳健的变量选择的问题。我们发现DNN的特征选择明显随机性,因为其具有随机性,可能妨碍解释性并产生误导性结果。我们提出了一个可解释的神经网络模型,使用混合法稳定下来,对遗传学研究进行有控制的变量选择。拟议方法的优点包括:(1) 灵活模拟基因变异物的非线性效应,以提高统计能力;(2) 在输入层中进行多次倒置,以严格控制虚假的发现率;(3) 等级层,以大幅度减少重量参数和激活量,以提高计算效率;(4) 去除随机特性,以稳定已查明的信号。我们在广泛的模拟研究中评价了拟议的方法,并将它应用于对阿尔茨海氏病遗传学的分析。我们表明,拟议的方法与常规的线性和非线性方法相比,可以在很大程度上导致。

相关内容

Neural Networks

关注 1651

神经网络（Neural Networks）是世界上三个最古老的神经建模学会的档案期刊:国际神经网络学会(INNS)、欧洲神经网络学会(ENNS)和日本神经网络学会(JNNS)。神经网络提供了一个论坛，以发展和培育一个国际社会的学者和实践者感兴趣的所有方面的神经网络和相关方法的计算智能。神经网络欢迎高质量论文的提交，有助于全面的神经网络研究，从行为和大脑建模，学习算法，通过数学和计算分析，系统的工程和技术应用，大量使用神经网络的概念和技术。这一独特而广泛的范围促进了生物和技术研究之间的思想交流，并有助于促进对生物启发的计算智能感兴趣的跨学科社区的发展。因此，神经网络编委会代表的专家领域包括心理学，神经生物学，计算机科学，工程，数学，物理。该杂志发表文章、信件和评论以及给编辑的信件、社论、时事、软件调查和专利信息。文章发表在五个部分之一:认知科学，神经科学，学习系统，数学和计算分析、工程和应用。官网地址：http://dblp.uni-trier.de/db/journals/nn/

Risk Sensitive Portfolio Optimization with Regime-Switching and Default Contagion，香港理工大学应用数学系余翔助理教授，第八届全国社会媒体处理大会SMP2019

专知会员服务

10+阅读 · 2019年10月24日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

50+阅读 · 2019年10月17日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

专知会员服务

83+阅读 · 2019年10月9日