Quantification of heritability is a fundamental desideratum in genetics, which allows an assessment of the contribution of additive genetic variation to the variability of a trait of interest. The traditional computational approaches for assessing the heritability of a trait have been developed in the field of quantitative genetics. However, the rise of modern population genomics with large sample sizes has led to the development of several new machine learning based approaches to inferring heritability. In this paper, we systematically summarize recent advances in machine learning which can be used to infer heritability. We focus on an application of these methods to bacterial genomes, where heritability plays a key role in understanding phenotypes such as antibiotic resistance and virulence, which are particularly important due to the rising frequency of antimicrobial resistance. By designing a heritability model incorporating realistic patterns of genome-wide linkage disequilibrium for a frequently recombining bacterial pathogen, we test the performance of a wide spectrum of different inference methods, including also GCTA. In addition to the synthetic data benchmark, we present a comparison of the methods for antibiotic resistance traits for multiple bacterial pathogens. Insights from the benchmarking and real data analyses indicate a highly variable performance of the different methods and suggest that heritability inference would likely benefit from tailoring of the methods to the specific genetic architecture of the target organism.
翻译:在遗传学领域,已经发展了传统的计算方法来评估某种特性的遗传性。但是,由于抗生素抗药性和剧变的频率不断上升,这一点特别重要。通过设计一种基于遗传学的模型,将基因组联系的不均匀性的现实模式纳入一个经常重新组合的细菌病原体,我们系统地总结了机器学习的最新进展,可以用来推断遗传性。我们注重将这些方法应用于细菌基因组,在了解抗生素抗药性和剧变性等人型方面发挥着关键作用。除了合成数据基准外,我们还比较了从抗生素抗体抗体抗体和抗体抗体抗变性等不同类型类型类型类型基因学方法的可变性模型。我们用该模型设计出一种包含基因组全局性联系不均匀性模型,用于经常重新组合细菌病原体的基因组,我们有系统地测试各种不同发酵方法的性能,包括GCTA。除了合成数据基准外,我们还比较了从抗生素抗体抗力性和易变性研究方法的多变性模型分析方法。