Sex difference in allele frequency is an emerging topic that is critical to our understanding of ascertainment bias, as well as data quality particularly of the largely overlooked X chromosome. To detect sex difference in allele frequency for both X chromosomal and autosomal variants, existing methods are conservative when applied to samples from multiple ancestral populations, such as African and European populations. Additionally, it remains unexplored whether the sex difference in allele frequency differs between populations, which is important to trans-ancestral genetic studies. We thus developed a novel retrospective regression-based testing framework to provide interpretable and easy-to-implement solutions to answer these questions. We then applied the proposed methods to the high-coverage whole genome sequence data of the 1000 Genomes Project, robustly analyzing all samples available from the five super-populations. We had 76 novel findings by recognizing and modeling ancestral differences.
翻译:不同频度的性别差异是一个新出现的主题,对于我们了解确定偏差以及数据质量至关重要,特别是在很大程度上被忽视的X染色体的数据质量至关重要。为了发现X染色体变异物和自相色谱变异物的异异异异异异异异异异异异异异异异异异异异异异异异异异异异异异异异异异异异异异异异异异异异异异异异异异异异异异异异异异异异异异异异异异异异异异异异异异异异异异异异异异异异异异异异异异异异异异异异异异异异异异异异异异异异异异异异异异异异异异异异异异异异异异异异异于异异异异异异异异异异异异异异异异异异异异异异异异异异异异异异异异异异异异异异异异异异异异异异异异异异异异异异异异异异异异异异异异异异异异异异异异异异异异异异异异异异异异异异异异异异异异异异异异异异异异异异异异异异异异异异异异异异异异异异异异异异异异异异异异异异异异异异异异异异异异异异异异异异异异异异异异异异异异异异异异异异异异异异异异异异异异异异异异异异异异异异异异异异异异异异异异异异异异于异异异异异异异异异异异异异异异异异异异异异异异异异异异异异异异异异异异异异异异异异异异异异异异异异异异异异异异异异异异异异异异异异异异异异异异异异异异异异异异异异异异异异异异异异异异异异异异异异异异异异异异异异异异异异异异异异异异异异异异异异异异异异异异异异异异异