We propose a novel way of representing and analysing single-cell genomic count data, by modelling the observed data count matrix as a network adjacency matrix. This perspective enables theory from stochastic networks modelling to be applied in a principled way to this type of data, providing new ways to view and analyse these data, and giving first-principles theoretical justification to established, successful methods. We show the success of this approach in the context of three cell-biological contexts, from the epiblast/epithelial/neural lineage. New technology has made it possible to gather genomic data from single cells at unprecedented scale, and this brings with it new challenges to deal with much higher levels of heterogeneity than expected between individual cells. Novel, tailored, computational-statistical methodology is needed to make the most of these new types of data, involving collaboration between mathematical and biomedical scientists.
翻译:我们建议一种代表和分析单细胞基因组计数数据的新方式,将观察到的数据计数矩阵建模成一个网络相邻矩阵,以此为模式,从随机网络建模的理论以有原则的方式应用于这类数据,为观察和分析这些数据提供新的方法,为既定的成功方法提供第一原则的理论依据。我们从上流/下水/内河线的三个细胞生物背景中展示了这一方法的成功。新技术使得有可能以前所未有的规模从单细胞中收集基因组数据,这带来了新的挑战,要处理比个体细胞之间预期高得多的异质性。需要新颖、定制、计算-统计方法,才能使这些新类型的数据得到充分利用,其中涉及数学和生物医学科学家之间的合作。</s>