Evolutionary scenarios describing the evolution of a family of genes within a collection of species comprise the mapping of the vertices of a gene tree $T$ to vertices and edges of a species tree $S$. The relative timing of the last common ancestors of two extant genes (leaves of $T$) and the last common ancestors of the two species (leaves of $S$) in which they reside is indicative of horizontal gene transfers (HGT) and ancient duplications. Orthologous gene pairs, on the other hand, require that their last common ancestors coincides with a corresponding speciation event. The relative timing information of gene and species divergences is captured by three colored graphs that have the extant genes as vertices and the species in which the genes are found as vertex colors: the equal-divergence-time (EDT) graph, the later-divergence-time (LDT) graph and the prior-divergence-time (PDT) graph, which together form an edge partition of the complete graph. Here we give a complete characterization in terms of informative and forbidden triples that can be read off the three graphs and provide a polynomial time algorithm for constructing an evolutionary scenario that explains the graphs, provided such a scenario exists. While both LDT and PDT graphs are cographs, this is not true for the EDT graph in general. We show that every EDT graph is perfect. While the information about LDT and PDT graphs is necessary to recognize EDT graphs in polynomial-time for general scenarios, this extra information can be dropped in the HGT-free case. However, recognition of EDT graphs without knowledge of putative LDT and PDT graphs is NP-complete for general scenarios. We finally connect the EDT graph to the alternative definitions of orthology that have been proposed for scenarios with horizontal gene transfer. With one exception, the corresponding graphs are shown to be colored cographs.
翻译:描述物种集合中基因组演变的进化假设包括绘制基因树的脊椎图($T$)到树种的脊椎和边缘($S$)。两个物种的最后共同祖先(离开为$T$)和它们居住的最后共同祖先(离开为$S$)的相对时间是横向基因转移(HGT)和古老复制的标志。Orthologos基因配对的基因要求他们最后共同的祖先与相应的探知事件相吻合。基因和物种差异的相对时间信息被三个彩色图表所捕捉,这些图表将存在基因作为脊椎(leod$T$)和基因的最后一个共同祖先(离开为$S$S(leept)),这是横向基因转移(LDT)的图示意指横向基因转移(LDT)的图示意指向横向基因转移(LDT)的较晚时间(LDT)的图示(LDT)和前的图示(PDT),这是整个图表的边端偏偏偏偏偏偏偏的图。我们解释了一个数字的亚的变变变变的图。