Stemmatology is a subfield of philology where one approach to understand the copy-history of textual variants of a text (witnesses of a tradition) is to generate an evolutionary tree. Computational methods are partly shared between the sister discipline of phylogenetics and stemmatology. In 2022, a surveypaper in nature communications found that Deep Learning (DL), which otherwise has brought about major improvements in many fields (Krohn et al 2020) has had only minor successes in phylogenetics and that "it is difficult to conceive of an end-to-end DL model to directly estimate phylogenetic trees from raw data in the near future"(Sapoval et al. 2022, p.8). In stemmatology, there is to date no known DL approach at all. In this paper, we present a new DL approach to placement of manuscripts on a stemma and demonstrate its potential. This could be extended to phylogenetics where the universal code of DNA might be an even better prerequisite for the method using sequence to sequence based neural networks in order to retrieve tree distances.
翻译:2022年,自然交流中的一份调查文件发现,“深学”(DL)在许多领域(Krohn等人,2020年)取得了重大改进,但在植物学方面只取得了微小的成功,而且“很难设想一种从终点到终点的DL模型能够直接从近期的原始数据中估算出植物基因树木”(Sapoval等人,2022年,第8页)。 在干线学方面,迄今为止还没有已知的DL方法。在本论文中,我们提出了一个新的DL方法,将手稿放在干线上,并展示其潜力。这可以扩大到植物学,在植物学方面,通用的DNA编码可能是使用基于神经网络的序列来检索树迹距离的方法的更好的先决条件。