The theoretical analysis of performance has been an important tool in the engineering of algorithms in many application domains. Its goals are to predict the empirical performance of an algorithm and to be a yardstick that drives the design of novel algorithms that perform well in practice. While these goals have been achieved in many instances, they have not been achieved in some other crucial application domains. In this paper, I focus on the example of sequencing bioinformatics, an inter-disciplinary field that uses algorithms to extract biological meaning from genome sequencing data. I will demonstrate two concrete examples of how theoretical analysis has failed to achieve its goals but also give one encouraging example of success. I will then catalog some of the challenges of applying theoretical analysis to sequencing bioinformatics, argue why empirical analysis is not enough, and give a vision for improving the relevance of theoretical analysis to sequencing bioinformatics and other application domains. By recognizing the problem, understanding its roots, and providing potential solutions, this work can hopefully be a crucial first step towards making theoretical analysis more relevant in modern application domains.
翻译:业绩的理论分析是在许多应用领域的算法工程中的一个重要工具。它的目标是预测算法的经验性表现,并成为推动设计实际运作良好的新算法的标准。虽然这些目标在许多情况下已经实现,但在其他一些关键应用领域却未能实现。在本文中,我重点介绍了生物信息学排序的例子,这是一个跨学科领域,利用算法从基因组测序数据中提取生物意义。我将展示两个具体的例子,说明理论分析如何未能实现其目标,但也举了一个令人鼓舞的成功的例子。然后,我将列举应用理论分析来测序生物信息学的一些挑战,说明为什么经验性分析还不够,并为改进理论分析对生物信息学和其他应用领域排序的相关性提出了愿景。通过认识问题,了解其根源,提供潜在的解决办法,这项工作有望成为使理论分析在现代应用领域更具有相关性的关键的第一步。