Recent advances in whole slide imaging (WSI) technology have led to the development of a myriad of computer vision and artificial intelligence (AI) based diagnostic, prognostic, and predictive algorithms. Computational Pathology (CPath) offers an integrated solution to utilize information embedded in pathology WSIs beyond what we obtain through visual assessment. For automated analysis of WSIs and validation of machine learning (ML) models, annotations at the slide, tissue and cellular levels are required. The annotation of important visual constructs in pathology images is an important component of CPath projects. Improper annotations can result in algorithms which are hard to interpret and can potentially produce inaccurate and inconsistent results. Despite the crucial role of annotations in CPath projects, there are no well-defined guidelines or best practices on how annotations should be carried out. In this paper, we address this shortcoming by presenting the experience and best practices acquired during the execution of a large-scale annotation exercise involving a multidisciplinary team of pathologists, ML experts and researchers as part of the Pathology image data Lake for Analytics, Knowledge and Education (PathLAKE) consortium. We present a real-world case study along with examples of different types of annotations, diagnostic algorithm, annotation data dictionary and annotation constructs. The analyses reported in this work highlight best practice recommendations that can be used as annotation guidelines over the lifecycle of a CPath project.
翻译:整个幻灯片成像(WSI)技术的最近进步导致大量基于计算机视觉和人工智能的诊断、预测和预测算法的开发。计算病理学(CPath)提供了一种综合解决办法,利用病理学WSI系统所包含的信息,超出我们通过视觉评估所获得的信息。为了自动分析和验证机器学习模型(ML)模型,需要提供幻灯片、组织和细胞层次的说明。在病理图象中的重要视觉构造的说明是CPath项目的一个重要组成部分。不正确说明可产生难以解释的算法,并可能产生不准确和不一致的结果。尽管CPath项目的说明具有关键作用,但在如何进行说明方面没有明确的指南或最佳做法。在本文件中,我们通过介绍在进行大规模说明活动过程中获得的经验和最佳做法,由病理学家、ML专家和研究人员作为病理学图解数据湖的一部分,作为分析分析分析、知识与教育领域最佳分析方法的一部分。我们用这一方法,可以建立一份真实周期分析模型,并用数据分析系统模型,作为目前使用的方法。