Due to morphological similarities, the differentiation of histologic sections of cutaneous tumors into individual subtypes can be challenging. Recently, deep learning-based approaches have proven their potential for supporting pathologists in this regard. However, many of these supervised algorithms require a large amount of annotated data for robust development. We present a publicly available dataset of 350 whole slide images of seven different canine cutaneous tumors complemented by 12,424 polygon annotations for 13 histologic classes, including seven cutaneous tumor subtypes. In inter-rater experiments, we show a high consistency of the provided labels, especially for tumor annotations. We further validate the dataset by training a deep neural network for the task of tissue segmentation and tumor subtype classification. We achieve a class-averaged Jaccard coefficient of 0.7047, and 0.9044 for tumor in particular. For classification, we achieve a slide-level accuracy of 0.9857. Since canine cutaneous tumors possess various histologic homologies to human tumors the added value of this dataset is not limited to veterinary pathology but extends to more general fields of application.
翻译:由于形态相似性,将切腹肿瘤的骨骼部分分为个别亚型可能具有挑战性。最近,深层次的学习方法证明了它们在这方面支持病理学家的潜力。然而,许多这些受监督的算法需要大量的附加说明数据来进行稳健的发展。我们为13个直系类,包括7个切腹肿瘤子类,提供了由12,424个多边形说明补充的七种不同犬类整张幻灯片数据集。在跨河实验中,我们显示了所提供的标签的高度一致性,特别是肿瘤说明。我们通过对组织分解和肿瘤亚型分类任务进行深层神经网络培训,进一步验证了数据集。我们取得了0.7047和0.9044的类平均计价系数,特别是肿瘤0.9044。关于分类,我们取得了0.9857的幻灯片精度。由于罐切度对人类肿瘤具有各种其基因特征,因此这一数据集的增加值不限于兽医病理学,而是扩大到更一般的应用领域。