[18F]-Fluorodeoxyglucose (FDG) positron emission tomography - computed tomography (PET-CT) has become the imaging modality of choice for diagnosing many cancers. Co-learning complementary PET-CT imaging features is a fundamental requirement for automatic tumor segmentation and for developing computer aided cancer diagnosis systems. We propose a hyper-connected transformer (HCT) network that integrates a transformer network (TN) with a hyper connected fusion for multi-modality PET-CT images. The TN was leveraged for its ability to provide global dependencies in image feature learning, which was achieved by using image patch embeddings with a self-attention mechanism to capture image-wide contextual information. We extended the single-modality definition of TN with multiple TN based branches to separately extract image features. We introduced a hyper connected fusion to fuse the contextual and complementary image features across multiple transformers in an iterative manner. Our results with two non-small cell lung cancer and soft-tissue sarcoma datasets show that HCT achieved better performance in segmentation accuracy when compared to state-of-the-art methods. We also show that HCT produces consistent performance across various image fusion strategies and network backbones.
翻译:[18F]-Fluodooxyglucose (FDG)- 红外线发射断层成像——计算成的断层成像(PET-CT)已成为诊断许多癌症的首选成像模式。共同学习辅助的 PET-CT成像功能是自动肿瘤分解和开发计算机辅助癌症诊断系统的基本要求。我们建议建立一个超链接变压器网络,将变压器网络(TN)与超链接的多式 PET-CT 图像融合在一起。TN因其在图像特征学习方面提供全球依赖性的能力而得到利用,而这种能力是通过使用带有自我感知机制的图像补丁嵌入来捕捉全图像背景信息而实现的。我们将多TN的多端TN的单一模式定义扩展为单独提取图像特征。我们引入了一个超链接的网络,以迭接方式将多个变压器的背景和互补图像特性结合在一起。我们通过两个非小型细胞肺癌和软质质的图像数据集得出的结果,通过使用自读机制的图像补补补的功能也显示,我们实现了各种分段的准确性表现。