The standard diagnostic procedures for targeted therapies in lung cancer treatment involve histological subtyping and subsequent detection of key driver mutations, such as EGFR. Even though molecular profiling can uncover the driver mutation, the process is often expensive and time-consuming. Deep learning-oriented image analysis offers a more economical alternative for discovering driver mutations directly from whole slide images (WSIs). In this work, we used customized deep learning pipelines with weak supervision to identify the morphological correlates of EGFR mutation from hematoxylin and eosin-stained WSIs, in addition to detecting tumor and histologically subtyping it. We demonstrate the effectiveness of our pipeline by conducting rigorous experiments and ablation studies on two lung cancer datasets - TCGA and a private dataset from India. With our pipeline, we achieved an average area under the curve (AUC) of 0.964 for tumor detection, and 0.942 for histological subtyping between adenocarcinoma and squamous cell carcinoma on the TCGA dataset. For EGFR detection, we achieved an average AUC of 0.864 on the TCGA dataset and 0.783 on the dataset from India. Our key learning points include the following. Firstly, there is no particular advantage of using a feature extractor layers trained on histology, if one is going to fine-tune the feature extractor on the target dataset. Secondly, selecting patches with high cellularity, presumably capturing tumor regions, is not always helpful, as the sign of a disease class may be present in the tumor-adjacent stroma.
翻译:肺癌治疗中定向治疗的标准诊断程序涉及对关键驱动变异进行生理亚质分析并随后检测主要驱动变异,如EGFR。尽管分子特征分析可以发现驱动变异,但这一过程往往费用昂贵且耗时。深入学习的图像分析为直接从整个幻灯片图像中发现驱动变异提供了更经济的替代办法。在这项工作中,我们使用定制的深度学习管道,但监督薄弱,以辨别血氧素和皮肤染色体等关键驱动变异的形态学关联,以及检测肿瘤和遗传变异。我们通过对两套肺癌数据集 -- -- TCGA和印度的私人数据集进行严格的实验和减缩研究,展示了我们输血管道的有效性。我们通过输油管道实现了0.964的曲线下平均区域肿瘤检测区域,以及0.942的皮肤变异细胞细胞变异异体在TC数据集中,我们通过精细的精益性检测,我们从0.883的直径直径直径直径直到直径直的直径直径细胞变色细胞变色细胞变色细胞变色细胞变色细胞变。在TRSyGEFR的检测中取得了一个普通数据,如果从0.83的直径测图图中,那么的直径直径测,那么的直径测,那么的直地测,那么的直的直的直径直径直径直径直地测,那么的直径直径直径直径直的直到直的直到直的直到直径直径直径直径,那么的直图,我们的直径直径直径直径直径直径直径直径直径直径直径直到直到直的直的直的直的直的直的直到直到直的直的直的直的直的直的直的直的直到直到直到直到直,我们的测距,我们的直到直的直到直到直到直到直路的直径直路的直径直路的直系系,直系的直系的直系,直系的直系的直系的直系的直系的直系的直系列列列列列列列列列列列列列列列列列列列列列列列列列列列的