Objectives: To develop and validate a deep learning (DL)-based primary tumor biopsy signature for predicting axillary lymph node (ALN) metastasis preoperatively in early breast cancer (EBC) patients with clinically negative ALN. Methods: A total of 1,058 EBC patients with pathologically confirmed ALN status were enrolled from May 2010 to August 2020. A DL core-needle biopsy (DL-CNB) model was built on the attention-based multiple instance-learning (AMIL) framework to predict ALN status utilizing the DL features, which were extracted from the cancer areas of digitized whole-slide images (WSIs) of breast CNB specimens annotated by two pathologists. Accuracy, sensitivity, specificity, receiver operating characteristic (ROC) curves, and areas under the ROC curve (AUCs) were analyzed to evaluate our model. Results: The best-performing DL-CNB model with VGG16_BN as the feature extractor achieved an AUC of 0.816 (95% confidence interval (CI): 0.758, 0.865) in predicting positive ALN metastasis in the independent test cohort. Furthermore, our model incorporating the clinical data, which was called DL-CNB+C, yielded the best accuracy of 0.831 (95%CI: 0.775, 0.878), especially for patients younger than 50 years (AUC: 0.918, 95%CI: 0.825, 0.971). The interpretation of DL-CNB model showed that the top signatures most predictive of ALN metastasis were characterized by the nucleus features including density ($p$ = 0.015), circumference ($p$ = 0.009), circularity ($p$ = 0.010), and orientation ($p$ = 0.012). Conclusion: Our study provides a novel DL-based biomarker on primary tumor CNB slides to predict the metastatic status of ALN preoperatively for patients with EBC. The codes and dataset are available at https://github.com/bupt-ai-cz/BALNMP
翻译:目标:开发并验证一个基于深度学习(DL)的初级肿瘤生物测试(DL)模型,用于预测在早期乳腺癌患者临床阴性ALN(EBC)临床阴性ALN(ALN)中轴心淋巴淋巴结(ALN)发酵。方法:2010年5月至2020年8月,共注册了1,058名有病理确认ALN地位的EB(EBC)病人。 DL核心-针头生物测试(DL-CNB)模型基于关注的多个实例学习(AMIL)框架,以预测ALN(ALN)特征:DL(ALN)的DL(AL)值为0.00(AL) 淋巴(ALN)值(ALN),从数字化整流化的全滑动图像(ASI)的癌症领域提取的(ALL),由两位病理学家的标注显示:准确度、敏感度、敏感度、特质、特质、直径(OL)的ALS(AL)数据(O)为直径直径、直径直径比(O),由直径直径直径)。