Recent advances in spatial omics methods enable the molecular composition of human tumors to be imaged at micron-scale resolution across hundreds of patients and ten to thousands of molecular imaging channels. Large-scale molecular imaging datasets offer a new opportunity to understand how the spatial organization of proteins and cell types within a tumor modulate the response of a patient to different therapeutic strategies and offer potential insights into the design of novel therapies to increase patient response. However, spatial omics datasets require computational analysis methods that can scale to incorporate hundreds to thousands of imaging channels (ie colors) while enabling the extraction of molecular patterns that correlate with treatment responses across large number of patients with potentially heterogeneous tumors presentations. Here, we have develop a machine learning strategy for the identification and design of signaling molecule combinations that predict the degree of immune system engagement with a specific patient tumors. We specifically train a classifier to predict T cell distribution in patient tumors using the images from 30-40 molecular imaging channels. Second, we apply a gradient descent based counterfactual reasoning strategy to the classifier and discover combinations of signaling molecules predicted to increase T cell infiltration. Applied to spatial proteomics data of melanoma tumor, our model predicts that increasing the level of CXCL9, CXCL10, CXCL12, CCL19 and decreasing the level of CCL8 in melanoma tumor will increase T cell infiltration by 10-fold across a cohort of 69 patients. The model predicts that the combination is many fold more effective than single target perturbations. Our work provides a paradigm for machine learning based prediction and design of cancer therapeutics based on classification of immune system activity in spatial omics data.
翻译:空间肿瘤方法的最近进步使得人类肿瘤分子构成能够以微幅分辨率在数百个病人和10至数千个分子成像信道的微幅分辨率上映成像。大型分子成像数据集为了解肿瘤内蛋白和细胞类型的空间组织如何在肿瘤内调节病人对不同治疗战略的反应提供了新的机会,并为新疗法的设计提供了潜在的洞察力,以增加病人的反应。然而,空间血管数据集需要计算分析方法,可以将成像频道(伊色)纳入成像频道的成像比例,同时能够提取分子模式,与大量可能出现变异肿瘤的病人的治疗反应相联系。在这里,我们开发了一个机器学习战略,用于识别和设计信号分子组合的空间组织,以预测免疫系统与特定病人肿瘤的接触程度。我们专门培训一个分类器,利用30-40个分子成像频道的图像来预测病人细胞肿瘤的T分布。 其次,我们用基于梯度的血基血基血基血基反位推理学战略来测量和发现T型分子的信号组合,预测增加CLX的心基内流数据。