The discovery of patient-specific imaging markers that are predictive of future disease outcomes can help us better understand individual-level heterogeneity of disease evolution. In fact, deep learning models that can provide data-driven personalized markers are much more likely to be adopted in medical practice. In this work, we demonstrate that data-driven biomarker discovery can be achieved through a counterfactual synthesis process. We show how a deep conditional generative model can be used to perturb local imaging features in baseline images that are pertinent to subject-specific future disease evolution and result in a counterfactual image that is expected to have a different future outcome. Candidate biomarkers, therefore, result from examining the set of features that are perturbed in this process. Through several experiments on a large-scale, multi-scanner, multi-center multiple sclerosis (MS) clinical trial magnetic resonance imaging (MRI) dataset of relapsing-remitting (RRMS) patients, we demonstrate that our model produces counterfactuals with changes in imaging features that reflect established clinical markers predictive of future MRI lesional activity at the population level. Additional qualitative results illustrate that our model has the potential to discover novel and subject-specific predictive markers of future activity.
翻译:在这项工作中,我们证明数据驱动生物标志的发现可以通过反事实综合过程实现。我们展示了如何使用一个深度的有条件的基因化模型来干扰与特定对象未来疾病演变有关的基线图像中的当地图像特征,并导致一个反现实的图像,预计这种图像将产生不同的未来结果。因此,候选生物标志是审查在这一过程中渗透的一组特征的结果。通过对大规模、多扫描器、多中心多重感应器(MS)的临床磁共振成像试验(MRI)数据集,我们展示了我们的模型产生了反事实,其成像特征的变化反映了预测未来MRI红外线特征的既定临床标志,并展示了今后人口水平的定性模型活动潜力。