In many medical applications, interpretable models with high prediction performance are sought. Often, those models are required to handle semi-structured data like tabular and image data. We show how to apply deep transformation models (DTMs) for distributional regression which fulfill these requirements. DTMs allow the data analyst to specify (deep) neural networks for different input modalities making them applicable to various research questions. Like statistical models, DTMs can provide interpretable effect estimates while achieving the state-of-the-art prediction performance of deep neural networks. In addition, the construction of ensembles of DTMs that retain model structure and interpretability allows quantifying epistemic and aleatoric uncertainty. In this study, we compare several DTMs, including baseline-adjusted models, trained on a semi-structured data set of 407 stroke patients with the aim to predict ordinal functional outcome three months after stroke. We follow statistical principles of model-building to achieve an adequate trade-off between interpretability and flexibility while assessing the relative importance of the involved data modalities. We evaluate the models for an ordinal and dichotomized version of the outcome as used in clinical practice. We show that both, tabular clinical and brain imaging data, are useful for functional outcome prediction, while models based on tabular data only outperform those based on imaging data only. There is no substantial evidence for improved prediction when combining both data modalities. Overall, we highlight that DTMs provide a powerful, interpretable approach to analyzing semi-structured data and that they have the potential to support clinical decision making.
翻译:在许多医学应用中,寻求具有高预测性能的可解释性模型,这些模型往往需要处理表格和图像数据等半结构化数据。我们展示了如何应用深度转换模型(DTMs)来进行符合这些要求的分布式回归。DTM系统使数据分析员能够为不同输入模式指定(深)神经网络,使其适用于各种研究问题。与统计模型一样,DTM系统可以提供可解释性影响估计,同时实现深神经网络的最新预测性能。此外,还需要建立保留模型结构和可解释性等半结构化数据。我们展示了如何应用深度转换模型(DDMs)来量化累积性和感知性不确定性。在这个研究中,我们比较了几个DTM系统,包括基线调整模型,培训了407个中风病人的半结构数据集,目的是预测各种研究问题。我们遵循了模型建设的统计原则,以实现解释性和灵活性之间的适当平衡,同时评估了所涉数据模式的相对重要性。我们评估了用于一次或一次或两次进行分解分析的不确定性不确定性。我们比较了几个DTM,包括基线调整模型,培训了40调整后的模型数据是用于临床分析结果的模型,而我们则根据临床分析数据提供了一个有用的分析结果的模型数据。