For many medical applications, interpretable models with a high prediction performance are sought. Often, those models are required to handle semi-structured data like tabular and image data. We show how to apply deep transformation models (DTMs) for distributional regression which fulfill these requirements. DTMs allow the data analyst to specify (deep) neural networks for different input modalities making them applicable to various research questions. DTMs can provide interpretable effect estimates like statistical models while they can achieve the state-of-the-art prediction performance of deep neural networks. In addition, the construction of ensembles of DTMs that retain model structure and interpretability allows quantifying epistemic and aleatoric uncertainty. In this study, we compare several DTMs, including baseline-adjusted models, trained on a semi-structured data set of 407 stroke patients with the aim to predict ordinal functional outcome three months after stroke. We follow statistical principles of model-building to achieve an adequate trade-off between interpretability and flexibility while we assess the relative importance of the involved data modalities. We evaluate the models for an ordinal and dichotomized version of the outcome as used in clinical practice. We show that both, tabular clinical and brain imaging data, are useful for functional outcome prediction, while models based on tabular data only outperform those based on imaging data only. There is no substantial evidence for improved prediction when combining both data modalities. Overall, we highlight that DTMs provide a powerful, interpretable approach to analyzing semi-structured data and that they have the potential to support clinical decision making.
翻译:对于许多医疗应用,都寻求具有高预测性能的可解释性模型。通常,这些模型需要处理表格和图像数据等半结构化数据。我们展示了如何应用深度转换模型(DTMs)来进行符合这些要求的分布式回归。DTM系统使数据分析员能够为不同输入模式指定(深度)神经网络,使其适用于各种研究问题。DTM系统可以提供像统计模型那样的可解释性估计,同时它们能够达到深神经网络的最新预测性能。此外,这些模型的构建可以保留模型结构和可解释性,从而可以量化缩放式结构的缩放性数据。在这个研究中,我们比较了几个DTM系统,包括基线调整模型,培训了407个中风病人的半结构化数据集,目的是在中风网络运行三个月后预测或正常功能性结果。我们遵循模型建设的统计原则,以便实现解释性和灵活性之间的适当取舍。我们评估了相关数据模式的相对重要性。我们评估了一个或多结构化的模型,可以量化缩算和透性不确定性。我们比较了一些DTM系统化的模型,同时将那些功能性数据作为基础数据作为基础分析结果的基础数据作为基础,我们使用。我们使用的基础,在临床分析结果中的数据格式上没有使用。