Deep neural networks are often applied to medical images to automate the problem of medical diagnosis. However, a more clinically relevant question that practitioners usually face is how to predict the future trajectory of a disease. Current methods for prognosis or disease trajectory forecasting often require domain knowledge and are complicated to apply. In this paper, we formulate the prognosis prediction problem as a one-to-many prediction problem. Inspired by a clinical decision-making process with two agents -- a radiologist and a general practitioner -- we predict prognosis with two transformer-based components that share information with each other. The first transformer in this framework aims to analyze the imaging data, and the second one leverages its internal states as inputs, also fusing them with auxiliary clinical data. The temporal nature of the problem is modeled within the transformer states, allowing us to treat the forecasting problem as a multi-task classification, for which we propose a novel loss. We show the effectiveness of our approach in predicting the development of structural knee osteoarthritis changes and forecasting Alzheimer's disease clinical status directly from raw multi-modal data. The proposed method outperforms multiple state-of-the-art baselines with respect to performance and calibration, both of which are needed for real-world applications. An open-source implementation of our method is made publicly available at \url{https://github.com/Oulu-IMEDS/CLIMATv2}.
翻译:深心神经网络往往用于医疗图像,使医学诊断问题自动化。然而,开业医生通常面临的一个更具有临床相关性的问题是如何预测疾病的未来轨迹。目前的预测或疾病轨迹预测方法往往需要领域知识,而且需要复杂应用。在本文中,我们把预测预测预测预测问题作为一个一至多种预测问题进行设计。受由两个代理 -- -- 放射学家和一般开业医生 -- -- 组成的临床决策过程的启发,我们用两个基于变压器的组件预测预后期,这两个组件彼此共享信息。这个框架中的第一个变压器旨在分析成像数据,第二个变压器将内部状态作为投入,同时用辅助临床数据加以利用。问题的时间性质在变压器状态内建模,让我们将预测问题作为多塔级分类处理,为此我们提议了新的损失。我们展示了我们预测结构膝盖/变压/脑炎变化和直接从原始多模调数据直接预测阿尔茨海默氏病临床状况的方法的有效性。拟议的方法是公开校验标准,这是我们用于公开校正的多种标准。