Although recent multi-task learning methods have shown to be effective in improving the generalization of deep neural networks, they should be used with caution for safety-critical applications, such as clinical risk prediction. This is because even if they achieve improved task-average performance, they may still yield degraded performance on individual tasks, which may be critical (e.g., prediction of mortality risk). Existing asymmetric multi-task learning methods tackle this negative transfer problem by performing knowledge transfer from tasks with low loss to tasks with high loss. However, using loss as a measure of reliability is risky since it could be a result of overfitting. In the case of time-series prediction tasks, knowledge learned for one task (e.g., predicting the sepsis onset) at a specific timestep may be useful for learning another task (e.g., prediction of mortality) at a later timestep, but lack of loss at each timestep makes it difficult to measure the reliability at each timestep. To capture such dynamically changing asymmetric relationships between tasks in time-series data, we propose a novel temporal asymmetric multi-task learning model that performs knowledge transfer from certain tasks/timesteps to relevant uncertain tasks, based on feature-level uncertainty. We validate our model on multiple clinical risk prediction tasks against various deep learning models for time-series prediction, which our model significantly outperforms, without any sign of negative transfer. Further qualitative analysis of learned knowledge graphs by clinicians shows that they are helpful in analyzing the predictions of the model. Our final code is available at https://github.com/anhtuan5696/TPAMTL.
翻译:尽管最近的多任务学习方法表明,在改进深层神经网络的普及方面,最近多任务学习方法是有效的,但应当谨慎地用于安全关键应用,如临床风险预测。这是因为,即使它们实现改进了平均任务绩效,它们仍可能在个别任务中产生退化的绩效,而这也许至关重要(例如,死亡率风险预测)。现有的不对称多任务学习方法通过执行从低损失任务到高损失任务的知识转移来解决这一负面转移问题。然而,使用损失作为可靠度的衡量标准是危险的,因为这可能是过度适应的结果。在时间序列预测任务中,为一项任务(例如,预测 sepsis起点)所学的知识在特定时间步骤中对于学习另一项任务(例如,死亡率预测)可能仍然会降低业绩。 现有的不对称多任务学习方法在每一时间阶段都难以衡量可靠性。 要通过时间序列数据中这种动态变化的不对称任务关系,我们建议采用新的时间不对称多任务学习模型,在某个时间序列中进行从某些任务/时间周期预测中进行知识转移的标志性任务/时间模型,在某个阶段预测中,我们关于多任务/周期预测的模型将大量学习各种任务。