为提高具有大病人索赔数据库的跨机构风险模型的可移动性而进行蒸馏 (Distillation to Enhance the Portability of Risk Models Across Institutions with Large Patient Claims Database)

Artificial intelligence, and particularly machine learning (ML), is increasingly developed and deployed to support healthcare in a variety of settings. However, clinical decision support (CDS) technologies based on ML need to be portable if they are to be adopted on a broad scale. In this respect, models developed at one institution should be reusable at another. Yet there are numerous examples of portability failure, particularly due to naive application of ML models. Portability failure can lead to suboptimal care and medical errors, which ultimately could prevent the adoption of ML-based CDS in practice. One specific healthcare challenge that could benefit from enhanced portability is the prediction of 30-day readmission risk. Research to date has shown that deep learning models can be effective at modeling such risk. In this work, we investigate the practicality of model portability through a cross-site evaluation of readmission prediction models. To do so, we apply a recurrent neural network, augmented with self-attention and blended with expert features, to build readmission prediction models for two independent large scale claims datasets. We further present a novel transfer learning technique that adapts the well-known method of born-again network (BAN) training. Our experiments show that direct application of ML models trained at one institution and tested at another institution perform worse than models trained and tested at the same institution. We further show that the transfer learning approach based on the BAN produces models that are better than those trained on just a single institution's data. Notably, this improvement is consistent across both sites and occurs after a single retraining, which illustrates the potential for a cheap and general model transfer mechanism of readmission risk prediction.

翻译：人工智能,特别是机器学习(ML)日益得到发展和部署,用于支持各种环境中的医疗保健。然而,如果广泛采用基于ML的临床决策支持技术,则需要将其移植。在这方面,一个机构开发的模型应可在另一个机构重新使用。然而,有许多可移植性失败的例子,特别是由于天真的应用ML模型。可移植性失灵可能导致不优化的护理和医疗错误,最终可能阻止在实际中采用基于ML的可移植性CDS。从增强可移动性中受益的一个具体医疗保健挑战就是预测30天的可移动性风险。迄今为止的研究表明,深层次学习模型能够有效地模拟这种风险。在这项工作中,我们通过跨地点评估可移动性模型的可移植性来调查模型的实用性。为了做到这一点,我们采用了一个经常性的神经网络,通过自我保存和与专家特征相结合的模型,为两个独立的大型改进数据机制建立可读性预测模型。我们进一步介绍了一种新型的可转移技术,在经过培训的单一机构系统进行更精确的系统测试后,在经过我们经过培训的系统进行更精确的系统测试的网络上展示了另一种更精确的系统。