Recurrent Neural Networks (RNNs) are often used for sequential modeling of adverse outcomes in electronic health records (EHRs) due to their ability to encode past clinical states. These deep, recurrent architectures have displayed increased performance compared to other modeling approaches in a number of tasks, fueling the interest in deploying deep models in clinical settings. One of the key elements in ensuring safe model deployment and building user trust is model explainability. Testing with Concept Activation Vectors (TCAV) has recently been introduced as a way of providing human-understandable explanations by comparing high-level concepts to the network's gradients. While the technique has shown promising results in real-world imaging applications, it has not been applied to structured temporal inputs. To enable an application of TCAV to sequential predictions in the EHR, we propose an extension of the method to time series data. We evaluate the proposed approach on an open EHR benchmark from the intensive care unit, as well as synthetic data where we are able to better isolate individual effects.
翻译:经常性神经网络(NNNs)经常用于对电子健康记录(EHRs)中的不利结果进行顺序建模,因为它们能够对过去的临床状态进行编码。这些深度的、经常性的建筑在许多任务中比其他建模方法表现出更大的性能,从而激发了在临床环境中部署深层模型的兴趣。确保安全模型部署和建立用户信任的关键要素之一是模型解释性。最近采用了“概念活性矢量”测试,通过将高级概念与网络的梯度进行比较,为人类提供可理解的解释。虽然该技术在现实世界成像应用中显示了有希望的结果,但并未应用于结构化的时间投入。为了能够将TCAV应用于在EHR中的连续预测,我们提议将这种方法扩大到时间序列数据。我们评估了从密集护理单位获得的开放式 EHR基准的拟议方法,以及我们能够更好地分离个体效应的合成数据。