Current approaches to explaining the decisions of deep learning systems for medical tasks have focused on visualising the elements that have contributed to each decision. We argue that such approaches are not enough to "open the black box" of medical decision making systems because they are missing a key component that has been used as a standard communication tool between doctors for centuries: language. We propose a model-agnostic interpretability method that involves training a simple recurrent neural network model to produce descriptive sentences to clarify the decision of deep learning classifiers. We test our method on the task of detecting hip fractures from frontal pelvic x-rays. This process requires minimal additional labelling despite producing text containing elements that the original deep learning classification model was not specifically trained to detect. The experimental results show that: 1) the sentences produced by our method consistently contain the desired information, 2) the generated sentences are preferred by doctors compared to current tools that create saliency maps, and 3) the combination of visualisations and generated text is better than either alone.
翻译:目前用于解释医疗任务深层学习系统决定的方法侧重于直观了解促成每项决定的因素。我们认为,这些方法不足以“打开医疗决策系统黑盒”的“打开黑匣子”,因为它们缺少一个关键组成部分,而这个组成部分在几个世纪以来一直被用作医生之间的标准通信工具:语言。我们建议了一种模型――不可知的解释方法,它涉及培训一个简单的经常性神经网络模型,以生成描述性句子来澄清深层学习分类师的决定。我们测试了我们从前骨盆X光中检测臀部骨裂的任务的方法。这个过程需要最低限度的额外标签,尽管其内容含有原始深层学习分类模式没有经过专门培训来检测的要素。实验结果显示:(1) 我们的方法所产生的句子始终包含想要的信息,(2) 医生们倾向于使用生成的句子,而目前的工具则创建突出的图子,(3) 视觉和生成的文本的组合比单有的要好。