The increasing popularity of attention mechanisms in deep learning algorithms for computer vision and natural language processing made these models attractive to other research domains. In healthcare, there is a strong need for tools that may improve the routines of the clinicians and the patients. Naturally, the use of attention-based algorithms for medical applications occurred smoothly. However, being healthcare a domain that depends on high-stake decisions, the scientific community must ponder if these high-performing algorithms fit the needs of medical applications. With this motto, this paper extensively reviews the use of attention mechanisms in machine learning (including Transformers) for several medical applications. This work distinguishes itself from its predecessors by proposing a critical analysis of the claims and potentialities of attention mechanisms presented in the literature through an experimental case study on medical image classification with three different use cases. These experiments focus on the integrating process of attention mechanisms into established deep learning architectures, the analysis of their predictive power, and a visual assessment of their saliency maps generated by post-hoc explanation methods. This paper concludes with a critical analysis of the claims and potentialities presented in the literature about attention mechanisms and proposes future research lines in medical applications that may benefit from these frameworks.
翻译:在保健方面,非常需要一些工具来改进临床医生和病人的日常工作。自然,对医疗应用采用基于关注的算法很顺利。然而,由于保健是一个取决于高决策的领域,科学界必须思考这些高性能算法是否符合医疗应用的需要。根据这个座右铭,本文件广泛审查了在机器学习(包括变换器)中使用关注机制的情况,以用于若干医疗应用。这项工作与其他前身相比有所区别,通过对三种不同使用案例的医学图像分类进行实验性案例研究,对文献中所提出的关注机制的主张和潜力进行批判性分析,并提出了医学应用的未来研究路线,这些实验侧重于将关注机制纳入既定的深层次学习结构,分析其预测能力,以及对其由事后解释方法产生的突出的地图进行直观评估。本文件最后对文献中关于关注机制的主张和潜力进行了批判性分析,并提出了可能受益于这些框架的医学应用的未来研究路线。