辐射图像的内在外形及其在深层学习中的作用 (The Intrinsic Manifolds of Radiological Images and their Role in Deep Learning)

from arxiv, preprint version, accepted for MICCAI 2022 (25th International Conference on Medical Image Computing and Computer Assisted Intervention). 8 pages (+ author names + references + supplementary), 4 figures. Code available at https://github.com/mazurowski-lab/radiologyintrinsicmanifolds

The manifold hypothesis is a core mechanism behind the success of deep learning, so understanding the intrinsic manifold structure of image data is central to studying how neural networks learn from the data. Intrinsic dataset manifolds and their relationship to learning difficulty have recently begun to be studied for the common domain of natural images, but little such research has been attempted for radiological images. We address this here. First, we compare the intrinsic manifold dimensionality of radiological and natural images. We also investigate the relationship between intrinsic dimensionality and generalization ability over a wide range of datasets. Our analysis shows that natural image datasets generally have a higher number of intrinsic dimensions than radiological images. However, the relationship between generalization ability and intrinsic dimensionality is much stronger for medical images, which could be explained as radiological images having intrinsic features that are more difficult to learn. These results give a more principled underpinning for the intuition that radiological images can be more challenging to apply deep learning to than natural image datasets common to machine learning research. We believe rather than directly applying models developed for natural images to the radiological imaging domain, more care should be taken to developing architectures and algorithms that are more tailored to the specific characteristics of this domain. The research shown in our paper, demonstrating these characteristics and the differences from natural images, is an important first step in this direction.

翻译：多重假设是深层学习成功背后的核心机制, 因此理解图像数据的内在多重结构是研究神经网络如何从数据中学习的核心。自然数据集元件及其与学习困难的关系最近开始为自然图像的共同领域进行研究, 但很少尝试对辐射图像进行研究。我们在这里讨论这个问题。首先, 我们比较辐射图像和自然图像的内在多元性。我们还调查各种数据集的内在维度和一般化能力之间的关系。我们的分析显示, 自然图像数据集的内在维度通常高于辐射图像。然而, 医学图像的概括化能力和内在维度之间的关系要大得多, 这可能会被解释为具有更难于学习的内在特征的辐射图像。这些结果为以下直觉提供了更具有原则性的基础: 辐射图像的内在多元性比机器学习研究通常的自然图像数据集更具挑战性。我们相信, 而不是直接应用为自然图像开发的模型到辐射成像域, 更需要注意的是, 开发建筑架构和算法之间的关系要大得多, 并且要从这个领域展示一个更符合自然特性的物理特性。