Exploration of bias has significant impact on the transparency and applicability of deep learning pipelines in medical settings, yet is so far woefully understudied. In this paper, we consider two separate groups for which training data is only available at differing image resolutions. For group H, available images and labels are at the preferred high resolution while for group L only deprecated lower resolution data exist. We analyse how this resolution-bias in the data distribution propagates to systematically biased predictions for group L at higher resolutions. Our results demonstrate that single-resolution training settings result in significant loss of volumetric group differences that translate to erroneous segmentations as measured by DSC and subsequent classification failures on the low resolution group. We further explore how training data across resolutions can be used to combat this systematic bias. Specifically, we investigate the effect of image resampling, scale augmentation and resolution independence and demonstrate that biases can effectively be reduced with multi-resolution approaches.
翻译:对偏见的探索对医疗环境中深层学习管道的透明度和适用性有重大影响,但迄今研究不足,令人痛心。本文认为,只有不同的图像分辨率才能提供培训数据的两个不同组别。H组的可用图像和标签为首选高分辨率,L组的可用图像和标签为首选高分辨率,L组只有折旧低分辨率数据存在。我们分析数据分布中的这一分辨率-偏差如何传播到对高分辨率L组的系统性偏差预测中。我们的结果显示,单分辨率培训环境导致大量体积差异,转化为DSC测量的错误分解,以及低分辨率组随后的分类失败。我们进一步探讨如何利用跨决议的培训数据来消除这种系统性的偏差。具体地说,我们调查图像重塑、比例扩大和分辨率独立性的影响,并表明通过多分辨率方法可以有效减少偏差。