机器翻译的结果如下：标题：应对患病率变化的图像分析算法部署摘要：领域差距是医学图像分析基于机器学习的解决方案临床应用的最重要的障碍之一，虽然当前的研究集中在新的训练范例和网络体系结构上，但对于部署在实践中的算法的患病率变化的特定影响却鲜有关注。这种算法在用于开发/验证数据和部署环境之间的类别频率差异可能很大，例如在人工智能（AI）民主化的背景下，疾病患病率可能在时间和地点上变化很大。我们的贡献是双重的。首先，我们通过实证研究论证错误处理患病率的潜在重大后果：（i）失真的程度，（ii）启动结果和最佳结果的偏离程度，以及（iii）验证度量用作神经网络性能反映部署人口的能力与发展和部署患病率差异的函数。其次，我们提出了一种适应不同环境的患病率感知的图像分类工作流程，该工作流程使用估计的部署患病率来调整训练的分类器，而不需要附加注释的部署数据。基于30个医学分类任务的广泛实验展示了所提出的工作流程比当前实践能够获得更好的分类器决策和更可靠的性能估计的优点。 (Deployment of Image Analysis Algorithms under Prevalence Shifts)

翻译：机器翻译的结果如下：标题：应对患病率变化的图像分析算法部署摘要：领域差距是医学图像分析基于机器学习的解决方案临床应用的最重要的障碍之一，虽然当前的研究集中在新的训练范例和网络体系结构上，但对于部署在实践中的算法的患病率变化的特定影响却鲜有关注。这种算法在用于开发/验证数据和部署环境之间的类别频率差异可能很大，例如在人工智能（AI）民主化的背景下，疾病患病率可能在时间和地点上变化很大。我们的贡献是双重的。首先，我们通过实证研究论证错误处理患病率的潜在重大后果：（i）失真的程度，（ii）启动结果和最佳结果的偏离程度，以及（iii）验证度量用作神经网络性能反映部署人口的能力与发展和部署患病率差异的函数。其次，我们提出了一种适应不同环境的患病率感知的图像分类工作流程，该工作流程使用估计的部署患病率来调整训练的分类器，而不需要附加注释的部署数据。基于30个医学分类任务的广泛实验展示了所提出的工作流程比当前实践能够获得更好的分类器决策和更可靠的性能估计的优点。

Patrick Godau,Piotr Kalinowski,Evangelia Christodoulou,Annika Reinke,Minu Tizabi,Luciana Ferrer,Paul Jäger,Lena Maier-Hein

Domain gaps are among the most relevant roadblocks in the clinical translation of machine learning (ML)-based solutions for medical image analysis. While current research focuses on new training paradigms and network architectures, little attention is given to the specific effect of prevalence shifts on an algorithm deployed in practice. Such discrepancies between class frequencies in the data used for a method's development/validation and that in its deployment environment(s) are of great importance, for example in the context of artificial intelligence (AI) democratization, as disease prevalences may vary widely across time and location. Our contribution is twofold. First, we empirically demonstrate the potentially severe consequences of missing prevalence handling by analyzing (i) the extent of miscalibration, (ii) the deviation of the decision threshold from the optimum, and (iii) the ability of validation metrics to reflect neural network performance on the deployment population as a function of the discrepancy between development and deployment prevalence. Second, we propose a workflow for prevalence-aware image classification that uses estimated deployment prevalences to adjust a trained classifier to a new environment, without requiring additional annotated deployment data. Comprehensive experiments based on a diverse set of 30 medical classification tasks showcase the benefit of the proposed workflow in generating better classifier decisions and more reliable performance estimates compared to current practice.

翻译：使用监督学习的机器翻译仍然会有一定的不准确性和错误，因此仅供参考。