最后批次正常化层的不合理效力 (The Unreasonable Effectiveness of the Final Batch Normalization Layer)

from arxiv, Accepted for the 16th International Symposium on Visual Computing (ISVC 2021). arXiv admin note: substantial text overlap with arXiv:2011.06319

Early-stage disease indications are rarely recorded in real-world domains, such as Agriculture and Healthcare, and yet, their accurate identification is critical in that point of time. In this type of highly imbalanced classification problems, which encompass complex features, deep learning (DL) is much needed because of its strong detection capabilities. At the same time, DL is observed in practice to favor majority over minority classes and consequently suffer from inaccurate detection of the targeted early-stage indications. In this work, we extend the study done by Kocaman et al., 2020, showing that the final BN layer, when placed before the softmax output layer, has a considerable impact in highly imbalanced image classification problems as well as undermines the role of the softmax outputs as an uncertainty measure. This current study addresses additional hypotheses and reports on the following findings: (i) the performance gain after adding the final BN layer in highly imbalanced settings could still be achieved after removing this additional BN layer in inference; (ii) there is a certain threshold for the imbalance ratio upon which the progress gained by the final BN layer reaches its peak; (iii) the batch size also plays a role and affects the outcome of the final BN application; (iv) the impact of the BN application is also reproducible on other datasets and when utilizing much simpler neural architectures; (v) the reported BN effect occurs only per a single majority class and multiple minority classes i.e., no improvements are evident when there are two majority classes; and finally, (vi) utilizing this BN layer with sigmoid activation has almost no impact when dealing with a strongly imbalanced image classification tasks.

翻译：在现实世界领域,如农业和保健领域,很少记录早期疾病迹象,然而,准确识别这些疾病迹象在那个时间点是关键。在这种高度不平衡的分类问题中,包括复杂的特征,由于探测能力强,极需要深层次学习(DL),因为其检测能力强。与此同时,在实践中观察到DL偏重少数阶层,因此在目标早期迹象的检测中受到不准确的检测。在这项工作中,我们扩展了Kocaman等人(Kocaman等人(Kocaman等人)(2020年)的研究,表明最终的BN层(放在软max输出层之前)对高度不平衡的图像分类问题有相当大的影响,并破坏了软分子产出作为不确定性衡量标准的作用。当前研究还涉及额外的假设和报告以下结论:(一) 在高度不平衡的环境中添加最终的BN层(N) 增加的B层后,业绩增益仍可能实现。 (二) 在最终的B层(N层) 进步到达顶峰前,最终的B级(N级) 降级(B级) 最终的图像作用也会影响B级和B级结果。