In this work we use variational inference to quantify the degree of epistemic uncertainty in model predictions of radio galaxy classification and show that the level of model posterior variance for individual test samples is correlated with human uncertainty when labelling radio galaxies. We explore the model performance and uncertainty calibration for a variety of different weight priors and suggest that a sparse prior produces more well-calibrated uncertainty estimates. Using the posterior distributions for individual weights, we show that signal-to-noise ratio (SNR) ranking allows pruning of the fully-connected layers to the level of 30\% without significant loss of performance, and that this pruning increases the predictive uncertainty in the model. Finally we show that, like other work in this field, we experience a cold posterior effect. We examine whether adapting the cost function in our model to accommodate model misspecification can compensate for this effect, but find that it does not make a significant difference. We also examine the effect of principled data augmentation and find that it improves upon the baseline but does not compensate for the observed effect fully. We interpret this as the cold posterior effect being due to the overly effective curation of our training sample leading to likelihood misspecification, and raise this as a potential issue for Bayesian deep learning approaches to radio galaxy classification in future.
翻译:在这项工作中,我们使用变式推论来量化无线电星系分类模型预测中的缩写不确定性程度,并表明,在给无线电星系贴上标签时,单个测试样品的模型外表差异程度与人类的不确定性相关。我们探索模型性能和各种不同重量前科的不确定性校准模型和不确定性校准方法,并表明,稀疏的先前的模型性能和不确定性的校准方法可得出更精确的不确定性估计值。我们使用单个重量的事后分配方法,显示信号对噪音比(SNR)的排名可以使完全连接的层缩小至30 ⁇ 的水平,而不会严重丧失性能,而且这种偏差增加了模型中的预测不确定性。我们最后表明,与该领域的其他工作一样,我们经历着一种寒冷的外表效应。我们审视了我们模型中的成本功能是否能够适应这种偏差,但发现它并没有产生显著的差别。我们还研究了有原则的数据增强的效果,发现它在基线上有所改进,但不能充分弥补所观察到的效果。我们将此解释为,冷海后后海层影响是深层学的概率,从而有可能进行深层学。