The true posterior distribution of a Bayesian neural network is massively multimodal. Whilst most of these modes are functionally equivalent, we demonstrate that there remains a level of real multimodality that manifests in even the simplest neural network setups. It is only by fully marginalising over all posterior modes, using appropriate Bayesian sampling tools, that we can capture the split personalities of the network. The ability of a network trained in this manner to reason between multiple candidate solutions dramatically improves the generalisability of the model, a feature we contend is not consistently captured by alternative approaches to the training of Bayesian neural networks. We provide a concise minimal example of this, which can provide lessons and a future path forward for correctly utilising the explainability and interpretability of Bayesian neural networks.
翻译:Bayesian神经网络的真正后天分布是大规模多式联运。 虽然这些模式大多在功能上等同,但我们证明,甚至最简单的神经网络设置中也仍然存在着某种程度的真实多式联运,只有通过利用适当的Bayesian取样工具,在所有后天模式上充分边缘化,我们才能捕捉网络的分裂性。以这种方式训练的网络在多个候选解决方案之间解释的功能,大大提高了模型的可普及性,我们所争论的一个特征并不是通过培训Bayesian神经网络的替代方法来一致体现的。我们提供了一个简明的简单例子,为正确利用Bayesian神经网络的可解释性和可解释性提供了经验教训和未来的道路。