Deep ensemble is a simple and straightforward approach for approximating Bayesian inference and has been successfully applied to many classification tasks. This study aims to comprehensively investigate this approach in the multi-output regression task to predict the aerodynamic performance of a missile configuration. By scrutinizing the effect of the number of neural networks used in the ensemble, an obvious trend toward underconfidence in estimated uncertainty is observed. In this context, we propose the deep ensemble framework that applies the post-hoc calibration method, and its improved uncertainty quantification performance is demonstrated. It is compared with Gaussian process regression, the most prevalent model for uncertainty quantification in engineering, and is proven to have superior performance in terms of regression accuracy, reliability of estimated uncertainty, and training efficiency. Finally, the impact of the suggested framework on the results of Bayesian optimization is examined, showing that whether or not the deep ensemble is calibrated can result in completely different exploration characteristics. This framework can be seamlessly applied and extended to any regression task, as no special assumptions have been made for the specific problem used in this study.
翻译:深度集成是用于近似贝叶斯推断的一种简单明了的方法,已成功应用于许多分类任务中。本研究旨在全面调查这种方法在多输出回归任务中的应用,以预测导弹构型的空气动力学性能。通过审查集成中所用神经网络数量的影响,发现估计不确定性的低置信度趋势很明显。在这种情况下,我们提出了一种基于后处理校准方法的深度集成框架,展示了其改进的不确定性量化性能。与高斯过程回归相比,后者是工程不确定性量化的最常见模型,并且已被证明在回归准确性、估计不确定性的可靠性和训练效率方面具有卓越的性能。最后,我们考察了所提出框架对贝叶斯优化结果的影响,结果表明,深度集成是否被校准将导致完全不同的探索特性。这种框架可无缝应用和扩展到任何回归任务中,因为我们没有针对本研究中使用的特定问题做出任何特殊假设。