Recently, there has been a growing interest in applying machine learning methods to problems in engineering mechanics. In particular, there has been significant interest in applying deep learning techniques to predicting the mechanical behavior of heterogeneous materials and structures. Researchers have shown that deep learning methods are able to effectively predict mechanical behavior with low error for systems ranging from engineered composites, to geometrically complex metamaterials, to heterogeneous biological tissue. However, there has been comparatively little attention paid to deep learning model calibration, i.e., the match between predicted probabilities of outcomes and the true probabilities of outcomes. In this work, we perform a comprehensive investigation into ML model calibration across seven open access engineering mechanics datasets that cover three distinct types of mechanical problems. Specifically, we evaluate both model and model calibration error for multiple machine learning methods, and investigate the influence of ensemble averaging and post hoc model calibration via temperature scaling. Overall, we find that ensemble averaging of deep neural networks is both an effective and consistent tool for improving model calibration, while temperature scaling has comparatively limited benefits. Looking forward, we anticipate that this investigation will lay the foundation for future work in developing mechanics specific approaches to deep learning model calibration.
翻译:最近,人们越来越有兴趣对工程机械学问题采用机械学习方法,特别是应用深层次的学习技术来预测各种材料和结构的机械行为,研究人员已经表明深层次的学习方法能够有效地预测机械行为,从工程合成物到几何复杂的元材料,到混合生物组织,从设计合成物到几何复杂的元材料等系统,低误差的系统,但相对较少注意深层次的学习模型校准,即结果的预测概率和结果的真实概率之间的匹配。在这项工作中,我们对7个开放存取工程机械学数据集的ML模型校准进行了全面调查,涵盖三种不同的机械问题。具体地说,我们评估了多机学习方法的模型和模型校准错误,并调查了通过温度缩放来计算共性平均和后临时模型校准的影响。总体而言,我们发现深层神经网络的均匀度是改进模型校准的有效和一致的工具,而温度校准则有相对有限的效益。展望,我们预计这一调查将为今后的具体工作提供校准基础。</s>