In recent years, the use of machine learning techniques as surrogate models for computational fluid dynamics (CFD) simulations has emerged as a promising method for reducing the computational cost associated with engine design optimization. However, such methods still suffer from drawbacks. One main disadvantage of such methods is that the default machine learning hyperparameters are often severely suboptimal for a given problem. This has often been addressed by manually trying out different hyperparameter settings, but this solution is ineffective in a high-dimensional hyperparameter space. Besides this problem, the amount of data needed for training is also not known a priori. In response to these issues which need to be addressed, this work describes and validates an automated active learning approach for surrogate-based optimization of internal combustion engines, AutoML-GA. In this approach, a Bayesian optimization technique is used to find the best machine learning hyperparameters based on an initial dataset obtained from a small number of CFD simulations. Subsequently, a genetic algorithm is employed to locate the design optimum on the surrogate surface trained with the optimal hyperparameters. In the vicinity of the design optimum, the solution is refined by repeatedly running CFD simulations at the projected optimum and adding the newly obtained data to the training dataset. It is shown that this approach leads to a better optimum with a lower number of CFD simulations, compared to the use of default hyperparameters. The developed approach offers the advantage of being a more hands-off approach that can be easily applied by researchers and engineers in industry who do not have a machine learning background.
翻译:近年来,使用机器学习技术作为计算流体动态(CFD)模拟的代用模型,作为计算流体动态(CFD)模拟的代用模型,已成为减少与发动机设计优化有关的计算成本的一个很有希望的方法,然而,这些方法仍然有缺陷。这些方法的一个主要缺点是,默认机器学习超参数对于某个特定问题来说往往极不理想。这通常通过人工尝试不同的超参数设置来解决,但在高维超参数空间中,这一解决方案是无效的。除了这个问题之外,培训所需的数据数量也并非事先就已知的。为了应对需要解决的问题,这项工作描述并验证了一种自动主动学习方法,用于模拟优化内部燃烧引擎(AutomalML-GA)的代用。在这个方法中,采用一种巴耶斯最优化技术来找到最佳的机器学习超参数。随后,采用一种遗传算法,在经过最优超标准超标准超标准比准的表面上找到最佳设计。在最优的C类比度上,在最优化的模拟中,在最优化的模型中,通过不断改进的模拟方法来改进数据。