基于大语言模型的求解器无关自动化问题建模方法用于高成本仿真驱动设计 (Solver-Independent Automated Problem Formulation via LLMs for High-Cost Simulation-Driven Design)

In the high-cost simulation-driven design domain, translating ambiguous design requirements into a mathematical optimization formulation is a bottleneck for optimizing product performance. This process is time-consuming and heavily reliant on expert knowledge. While large language models (LLMs) offer potential for automating this task, existing approaches either suffer from poor formalization that fails to accurately align with the design intent or rely on solver feedback for data filtering, which is unavailable due to the high simulation costs. To address this challenge, we propose APF, a framework for solver-independent, automated problem formulation via LLMs designed to automatically convert engineers' natural language requirements into executable optimization models. The core of this framework is an innovative pipeline for automatically generating high-quality data, which overcomes the difficulty of constructing suitable fine-tuning datasets in the absence of high-cost solver feedback with the help of data generation and test instance annotation. The generated high-quality dataset is used to perform supervised fine-tuning on LLMs, significantly enhancing their ability to generate accurate and executable optimization problem formulations. Experimental results on antenna design demonstrate that APF significantly outperforms the existing methods in both the accuracy of requirement formalization and the quality of resulting radiation efficiency curves in meeting the design goals.

翻译：在高成本仿真驱动设计领域，将模糊的设计需求转化为数学优化模型是优化产品性能的关键瓶颈。这一过程耗时且高度依赖专家知识。尽管大语言模型（LLMs）为自动化此任务提供了潜力，但现有方法要么存在形式化质量差、无法准确对齐设计意图的问题，要么依赖求解器反馈进行数据筛选，而这在高仿真成本下无法实现。为应对这一挑战，我们提出了APF，一种基于大语言模型的、求解器无关的自动化问题建模框架，旨在自动将工程师的自然语言需求转换为可执行的优化模型。该框架的核心是一个创新的高质量数据自动生成流程，它借助数据生成与测试实例标注，克服了在缺乏高成本求解器反馈情况下构建合适微调数据集的困难。生成的高质量数据集用于对大语言模型进行监督微调，显著提升了其生成准确且可执行的优化问题模型的能力。在天线设计上的实验结果表明，APF在需求形式化的准确性以及所得辐射效率曲线满足设计目标的质量方面，均显著优于现有方法。