目标 - 分散设计剂:将视觉模拟与独步观光头优化生成设计相结合 (Goal-Directed Design Agents: Integrating Visual Imitation with One-Step Lookahead Optimization for Generative Design)

Engineering design problems often involve large state and action spaces along with highly sparse rewards. Since an exhaustive search of those spaces is not feasible, humans utilize relevant domain knowledge to condense the search space. Previously, deep learning agents (DLAgents) were introduced to use visual imitation learning to model design domain knowledge. This note builds on DLAgents and integrates them with one-step lookahead search to develop goal-directed agents capable of enhancing learned strategies for sequentially generating designs. Goal-directed DLAgents can employ human strategies learned from data along with optimizing an objective function. The visual imitation network from DLAgents is composed of a convolutional encoder-decoder network, acting as a rough planning step that is agnostic to feedback. Meanwhile, the lookahead search identifies the fine-tuned design action guided by an objective. These design agents are trained on an unconstrained truss design problem that is modeled as a sequential, action-based configuration design problem. The agents are then evaluated on two versions of the problem: the original version used for training and an unseen constrained version with an obstructed construction space. The goal-directed agents outperform the human designers used to train the network as well as the previous objective-agnostic versions of the agent in both scenarios. This illustrates a design agent framework that can efficiently use feedback to not only enhance learned design strategies but also adapt to unseen design problems.

翻译：工程设计问题往往涉及巨大的状态和行动空间以及高度稀少的奖励。由于彻底搜索这些空间不可行,因此人类利用相关域知识来压缩搜索空间。以前,引入了深学习代理(DLAgents)来使用视觉仿真学习来模拟设计域知识。本说明以DLAgents为基础,并将它们与一步骤的外观搜索结合起来,以开发目标导向的代理物,从而能够加强按顺序制作设计的知识型战略。目标导向的DLAgents可以使用从数据中学习的人类战略,同时优化一个目标功能。DLAgents的视觉仿真网络由同级编码脱coder网络组成,作为粗略的规划步骤,对反馈具有不可知性。与此同时,外观搜索确定了一个目标指导下的精细调整设计动作。这些设计代理物受过培训,它只能以顺序、基于行动的方式配置设计成问题。然后对问题的两个版本进行了评估:用于培训的原始版本,DLAgents的视觉仿真模仿网络网络网络网络的视觉仿照网络,同时使用一种精细设计型设计图案。目的设计图则用于提高人类设计师的虚拟设计图案。前设计图案,可以用来改进结构结构图案,将人类设计图案的精准结构图案。前的精准结构图案,作为人类设计图案的精准结构图案。前的精准结构图案,作为人类设计图案的精准的精准设计图。前设计图。前的精准,用于。