Process synthesis experiences a disruptive transformation accelerated by digitization and artificial intelligence. We propose a reinforcement learning algorithm for chemical process design based on a state-of-the-art actor-critic logic. Our proposed algorithm represents chemical processes as graphs and uses graph convolutional neural networks to learn from process graphs. In particular, the graph neural networks are implemented within the agent architecture to process the states and make decisions. Moreover, we implement a hierarchical and hybrid decision-making process to generate flowsheets, where unit operations are placed iteratively as discrete decisions and corresponding design variables are selected as continuous decisions. We demonstrate the potential of our method to design economically viable flowsheets in an illustrative case study comprising equilibrium reactions, azeotropic separation, and recycles. The results show quick learning in discrete, continuous, and hybrid action spaces. Due to the flexible architecture of the proposed reinforcement learning agent, the method is predestined to include large action-state spaces and an interface to process simulators in future research.
翻译:通过数字化和人工智能加速了过程合成过程的破坏性转变。我们提议了化学过程设计强化学习算法,该算法以最新的行为者-批评逻辑为基础。我们提议的算法将化学过程作为图表,并使用图形进化神经网络从过程图中学习。特别是,图形神经网络是在代理结构内实施的,用于处理各州和作出决定。此外,我们实施一个等级和混合决策程序,以生成流程表,将单元操作置于迭接状态,因为独立决定和相应的设计变量被选择为连续决定。我们展示了我们设计经济可行的流程表的方法在包括均衡反应、天体分离和循环循环的示例案例研究中的潜力。结果显示在离散、连续和混合行动空间中快速学习。由于拟议增强学习剂的灵活结构,我们预想的方法是将大型行动状态空间和处理模拟器的接口纳入未来的研究中。