Generative Flow Networks (or GFlowNets for short) are a family of probabilistic agents that learn to sample complex combinatorial structures through the lens of "inference as control". They have shown great potential in generating high-quality and diverse candidates from a given energy landscape. However, existing GFlowNets can be applied only to deterministic environments, and fail in more general tasks with stochastic dynamics, which can limit their applicability. To overcome this challenge, this paper introduces Stochastic GFlowNets, a new algorithm that extends GFlowNets to stochastic environments. By decomposing state transitions into two steps, Stochastic GFlowNets isolate environmental stochasticity and learn a dynamics model to capture it. Extensive experimental results demonstrate that Stochastic GFlowNets offer significant advantages over standard GFlowNets as well as MCMC- and RL-based approaches, on a variety of standard benchmarks with stochastic dynamics.
翻译:生成流网络(GFlowNets)是一种概率代理,通过“推理即控制”来学习从给定能量景观中采样复杂的组合结构。它们已经在生成高质量和多样性的候选项方面展现出巨大潜力。然而,现有的GFlowNets仅适用于确定性环境,并在具有随机动态的更一般任务中失败,这可能限制它们的适用性。为了克服这个挑战,本文介绍了随机GFlowNets,一种将GFlowNets扩展到随机环境的新算法。通过将状态转换分解为两个步骤,随机GFlowNets将环境随机性隔离并学习动态模型来捕获它。广泛的实验结果表明,随机GFlowNets在具有随机动态的各种标准基准测试中,相对于标准GFlowNets以及MCMC和RL的方法,都具有显著的优势。