In many applications of machine learning, like drug discovery and material design, the goal is to generate candidates that simultaneously maximize a set of objectives. As these objectives are often conflicting, there is no single candidate that simultaneously maximizes all objectives, but rather a set of Pareto-optimal candidates where one objective cannot be improved without worsening another. Moreover, in practice, these objectives are often under-specified, making the diversity of candidates a key consideration. The existing multi-objective optimization methods focus predominantly on covering the Pareto front, failing to capture diversity in the space of candidates. Motivated by the success of GFlowNets for generation of diverse candidates in a single objective setting, in this paper we consider Multi-Objective GFlowNets (MOGFNs). MOGFNs consist of a novel Conditional GFlowNet which models a family of single-objective sub-problems derived by decomposing the multi-objective optimization problem. Our work is the first to empirically demonstrate conditional GFlowNets. Through a series of experiments on synthetic and benchmark tasks, we empirically demonstrate that MOGFNs outperform existing methods in terms of Hypervolume, R2-distance and candidate diversity. We also demonstrate the effectiveness of MOGFNs over existing methods in active learning settings. Finally, we supplement our empirical results with a careful analysis of each component of MOGFNs.
翻译:在许多机器学习应用中,如药物发现和材料设计,目标是产生同时实现一套目标最大化的候选人,这些目标往往相互冲突,因此没有一个同时实现所有目标最大化的单一候选人,而是一组Pareto最佳候选人,其中一个目标不能在不使另一个目标恶化的情况下得到改进。此外,在实践中,这些目标往往没有得到充分确定,使候选人的多样性成为一个关键考虑因素。现有的多目标优化方法主要侧重于覆盖Pareto前线,未能抓住候选人空间的多样性。受在单一目标环境中为不同候选人创造的GFlowNet的成功激励,我们在本文件中考虑的是多目标GFlowNet(MOGFNs)。MOGFNs是一个新型的Conditional GFlowNet(GFNs)模型,该模型通过分解多目标优化问题来模拟单一目标子问题。我们的工作首先从经验角度展示了GFlowNets。通过一系列合成和基准任务实验,我们从实验的角度展示了MOGFN的每个版本,我们从目前的经验分析中展示了我们目前学习模式中的现有方法。