We study the problem of learning the objective functions or constraints of a multiobjective decision making model, based on a set of sequentially arrived decisions. In particular, these decisions might not be exact and possibly carry measurement noise or are generated with the bounded rationality of decision makers. In this paper, we propose a general online learning framework to deal with this learning problem using inverse multiobjective optimization. More precisely, we develop two online learning algorithms with implicit update rules which can handle noisy data. Numerical results show that both algorithms can learn the parameters with great accuracy and are robust to noise.
翻译:我们研究学习多目标决策模式的客观功能或制约因素的问题,这些模式以一系列按顺序作出的决定为基础。特别是,这些决定可能并不准确,而且可能带有测量噪音,或是由决策者的狭隘理性产生。在本文中,我们提议一个通用的在线学习框架,用反向多目标优化处理这一学习问题。更准确地说,我们开发了两种在线学习算法,其中含有可以处理吵闹数据的隐含更新规则。数字结果显示,两种算法都能非常精确地学习参数,并且对噪音具有很强的说服力。