Online learning with expert advice is widely used in various machine learning tasks. It considers the problem where a learner chooses one from a set of experts to take advice and make a decision. In many learning problems, experts may be related, henceforth the learner can observe the losses associated with a subset of experts that are related to the chosen one. In this context, the relationship among experts can be captured by a feedback graph, which can be used to assist the learner's decision making. However, in practice, the nominal feedback graph often entails uncertainties, which renders it impossible to reveal the actual relationship among experts. To cope with this challenge, the present work studies various cases of potential uncertainties, and develops novel online learning algorithms to deal with uncertainties while making use of the uncertain feedback graph. The proposed algorithms are proved to enjoy sublinear regret under mild conditions. Experiments on real datasets are presented to demonstrate the effectiveness of the novel algorithms.
翻译:专家咨询在线学习被广泛用于各种机器学习任务。它考虑到学习者从一组专家中选择一个专家来提供咨询和作出决定的问题。在许多学习问题中,专家可能具有关联性,从此以后,学习者可以观察与所选专家有关的一组专家有关的损失。在这方面,专家之间的关系可以通过反馈图来捕捉,该图可用于协助学习者的决策。然而,在实际中,名义反馈图往往带有不确定性,因此无法揭示专家之间的实际关系。为了应对这一挑战,目前的工作研究各种潜在的不确定性案例,并开发新的在线学习算法,以便在使用不确定的反馈图的同时处理不确定性。拟议的算法证明在温和的条件下享有亚线性遗憾。对真实数据集的实验是为了证明新算法的有效性。