Bayesian persuasion studies how an informed sender should influence beliefs of rational receivers who take decisions through Bayesian updating of a common prior. We focus on the online Bayesian persuasion framework, in which the sender repeatedly faces one or more receivers with unknown and adversarially selected types. First, we show how to obtain a tight $\tilde O(T^{1/2})$ regret bound in the case in which the sender faces a single receiver and has partial feedback, improving over the best previously known bound of $\tilde O(T^{4/5})$. Then, we provide the first no-regret guarantees for the multi-receiver setting under partial feedback. Finally, we show how to design no-regret algorithms with polynomial per-iteration running time by exploiting type reporting, thereby circumventing known intractability results on online Bayesian persuasion. We provide efficient algorithms guaranteeing a $O(T^{1/2})$ regret upper bound both in the single- and multi-receiver scenario when type reporting is allowed.
翻译:Bayesian 说服力研究知情发送者如何影响通过Bayesian更新一个共同的先前版本作出决定的理性接收者的信仰。 我们侧重于在线Bayesian说服框架,让发送者反复面对一个或一个以上未知和敌对选择型的接收者。 首先,我们展示如何在发送者面临一个单一接收者且有部分反馈的情况下获得紧凑的 $\ tilde O(T<unk> 1/2}) 美元(t<unk> 4/5} 美元) 的遗憾。 然后,我们在部分反馈下为多接收者设置提供了第一个无记录保证。 最后,我们展示了如何通过利用类型报告设计无记录算法,从而绕过在线Bayesian 说服中已知的不可吸引结果。 我们提供了高效的算法,保证在允许类型报告时,在单一和多接收者假设中,美元(T<unk> 1/2} ($) 都为上限。</s>