In this paper, we study a distributed privacy-preserving learning problem in social networks with general topology. The agents can communicate with each other over the network, which may result in privacy disclosure, since the trustworthiness of the agents cannot be guaranteed. Given a set of options which yield unknown stochastic rewards, each agent is required to learn the best one, aiming at maximizing the resulting expected average cumulative reward. To serve the above goal, we propose a four-staged distributed algorithm which efficiently exploits the collaboration among the agents while preserving the local privacy for each of them. In particular, our algorithm proceeds iteratively, and in every round, each agent i) randomly perturbs its adoption for the privacy-preserving purpose, ii) disseminates the perturbed adoption over the social network in a nearly uniform manner through random walking, iii) selects an option by referring to the perturbed suggestions received from its peers, and iv) decides whether or not to adopt the selected option as preference according to its latest reward feedback. Through solid theoretical analysis, we quantify the trade-off among the number of agents (or communication overhead), privacy preserving and learning utility. We also perform extensive simulations to verify the efficacy of our proposed social learning algorithm.
翻译:在本文中,我们研究了社交网络中一般地形学的分散式隐私保护学习问题。代理商可以在网络上相互沟通,这可能导致隐私披露,因为代理商的可信度得不到保障。鉴于一系列选择,产生未知的随机性奖赏,每个代理商都必须学习最好的选择,以最大限度地实现预期平均累积奖赏。为了实现上述目标,我们提出一个四级分布式算法,有效地利用代理商之间的合作,同时保护每个代理商的当地隐私。特别是,我们的算法反复进行,在每轮交易中,每个代理商(i)随机地干扰其为隐私保护目的的采用。 (ii)通过随机行走,以几乎统一的方式在社交网络上传播受干扰的收养。 (iii) 选择一种选择,参照其同龄人提出的被包围的建议,以及(iv) 决定是否根据最近的报酬反馈,将选定的选择作为优先选择。我们通过可靠的理论分析,对代理商(或通信管理商)的数量进行量化,对隐私的保密性模拟和学习社会效用进行广泛的学习。