Privacy-aware multiagent systems must protect agents' sensitive data while simultaneously ensuring that agents accomplish their shared objectives. Towards this goal, we propose a framework to privatize inter-agent communications in cooperative multiagent decision-making problems. We study sequential decision-making problems formulated as cooperative Markov games with reach-avoid objectives. We apply a differential privacy mechanism to privatize agents' communicated symbolic state trajectories, and then we analyze tradeoffs between the strength of privacy and the team's performance. For a given level of privacy, this tradeoff is shown to depend critically upon the total correlation among agents' state-action processes. We synthesize policies that are robust to privacy by reducing the value of the total correlation. Numerical experiments demonstrate that the team's performance under these policies decreases by only 3 percent when comparing private versus non-private implementations of communication. By contrast, the team's performance decreases by roughly 86 percent when using baseline policies that ignore total correlation and only optimize team performance.
翻译:私隐性多试剂系统必须保护代理人的敏感数据,同时确保代理人实现其共同目标。为实现这一目标,我们提议了一个框架,在合作性多剂决策问题上将代理人之间的通信私有化。我们研究作为合作性马可夫游戏而形成的连续决策问题,目标是达不到的。我们应用一种差别性隐私机制将代理人的象征性国家轨迹私有化,然后我们分析隐私强度与团队业绩之间的权衡。对于某种程度的隐私,这种权衡显示关键取决于代理人国家行动进程之间的总相关性。我们通过减少总体相关性的价值,综合了对隐私具有活力的政策。数字实验表明,在比较私营和非私营的通信执行情况时,这些政策小组的业绩仅下降了3%。相比之下,小组在使用忽视完全相关性和只优化团队业绩的基准政策时,其业绩下降了大约86%。