This paper studies \emph{differential privacy (DP)} and \emph{local differential privacy (LDP)} in cascading bandits. Under DP, we propose an algorithm which guarantees $\epsilon$-indistinguishability and a regret of $\mathcal{O}((\frac{\log T}{\epsilon})^{1+\xi})$ for an arbitrarily small $\xi$. This is a significant improvement from the previous work of $\mathcal{O}(\frac{\log^3 T}{\epsilon})$ regret. Under ($\epsilon$,$\delta$)-LDP, we relax the $K^2$ dependence through the tradeoff between privacy budget $\epsilon$ and error probability $\delta$, and obtain a regret of $\mathcal{O}(\frac{K\log (1/\delta) \log T}{\epsilon^2})$, where $K$ is the size of the arm subset. This result holds for both Gaussian mechanism and Laplace mechanism by analyses on the composition. Our results extend to combinatorial semi-bandit. We show respective lower bounds for DP and LDP cascading bandits. Extensive experiments corroborate our theoretic findings.
翻译:本文研究 \ emph{ 不同隐私 (DP) 和\ emph{ 地方差异隐私 (LDP) 。 根据 DP, 我们提议一种算法, 保证 $\ epsilon $- indididibilable 和 $\ mathcal{ O} ((\\ fracxlog Tunpsilon} ) 和 \ emph{ 地方差异隐私 (LDP) 。 这比 $\ mathcal{ O} (\\ frac) 3 Tunepsilon} 的以往工作大有改进。 在 $\ epsilon $ ($\ delta$) - LDP 下, 我们通过 隐私预算 $\ epslon 和 错误概率 $\ delta} 来放松2美元的依赖, 并获得 $\ maxcalcal devolution, $K$K$ 和 Labro degistrateal rogradual ro res res resmagistrual res resmagistrisal res res resmagistrismabal 。 这个结果, 我们的构成和我们的底的图像分析结果, DP 和我们的底底底底底的 和底底底的仪, DP 和底的仪表, 我们的仪表, 我们的仪表, 我们的仪表,我们的仪表,我们的仪表和底的仪表,我们的仪表的仪表。