We introduce a new privacy model relying on bistochastic matrices, that is, matrices whose components are nonnegative and sum to 1 both row-wise and column-wise. This class of matrices is used to both define privacy guarantees and a tool to apply protection on a data set. The bistochasticity assumption happens to connect several fields of the privacy literature, including the two most popular models, k-anonymity and differential privacy. Moreover, it establishes a bridge with information theory, which simplifies the thorny issue of evaluating the utility of a protected data set. Bistochastic privacy also clarifies the trade-off between protection and utility by using bits, which can be viewed as a natural currency to comprehend and operationalize this trade-off, in the same way than bits are used in information theory to capture uncertainty. A discussion on the suitable parameterization of bistochastic matrices to achieve the privacy guarantees of this new model is also provided.
翻译:我们引入了一种新的隐私模式,依靠二审制矩阵,即其组成部分非负数和总和等于一行和一列的矩阵。这一类矩阵既用于界定隐私保障,又用作对数据集实行保护的工具。二审制假设恰好连接了隐私文献的多个领域,包括两种最受欢迎的模型,即k-匿名和不同的隐私。此外,它与信息理论建立了桥梁,简化了评估受保护数据集效用的棘手问题。双审制隐私权还通过使用比特来澄清保护与实用之间的权衡,可被视为一种用于理解和操作这一权衡的自然货币,与信息理论中用来捕捉不确定性的比特一样。还就双审制矩阵的适当参数化问题进行了讨论,以实现这一新模型的隐私保障。