Quasi-identifier-based deidentification techniques (QI-deidentification) are widely used in practice, including $k$-anonymity, $\ell$-diversity, and $t$-closeness. We present three new attacks on QI-deidentification: two theoretical attacks and one practical attack on a real dataset. In contrast to prior work, our theoretical attacks work even if every attribute is a quasi-identifier. Hence, they apply to $k$-anonymity, $\ell$-diversity, $t$-closeness, and most other QI-deidentification techniques. First, we introduce a new class of privacy attacks called downcoding attacks, and prove that every QI-deidentification scheme is vulnerable to downcoding attacks if it is minimal and hierarchical. Second, we convert the downcoding attacks into powerful predicate singling-out (PSO) attacks, which were recently proposed as a way to demonstrate that a privacy mechanism fails to legally anonymize under Europe's General Data Protection Regulation. Third, we use LinkedIn.com to reidentify 3 students in a $k$-anonymized dataset published by EdX (and show thousands are potentially vulnerable), undermining EdX's claimed compliance with the Family Educational Rights and Privacy Act. The significance of this work is both scientific and political. Our theoretical attacks demonstrate that QI-deidentification may offer no protection even if every attribute is treated as a quasi-identifier. Our practical attack demonstrates that even deidentification experts acting in accordance with strict privacy regulations fail to prevent real-world reidentification. Together, they rebut a foundational tenet of QI-deidentification and challenge the actual arguments made to justify the continued use of $k$-anonymity and other QI-deidentification techniques.
翻译:QI- 身份识别技术( QI- dedemination) 在实践上被广泛使用, 包括美元匿名、 美元多样性和美元亲密。 我们对 QI- 身份识别提出了三次新的攻击: 两次理论攻击和一次实际攻击真实数据集。 与先前的工作不同, 我们的理论攻击工作, 即使每个属性都是准识别器。 因此, 它们适用于 $- 匿名、 美元多样性、 美元- 披露, 以及大多数其他 QI- 身份识别技术。 首先, 我们引入了一个新的隐私攻击类别, 称为低码攻击, 并证明每一个 QI- 身份识别系统如果是最小的和等级的, 很容易被下码攻击。 其次, 我们把下码攻击转换成强大的前缀加点( PSOO) 攻击。 最近有人提议, 以证明隐私机制无法根据欧洲数据保护总条例在法律上对美元进行匿名。 第三, 我们使用链接- Q- 提供真实的直立性攻击, 我们的直立性解释3学生的行为基础 。