Deep learning models are nowadays broadly deployed to solve an incredibly large variety of tasks. However, little attention has been devoted to connected legal aspects. In 2016, the European Union approved the General Data Protection Regulation which entered into force in 2018. Its main rationale was to protect the privacy and data protection of its citizens by the way of operating of the so-called "Data Economy". As data is the fuel of modern Artificial Intelligence, it is argued that the GDPR can be partly applicable to a series of algorithmic decision making tasks before a more structured AI Regulation enters into force. In the meantime, AI should not allow undesired information leakage deviating from the purpose for which is created. In this work we propose DisP, an approach for deep learning models disentangling the information related to some classes we desire to keep private, from the data processed by AI. In particular, DisP is a regularization strategy de-correlating the features belonging to the same private class at training time, hiding the information of private classes membership. Our experiments on state-of-the-art deep learning models show the effectiveness of DisP, minimizing the risk of extraction for the classes we desire to keep private.
翻译:目前,深度学习模式被广泛用于解决令人难以置信的众多任务。然而,人们很少关注相关的法律方面。2016年,欧盟批准了2018年生效的《数据保护总条例》,其主要理由是通过所谓的“数据经济”的运作来保护公民的隐私和数据保护。数据是现代人工智能的燃料,人们争辩说,GDPR可以部分地适用于一系列算法决策任务,在结构更完善的AI条例生效之前。与此同时,AI不应允许不受欢迎的信息泄漏偏离所创建的目的。在这个工作中,我们提议DOP,这是一种深度学习模式的方法,将我们希望保持隐私的某些班级的信息与AI所处理的数据脱钩。特别是,DIP是一种正规化战略,在培训期间隐藏属于同一私立班的特征,隐藏私人班级成员的信息。我们在州际深层学习模型上的实验显示DepP的有效性,将我们想保持的私有班级的提取风险降到最低。