We present PORT, a software platform for local data extraction and analysis of digital trace data. While digital trace data collected by private and public parties hold a huge potential for social-scientific discovery, their most useful parts have been unattainable for academic researchers due to privacy concerns and prohibitive API access. However, the EU General Data Protection Regulation (GDPR) grants all citizens the right to an electronic copy of their personal data. All major data controllers, such as social media platforms, banks, online shops, loyalty card systems and public transportation cards comply with this right by providing their clients with a `Data Download Package' (DDP). Previously, a conceptual workflow was introduced allowing citizens to donate their data to scientific- researchers. In this workflow, citizens' DDPs are processed locally on their machines before they are asked to provide informed consent to share a subset of the processed data with the researchers. In this paper, we present the newly developed software PORT that implements the local processing part of this workflow, protecting privacy by shielding sensitive data from any contact with outside observers -- including the researchers themselves. Thus, PORT enables a host of potential applications of social data science to hitherto unobtainable data.
翻译:我们提出了本地数据提取和分析数字跟踪数据的软件平台 -- -- PORT -- -- 当地数据提取和分析数字跟踪数据的软件平台 -- -- 私营和公营各方收集的数字跟踪数据具有巨大的社会科学发现潜力,但由于隐私问题和难以进入API,因此学术研究人员无法获得其最有用的部分。然而,欧盟一般数据保护条例(GDPR)赋予所有公民个人数据电子副本的权利。所有主要数据控制器,如社交媒体平台、银行、在线商店、忠诚卡系统和公共交通卡,都遵守这一权利,向其客户提供“数据下载包”(DDP)。以前,引入了概念工作流程,允许公民将其数据捐赠给科研人员。在这一工作流程中,公民的DDP公司在当地处理,然后才要求他们提供知情同意,与研究人员分享经过处理的一组数据。本文介绍了新开发的软件TRP,通过保护隐私,防止与外部观察员 -- 包括研究人员本身 -- 接触任何敏感数据,从而保护隐私。因此,PORP使得社会数据的潜在应用在迄今无法保存。