Dr.Aid:支持数据治理规则的遵守情况,促进以自动化方式进行分散化协作 (Dr.Aid: Supporting Data-governance Rule Compliance for Decentralized Collaboration in an Automated Way)

Collaboration across institutional boundaries is widespread and increasing today. It depends on federations sharing data that often have governance rules or external regulations restricting their use. However, the handling of data governance rules (aka. data-use policies) remains manual, time-consuming and error-prone, limiting the rate at which collaborations can form and respond to challenges and opportunities, inhibiting citizen science and reducing data providers' trust in compliance. Using an automated system to facilitate compliance handling reduces substantially the time needed for such non-mission work, thereby accelerating collaboration and improving productivity. We present a framework, Dr.Aid, that helps individuals, organisations and federations comply with data rules, using automation to track which rules are applicable as data is passed between processes and as derived data is generated. It encodes data-governance rules using a formal language and performs reasoning on multi-input-multi-output data-flow graphs in decentralised contexts. We test its power and utility by working with users performing cyclone tracking and earthquake modelling to support mitigation and emergency response. We query standard provenance traces to detach Dr.Aid from details of the tools and systems they are using, as these inevitably vary across members of a federation and through time. We evaluate the model in three aspects by encoding real-life data-use policies from diverse fields, showing its capability for real-world usage and its advantages compared with traditional frameworks. We argue that this approach will lead to more agile, more productive and more trustworthy collaborations and show that the approach can be adopted incrementally. This, in-turn, will allow more appropriate data policies to emerge opening up new forms of collaboration.

翻译：今天,跨机构边界的合作十分广泛,而且日益扩大,这取决于联合会分享往往有治理规则或限制使用这些规则的外部规章的数据。然而,数据治理规则(数据使用政策)的处理仍然是手工、耗时和容易出错的,限制了合作能够形成和应对挑战和机遇的速度,阻碍了公民科学,降低了数据提供者对合规的信任。使用自动化系统便利合规处理,从而大大缩短了非特派团工作所需的时间,从而加快了合作并提高了生产率。我们提出了一个框架,即Dr.Aid博士,帮助个人、组织和联合会遵守数据规则,利用自动化来跟踪在流程之间和生成数据时适用哪些规则的情况。它用正式语言编码数据治理规则,在分散的背景下对多投入数据流图进行推理。我们通过与用户进行旋风跟踪和地震建模以支持缓解和应急反应,测试其实力和效用。我们从他们使用的工具和系统的细节向分解调出一个框架,通过自动化来跟踪哪些规则在流程和衍生数据过程中适用哪些规则,因为这些规则在流程中不可避免地会使用一种正式语言,从而显示其真实使用框架的优势。

相关内容

Automator

关注 5

Automator是苹果公司为他们的Mac OS X系统开发的一款软件。 只要通过点击拖拽鼠标等操作就可以将一系列动作组合成一个工作流，从而帮助你自动的（可重复的）完成一些复杂的工作。Automator还能横跨很多不同种类的程序，包括：查找器、Safari网络浏览器、iCal、地址簿或者其他的一些程序。它还能和一些第三方的程序一起工作，如微软的Office、Adobe公司的Photoshop或者Pixelmator等。

2020数据工程师成长路线图

专知会员服务

41+阅读 · 2020年9月6日

Linux导论，Introduction to Linux，96页ppt

专知会员服务

82+阅读 · 2020年7月26日

【快讯】CVPR2020结果出炉，1470篇上榜，你的paper中了吗？

专知会员服务

51+阅读 · 2020年2月24日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日