We consider a project (model) owner that would like to train a model by utilizing the local private data and compute power of interested data owners, i.e., trainers. Our goal is to design a data marketplace for such decentralized collaborative/federated learning applications that simultaneously provides i) proof-of-contribution based reward allocation so that the trainers are compensated based on their contributions to the trained model; ii) privacy-preserving decentralized model training by avoiding any data movement from data owners; iii) robustness against malicious parties (e.g., trainers aiming to poison the model); iv) verifiability in the sense that the integrity, i.e., correctness, of all computations in the data market protocol including contribution assessment and outlier detection are verifiable through zero-knowledge proofs; and v) efficient and universal design. We propose a blockchain-based marketplace design to achieve all five objectives mentioned above. In our design, we utilize a distributed storage infrastructure and an aggregator aside from the project owner and the trainers. The aggregator is a processing node that performs certain computations, including assessing trainer contributions, removing outliers, and updating hyper-parameters. We execute the proposed data market through a blockchain smart contract. The deployed smart contract ensures that the project owner cannot evade payment, and honest trainers are rewarded based on their contributions at the end of training. Finally, we implement the building blocks of the proposed data market and demonstrate their applicability in practical scenarios through extensive experiments.
翻译:我们考虑一个项目(模范)所有人,希望通过利用当地私人数据来培训模型,并计算感兴趣的数据所有人(即培训员)的能力。我们的目标是为这种分散化的协作/联合学习应用程序设计一个数据市场,同时提供:(一) 以捐款证明为基础的奖赏分配,以便根据培训员对培训模式的贡献补偿;(二) 通过避免数据拥有者的任何数据移动,保护隐私,保留分散模式培训;(三) 防止恶意方(例如,旨在毒害模型的培训员)的稳健性;(四) 从某种意义上说,可以核实数据市场协议中所有计算的完整性,即准确性,包括缴款评估和外部检测,通过零知识证明进行核查;(五) 高效和通用的设计。我们建议采用基于街区的市场设计,以实现上述所有五个目标。在我们的设计中,我们使用分布式的储存基础设施,以及项目所有者和培训员以外的隔离器。聚合是一个处理节点,进行某些计算,包括评估培训者的贡献,去除所有者的外额,通过智能合同更新项目部署后,我们无法通过智能合同执行。</s>