As Machine Learning (ML) models are becoming increasingly complex, one of the central challenges is their deployment at scale, such that companies and organizations can create value through Artificial Intelligence (AI). An emerging paradigm in ML is a federated approach where the learning model is delivered to a group of heterogeneous agents partially, allowing agents to train the model locally with their own data. However, the problem of valuation of models, as well the questions of incentives for collaborative training and trading of data/models, have received limited treatment in the literature. In this paper, a new ecosystem of ML model trading over a trusted Blockchain-based network is proposed. The buyer can acquire the model of interest from the ML market, and interested sellers spend local computations on their data to enhance that model's quality. In doing so, the proportional relation between the local data and the quality of trained models is considered, and the valuations of seller's data in training the models are estimated through the distributed Data Shapley Value (DSV). At the same time, the trustworthiness of the entire trading process is provided by the distributed Ledger Technology (DLT). Extensive experimental evaluation of the proposed approach shows a competitive run-time performance, with a 15\% drop in the cost of execution, and fairness in terms of incentives for the participants.
翻译:随着机器学习(ML)模式日益复杂,中心挑战之一是其规模的部署,使公司和组织能够通过人工智能创造价值。在ML中,正在形成一个联合模式,将学习模式部分交付给一组不同代理人,使代理商能够用自己的数据在当地对模型进行培训;然而,模型的估价问题以及鼓励合作培训和数据/模型交易的奖励问题在文献中得到的处理有限。在本文中,提议在信任的链式黑锁网络上建立新的ML模式交易生态系统。买方可以从ML市场获得利益模型,感兴趣的卖方则用当地计算数据来提高模型质量。在这样做时,考虑了当地数据与经过培训的模式质量之间的比例关系,通过分发的数据沙皮价值(DSV)对卖方培训模型的数据估值进行了估计。与此同时,分发的Ledger技术(DLT)为整个交易过程提供了信任度。买方可以从MLL市场获得利益模型中获取利益模型,而感兴趣的卖方则用当地计算其数据来提高模型质量。为此,考虑了当地数据与经过培训的模型质量之间的相称性关系。在拟议的业绩评估中进行了广泛的试验性评估。