Machine learning (ML) has been pervasively researched nowadays and it has been applied in many aspects of real life. Nevertheless, issues of model and data still accompany the development of ML. For instance, training of traditional ML models is limited to the access of data sets, which are generally proprietary; published ML models may soon be out of date without an update of new data and continuous training; malicious data contributors may upload wrongly labeled data that leads to undesirable training results; and the abuse of private data and data leakage also exit. With the utilization of blockchain, an emerging and swiftly developing technology, these problems can be efficiently solved. In this paper, we survey the convergence of collaborative ML and blockchain. Different ways of the combination of these two technologies are investigated and their fields of application are examined. Discussion on the limitations of current research and their future directions are also included.
翻译:如今,对机器学习(ML)进行了广泛的研究,并应用于现实生活的许多方面,然而,模型和数据问题仍然伴随着ML的发展。例如,传统ML模型的培训限于数据集的存取,这些数据集一般是专有的;已公布的ML模型可能很快就过时,而没有更新新的数据和不断的培训;恶意数据提供者可能上传标签错误的数据,导致不良的培训结果;滥用私人数据和数据渗漏也退出。随着对正在形成和迅速发展的链条技术的利用,这些问题是可以有效解决的。在本文件中,我们调查了合作ML和块链的结合情况,调查了这两种技术的各种结合方式,并审查了其应用领域。还讨论了当前研究的局限性及其未来方向。