Machine learning (ML) has been pervasively researched nowadays and it has been applied in many aspects of real life. Nevertheless, issues of model and data still accompany the development of ML. For instance, training of traditional ML models is limited to the access of data sets, which are generally proprietary; published ML models may soon be out of date without update of new data and continuous training; malicious data contributors may upload wrongly labeled data that leads to undesirable training results; and the abuse of private data and data leakage also exit. With the utilization of blockchain, an emerging and swiftly developing technology, these problems can be efficiently solved. In this paper, we conduct a survey of the convergence of collaborative ML and blockchain. We investigate different ways of combination of these two technologies, and their fields of application. We also discuss the limitations of current research and their future directions.
翻译:如今,对机器学习(ML)进行了广泛的研究,并应用于现实生活的许多方面,然而,模型和数据问题仍然伴随着ML的发展。例如,传统ML模型的培训仅限于数据集的存取,这些数据集一般是专有的;已公布的ML模型可能很快就过时,而没有更新新的数据和持续培训;恶意数据提供者可能上传标签错误的数据,导致不良的培训结果;滥用私人数据和数据渗漏也随之退出。利用块链这一新兴和迅速发展的技术,这些问题是可以有效解决的。我们在本文中调查了合作ML和块链的趋同。我们调查了这两种技术的不同组合方式及其应用领域。我们还讨论了当前研究的局限性及其未来方向。