A vast amount of data is created every minute, both in the private sector and industry. Whereas it is often easy to get hold of data in the private entertainment sector, in the industrial production environment it is much more difficult due to laws, preservation of intellectual property, and other factors. However, most machine learning methods require a data source that is sufficient in terms of quantity and quality. A suitable way to bring both requirements together is federated learning where learning progress is aggregated, but everyone remains the owner of their data. Federate learning was first proposed by Google researchers in 2016 and is used for example in the improvement of Google's keyboard Gboard. In contrast to billions of android users, comparable machinery is only used by few companies. This paper examines which other constraints prevail in production and which federated learning approaches can be considered as a result.
翻译:私营部门和行业每分钟都会产生大量数据,私营部门和产业部门的数据往往很容易在私营娱乐部门获得数据,而工业生产环境则由于法律、知识产权保护以及其他因素而困难得多,然而,大多数机器学习方法需要足够数量和质量的数据源,将这两种要求结合起来的适当方式是将学习进展汇总起来的联邦学习,但每个人都是其数据的拥有者。 联邦学习是谷歌研究人员在2016年首次提出的,并被用于改进谷歌键盘Gboard。 与数十亿用户和机器人用户相比,只有很少的公司使用可比机械。 本文考察了生产中存在哪些其他限制因素,以及可以考虑采用何种联结式学习方法。