Federated Learning (FL) is a setting where multiple parties with distributed data collaborate in training a joint Machine Learning (ML) model while keeping all data local at the parties. Federated clustering is an area of research within FL that is concerned with grouping together data that is globally similar while keeping all data local. We describe how this area of research can be of interest in itself, or how it helps addressing issues like non-independently-identically-distributed (i.i.d.) data in supervised FL frameworks. The focus of this work, however, is an extension of the federated fuzzy $c$-means algorithm to the FL setting (FFCM) as a contribution towards federated clustering. We propose two methods to calculate global cluster centers and evaluate their behaviour through challenging numerical experiments. We observe that one of the methods is able to identify good global clusters even in challenging scenarios, but also acknowledge that many challenges remain open.
翻译:联邦学习联盟(FL)是一个背景,拥有分布数据的多方在培训联合机器学习(ML)模型方面进行合作,同时在缔约方保持所有本地数据; 联邦集群是FL内部的一个研究领域,涉及将全球相似的数据组合起来,同时保持所有本地数据; 我们描述了这一研究领域本身如何能引起兴趣,或如何有助于解决诸如非独立分散的(一.d.)在受监督的FL框架中的数据等问题; 然而,这项工作的重点是将FL设置的FZY $CC$-平均算法扩展为FL设置(FFCM),作为对联合集群的贡献。 我们提出了计算全球集群中心并通过具有挑战性的数字实验评价其行为的两个方法。 我们观察到,其中一种方法即使在具有挑战性的情况下也能确定良好的全球集群,但也承认许多挑战依然存在。