While machine learning techniques are being applied to various fields for their exceptional ability to find complex relations in large datasets, the strengthening of regulations on data ownership and privacy is causing increasing difficulty in its application to medical data. In light of this, Federated Learning has recently been proposed as a solution to train on private data without breach of confidentiality. This conservation of privacy is particularly appealing in the field of healthcare, where patient data is highly confidential. However, many studies have shown that its assumption of Independent and Identically Distributed data is unrealistic for medical data. In this paper, we propose Personalized Federated Cluster Models, a hierarchical clustering-based FL process, to predict Major Depressive Disorder severity from Heart Rate Variability. By allowing clients to receive more personalized model, we address problems caused by non-IID data, showing an accuracy increase in severity prediction. This increase in performance may be sufficient to use Personalized Federated Cluster Models in many existing Federated Learning scenarios.
翻译:虽然由于在大型数据集中发现复杂关系的超常能力,机械学习技术正应用于各个领域,因为其能在大型数据集中找到复杂的关系,但数据所有权和隐私条例的加强在医疗数据应用方面造成越来越多的困难。有鉴于此,最近有人提议将联邦学习作为在不违反保密规定的情况下培训私人数据的解决办法。这种保护隐私在保健领域特别具有吸引力,因为病人数据高度保密。然而,许多研究表明,它假定独立和同种分布的数据对医疗数据是不现实的。在本文件中,我们提议采用个性化联邦分组模型,一个基于分级集群的FL程序,从心脏速变异性中预测重大压抑性障碍严重性。我们允许客户获得更个性化模型,从而解决非IID数据造成的问题,显示强度预测的准确性提高。这种性提高可能足以在许多现有的联邦学习假设中使用个化联邦组合模型。