FLAMby:现实保健环境中跨西罗联邦学习的数据集和基准 (FLamby: Datasets and Benchmarks for Cross-Silo Federated Learning in Realistic Healthcare Settings) - 专知论文

会员服务 ·

0

Learning · 联邦学习 · 数据集 · 情景 · 评论员 ·

2022 年 10 月 17 日

FLamby: Datasets and Benchmarks for Cross-Silo Federated Learning in Realistic Healthcare Settings

翻译：FLAMby:现实保健环境中跨西罗联邦学习的数据集和基准

Jean Ogier du Terrail,Samy-Safwan Ayed,Edwige Cyffers,Felix Grimberg,Chaoyang He,Regis Loeb,Paul Mangold,Tanguy Marchand,Othmane Marfoq,Erum Mushtaq,Boris Muzellec,Constantin Philippenko,Santiago Silva,Maria Teleńczuk,Shadi Albarqouni,Salman Avestimehr,Aurélien Bellet,Aymeric Dieuleveut,Martin Jaggi,Sai Praneeth Karimireddy,Marco Lorenzi,Giovanni Neglia,Marc Tommasi,Mathieu Andreux

from arxiv, Accepted to NeurIPS, Datasets and Benchmarks Track

Federated Learning (FL) is a novel approach enabling several clients holding sensitive data to collaboratively train machine learning models, without centralizing data. The cross-silo FL setting corresponds to the case of few ($2$--$50$) reliable clients, each holding medium to large datasets, and is typically found in applications such as healthcare, finance, or industry. While previous works have proposed representative datasets for cross-device FL, few realistic healthcare cross-silo FL datasets exist, thereby slowing algorithmic research in this critical application. In this work, we propose a novel cross-silo dataset suite focused on healthcare, FLamby (Federated Learning AMple Benchmark of Your cross-silo strategies), to bridge the gap between theory and practice of cross-silo FL. FLamby encompasses 7 healthcare datasets with natural splits, covering multiple tasks, modalities, and data volumes, each accompanied with baseline training code. As an illustration, we additionally benchmark standard FL algorithms on all datasets. Our flexible and modular suite allows researchers to easily download datasets, reproduce results and re-use the different components for their research. FLamby is available at~\url{www.github.com/owkin/flamby}.

翻译：联邦学习联盟(FL)是一种新颖的方法,使持有敏感数据的多个客户能够将敏感数据用于合作培训机器学习模式,而没有集中数据。跨SIlo FL设置符合少数($-50美元)可靠客户的情况,每个客户都持有中、大数据集,通常在保健、金融或行业等应用中找到。虽然以前的工作为交叉设计FL提出了具有代表性的数据集,但很少有现实的跨sil保健跨SIlo FL数据集存在,从而减缓了这一关键应用程序的算法研究。在这项工作中,我们提议建立一个新的跨SIlo数据集套件,侧重于医疗保健、FLamby(你跨SIlo战略的联邦学习基准),以弥合跨SIlo FL的理论与实践之间的差距。FL.FLamby包含7个保健数据集,包含多种任务、模式和数据量,每个数据都附有基线培训代码。举例说,我们对所有数据集的标准 FLL的算法进行了进一步基准。我们的灵活和模块化套件可以让研究人员在Fset、复制结果和Re-lab/reus 不同的研究组成部分。Frbby。

0

相关内容

Learning

不可错过！《机器学习100讲》课程，UBC Mark Schmidt讲授

不可错过！《机器学习100讲》课程，UBC Mark Schmidt讲授

专知会员服务

76+阅读 · 2022年6月28日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

166+阅读 · 2020年3月18日

【跨语言BERT模型大集合】Transfer learning is increasingly going multilingual with language-specific BERT models

专知会员服务

54+阅读 · 2020年1月30日

【机器学习基础最新版】（Mathematics for Machine Learning），417页pdf

【机器学习基础最新版】（Mathematics for Machine Learning），417页pdf

专知会员服务

245+阅读 · 2019年10月21日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

2019年机器学习框架回顾

2019年机器学习框架回顾

专知会员服务

36+阅读 · 2019年10月11日

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

专知会员服务

83+阅读 · 2019年10月9日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

VCIP 2022 Call for Special Session Proposals

VCIP 2022 Call for Special Session Proposals

CCF多媒体专委会

1+阅读 · 2022年4月1日

ACM MM 2022 Call for Papers

ACM MM 2022 Call for Papers

CCF多媒体专委会

5+阅读 · 2022年3月29日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium8

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium8

中国图象图形学学会CSIG

0+阅读 · 2021年11月16日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium5

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium5

中国图象图形学学会CSIG

1+阅读 · 2021年11月11日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium3

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium3

中国图象图形学学会CSIG

0+阅读 · 2021年11月9日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium1

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium1

中国图象图形学学会CSIG

0+阅读 · 2021年11月3日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

核因子NF90在肝癌细胞中稳定细胞周期蛋白Cyclin E1 mRNA的机制研究

国家自然科学基金

0+阅读 · 2015年12月31日

听力损伤评价方法及计算模型

国家自然科学基金

0+阅读 · 2014年12月31日

深海浮力材料压缩损伤机理及模拟动态服役性能预报

国家自然科学基金

0+阅读 · 2014年12月31日

Trop2对CBSCs移植修复梗死心肌的影响及机制研究

国家自然科学基金

0+阅读 · 2013年12月31日

GIT1CC2结构域在保护脊髓缺血再灌注损伤（SCII）中的机制研究

国家自然科学基金

0+阅读 · 2013年12月31日

Schrodinger-Poisson方程的若干问题研究

国家自然科学基金

1+阅读 · 2012年12月31日

滇西老厂富银红土型锰矿次生富集机制及40Ar/39Ar年龄

国家自然科学基金

0+阅读 · 2012年12月31日

聚集型DNA损伤在不同细胞周期中的修复及抑癌机制研究

国家自然科学基金

0+阅读 · 2011年12月31日

基于振动响应分析的海洋平台结构损伤检测研究

国家自然科学基金

0+阅读 · 2011年12月31日

小电导Ca2+激活K+通道与ryanodine受体功能性偶联的研究

国家自然科学基金

0+阅读 · 2008年12月31日

Provably Learning Diverse Features in Multi-View Data with Midpoint Mixup

Arxiv

0+阅读 · 2022年11月21日

Deep Learning on a Healthy Data Diet: Finding Important Examples for Fairness

Arxiv

0+阅读 · 2022年11月20日

Scalable Collaborative Learning via Representation Sharing

Arxiv

0+阅读 · 2022年11月20日

Can Differential Privacy Practically Protect Collaborative Deep Learning Inference for the Internet of Things?

Arxiv

0+阅读 · 2022年11月19日

Federated Learning for Healthcare Domain - Pipeline, Applications and Challenges

Arxiv

1+阅读 · 2022年11月19日

FedMT: Federated Learning with Mixed-type Labels

Arxiv

0+阅读 · 2022年11月18日

FedAudio: A Federated Learning Benchmark for Audio Tasks

Arxiv

0+阅读 · 2022年11月18日

Deep Learning for Optimal Volt/VAR Control using Distributed Energy Resources

Arxiv

0+阅读 · 2022年11月17日

Targeted Attention for Generalized- and Zero-Shot Learning

Arxiv

1+阅读 · 2022年11月17日

A Synthetic Dataset for 5G UAV Attacks Based on Observable Network Parameters

Arxiv

0+阅读 · 2022年11月5日

VIP会员

文章信息

相关主题

相关VIP内容

不可错过！《机器学习100讲》课程，UBC Mark Schmidt讲授

不可错过！《机器学习100讲》课程，UBC Mark Schmidt讲授

专知会员服务

76+阅读 · 2022年6月28日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

166+阅读 · 2020年3月18日

【跨语言BERT模型大集合】Transfer learning is increasingly going multilingual with language-specific BERT models

专知会员服务

54+阅读 · 2020年1月30日

【机器学习基础最新版】（Mathematics for Machine Learning），417页pdf

【机器学习基础最新版】（Mathematics for Machine Learning），417页pdf

专知会员服务

245+阅读 · 2019年10月21日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

2019年机器学习框架回顾

2019年机器学习框架回顾

专知会员服务

36+阅读 · 2019年10月11日

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

专知会员服务

83+阅读 · 2019年10月9日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

《全谱战争——从拓宽工具到思考不可思考之事》

《FPV武装无人机的战斗飞行艺术与科学》最新报告

无人机作战：演进、创新与未来战场

《反无人机：用于无人机探测与定位的多输入多输出雷达》最新69页

相关资讯

VCIP 2022 Call for Special Session Proposals

VCIP 2022 Call for Special Session Proposals

CCF多媒体专委会

1+阅读 · 2022年4月1日

ACM MM 2022 Call for Papers

ACM MM 2022 Call for Papers

CCF多媒体专委会

5+阅读 · 2022年3月29日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium8

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium8

中国图象图形学学会CSIG

0+阅读 · 2021年11月16日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium5

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium5

中国图象图形学学会CSIG

1+阅读 · 2021年11月11日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium3

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium3

中国图象图形学学会CSIG

0+阅读 · 2021年11月9日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium1

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium1

中国图象图形学学会CSIG

0+阅读 · 2021年11月3日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

相关论文

Provably Learning Diverse Features in Multi-View Data with Midpoint Mixup

Arxiv

0+阅读 · 2022年11月21日

Deep Learning on a Healthy Data Diet: Finding Important Examples for Fairness

Arxiv

0+阅读 · 2022年11月20日

Scalable Collaborative Learning via Representation Sharing

Arxiv

0+阅读 · 2022年11月20日

Can Differential Privacy Practically Protect Collaborative Deep Learning Inference for the Internet of Things?

Arxiv

0+阅读 · 2022年11月19日

Federated Learning for Healthcare Domain - Pipeline, Applications and Challenges

Arxiv

1+阅读 · 2022年11月19日

FedMT: Federated Learning with Mixed-type Labels

Arxiv

0+阅读 · 2022年11月18日

FedAudio: A Federated Learning Benchmark for Audio Tasks

Arxiv

0+阅读 · 2022年11月18日

Deep Learning for Optimal Volt/VAR Control using Distributed Energy Resources

Arxiv

0+阅读 · 2022年11月17日

Targeted Attention for Generalized- and Zero-Shot Learning

Arxiv

1+阅读 · 2022年11月17日

A Synthetic Dataset for 5G UAV Attacks Based on Observable Network Parameters

Arxiv

0+阅读 · 2022年11月5日

相关基金

核因子NF90在肝癌细胞中稳定细胞周期蛋白Cyclin E1 mRNA的机制研究

国家自然科学基金

0+阅读 · 2015年12月31日

听力损伤评价方法及计算模型

国家自然科学基金

0+阅读 · 2014年12月31日

深海浮力材料压缩损伤机理及模拟动态服役性能预报

国家自然科学基金

0+阅读 · 2014年12月31日

Trop2对CBSCs移植修复梗死心肌的影响及机制研究

国家自然科学基金

0+阅读 · 2013年12月31日

GIT1CC2结构域在保护脊髓缺血再灌注损伤（SCII）中的机制研究

国家自然科学基金

0+阅读 · 2013年12月31日

Schrodinger-Poisson方程的若干问题研究

国家自然科学基金

1+阅读 · 2012年12月31日

滇西老厂富银红土型锰矿次生富集机制及40Ar/39Ar年龄

国家自然科学基金

0+阅读 · 2012年12月31日

聚集型DNA损伤在不同细胞周期中的修复及抑癌机制研究

国家自然科学基金

0+阅读 · 2011年12月31日

基于振动响应分析的海洋平台结构损伤检测研究

国家自然科学基金

0+阅读 · 2011年12月31日

小电导Ca2+激活K+通道与ryanodine受体功能性偶联的研究

国家自然科学基金

0+阅读 · 2008年12月31日

微信扫码咨询专知VIP会员