Variance reduced Shapley value estimation for trustworthy data valuation - 专知论文

会员服务 ·

0

可约的 · 估计/估计量 · 方差 · Shapley value · 样本 ·

2023 年 5 月 22 日

Variance reduced Shapley value estimation for trustworthy data valuation

翻译：暂无翻译

Mengmeng Wu,Ruoxi Jia,Changle Lin,Wei Huang,Xiangyu Chang

Data valuation, especially quantifying data value in algorithmic prediction and decision-making, is a fundamental problem in data trading scenarios. The most widely used method is to define the data Shapley and approximate it by means of the permutation sampling algorithm. To make up for the large estimation variance of the permutation sampling that hinders the development of the data marketplace, we propose a more robust data valuation method using stratified sampling, named variance reduced data Shapley (VRDS for short). We theoretically show how to stratify, how many samples are taken at each stratum, and the sample complexity analysis of VRDS. Finally, the effectiveness of VRDS is illustrated in different types of datasets and data removal applications.

翻译：暂无翻译

0

相关内容

可约的

【2022新书】Python数据分析第三版，579页pdf

【2022新书】Python数据分析第三版，579页pdf

专知会员服务

254+阅读 · 2022年8月31日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

105+阅读 · 2019年10月9日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

44+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

TRAP1在赭曲霉毒素A干扰肾细胞凋亡与自噬内稳态中的作用机制

国家自然科学基金

0+阅读 · 2014年12月31日

TRPV4在Aβ诱导星形胶质细胞活化及介导神经元死亡中的作用

国家自然科学基金

0+阅读 · 2013年12月31日

甘露消毒丹及其挥发油对IV感染小鼠粘膜免疫及RIG-Ⅰ/NF-κB信号通路作用机制的研究

国家自然科学基金

0+阅读 · 2013年12月31日

以优化类石墨烯VIB族过渡金属硫属化合物的光电性质为目标的材料设计研究

国家自然科学基金

0+阅读 · 2013年12月31日

基于量子点敏化的透明型纳米管阵列基固态太阳能电池

国家自然科学基金

0+阅读 · 2008年12月31日

On the Pointwise Behavior of Recursive Partitioning and Its Implications for Heterogeneous Causal Effect Estimation

Arxiv

0+阅读 · 2023年7月9日

The Bayan Algorithm: Detecting Communities in Networks Through Exact and Approximate Optimization of Modularity

Arxiv

0+阅读 · 2023年7月8日

One at A Time: Multi-step Volumetric Probability Distribution Diffusion for Depth Estimation

Arxiv

0+阅读 · 2023年7月7日

Reducing Network Load via Message Utility Estimation for Decentralized Multirobot Teams

Arxiv

0+阅读 · 2023年7月6日

Approximating the Shapley Value without Marginal Contributions

Arxiv

0+阅读 · 2023年7月6日

VIP会员

文章信息

相关主题

估计/估计量

相关VIP内容

【2022新书】Python数据分析第三版，579页pdf

【2022新书】Python数据分析第三版，579页pdf

专知会员服务

254+阅读 · 2022年8月31日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

105+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

因果强化学习的统一框架：综述、分类体系、算法与应用

《无人机系统 - 反无人机系统：测试方法》364页

【MIT博士论文】语言模型的推理时学习算法

美军低成本无人作战攻击系统（LUCAS）：扩大无人机战争规模

相关资讯

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

44+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

相关论文

On the Pointwise Behavior of Recursive Partitioning and Its Implications for Heterogeneous Causal Effect Estimation

Arxiv

0+阅读 · 2023年7月9日

The Bayan Algorithm: Detecting Communities in Networks Through Exact and Approximate Optimization of Modularity

Arxiv

0+阅读 · 2023年7月8日

One at A Time: Multi-step Volumetric Probability Distribution Diffusion for Depth Estimation

Arxiv

0+阅读 · 2023年7月7日

Reducing Network Load via Message Utility Estimation for Decentralized Multirobot Teams

Arxiv

0+阅读 · 2023年7月6日

Approximating the Shapley Value without Marginal Contributions

Arxiv

0+阅读 · 2023年7月6日

相关基金

TRAP1在赭曲霉毒素A干扰肾细胞凋亡与自噬内稳态中的作用机制

国家自然科学基金

0+阅读 · 2014年12月31日

TRPV4在Aβ诱导星形胶质细胞活化及介导神经元死亡中的作用

国家自然科学基金

0+阅读 · 2013年12月31日

甘露消毒丹及其挥发油对IV感染小鼠粘膜免疫及RIG-Ⅰ/NF-κB信号通路作用机制的研究

国家自然科学基金

0+阅读 · 2013年12月31日

以优化类石墨烯VIB族过渡金属硫属化合物的光电性质为目标的材料设计研究

国家自然科学基金

0+阅读 · 2013年12月31日

基于量子点敏化的透明型纳米管阵列基固态太阳能电池

国家自然科学基金

0+阅读 · 2008年12月31日

微信扫码咨询专知VIP会员