估算基于交叉估价的分类员业绩估计标准误差 (Estimating the standard error of cross-Validation-Based estimators of classifier performance) - 专知论文

会员服务 ·

0

估计/估计量 · Performer · 方差 · 有偏 · Better ·

2021 年 11 月 9 日

Estimating the standard error of cross-Validation-Based estimators of classifier performance

翻译：估算基于交叉估价的分类员业绩估计标准误差

Waleed A. Yousef

First, we analyze the variance of the Cross Validation (CV)-based estimators used for estimating the performance of classification rules. Second, we propose a novel estimator to estimate this variance using the Influence Function (IF) approach that had been used previously very successfully to estimate the variance of the bootstrap-based estimators. The motivation for this research is that, as the best of our knowledge, the literature lacks a rigorous method for estimating the variance of the CV-based estimators. What is available is a set of ad-hoc procedures that have no mathematical foundation since they ignore the covariance structure among dependent random variables. The conducted experiments show that the IF proposed method has small RMS error with some bias. However, surprisingly, the ad-hoc methods still work better than the IF-based method. Unfortunately, this is due to the lack of enough smoothness if compared to the bootstrap estimator. This opens the research for three points: (1) more comprehensive simulation study to clarify when the IF method win or loose; (2) more mathematical analysis to figure out why the ad-hoc methods work well; and (3) more mathematical treatment to figure out the connection between the appropriate amount of "smoothness" and decreasing the bias of the IF method.

翻译：首先,我们分析跨度校验(CV)依据的估测标准的差异。第二,我们提出一个新的估计标准,用以前非常成功地用来估计基于靴的测算器差异的“影响函数(IF)”方法来估计这种差异。首先,我们分析基于跨度校验(CV)的估测标准的差异。第二,我们提出一个新的估计标准,以利用以前非常成功地用来估计基于靴的测算器差异的“影响函数(IF)”方法来估计这种差异。研究的动机是,据我们所知,文献缺乏一种严格的方法来估计基于基于CV(CV)的估测仪的差异。可用的是一套没有数学基础的特设程序,因为它们忽视了依赖性随机变量之间的差异结构。我们进行的实验表明,IFFS建议的方法有小的“RMS”错误,带有某些偏差。然而,令人惊讶的是,基于靴的测算器仍然比基于IFP的方法工作得更好。不幸的是,这是由于与“测算器”的测算器相比缺乏足够的顺畅通性。这为研究打开了三点:(1) :(1)更全面的模拟研究,以澄清IFFFS方法何时获胜或松;(2)更数学分析,以说明为什么“方法与IFFFFFFFFFS的偏差的程度越来越小。

0

相关内容

估计/估计量

估计/估计量

NLP必读经典文献100篇

专知会员服务

124+阅读 · 2020年9月8日

康奈尔大学Jon Kleinberg经典书《算法设计Algorithm Design》课件PPT与电子书，864页pdf

康奈尔大学Jon Kleinberg经典书《算法设计Algorithm Design》课件PPT与电子书，864页pdf

专知会员服务

235+阅读 · 2020年1月21日

【WSDM 2020 论文】网络嵌入的初始化：一种图划分方法（Initialization for Network Embedding: A Graph Partition Approach）

【WSDM 2020 论文】网络嵌入的初始化：一种图划分方法（Initialization for Network Embedding: A Graph Partition Approach）

专知会员服务

44+阅读 · 2019年11月20日

Risk Sensitive Portfolio Optimization with Regime-Switching and Default Contagion，香港理工大学应用数学系余翔助理教授，第八届全国社会媒体处理大会SMP2019

Risk Sensitive Portfolio Optimization with Regime-Switching and Default Contagion，香港理工大学应用数学系余翔助理教授，第八届全国社会媒体处理大会SMP2019

专知会员服务

10+阅读 · 2019年10月24日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

专知会员服务

160+阅读 · 2019年10月12日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

2019年机器学习框架回顾

2019年机器学习框架回顾

专知会员服务

36+阅读 · 2019年10月11日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

已删除

将门创投

5+阅读 · 2020年3月2日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

Disentangled的假设的探讨

Disentangled的假设的探讨

CreateAMind

9+阅读 · 2018年12月10日

【推荐】决策树/随机森林深入解析

【推荐】决策树/随机森林深入解析

机器学习研究会

5+阅读 · 2017年9月21日

【推荐】RNN/LSTM时序预测

【推荐】RNN/LSTM时序预测

机器学习研究会

25+阅读 · 2017年9月8日

【学习】Hierarchical Softmax

【学习】Hierarchical Softmax

机器学习研究会

4+阅读 · 2017年8月6日

Auto-Encoding GAN

Auto-Encoding GAN

CreateAMind

7+阅读 · 2017年8月4日

Statistically Optimal First Order Algorithms: A Proof via Orthogonalization

Arxiv

0+阅读 · 2022年1月13日

Understanding tree: a tool for estimating one's understanding of conceptual knowledge

Understanding tree: a tool for estimating one's understanding of conceptual knowledge

Arxiv

0+阅读 · 2022年1月13日

A Non-Classical Parameterization for Density Estimation Using Sample Moments

Arxiv

0+阅读 · 2022年1月13日

Certifiable Robustness for Nearest Neighbor Classifiers

Arxiv

0+阅读 · 2022年1月13日

A Method for Estimating the Entropy of Time Series Using Artificial Neural Networks

Arxiv

0+阅读 · 2022年1月13日

On generalization bounds for deep networks based on loss surface implicit regularization

Arxiv

0+阅读 · 2022年1月12日

A comparison of maximum likelihood and absolute moments for the estimation of Hurst exponents in a stationary framework

A comparison of maximum likelihood and absolute moments for the estimation of Hurst exponents in a stationary framework

Arxiv

0+阅读 · 2022年1月11日

Training-Free Uncertainty Estimation for Dense Regression: Sensitivity as a Surrogate

Arxiv

0+阅读 · 2022年1月11日

The Causal Learning of Retail Delinquency

Arxiv

14+阅读 · 2020年12月17日

Variance-based regularization with convex objectives

Arxiv

5+阅读 · 2017年12月14日

VIP会员

文章信息

相关主题

估计/估计量

相关VIP内容

NLP必读经典文献100篇

专知会员服务

124+阅读 · 2020年9月8日

康奈尔大学Jon Kleinberg经典书《算法设计Algorithm Design》课件PPT与电子书，864页pdf

康奈尔大学Jon Kleinberg经典书《算法设计Algorithm Design》课件PPT与电子书，864页pdf

专知会员服务

235+阅读 · 2020年1月21日

【WSDM 2020 论文】网络嵌入的初始化：一种图划分方法（Initialization for Network Embedding: A Graph Partition Approach）

【WSDM 2020 论文】网络嵌入的初始化：一种图划分方法（Initialization for Network Embedding: A Graph Partition Approach）

专知会员服务

44+阅读 · 2019年11月20日

Risk Sensitive Portfolio Optimization with Regime-Switching and Default Contagion，香港理工大学应用数学系余翔助理教授，第八届全国社会媒体处理大会SMP2019

Risk Sensitive Portfolio Optimization with Regime-Switching and Default Contagion，香港理工大学应用数学系余翔助理教授，第八届全国社会媒体处理大会SMP2019

专知会员服务

10+阅读 · 2019年10月24日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

专知会员服务

160+阅读 · 2019年10月12日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

2019年机器学习框架回顾

2019年机器学习框架回顾

专知会员服务

36+阅读 · 2019年10月11日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

热门VIP内容

开通专知VIP会员享更多权益服务

【CMU博士论文】以人为中心的强化学习

任务规划与地形分析：现代复杂环境作战导航体系

认知优势：人工智能在国家安全决策中的核心作用

大模型赋能的具身智能：决策与具身学习综述

相关资讯

已删除

将门创投

5+阅读 · 2020年3月2日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

Disentangled的假设的探讨

Disentangled的假设的探讨

CreateAMind

9+阅读 · 2018年12月10日

【推荐】决策树/随机森林深入解析

【推荐】决策树/随机森林深入解析

机器学习研究会

5+阅读 · 2017年9月21日

【推荐】RNN/LSTM时序预测

【推荐】RNN/LSTM时序预测

机器学习研究会

25+阅读 · 2017年9月8日

【学习】Hierarchical Softmax

【学习】Hierarchical Softmax

机器学习研究会

4+阅读 · 2017年8月6日

Auto-Encoding GAN

Auto-Encoding GAN

CreateAMind

7+阅读 · 2017年8月4日

相关论文

Statistically Optimal First Order Algorithms: A Proof via Orthogonalization

Arxiv

0+阅读 · 2022年1月13日

Understanding tree: a tool for estimating one's understanding of conceptual knowledge

Understanding tree: a tool for estimating one's understanding of conceptual knowledge

Arxiv

0+阅读 · 2022年1月13日

A Non-Classical Parameterization for Density Estimation Using Sample Moments

Arxiv

0+阅读 · 2022年1月13日

Certifiable Robustness for Nearest Neighbor Classifiers

Arxiv

0+阅读 · 2022年1月13日

A Method for Estimating the Entropy of Time Series Using Artificial Neural Networks

Arxiv

0+阅读 · 2022年1月13日

On generalization bounds for deep networks based on loss surface implicit regularization

Arxiv

0+阅读 · 2022年1月12日

A comparison of maximum likelihood and absolute moments for the estimation of Hurst exponents in a stationary framework

A comparison of maximum likelihood and absolute moments for the estimation of Hurst exponents in a stationary framework

Arxiv

0+阅读 · 2022年1月11日

Training-Free Uncertainty Estimation for Dense Regression: Sensitivity as a Surrogate

Arxiv

0+阅读 · 2022年1月11日

The Causal Learning of Retail Delinquency

Arxiv

14+阅读 · 2020年12月17日

Variance-based regularization with convex objectives

Arxiv

5+阅读 · 2017年12月14日

微信扫码咨询专知VIP会员