排名环境中的交叉估价变式测试 (Testing Cross-Validation Variants in Ranking Environments) - 专知论文

会员服务 ·

0

秩 · 统计量 · 回合 · Performer · Better ·

2021 年 5 月 25 日

Testing Cross-Validation Variants in Ranking Environments

翻译：排名环境中的交叉估价变式测试

Balázs R. Sziklai,Máté Baranyi,Károly Héberger

This research investigates how to determine whether two rankings can come from the same distribution. We evaluate three hybrid tests: Wilcoxon's, Dietterich's, and Alpaydin's statistical tests combined with cross-validation, each operating with folds ranging from 5 to 10, thus altogether 18 variants. We have used the framework of a popular comparative statistical test, the Sum of Ranking Differences, but our results are representative of all ranking environments. To compare these methods, we have followed an innovative approach borrowed from Economics. We designed eight scenarios for testing type I and II errors. These represent typical situations (i.e., different data structures) that cross-validation (CV) tests face routinely. The optimal CV method depends on the preferences regarding the minimization of type I/II errors, size of the input, and expected patterns in the data. The Wilcoxon method with eight folds proved to be the best under all three investigated input sizes, although there were scenarios and decision aspects where other methods, namely Wilcoxon~10 and Alpaydin~10, performed better.

翻译：这项研究调查了如何确定两个排名是否来自同一分布。我们评估了三种混合测试: Wilcoxon, Dittelich's 和 Alpaydin 的统计测试,加上交叉校验,每个测试都以5到10的折叠进行,因此总共是18种变量。我们使用了流行的比较统计测试框架,即分级差异总和,但我们的结果代表了所有排名环境。为了比较这些方法,我们采用了从经济学中借用的创新方法。我们设计了8种测试I类和II错误的假设和决定方面。这些是交叉校验(CV)通常面临的典型情况(即不同的数据结构 ) 。最佳的CV方法取决于关于尽可能减少I/II型误差、输入大小和数据预期模式的偏好。使用8个折的Wilcoxon 方法证明在所有3个调查的投入大小中都是最好的, 尽管存在其他方法( Wilcoxon~ 10 和 Alpaydin~ 10) 的假想和决定方面, 表现得更好。

0

相关内容

【MIT】自监督几何感知，22页ppt，Self-supervised Geometric Perception

【MIT】自监督几何感知，22页ppt，Self-supervised Geometric Perception

专知会员服务

23+阅读 · 2021年6月3日

WWW21最新「比较学习」教程，135页PPT阐述从排名数据中学习

专知会员服务

37+阅读 · 2021年4月27日

【干货书】机器学习速查手册，135页pdf

【干货书】机器学习速查手册，135页pdf

专知会员服务

127+阅读 · 2020年11月20日

Linux导论，Introduction to Linux，96页ppt

Linux导论，Introduction to Linux，96页ppt

专知会员服务

81+阅读 · 2020年7月26日

最近几种小样本元学习简明综述，A Concise Review of Recent Few-shot Meta-learning Methods

最近几种小样本元学习简明综述，A Concise Review of Recent Few-shot Meta-learning Methods

专知会员服务

35+阅读 · 2020年5月25日

【ICML2019 tutorial】主动假设检验:一个信息论的观点（Active Hypothesis Testing: An Information Theoretic (re)View），Tara Javidi

【ICML2019 tutorial】主动假设检验:一个信息论的观点（Active Hypothesis Testing: An Information Theoretic (re)View），Tara Javidi

专知会员服务

8+阅读 · 2019年12月7日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

专知会员服务

79+阅读 · 2019年10月10日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

已删除

将门创投

7+阅读 · 2018年10月12日

disentangled-representation-papers

disentangled-representation-papers

CreateAMind

26+阅读 · 2018年9月12日

[DLdigest-8] 每日一道算法

[DLdigest-8] 每日一道算法

深度学习每日摘要

4+阅读 · 2017年11月2日

【推荐】SVM实例教程

【推荐】SVM实例教程

机器学习研究会

17+阅读 · 2017年8月26日

【学习】Hierarchical Softmax

【学习】Hierarchical Softmax

机器学习研究会

4+阅读 · 2017年8月6日

Group Testing in the High Dilution Regime

Arxiv

0+阅读 · 2021年7月15日

Mismatched Binary Hypothesis Testing: Error Exponent Sensitivity

Arxiv

0+阅读 · 2021年7月14日

Spectral Recovery of Binary Censored Block Models

Arxiv

0+阅读 · 2021年7月13日

Exact Recovery of Clusters in Finite Metric Spaces Using Oracle Queries

Arxiv

0+阅读 · 2021年7月13日

The Element Extraction Problem and the Cost of Determinism and Limited Adaptivity in Linear Queries

Arxiv

0+阅读 · 2021年7月13日

Heterogeneous Effects in the Built Environment

Arxiv

0+阅读 · 2021年7月13日

End to end learning and optimization on graphs

Arxiv

7+阅读 · 2019年5月31日

Testing Matrix Rank, Optimally

Arxiv

3+阅读 · 2018年10月18日

Learning to Importance Sample in Primary Sample Space

Learning to Importance Sample in Primary Sample Space

Arxiv

5+阅读 · 2018年8月23日

A Survey on Multi-Task Learning

Arxiv

5+阅读 · 2017年7月25日

VIP会员

文章信息

相关主题

相关VIP内容

【MIT】自监督几何感知，22页ppt，Self-supervised Geometric Perception

【MIT】自监督几何感知，22页ppt，Self-supervised Geometric Perception

专知会员服务

23+阅读 · 2021年6月3日

WWW21最新「比较学习」教程，135页PPT阐述从排名数据中学习

专知会员服务

37+阅读 · 2021年4月27日

【干货书】机器学习速查手册，135页pdf

【干货书】机器学习速查手册，135页pdf

专知会员服务

127+阅读 · 2020年11月20日

Linux导论，Introduction to Linux，96页ppt

Linux导论，Introduction to Linux，96页ppt

专知会员服务

81+阅读 · 2020年7月26日

最近几种小样本元学习简明综述，A Concise Review of Recent Few-shot Meta-learning Methods

最近几种小样本元学习简明综述，A Concise Review of Recent Few-shot Meta-learning Methods

专知会员服务

35+阅读 · 2020年5月25日

【ICML2019 tutorial】主动假设检验:一个信息论的观点（Active Hypothesis Testing: An Information Theoretic (re)View），Tara Javidi

【ICML2019 tutorial】主动假设检验:一个信息论的观点（Active Hypothesis Testing: An Information Theoretic (re)View），Tara Javidi

专知会员服务

8+阅读 · 2019年12月7日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

专知会员服务

79+阅读 · 2019年10月10日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

面向具身智能的多模态数据存储与检索：综述

《算法战争研究计划全景评估》35页

【CMU博士论文】水下三维视觉感知与生成

智能体战争：自主人工智能军备竞赛全景透视

相关资讯

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

已删除

将门创投

7+阅读 · 2018年10月12日

disentangled-representation-papers

disentangled-representation-papers

CreateAMind

26+阅读 · 2018年9月12日

[DLdigest-8] 每日一道算法

[DLdigest-8] 每日一道算法

深度学习每日摘要

4+阅读 · 2017年11月2日

【推荐】SVM实例教程

【推荐】SVM实例教程

机器学习研究会

17+阅读 · 2017年8月26日

【学习】Hierarchical Softmax

【学习】Hierarchical Softmax

机器学习研究会

4+阅读 · 2017年8月6日

相关论文

Group Testing in the High Dilution Regime

Arxiv

0+阅读 · 2021年7月15日

Mismatched Binary Hypothesis Testing: Error Exponent Sensitivity

Arxiv

0+阅读 · 2021年7月14日

Spectral Recovery of Binary Censored Block Models

Arxiv

0+阅读 · 2021年7月13日

Exact Recovery of Clusters in Finite Metric Spaces Using Oracle Queries

Arxiv

0+阅读 · 2021年7月13日

The Element Extraction Problem and the Cost of Determinism and Limited Adaptivity in Linear Queries

Arxiv

0+阅读 · 2021年7月13日

Heterogeneous Effects in the Built Environment

Arxiv

0+阅读 · 2021年7月13日

End to end learning and optimization on graphs

Arxiv

7+阅读 · 2019年5月31日

Testing Matrix Rank, Optimally

Arxiv

3+阅读 · 2018年10月18日

Learning to Importance Sample in Primary Sample Space

Learning to Importance Sample in Primary Sample Space

Arxiv

5+阅读 · 2018年8月23日

A Survey on Multi-Task Learning

Arxiv

5+阅读 · 2017年7月25日

微信扫码咨询专知VIP会员